Automatic Interlinear Glossing for Otomi Language

Total Page:16

File Type:pdf, Size:1020Kb

Automatic Interlinear Glossing for Otomi Language Automatic Interlinear Glossing for Otomi language Diego Barriga1;3 Victor Mijangos1;3 Ximena Gutierrez-Vasques2;3 1Universidad Nacional Autónoma de México (UNAM) 2URPP Language and Space, University of Zürich 3Comunidad Elotl {dbarriga, vmijangosc}@ciencias.unam.mx [email protected] Abstract Sentence hí tó=tsogí Glossing NEG 3.PRF=leave In linguistics, interlinear glossing is an essen- Translation ’I have not left it’ tial procedure for analyzing the morphology of languages. This type of annotation is useful Table 1: Example of morpheme-by-morpheme glosses for language documentation, and it can also for Otomi provide valuable data for NLP applications. We perform automatic glossing for Otomi, an under-resourced language. Our work also com- time consuming task that requires linguistic exper- prises the pre-processing and annotation of the tise. In particular, low-resource languages lack of corpus. documentation and language technologies (Mager We implement different sequential labelers. et al., 2018). CRF models represented an efficient and good solution for our task (accuracy above 90%). Our aim is to successfully produce automatic Two main observations emerged from our glossing annotation in a low resource scenario. We work: 1) models with a higher number of pa- focus on Otomi of Toluca, an indigenous language rameters (RNNs) performed worse in our low- spoken in Mexico (Oto-Manguean family). This resource scenario; and 2) the information en- is a morphological rich language with fusional coded in the CRF feature function plays an im- tendency. Moreover, it has scarcity of digital re- portant role in the prediction of labels; how- sources, e.g., monolingual and parallel corpora. ever, even in cases where POS tags are not available it is still possible to achieve competi- Our initial resource is a small corpus transcribed tive results. into a phonetic alphabet. We pre-process it and we perform manual glossing. Once we have this 1 Introduction dataset, we use it for training an automatic glossing system for Otomi. One of the important steps of linguistic documenta- tion is to describe the grammar of a language. Mor- By using different variations of Conditional Ran- phological analysis constitutes one of the stages dom Fields (CRFs), we were able to achieve good for building this description. Traditionally, this is accuracy in the automatic glossing task (above done by means of interlinear glossing. This is an 90%), regardless the low-resource scenario. Fur- annotation task where linguists analyze sentences thermore, computationally more expensive meth- in a given language and they segment each word ods, i.e., neural networks, did not perform as well. with the aim of annotating the morphosyntactic cat- We also performed an analysis of the results egories of the morphemes within this word (see from the linguistics perspective. We explored the example in Table1). automatic glossing performance for a subset of la- This type of linguistic annotated data is a valu- bels to understand the errors that the model makes. able resource not only for documenting a language Our work can be a helpful tool for reducing the but it can also enable NLP technologies, e.g., by workload when manually glossing. This would providing training data for automatic morphologi- have an impact on language documentation. It can cal analyzers, taggers, morphological segmentation, also lead to an increment of annotated resources etc. for Otomi, which could be a starting point for de- However, not all languages have this type of an- veloping NLP technologies that nowadays are not notated corpora readily available. Glossing is a yet available for this language. 34 Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 34–43 June 11, 2021. ©2021 Association for Computational Linguistics 2 Background able source of morphological information for sev- eral NLP tasks. For instance, it could be used As we have mentioned before, glossing comprises to train state-of-the-art morphological segmenta- describing the morphological structure of a sen- tion systems for low-resource languages (Kann and tence by associating every morpheme with a mor- Schütze, 2018). The information contained in the phological label or gloss. In a linguistic gloss, there glosses is also helpful for training morphological are usually three levels of analysis: a) the segmen- reinflection systems (Cotterell et al., 2016), this tation by morphemes; b) the glosses describing consists in predicting the inflected form of a word these morphemes; and c) the translation or lexical given its lemma. It also can help in the automatic correspondences in a reference language. generation of morphological paradigms (Moeller Several works have tried to automatize this task et al., 2020). by using computational methods. In Snoek et al. These morphological tools can then be used to (2014), they use a rule-based approach (Finite State build downstream applications, e.g., machine trans- Transducer) to obtain glosses for Plains Cree, an lation, text generation. It is noteworthy that these Algonquian language. They focus only on the anal- are language technologies that are not yet available ysis of nouns. Samardzic et al.(2015) propose a for all languages, especially for under-resourced method for glossing Chintang language; they di- ones. vide the task into grammatical and lexical gloss- ing. Grammatical glossing is approached as a su- 3 Methodology pervised part-of-speech tagging, while for lexical glossing, they use a dictionary. A fully automatized 3.1 Corpus procedure is not performed since word segmenta- Otomi is considered a group of languages spoken tion is not addressed. in Mexico (around 300,000 speakers). It belongs Some other works have approached the whole to the Oto-Pamean branch of the Oto-Manguean pipeline of automatic glossing as a supervised tag- family (Barrientos López, 2004). It is a morpho- ging task using machine learning sequential mod- logically rich language that shows particular phe- els, and they have particularly focused on under- nomena (Baerman et al., 2019; Lastra, 2001): resourced languages (Moeller and Hulden, 2018; Anastasopoulos et al., 2018; Zhao et al., 2020). In • fusional patterns for the inflection of the verbs Anastasopoulos et al.(2018), they make use of (it fuses person, aspect, tense and mood in a neural-based models with dual sources, they lever- single affix); age easy-to-collect translations. In Moeller and Hulden(2018), they perform au- • a complex system of inflectional classes; tomatic glossing for Lezgi (Nakh-Daghestanian • stem alternation, e.g., dí=pädi ‘I know’ and family) under challenging low-resource condi- bi=mbädi ‘He knew’; tions. They implement different methods, i.e., CRF, CRF+SVM, Seq2Seq neural network. The best re- • complex morphophnological patterns, e.g., sults are obtained with a CRF model that leverages dí=pädi ‘I know’, dí=pä-hu ‘We know’; POS tags. The glossing is mainly focused on tag- ¯ ging grammatical (functional) morphemes. While • complex noun inflectional patterns. the lexical items are tagged simply as stems. This latter approach especially influences our Furthermore, digital resources are scarce for this work. In fact, Moeller and Hulden(2018) highlight language. the importance of testing these models on other lan- We focus on the Otomi of Toluca variety.1 Our guages, particularly polysynthetic languages with starting point is the corpus compiled by Lastra fusion and complex morphonology. Our case of (1992), which is comprised of narrations and dia- study, Otomi, is precisely a language highly fu- logues. The corpus was originally transcribed into sional with complex morphophonological patterns, a phonetic alphabet. We pre-processed this corpus, as we will discuss on Section3. i.e., we performed digitization and orthographic Finally, automatic glossing is not only crucial 1An Otomi language spoken in the region of San Andrés for aiding linguistic research and language docu- Cuexcontitlán, Toluca, State of Mexico. Usually regarded as mentation. This type of annotation is also a valu- ots (iso639). 35 normalization.2 We used the orthographic standard each of the morphemes within the words, as it is proposed by INALI (INALI, 2014), although we shown in the Example1. Translation implies a had problems in processing the appropriate UTF-8 different level of analysis and, due to the scarce representations for some of the vocals (Otomi has digital resources, it is not addressed here. a wide range of vowels). Similar to previous works, we use a closed set of The corpus, then, was manually tagged,3 i.e., labels, i.e., we have labels for all the grammatical interlinear glossing and Part Of Speech (POS). We (functional) morphemes and a single label for all followed the Leipzig glossing rules (Comrie et al., the lexical morphemes (stem). We can see in the 2008). Example1 that morphemes like tsogí (‘leave’) are labeled as stem. Domain Count Narrative 32 (1) hí tó=tsogí Dialogues 4 NEG 3.PRF=stem Total sentences 1769 Once we have a gloss label associated to each Total words (tokens) 8550 morpheme, we prepare the training data, i.e., we pair each letter with a BIO-label. BIO-labeling Table 2: General information about the Otomi corpus consists on associating each original label with a Beginning-Inside-Outside (BIO) label. This means In addition to this corpus, we included 81 extra that each position of a morpheme is declared either short sentences that a linguist annotated; these ex- as the beginning (B) or inside (I). We neglected O amples contained particularly difficult phenomena, (outside). BIO-labels include the morpheme cate- e.g., stem alternation, reduction of the stem and gory (e.g. B-stem) or affix glosses (e.g. B-PST, for others. Table2 contains general information about past tense). For example, the labeled representation the final corpus size. of the word tótsogí would be as follows: We also show in Table3 the top ten most com- mon POS tags and gloss labels in the corpus.
Recommended publications
  • Malayalam Noun and Verb Morphological Analyzer: a Simple Approach
    Malayalam Noun and Verb Morphological Analyzer: A Simple Approach Nimal J Valath1, Narsheedha Beegum2 M.Tech Student1, M.Tech Student2 ABSTRACT Malayalam is a language of the Dravidian family and is one of This paper discusses the methods involved in the the four major languages of this family with a rich literary development of a Simple Malayalam Verb and Noun tradition. It is very close to Tamil, one of the major languages of Morphological Analyzer. Since in Malayalam, words can be the same family. This was due to the extensive cultural derived from a root word, a purely dictionary based approach synthesis that took place between the speakers of the two for Morphological analysis is not practical. Hence, a ‘Rule-cum- languages. The origin of Malayalam as a distinct language may Dictionary’ based approach is followed along with the Suffix be traced to the last quarter of 9th Century A.D. Throughout its Stripping concept. The grammatical behavior of the language, gradual evolution Malayalam has been influenced by the the formation of words with multiple suffixes and the various circumstances prevailed on different periods. preparation of the language are dealt with here, with examples of noun and verb forms in detail. Mainly Malayalam was influenced by Sanskrit and Prakrit brought into Kerala by Brahmins. After the 11th century a unique mixture of the native languages of Kerala and Sanskrit Keywords known as Manipravalam served as the medium of literary Morphological Analyzer, Malayalam, Suffix stripping, expression. Malayalam absorbed a lot from Sanskrit, not only in Transliteration, Retransliteration, Verb and Noun, Sandhi rules, the lexical level, but also in the phonemic, morphemic and Word Formation, Noun Cases, Algorithm.
    [Show full text]
  • Unsupervised Separation of Transliterable and Native Words For
    Unsupervised Separation of Transliterable and Native Words for Malayalam Deepak P Queen’s University Belfast, UK [email protected] Abstract names that need to be transliterated than translated to correlate with English text. On a manual analy- Differentiating intrinsic language words sis of a news article dataset, we found that translit- from transliterable words is a key step erated words and proper nouns each form 10-12% aiding text processing tasks involving dif- of all distinct words. It is useful to transliterate ferent natural languages. We consider such words for scenarios that involve processing the problem of unsupervised separation of Malayalam text in the company of English text; transliterable words from native words for this will avoid them being treated as separate index text in Malayalam language. Outlining a terms (wrt their transliteration) in a multi-lingual key observation on the diversity of char- retrieval engine, and help a statistical translation acters beyond the word stem, we develop system to make use of the link to improve effec- an optimization method to score words tiveness. In this context, it ia notable that there has based on their nativeness. Our method re- been recent interest in devising specialized meth- lies on the usage of probability distribu- ods to translate words that fall outside the core vo- tions over character n-grams that are re- cabulary (Tsvetkov and Dyer, 2015). fined in step with the nativeness scorings In this paper, we consider the problem of sepa- in an iterative optimization formulation. rating out such transliterable words from the other Using an empirical evaluation, we illus- words within an unlabeled dataset; we refer to trate that our method, DTIM, provides sig- the latter as “native” words.
    [Show full text]
  • Otomi and Mazahua
    Language and Culture Archives Bartholomew Collection of Unpublished Materials SIL International - Mexico Branch © SIL International NOTICE This document is part of the archive of unpublished language data created by members of the Mexico Branch of SIL International. While it does not meet SIL standards for publication, it is shared “as is” under the Creative Commons Attribution- NonCommercial-ShareAlike license (http://creativecommons.org/licenses/by-nc- sa/4.0/) to make the content available to the language community and to researchers. SIL International claims copyright to the analysis and presentation of the data contained in this document, but not to the authorship of the original vernacular language content. AVISO Este documento forma parte del archivo de datos lingüísticos inéditos creados por miembros de la filial de SIL International en México. Aunque no cumple con las normas de publicación de SIL, se presenta aquí tal cual de acuerdo con la licencia "Creative Commons Atribución-NoComercial-CompartirIgual" (http://creativecommons.org/licenses/by-nc- sa/4.0/) para que esté accesible a la comunidad y a los investigadores. Los derechos reservados por SIL International abarcan el análisis y la presentación de los datos incluidos en este documento, pero no abarcan los derechos de autor del contenido original en la lengua indígena. THE UNIVERSITY OF CHICAGO THE RECONSTRUCTION OF OTOPAMEAN (MEXICO) A DISSERTATION SUBMITTED TO THE FACULTY OF THE DIVISION OF THE HUMANITIES IN CANDIDACY FOR THE DEGREE OF DOCTOR OF PHILOSOPHY DEPARTMENT
    [Show full text]
  • Children's Possessive Structures
    Ling 404 Lecture Notes No.4 Synchronic Clines in Morphology Productivity Clines: Japanese, Navajo, Mohawk, Eskimo, vs. Spanish, Italian, English One of the questions will be asking throughout is where do the above languages fall on the productivity cline and why (providing data and analyses.) One question to ask is to what degree is the lexicon ‘morpheme-based’ or ‘word-based’ (Chapters 3-4)? ‘Working Memory’ is implicated in the choice. For instance, are speakers all equally productive with their morpheme-units as tucked within word, a [-Fusion] language, or do some languages rather require morphemes within words to be memorized and incorporated as part of the lexical item, a [+Fusion] language? In addition to labeling languages as [+]Synthetic (as in Turkish, Hungarian, Spanish, English, Mohawk) or [-]Synthetic (as in Chinese, Vietnamese), the following data provide an additional exercise in teasing out what might be going on in languages which carry such a large number of embedded morphemes. Question leading to mid-term: How might such morphemes in Polysynthetic-type languages be stored and processed? The notion of [+/-Fusion] will be the central question here as we move to our mid-term material of this class. Typically, agglutinative morphemes are considered to be ‘loosely’ structured in that one morpheme has a one-to-one meaning. But the question might rather be how productive are the morphemes in isolation—viz., do they allow for movement? The question of movement, both at the word-level and at the morpheme-level will allow us to determine the nature of +/- productivity and +/- fusion. Regarding Movement.
    [Show full text]
  • Abstracts of the Psychonomic Society — Volume 13 — November 2008 49Th Annual Meeting — November 13–16, 2008 — Chicago, Illinois
    Abstracts of the Psychonomic Society — Volume 13 — November 2008 49th Annual Meeting — November 13–16, 2008 — Chicago, Illinois Papers 1–7 Friday Morning Motion and Attention motion is mixed. In the present study, target stimuli were associated with Grand Ballroom, Friday Morning, 8:00–9:40 objects that loomed, receded or remained static in arrays of varying size. Results showed that both motion types received equal prioritization, as Chaired by Gary Chon-Wen Shyi, National Chung Cheng University evidenced by their parallel search slopes, yet looming targets benefited from an overall reduction in reaction time (RT). Further investigation 8:00–8:15 (1) ruled out possible confounding explanations for this RT advantage, Differential Distribution of Visuospatial Attention in Tracking Mul- while a perceptual measure of performance confirmed the attentional tiple Moving Objects. GARY C.-W. SHYI & SUNG-EN CHIEN, Na- equivalence of the two motion types. Taken together, these results in- tional Chung Cheng University—Paying attention to a relatively com- dicate that looming and receding objects receive equal prioritization plex object has shown evidence for differential distribution within the during attentional selection. However, it may be that postattentional pro- object. Here, we explored differential distribution of attention when mul- cesses, possibly those involved in motor preparation, facilitate responses tiple moving objects were visually tracked. In Experiments 1 and 2, we to looming motion. not only replicated the findings reported by Alvarez and Scholl (2005), demonstrating both attentional concentration and attentional amplifica- 9:20–9:35 (5) tion, but also generalized the effects to uniformed circular shapes. In Individual Differences in Voluntary Visual Attention.
    [Show full text]
  • Studie V Příruční Studovně Separátů
    Studie v příručce separátů LING (č. 419a) Sestavil: Jan Křivan Afrika (africké jazyky) • Kamwe (Higi) Mohrlang, Roger (1971). VECTORS, PROSODIES, AND HIGI VOWELS. Journal of African Languages . Téma: fonetika. Více informací. • Ejagham (Etung) Edmondson, T. & Bendor-Samuel, J.T. (1966). TONE PATTERNS OF ETUNG. Journal of African Languages . Téma: fonetika. Více informací. • Igede (Igede) Bergman, Richard (1971). VOWEL SANDHI AND WORD DIVISION IN IGEDE. J.W.A.L.. Téma: fonologie. Více informací. • Hun-Saare (Duka) Bendor-Samuel, John & Cressman, Esther & Skitch, Donna (1971). THE NOMINAL PHRASE IN DUKA. J.W.A.L.. Téma: syntax. Více informací. • Nuclear Igbo (Izi) Bendor-Samuel, J.T. (1968). VERB CLUSTERS IN IZI. J.W.A.L.. Téma: syntax. Více informací. • Nuclear Igbo (Izi) Bendor-Samuel, J.T. & Meier, Inge (1967). SOME CONTRASTING FEATURES OF THE IZI VERBAL SYSTEM. Institute of linguistics. Téma: syntax. Více informací. • Legbo (Agbo) Bendor-Samuel J.T. & Spreda, K.W. (1969). FORTIS ARTICULATION: A FEATURE OF THE PRESENT CONTINUOUS VERB IN AGBO. Linguistics An International Review. Téma: fonetika. Více informací. • Lokaa (Yakur) Bendor-Samuel, J.T. (1969). YAKUR SYLLABLE PATTERNS. Word. Téma: fonologie. Více informací. • Mossi – Dagbani – Gourmanchema (Afrikanische Sprachen) Westermann, Diedrich (1947). PLURALBILDUNG UND NOMINALKLASSEN IN EINIGEN AFRIKANISCHEN SPRACHEN. Abhandlungen der deutschen Akademie der Wissenschaften zu Berlin. Téma: srovnání. Více informací. • Dagbani (Dagbani) Wilson, W.A.A. & Bendor-Samuel J.T. (1969). THE PHONOLOGY OF THE NOMINAL IN DAGBANI. Linguistics An International Review. Téma: fonologie. Více informací. • Bimoba – Moba (Bimoba) Bendor-Samuel, John T. (1965). PROBLEMS IN THE ANALYSIS OF SENTENCES AND CLAUSES IN BIMOBA. Word. Téma: syntax. Více informací.
    [Show full text]
  • Abstracts (PDF)
    Abstracts of the Psychonomic Society — Volume 12 — November 2007 48th Annual Meeting — November 15–18, 2007 — Long Beach, California Papers 1–7 Friday Morning Implicit Memory is a widely used experimental procedure that has been modeled in Regency ABC, Friday Morning, 8:00–9:20 order to obtain ostensibly separate measures for explicit memory and implicit memory. Existing models are critiqued here and a new con- Chaired by Neil W. Mulligan fidence process-dissociation (CPD) model is developed for a modi- University of North Carolina, Chapel Hill fied process-dissociation task that uses confidence ratings in con- junction with the usual old/new recognition response. With the new 8:00–8:15 (1) model there are several direct tests for the key assumptions of the stan- Attention and Auditory Implicit Memory. NEIL W. MULLIGAN, dard models for this task. Experimental results are reported here that University of North Carolina, Chapel Hill—Traditional theorizing are contrary to the critical assumptions of the earlier models for the stresses the importance of attentional state during encoding for later process-dissociation task. These results cast doubt on the suitability memory, based primarily on research with explicit memory. Recent re- of the process-dissociation procedure itself for obtaining separate search has investigated the role of attention in implicit memory in the measures for implicit and explicit memory. visual modality. The present experiments examined the effect of divided attention on auditory implicit memory, using auditory perceptual iden- Attentional Selection and Priming tification, word-stem completion, and word-fragment completion. Par- Regency DEFH, Friday Morning, 8:00–10:00 ticipants heard study words under full attention conditions or while si- multaneously carrying out a distractor task.
    [Show full text]
  • Processing Distinctions Between Stems and Affixes: Evidence from a Non-Fluent Aphasic Patient*
    cognition, 36 (1990) 129-153 Processing distinctions between stems and affixes: Evidence from a non-fluent aphasic patient* LORRAINE K. NLER SUSANBEHRENS HOWARD COBB University of Cambridge WlLLlAM MARSLEN-WILSON MRC Applied Psychology Unit, Cambridge Received June 2, 1989, final revision accepted February 19, 1990 Abstract Tyler, L.K., Behrens, S., Cobb, H., and Marsten-Witson, W., 1990. Processing distinctions between stems and affixes: Evidence from a non-fluent aphasic patient. Cognition, 36: 129-153. In this paper, we study the ability of a non-fluent aphasic patient, BX, to comprehend morphologically complex words when they appear in utterance contexts. Wefirst establish that he is insensitive to the contextual appropriateness of both derived and inflected words. In a further experiment we show that he has no difficulty processing the stems of complex words and conclude th-zt his problem is with the bound morphemes themselves. We then ask whether this problem is due to his inability to access either the phonological form of a morphologically complex word or its semantic andlor syntactic content. We find that only the access of semantic and syntactic content is impaired. We conclude from these six studies that: (a) BN presents a counter-example to the claim that non-fluent patients have particular dificulty with those aspects of morphology which have a syntactic function; (b) BN processes both derived and inflected words by mapping the sensory input onto the entire full-form of a complex word, but the semantic and syntactic content of the stem alone is accessed and integrated into the context. The semantic and syntactic implica- tions of the suffix are never evaluated.
    [Show full text]
  • Mathematics of the Hñähñu: the Otomies Thomas E. Gilsdorf
    Mathematics of the Hñähñu: the Otomies Thomas E. Gilsdorf Department of Mathematics University of North Dakota 101 Cornell Street, Mail Stop 8376 Grand Forks, ND 58202-8376 [email protected] Abstract: English: This article is about the mathematics of the cultural group known as the Otomies, of present day central Mexico. In particular, I discuss the Otomi number system and a comparison of that system with Aztec counting, Otomi art and decoration, mathematical symbols that appear in some Mesoamerican codices, and the Otomi calendar. Español: Este artículo es del tema de matemáticas culturales de los Otomíes, de la región central de lo que hoy en día es México. En particular, describo el sistema numérico de los Otomíes y una comparación con el sistema numérico de los Aztecas, el arte de los Otomíes, los símbolos matemáticos que aparecen en unos códices mesoamericanos, y el calendario otomí. 1. Who are the Hñähñu? The cultural group we are going to consider is generally referred to as the Otomies (pronounced “oh- toh- MEES”). The name Hñähñu (“hñ” is pronounced like the “ny” in “canyon”, preceded by a nasalized “h”) is one of the names by which people of this culture refer to themselves. In general, people of Hñähñu culture use a variant of this word when referring to themselves with other members of the same culture, but use the word Otomi in communications in Spanish or with people from outside the culture. The term Otomi has been used in a derogatory way since before the arrival of the Spanish. Nevertheless, there is some consensus among anthropologists who study Hñähñu culture that it is better to use the word Otomi, presumably in a positive way, so as to work toward creating a more positive image of the people who are often referred to with this word.
    [Show full text]
  • The Genetic History of the Otomi in the Central Mexican Valley
    University of Pennsylvania ScholarlyCommons Anthropology Senior Theses Department of Anthropology Spring 2013 The Genetic History Of The Otomi In The Central Mexican Valley Haleigh Zillges University of Pennsylvania Follow this and additional works at: https://repository.upenn.edu/anthro_seniortheses Part of the Anthropology Commons Recommended Citation Zillges, Haleigh, "The Genetic History Of The Otomi In The Central Mexican Valley" (2013). Anthropology Senior Theses. Paper 133. This paper is posted at ScholarlyCommons. https://repository.upenn.edu/anthro_seniortheses/133 For more information, please contact [email protected]. The Genetic History Of The Otomi In The Central Mexican Valley Abstract The Otomí, or Hñäñhü, is an indigenous ethnic group in the Central Mexican Valley that has been historically marginalized since before Spanish colonization. To investigate the extent by which historical, geographic, linguistic, and cultural influences shaped biological ancestry, I analyzed the genetic variation of 224 Otomí individuals residing in thirteen Otomí villages. Results indicate that the majority of the mitochondrial DNA (mtDNA) haplotypes belong to the four major founding lineages, A2, B2, C1, and D1, reflecting an overwhelming lack of maternal admixture with Spanish colonizers. Results also indicate that at an intra-population level, neither geography nor linguistics played a prominent role in shaping maternal biological ancestry. However, at an inter-population level, geography was found to be a more influential determinant. Comparisons of Otomí genetic variation allow us to reconstruct the ethnic history of this group, and to place it within a broader-based Mesoamerican history. Disciplines Anthropology This thesis or dissertation is available at ScholarlyCommons: https://repository.upenn.edu/anthro_seniortheses/133 THE GENETIC HISTORY OF THE OTOMI IN THE CENTRAL MEXICAN VALLEY By Haleigh Zillges In Anthropology Submitted to the Department of Anthropology University of Pennsylvania Thesis Advisor: Dr.
    [Show full text]
  • Mitochondrial DNA Analysis of Mazahua and Otomi Indigenous
    Wayne State University Human Biology Open Access Pre-Prints WSU Press 1-1-2018 Mitochondrial DNA Analysis of Mazahua and Otomi Indigenous Populations From Estado de Mexico Suggests a Distant Common Ancestry Angelica Gonzalé z-Oliver Facultad de Ciencias, Departamento de Biologiá Celular, Universidad Nacional Autoń oma de Mex́ ico, Ciudad de Mex́ ico, Mex́ ico, [email protected] Ernesto Garfias-Morales Facultad de Ciencias, Departamento de Biologiá Celular, Universidad Nacional Autoń oma de Mex́ ico, Ciudad de Mex́ ico, Mex́ ico, [email protected] D G. Smith Department of Anthropology, University of California, Davis, Davis, California, USA, [email protected] Mirsha Quinto-Sánchez Facultad de Medicina, Departamento de Biologiá Celular, Universidad Nacional Autoń oma de Mex́ ico, Ciudad de Mex́ ico, Mex́ ico, [email protected] Recommended Citation Gonzaleź -Oliver, Angelica; Garfias-Morales, Ernesto; Smith, D G.; and Quinto-Sánchez, Mirsha, "Mitochondrial DNA Analysis of Mazahua and Otomi Indigenous Populations From Estado de Mexico Suggests a Distant Common Ancestry" (2018). Human Biology Open Access Pre-Prints. 125. https://digitalcommons.wayne.edu/humbiol_preprints/125 This Article is brought to you for free and open access by the WSU Press at DigitalCommons@WayneState. It has been accepted for inclusion in Human Biology Open Access Pre-Prints by an authorized administrator of DigitalCommons@WayneState. Mitochondrial DNA Analysis of Mazahua and Otomi Indigenous Populations from Estado de Mexico Suggests a Distant Common Ancestry Angélica González-Oliver,1* Ernesto Garfias-Morales,1 D. G. Smith,2 Mirsha Quinto-Sánchez3 1Facultad de Ciencias, Departamento de Biología Celular, Universidad Nacional Autónoma de México, Ciudad de México, México.
    [Show full text]
  • Word Formation
    English 256 Spring 2005 Word (Stem) Formation I. Derivation A. Affixation 1. Prefixation a. English i. write/rewrite read/reread injure/reinjure shoot/reshoot play/replay do/redo move/remove tell/retell ii. large/enlarge dear/endear able/enable purple/empurple rich/enrich iii. do/undo tie/untie fasten/unfasten wrap/unwrap cover/uncover dress/undress latch/unlatch lock/unlock iv. kind/unkind easy/uneasy fit/unfit believable/unbelievable gentle/ungentle afraid/unafraid bound/unbound b. Maricopa (Southwest American Indian) i. a’uly ‘carry (something long)’ aduuly ‘hide (something long)’ ch’uly ‘carry (things)’ chduuly ‘hide (things)’ t’uly ‘carry (something round)’ tduuly ‘hide (something round)’ a’or ‘put (something long) on’ anak ‘put (something long) in liquid’ t’or ‘put (something round) on’ chnak ‘put (things) in liquid’ tnak ‘put (something round) in liquid’ ii. puy ‘die’ tpuy ‘kill’ pom ‘burn’ tpom ‘burn (something)’ ‘er ‘be scared’ t’er ‘scare’ herher ‘be wrinkled’ therher ‘wrinkle (something)’ hmaaly ‘be white’ thmaaly ‘white’ hpily ‘be dirty’ thpily ‘make dirty’ 2. Suffixation a. English i. writer/writer read/reader do/doer work/worker fly/flier sing/singer dance/dancer drive/driver run/runner ii. type/typist art/artist violin/violinist manicure/manicurist left/leftist race/racist duel/duelist iii. quick/quickly childish/childishly true/truly angry/angrily final/finally truthful/truthfully false/falsely amazing/amazingly happy/happily correct/correctly real/really English 256 Spring 2005 b. Kurdish (Near Eastern) aaqil ‘wise’ aaqilii ‘forethought’ draiz ‘long’ draizii ‘length’ diz ‘robber’ dizii ‘robbery’ garm ‘warm’ garmii ‘warmth’ c. Hanunoo (Philippines) usa ‘one’ ?usahi ‘make it one!’ upat ‘four’ upati ‘make it four!’ duwa ‘two’ duwahi ‘make it two!’ unum ‘six’ unumi ‘make it six!’ tulu ‘three’ tuluhi ‘make it three!’ 3.
    [Show full text]