Linguistic Distance among Gurage Language Varieties

doi:10.21203/rs.3.rs-2469632/v1

Download PDF

Research Article

Linguistic Distance among Gurage Language Varieties

https://doi.org/10.21203/rs.3.rs-2469632/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

This article measures linguistic distance and classifies Gurage language varieties, the branches of South Ethiosemitic languages. Previous classifications were inconsistent in sub-grouping the language varieties due to data shortage in each language variety, and the methodologies used. The present study included data from all Gurage varieties and used two methods: phonetic distance measured statistically with Cog.1.3.6.1 and Gabmap, and morphological affixes discussed qualitatively to strengthen the finding from the statistical measures. We had 60 informants, 4 speakers from each language variety. Data were collected over three years. The finding showed that there are two major groups: Dumi Gurage (Zay, Silt’e, and Wolane), and Gunnӓn Gurage which is grouped into West Gurage and North Gurage. The West Gurage is sub-classified into Teʃә-group (Chaha, Gura, Gumer, Ezha, and Gyeto) and K^wese-group (Inor, Ener, and Edegagn). The North Gurage is sub-grouped into Nәgda-group (Kistane and Dobbi), and Bazәna-group (Mesqan and Muher).

Gurage-Classification

Phonetic-distance

Morphological-affix

Background: Gurage refers to pockets of South Ethiosemitic language speakers living in the Gurage Zone of the Southern Nations, Nationalities and Peoples’ Regional State, and to the set of languages and dialects spoken in the Gurage Zone, Silt’e Zone, and the Oromo region. The language clusters the people speak. Gurage clusters are grouped into two major branches; namely, Gunnӓn-Gurage Hetzron (1977) which includes the North Gurage and West Gurage, and East Gurage which we may call Dumi- Gurage in parallel with Gunnӓn Gurage. The words Gunnӓn and Dumi etymologically refer to ‘hair’ or ‘head’ in Gunnӓn-Gurage and East Gurage, respectively. However, Zay has two different terms for dabasa for ‘hair’ and oxәt for ‘head’.

The Gurage languages are enclosed by the Cushitic language Afan Oromo in the North, North East, and South East, Libido also called Mareqo in the East, K’abeena in the North West, Hadiyya in the South West, Alaba in the South, and an Omotic language Yemsa in the West. Fellman (1993, p. 673) metaphorically described the location of Gurage as “a tiny Semitic island floating in a vast Cushitic sea”. Zay is completely enclosed by Afan Oromo, as it is located in the Oromo Regional State, specifically on the Island of Lake Zway. Figure 1 shows the Gurage languages and the surrounding Omotic language Yemsa, and the Cushitic languages Oromo, Hadiyya, and Alaba.

Other Gurage languages which have to be mentioned are Galila which was spoken around Lake Wenchi in Ambo area of the Oromo region, and two extinct languages +Gafat a north Gurage language (cf. Faber, 1997, p. 6), and +Mesmes a Peripheral West Gurage language.

Gurage as a linguistic group has been more complex in terms of classification. The linguistic features used to characterize each of the sub-groups of the Gurage languages were diverse and sometimes incorrect (Demeke 2001). The linguistic complexity has been attributed to different factors. One factor is the origin of speakers who came into the Gurage land, from different sources and at different times, with their languages that in due time intermingled with languages that already were spoken in the area (Menuta, 2015, p. 11). The second factor is the language contact with the neighboring Omotic language Yemsa and the Cushitic languages of Oromo and Sidamo (Leslau, 1952) such as Hadiyya, Alaba, Qabena, and Mareqo. Amharic, the national official language of Ethiopia, and Classical Arabic, the language of Islam religion, were also in contact with Gurage languages. Furthermore, there are within Gurage languages contacts (Nurga, 2021). In addition to the linguistic complexity, the methodology used by scholars who worked in the classification of Gurage languages was a challenge to the accuracy and consistency. Some scholars based their classification on lexical items (Hudson, 2013; Bender, 1971, 1966), while others used shared innovations and morphological affixes (Demeke, 2001; Hetzron, 1972; Leslau, 1969). Another challenge for the inaccuracy of the classification of Gurage languages was the lack of data on each of the language varieties; hence, there were language varieties that were not included as a sample in the investigations.

Objective: the objective of this study is to revisit the classification of Gurage languages and characterize the sub-group with linguistic features. It also attempts to correlate linguistic distance with geographical distance as language variation is influenced by the areal distance in addition to other variables such as language contact. It also discusses the finding with pre-existing classification.

Significance: The study will help to document the existing works, thereby acknowledging the scholars who contributed to our knowledge of Gurage languages classification, show the existing mismatches, and provide alternative classification based on phonetic distance, and language area-specific morphemes. It can be used as a base for future research works, and refinements.

Previous studies: As there are several studies on the description of Gurage languages, we will focus on literature dealing with intelligibility tests that somehow show language grouping and language distance, and on literature directly concerned with language classification. The major works on intelligibility study include Gutt (1980), Ahland (2010), and Menuta (2015).

Ahland (2010) compares Kistane, Chaha, and Silt’e each representing North Gurage, West Gurage, and East Gurage, respectively. He reports that the intelligibility test result among the varieties is low. On the contrary, Hetzron (1997, p. 536) reports communication among Gunnӓn Gurage is easy. Ahland (2010) compared, using an intelligibility test, eleven Gunnӓ Gurage languages. He used the test result as a means of classification and grouped the languages into four: the Kistane-group (Kistane and Dobbi) Mesqan, and Sebatbet-group (Chaha, Gura, Gumer, Ezha, Geto, and Muher with sub-groups Aklil and Dessa); Inor-group (Inor, Ener, Endegagn, and Mesmes). The problem with Ahland (2010) grouping is the use of intelligibility tests for language classification. Intelligibility is affected by inter-lingual comprehension, frequency of contact among speakers, and intergroup relationships. What is more, the author’s test results were not strictly bi-directional as every language variety speaking group that was tested did not take the test for every language variety compared. The terminology used for Sebatbet also seems misleading because it includes the Inor-group linguistically as well as sociologically.

Menuta (2014, 2015) compares the inherent intelligibility using lexical items, phonological features, and morphological affixes of six purposefully sampled Gurage language varieties; namely, Kistane (north Gurage), Chaha (Central West Gurage), Inor (Peripheral West Gurage), Mesqan which is grouped as West Gurage genetically but is in the Eastern part of Gurage geographically together with Dobbi, and Muher which is controversially grouped as West Gurage by some authors (Demeke, 2001; Leslau, 1969) and as a north Gurage by some others (Hetzron, 1972), and Wolane representing the East Gurage languages. Menuta (2015) also makes intelligibility tests among the languages. As the purpose of this study was to cluster the languages based on the ease or difficulty of inter-group communication, it did not attempt to provide their classification.

Studies pertinent to the classification of Gurage languages are those works dealing with the classification of Semitic languages in general and Ethiosemitic languages in particular. When it comes to the classification of Ethiosemitic languages, there are two competing views. The first view, the traditional theory, on the origin of Ethiosemitic assumes that Ethiosemitic languages emerged from South Arabia (Tadesse, 2009, p. 11; Hetzron, 1972, p. 17; Ullendorff, 1960, p. 34). The second view also called the Africanists or the alternative theory, assumes that the Ethiosemitic languages were in Ethiopia together with other Afro-Asiatic languages; namely, the Cushitic and Omotic languages (Tadesse, 2009, p. 11; Murtonen, 1967, p. 74; Hudson, 2000, pp.78-79; 2002, p.1770). The tenets of the latter view base their argument on the principle of the wave theory which assumes, similar to waves that travel from high-pressure to low-pressure areas, languages spread from high-diversity to low-diversity areas. As to the claim, we find diverse language varieties in the Ethiosemitic branches than in the other Semitic languages-speaking areas. However, there is no substantial linguistic evidence provided as to which one of the Ethiosemitic languages is a proto-language and how the other Ethiosemitic languages and/or other Semitic languages branched out from the proto-language. The language mentioned as a proto-form for the Ethiosemitic languages from the Africanists’ perspective is the Ge’ez. Fellman (1996, p. 205), for instance, states that all Ethiosemitic languages “… are descendants of a Proto Ethiopic most closely resembling Ge'ez”. Thus, he groups the Ethiosemitic languages considering Geez a proto-form that branches out to Northern Ethiopic (Tigre and Tigrinya) and Southern Ethiopic that branches out into three: Western (Gafat and West Gurage), Central (Amharic and Argoba), and Eastern (East Gurage and Harari). Fellman (1996) had no data, but he based his classification on the reviews of (Cohen, 1931; Polotsky, 1949; Leslau, 1960; Goldenberg, 1968; Hetzron, 1972). Thus, there is no empirical data-driven, and well-established pieces of linguistic evidence provided to consider Geez a proto-Ethiopic (cf. Feleke, 2021).

In the present study, we adhere to the traditional theory regarding the origin of Gurage languages and focus on linguistic distance and its correlates with geographical distance. The most notable works on the classification of Ethiosemitic languages based on the traditional theory include (Bender, 1966, 1971; Leslau, 1969; Hetzron, 1972; Goldenberg, 1977), and more recent works (Demeke, 2001; Kitchen, et al 2009; Hudson, 2013; Feleke, et al. 2020; Feleke, 2021). The most cited work on Ethiosemitic language classification is Hetzron (1972) provided in Figure 2. The other works reviewed are follow-up classifications that attempted to argue against Hetzron (1972) and provided alternative proposals.

Hetzron’s (1972) classification depicts that Gurage languages emerge from two routes; namely, the Eastern branch of the Transversal South Ethiopic- the East Gurage (Dumi-group) and the Outer South Ethiopic (Gunnän Gurage-group). From the traditional theory perspective, the origin of the Gurage languages from these two directions, probably at different times is relatively less contested; however, their internal classification has been controversial, ever-changing, and not yet settled. Studies that attempted to amend Cohen's (1931), Leslau's (1969), and Hetzron's (1972 and 1977) works are the following.

Fellman (1996) used literature from different authors and provided an alternative classification discussed above. Goldenberg (1977 and 2013) discusses the problems in the internal classification of Semitic languages and inter-language contacts that influenced the classification. Demeke (2001) regrouped Gurage languages into the eastern Group (East Gurage which consists of Wolane and Silt’e as sisters, and Harari and Zay, two different languages which are sisters to the east Gurage groups), Outer south Ethiopic group which branches into GMS group, and West Gurage group. The GMS, which is supposed to be the initials of Gafat, Gogot, Mesqan, and Soddo, with G representing two language varieties (Gafat and Gogot), is divided into Gafat and North Gurage, which branches into AMCM, affirmative main clause marker, group (Gogot and Soddo), and Mesqan. West Gurage is divided into Central West Gurage (CWG) and Peripheral West Gurage (PWG) consistent with Hetzron (1972), but the languages grouped in each group differ slightly. The CWG consisted of Muher and its sisters (Ezha, Chaha, Gumer, Gura, and Ennemor the alternative name of Inor), and the PWG consisted of Geyto, Endegeñ, Ener, and Mesmes. Being other groupings relatively similar, Demeke (2001) differs from Hetzron (1972) in the grouping of the following language varieties:

Table 1: Differences in findings of Hetzron (1972) and Demeke (2001)

Languages	Hetzron, 1972	Demeke, 2001
Mesqan	West Gurage	North Gurage
Muher	North Gurage	West Gurage
Inor (Ennemor)	PWG	CWG

The reasons Demeke (2001) provided for regrouping Mesqan to the North Gurage were (1) it has only two tenses similar to Kistane and Dobbi, a factor which was also reported by Hetzron (1972), (2) the presence of gemination in the perfective forms of verb patterns A, B, and C in Kistane and Mesqan; according to the author this feature is lacking in the West Gurage languages. The gemination in the perfective verbs, however, is also a feature of some of the West Gurage languages such as Ezha, Muher, and partly Gumer. Muher is grouped with the West Gurage languages based on verb root patterns (Demeke, 2001, p.79). Demeke (2001) did not provide any strong morphological evidence for grouping Inor (Ennemor) to the Central West Gurage than the Peripheral West Gurage.

The Kitchen et al. (2009) classification is not comprehensive and considers only 15 selected Ethiosemitic languages. These authors groups Mesmes, a peripheral West Gurage language, with Kistane (the North Gurage language); Zay and Wolane belong to the same group, but Silt’e is not in the sample and the group. Other Gurage languages clustered as similar groups are Chaha, Gyeto, Mesqan, and Inor.

Hudson (2013) used lexical comparison and intelligibility tests to classify Ethiosemitic languages. Similar to Hetzron (1977), he groups Gurage languages into Gunnӓn Gurage (Kistane, Mesqan, Muher, Chaha, and Inor) and, East Gurage (Silt’e and Zay). Several Gurage varieties are missing in his classification.

Feleke et al. (2020) make a comparison of some Gurage languages, and South Ethiosemitic languages based on phonetic distance, lexical distance, functional distance, and perceptual distance. They grouped the Gurage languages into four (1) Silt’e (Wolane and Zay are not included in the sample), (2) Kistane (Dobbi is not included in the sample) (3) Inor and Endegagn (Ener and Gyeto are not included in the sample), (4) Gumer, Gura, Ezha, and Chaha, and (5) Mesqan and Muher. Though this study contributes to grouping the Gurage languages into their closer relatives, it is incomplete in including all the language varieties. The criteria used are multiple hence the authors have to conclude based on the intelligibility test (the functional and probably the perceptual) and phonetic and lexical comparisons (structural). It is not clear how the authors negotiated the differences, for instance, Kistane’s position which is a sister to Mesqan and Muher (p.16a), a sister to Inor, Endegagn, Mesqan, Muher, Gumer, Gura, Ezha, Chaha (p.16b), a sister to Mesqan (p.16c) and a sister to Silt’e (p.16d).

A more recent study of the classification of Ethiosemitic is Feleke (2021) which is based on phonetic and lexical distance computation. The finding of this study is more similar to Hetzron (1972) with some differences. It groups Gurage languages into Silt’e, Wolane, and Zay in one group, and all the Gunnӓn-Gurage languages into another group. Of the latter groups, Inor, Endegagn and Gyeto are sisters to the North Gurage languages (Kistane and Dobbi) and the West Gurage languages; namely, Muher, Mesqan, Gumer, Gura, Ezha, Chaha (Feleke, 2021, p.8).

The several backs and forths in Gurage language classification due to the method of classification, shortage of data on each language variety, and advances in computing language variation necessitate further studies. The present study provides data on phonetic, and morphological comparisons while at the same time building on the previous works. This approach will help provide a more comprehensive and reliable classification of Gurage varieties by offering linguistic features that characterize each sub-grouping.

Methodology: In scope, all the Gurage language varieties, except the extinct +Gafat and +Mesmes, and the Galia for which we could not have access to linguistic data are included in the study. We used 255 lexical items (Appendix A) to compare the phonetic distance of the language varieties. We had four informants, two males, and two females, in each language variety site; hence, a total of 60 informants. The data were collected over three years (2019-2022).

Data were analyzed using a mixed methods methodology. In the quantitative method, string variables (lexical items) are changed into numeric variables and then computed with a Cog. 1.3.6.1 and Gabman Software. In Cog.1.3.6.1, phonetic distance is measured by aligning the words compared and using a threshold assigned to sounds based on their sonority value. The sonority scale value provided as a threshold is given in Table 2:

Table 2: Sonority Scale for vowels and consonants

Name	Type	Description	Sonority
Pre-nasal	Segment	M, n, ŋ	1
Stop	Consonant	Manner: stop, nasal	2
Affricate	Consonant	Manner: affricate	3
Fricative	Consonant	Manner: fricative, lateral	4
Nasal	Consonant	Nasal: +	5
Trill	Consonant	Manner: trill	6
Lateral	Consonant	Lateral:+	7
Flap	Consonant	Manner: flap	8
Glide	Segment	j, ɥ, ɰ, w	9
Non-syllabic vowel	Vowel	[Syllabic:-]	9
Close vowel	Vowel	[Syllabic:+, Height: close]	10
Near-close vowel	Vowel	[Syllabic:+, Height: near-close]	11
Close-mid vowel	Vowel	[Syllabic:+, Height: close-mid]	12
Mid-vowel	Vowel	[Syllabic:+, Height: mid]	13
Open-mid vowel	Vowel	[Syllabic: +, Height: open-mid]	14
Near-open vowel	Vowel	[Syllabic: +, Height: near-open]	15
Open vowel	Vowel	[Syllabic: +, Height: open]	16

The system aligns the word pairs and /or multi-words and provides them with the values given above for computation. The words aligned are also grouped as cognates and non-cognates as the alignments for ‘house’ below:

In this alignment, we have three groups of cognates: Cognate-1 with bet and bid ‘house’; Cognate-2 with gar ‘house’, and Non-cognate bejt and ge ‘house’. Had it not been for the software application used, one may align bejt ‘house’ with Cognate-1 and ge with cognate-2. It seems that the application considered /jt/ of bejt as a single grapheme similar to the affricate; otherwise, ge ‘house’ could have been represented as ge - - than ge-. Such groupings may influence the output concerning the cognate groups and the statistics as the affricates and non-affricates have different sonority values.

Cog.1.3.6.1 provides results of the computation in several forms, including a percentage of shared words in the table matrix, and the dendrograms. Two approaches are used to analyze linguistic distance using the dendrograms; that is, the unweighted pair group method with arithmetic mean (UPGMA) and neighbor-joining (NJ). The two approaches differ in that the UPGMA assumes the evolution of languages based on average linkage whereas NJ assumes the minimum evolution of languages. The two also differ in the phylogenetic tree they produce; UPGMA provides rooted trees with equal branching tips due to the assumption of equal rates of evolution, whereas NJ produces unrooted trees whose branches are proportionally varying with the change of the languages. Cog.1.3.6.1 also provides network graphs. Though we focused on minimum evolutionary change; hence, a synchronic description, we have provided both the UPGMA and NJ forms in the present study.

Gabmap like the Cog.1.3.6.1 application provides phonetic distance. However, Gabmap has additional features as it compares linguistic distance and geographical map distance of the languages compared. It also provides output in the form of plots, bar graphs, and Regression lines comparing the fitness of linguistic and geographical distances. In Gabmap, dendrograms are produced with advanced statistics that is using fuzzy clustering than cluster analysis. The latter enables to make the unstable output of clustering more stable (cf. §2.1.3). We used both Cog.1.3.6.1 and Gabmap as they support one another.

We used qualitative data, the morphological affixes, to substantiate the results obtained from phonetic similarities. We focused on the affixes that characterize the different language clusters than the affixes that are equally shared across the language varieties.

In this section, we present the phonetic distance and the correlation of linguistic distance with geographical distance, and some morphological pieces of evidence for clustering language varieties, and then conclude.

2.1 Phonetic Distance

It is possible to run statistics for lexical similarity and phonetic similarity. Relatively both approaches provide similar results. However, as lexical similarity focuses on the sound correspondence of given word pairs and phonetic similarity focuses on the phonetic features of each sound that constitute the lexical items compared, the phonetic similarity is more detailed. In this article, we presented only the phonetic distance rather than presenting redundantly the lexical similarity and phonetic similarity.

2.1.1 Phonetic similarity matrix

Based on the phonetic values threshold assigned for each segment forming the 255 lexical items, we computed the phonetic similarity across the language varieties and for each pair of the language variety as shown in Table 3

Table 3 Percentage of the phonetic similarity matrix

Based on the phonetic similarity, the language varieties (Ener, Endegagn, Inor, Chaha, Gumer, Gura, Gyeto, Ezha, and Mesqan) with purple color in the matrix share the maximum ( 81%-99%) phonetic features. The language varieties with light yellow color (Muher, Dobbi, and Kistane) tend to be similar groups though they also equally share phonetic features (70%-77%) with Ener, Inor, Chaha, Gumer, Gura, Gyeta, and Ezha. Silt’e and Wolane share 79% of phonetic similarity, but Zay seems different from the Dumi group. In the matrix, the value in green color is relatively less shared (48%-68%) among the language varieties

2.1.2 Cluster Analysis

The classification of Gurage languages based on phonetic distance is demonstrated with cluster analysis that groups the language varieties to their close and distant relatives using UPGMA (Figure 3) and Neighbor-joining (Figure 4) dendrograms, respectively.

The UPGMA clusters the Gurage languages into two major groups: Dumi-Gurage (Silt’e, Wolane, and Zay) and Gunnӓn-Gurage (all the others). The latter is further grouped into Ener and Endegn on one hand and the rest together, which are sub-grouped into (Inor, Ezha, Gyeto, Gura, Gumer, and Chaha) and (Kistane, Dobbi, Mesqan, and Muher). Note that Chaha and Gumer seem more related in phonetic features compared to Gura, but this is not the case in (Figure-4)-NJ analysis which does not assume evolution changes.

The Neighbor-joining based on phonetic distance grouped Gurage languages into two: Dobbi and Kistane (North Gurage groups) and the Dumi Gurage (Zay, Silt’e, and Wolane) on one hand, and all the West Gurage languages (Hezron, 1977) on the other hand. Mesqan is a sister to Muher, both of which are linked with the node to the Central and Peripheral West Gurage language varieties.

The grouping in (Table-3) and (Figure-4) are slightly different in sub-grouping the language varieties. This is basically because UPGMA assumes evolution whereas the Neighbor-joining is not. What is more, cluster analysis is a relatively unstable method in that smaller changes in the distance matrix result in relatively larger changes in the clustering. To solve this problem we used fuzzy clustering, which contaminates (varies) the original distance matrix with a small amount of random noise (Nerbonne et al., 2011) which is done several times hence clustering the languages again and again.

2.1.3 Fuzzy clustering

Fuzzy clustering with varying noise clusters the language varieties several times and the clusters repeatedly grouped with added noise are stable groupings (Nerbonne et al., 2011, p.11). Fuzzy clustering also shows the probability of clustering the language varieties together in percentiles. The relative similarity of the language varieties is also shown with colors as in Figure-5.

The fuzzy clustering grouped Dumi Gurage and Gunnӓn Gurage with 100% of accuracy (similar to the UPGMA clustering (Figure-3) but different from NJ (Figure-4) which grouped Dumi Gurage with part of the Gunnӓn Gurage - the North Gurage. The Dumi Gurage (Silt’e, Wolane, and Zay) are grouped with 100% accuracy, but Zay relatively differs from the two varieties. Kistane and Dobbi are separated from the other Gunnӓn Gurage relatively earlier, yet these two varieties are related to Muher and Mesqan. Inor, which is grouped with the Central West Gurage (CWG) language varieties, is relatively separated from the CWG languages (at a distance of about 0.2 on the dendrograms); then Gyeto split from the rest CWG varieties (Ezha, Gumer, Gura and Chaha). The Chaha and Gura seem almost the same varieties as the degree of accuracy to consider them two groups is only 72% and the time distance is nearly zero.

The fuzzy cluster grouping with multidimensional Gabmap analysis (Figure-6) in which the geographical location of the languages and the linguistic similarity are correlated are shown. Languages with close similarities are painted with similar colors. The relative color difference with similar groups is also demonstrated by slightly varying the colors (for example 1 and 5 relative to 15; and 12 relatives to 7, 8, 8, 9, and 11).

The language areas on the map more or less correspond to the administrative organization of the Gurage Zone except for Zay (15) which is in Oromiya Regional State, and not adjacent to the Gurage area, and Silt’e (1) which was part of the Gurage Zone until it became a separate zone in 2001 with its administrative city at Worabe. Masqan (2) and Dobbi (3) are within the same district, the Mesqanina Mareko district. Recently another district of Mesqan was established. Kistane (4) was named Sodo district with its administrative city of Bui, but recently it is divided into two districts. The present map assumes the undivided Sodo, and Mesqan districts. Wolane (5), Muher (6), Ezha (7), Gumer (8), and Gyeto (9) are independent districts with their administrative cities Mehal-Amba, Hawariyat, Agenna, Arekit, and Quante, respectively. Gura (10) and Chaha (11) are located within the Chaha district, with their administrative city Emdibir. Inor (12) and Ener (13) were administered together as a district. Recently; however, the two are divided into two districts; the major land Inor with an administrative city of Gunchire, and Ener, a peripheral part of Inor (Meger) constituted a separate district with an administrative capital of Mike. Our map shows only the Inor before it was split into two. Endegagn (14) is a district in the Gurage Zone with its administrative town of Dinkula.

2.1.4 Linguistic and Geographical Distance

The linguistic difference is observed as one travels relatively far from one local area to the other. A language variation is observed within 25km. The language variation is higher at about 100km distance (Figure-7). The linguistic similarity within localities is variable as the zigzag lines with the red color on the graph demonstrate.

Some language varieties deviate sharply within a short distance (for example between linguistic distances of 0.2 and 0.3 which is within 25 km). The similarity is highly unstable between 0.3-0.4 of linguistic distance and 40-80 km of geographical distance.

2.1.5 Morphological Affixes

As some of the previous studies based their classification on morphological affixes, we have included the major morphemes attached to nouns and verbs. We have omitted the morphemes which are similar across the language varieties. The n- or –n, and v- or –v in the table (Table-4) refer to a noun and a verb, respectively to which affixes are added.

Table 4: Morphological affixes of Gurage varieties

	Definite	Plural	Accusative	Dative	Genitive	Abstract	Verbal	Instrument	MV marker	DF-tense	IF-tense
Silt'e	n-i	n-ʧa	ji-n	lә-n	lә-n	n-nәt	v-ot	mә-n-ja	V-	V-	V-
Wolane	n-i	n-ʧa	ji-n	lә-n	lә-n	n-nәt	v-ot	mә-n-ja	V-	V-	V-
Zay	n-i	n-ʧa	n-ne	lә-n	lә-n	n-nәt	v-ot	mә-n-ja	V-	V-	V-
Gura	n-hut	n-	jә-n	jә-n	jә-n	n-nәt	v-ot	mә-n-ja	V-m	V-te	V-ʃә
Gyeto	n-hut	n-	jә-n	jә-n	jә-n	n-nәt	v-ot	mә-n-ja	V-m	V-te	V-ʃә
Ezha	n-we	n-	jә-n	jә-n	jә-n	n-nәt	wә-v	wә-n-ja	V-m	V-te	V-ʃә
Gumer	n-hut	n-	jә-n	jә-n	jә-n	n-nәt	v-ot	mә-n-ja	V-m	V-te	V-ʃә
Chaha	n-hut	n-	jә-n	jә-n	jә-n	n-nәt	v-ot	mә-n-ja	V-m	V-te	V-ʃә
Inor	n-hud	n-	ә-n	ә-n	ә-n	n-nәd	ә-v	mә-n-ja	V- ̴	V-k^we	V-se
Ener	n-hud	n-	ә-n	ә-n	ә-n	n-nәd	ә-v	mә-n-ja	V- ̴	V-k^we	V-se
Endegagn	n-hud	n-	ә-n	ә-n	ә-n	n-nәd	ә-v	mә-n-ja	V- ̴	V-k^we	V-se
Mesqan	n-i	n-oʧʧ	jә-n	jә-n	jә-n	n-nәt	wә-v	mә-n-ja	V-m	V-	V-
Muher	n-we	n-	jә-n	jә-n	jә-n	n-nәt	wә-v	wә-n-ja	V-nn/m	V-	V-
Dobbi	n-i	n-oʧʧ	jә-n	jә-n	jә-n	n-nәt	wә-v	wә-n-ja	V-tt	V-	V-
Kistane	n-i	n-oʧʧ	jә-n	jә-n	jә-n	n-nәt	wә-v	wo-n-ja	V-tt	V-	V-

Based on definiteness, there are three groups: n-i groups (Silt’e, Wolane, Zay, Mesqan, Dobbi, and Kistane); n-we group (Muher and Ezha); and n-hut(d) groups (Gura, Chaha, Gumer, Gyeto, Inor, Ener and Endegagn). The last group can be sub-grouped as n-hut (Gura, Chaha, Gumer, and Gyeto ) and n-hud (Inor, Ener, and Endegagn).

There are several ways to show plural including reduplication, affixation, sound relocation, and syntactic means. We have shown here plurals marked with affixes only. We have three groups based on plural formation. Kistane (Wakjira, 2010, p. 21), Mesqan (Shafi, 2015, p. 40) and Dobbi (Teklemichael, 2002, p. 14) have {-oʧʧ} with alternative {-aʧʧ} in Kistane. Silt’e (Oda, 2017, p. 87), Wolane (Meyer, 2010, p. 136) and Zay (Meyer, 2006, p.19) have {-ʧ(ʧ)a}. The other groups do not have plural marking affixes but use other mechanisms if it has to be shown (Völlmin, 2017, pp. 204-206 for Gummer, and Adigeh, 2015, pp. 86-87 for Endegagn; Habte, 2020, pp. 205-206 for Ener).

There are four groups based on accusative markers: the {ji-} group (Silt’e and Wolane, but {-nɛ} Zay); {jә-} group (Gura, Chaha, Ezha, Gumer, Gyeto, Muher, Mesqan, Dobbi, and Kistane); and {ә-} group (Inor, Ener and Endegagn). The dative and genetive case markers group the language varieties into three: {lә-} group (Silt’e, Wolane, and Zay); {jә-} group (Gura, Chaha, Ezha, Gumer, Gyeto, Muher, Mesqan, Dobbi, and Kistane); and {ә-} group (Inor, Ener and Endegagn). Based on the abstract noun derivation, there are only two groups: {nәt} group (Silt’e, Wolane, Zay, Gura, Chaha, Ezha, Gumer, Gyeto, Muher, Mesqan, Dobbi, and Kistane); and {-nәd} group (Inor, Ener and Endegagn). Verbal noun derivation groups the language varieties into three: {-ot} group (Silt’e, Wolane, Zay, Gura, Gumer, Chaha, and Gyeto); {wә-} group (Ezha, Mesqan, Muher, Dobbi, and Kistane); and {ә-} group (Inor, Ener and Endegagn). The Instrument noun derivation shows two ways of grouping: {mә…ja} group (Silt’e, Wolane, Zay, Gura, Gumer, Chaha, Gyeto, and Mesqan); and {wә(o)…ja} group (Ezha, Muher, Dobbi, and Kistane). There are several sub-groups based on the Main verb marker: those not marking the main verb (Silte, Wolane, and Zay); {-m} groups (Gura, Gumer, Chaha, Ezha, Gyeto, and Mesqan); {-m drop and nasal trace ( ̴ )} group (Inor, Ener and Endegagn); {-nn/-u} group (Muher), and {-tt} group (Kistane and Dobbi). Definite future marking groups the languages into three: those not marking definite future (Silt’e, Wolane, Zay, Mesqan, Muher, Dobbi, and Kistane); {-te} group (Gura, Gumer, Ezha, Chaha, and Gyeto); and {-k^we} group (Inor, Ener and Endegagn). Similarly, indefinite future marking shows three ways of grouping: those not marking groups (Silt’e, Wolane, Zay, Mesqan, Muher, Dobbi, and Kistane); {-ʃә} group (Gura, Gumer, Ezha, Chaha, and Gyeto); and {-se} group (Inor, Ener and Endegagn.)

As the morphological affixes are either largely shared by several groups (for example definiteness, verbal noun derivation, and abstract noun derivation) or specific to a few varieties (for example main verb marker, definite and indefinite future), it is quite difficult to classify the languages with morphological evidence. However, such morphological affixes can help to distinguish some of the sub-groups that are already classified with phonetic distance.

2.2.1 Discussion

Gurage language classification was unstable and ever-changing (Feleke, 2021) because of the methodology followed and the data type used. The present study focused on classifying the Gurage language varieties as standalone, unlike the previous works. We have focused on phonetic distance because we can get reliable and stable results even using 100 or 200 lexical items ( Nerbonne et al., 2011).

The result of the present finding is more similar to (Hetzron, 1972) except the present finding groups Mesqan to the North Gurage with Kistane, Dobbi, and Muher. It is also similar to Feleke (2021) in grouping Gurage languages into Dumi Gurage (Silt’e, Wolane, and Zay) and Gunnӓn Gurage, all the rest. Of the Gunnӓn Gurage groups, Muher is grouped to West Gurage (unlike the present finding and Hetzron (1972); Inor, Endegagn, and Gyeto are the same groups, and are sisters to the North Gurage languages (Kistane and Dobbi) and West Gurage languages (Muher, Mesqan, Gumer, Gura, Ezha, and Chaha (cf. Feleke, 2021, p.8). Demeke's (2001) work is similar to the present finding in grouping Mesqan to north Gurage, and Inor to the central west Gurage languages, but differs in grouping Muher to the West Gurage languages similar to Feleke (2021).

The position of Mesqan and Muher seems still a challenge though the present finding tends to group them with North Gurage. They share huge common features with North Gurage and West Gurage. Geographically, they are a contact area of the North and West Gurage (cf. Figure-5 and Table-4). Another challenge is the fact that Inor is grouped to the Central West Gurage languages in the present study and in (Demeke, 2001), but to the Peripheral West Gurage languages, Engegagn and Ener, in the other works (Ahland, 2010; Hetzron, 1972; Feleke, 2021). However, based on the morphological shreds of evidence (for example definiteness n-hud, case marker {ә-}, abstract noun derivation morpheme {-nәd}, definite future marker {-k^we}, and indefinite future marker {-se}, Inor can best be grouped to Endegagn and Ener as depicted in Neighbor-joining analysis (Figure 4) and previous studies (Ahland, 2010; Hetzron, 1972; and Feleke, 2021).

2.2.2 Conclusion

Based on the results of Phonetic distance and morphological affixes, we can draw the following conclusion:

The Gurage languages constitute two major groups: Dumi Gurage (Silt’e, Wolane, and Zay), and Gunnӓn Gurage (all the rest of the Gurage varieties)
Zay is relatively aberrant from the Dumi group whereas Silt’te and Wolane are closely related.
Inor is closer to Central West Gurage varieties (Gura, Chaha, Gumer, Ezha, and Gyeto ) phonetically (Figures 3 and 5 but not based on Figure 4); however, it better fits the Ener and Endegagn based on morphological features (Table 4).
Kistane, Dobbi, Muher, and Mesqan belong to the same group, but they can be sub-grouped into Kistane-Dobbi and Muher-Mesqan groups.

With these conclusions, we provide the classification of Gurage varieties as follows:

We have used lexical, geography, and morphological criteria to show the classification of the language varieties. The lexical items are Dumi and Gunnӓn ‘head’, and Nәgda and Bazәna ‘guest’. Dumi Gurage is a substitute for the East Gurage in previous studies. Geographical terms are the West Gurage and North Gurage. The morphemes are te-ʃә- (used without showing the morpheme boundaries in Figure 8) and k^we-se- that group the three tense groups into two based on the definite future and indefinite future markers, respectively.

I declare this work has not been published elsewhere so far and It is my original work. All works cited are duly acknowledged.

Acknowledgments

I would like to acknowledge my informants for their precious time and willingness to take part in the interviews. I would like to express my heartfelt thanks to Dr. Asfaw Tadesse for his support in preparing reference points for Gabmap and making a Gurage map with Google maps.

Conflict of interest:

There is no conflict of interest, financial or non-financial, related to this article.

Funding:

The study was funded by Linguistic Capacity Building: Tools for the Inclusive Development of Ethiopia, which has already been phased out.

Author’s Contribution:

The author collected data, analyzed, interpreted, and wrote the report.

Adigeh, Y. (2015). Descriptive Grammar of Endegagn. Unpublished Ph. D. thesis submitted to Department of Linguistics. Addis Ababa University.
Ahland, M. B. (2010). Language Death in Mesmes: A Sociolinguistic and Historical-Comparative Examination of a Disappearing Ethiopian Semitic Language. Dallas, Texas: SIL International.
Bender, L. M. (1966). Notes on Lexical Correlations in Some Ethiopian Languages. Journal of Ethiopian Studies, 4, 5-16.
Bender, L. M. (1971). The languages of Ethiopia: a new lexicostatistic classification and some problems of diffusion. Anthropol. Ling, 13 (5), 165-288.
Cohen, M. (1931). Etudes d' ethiopien meridional, Paris: P. Guenther.
Demeke, G. A. (2001).The Ethio-Semitic languages (Re-examining the classification). J. Ethiop. Stud, 34 (2), 57-93.
Faber, A. (1997). Genetic subgrouping of the Semitic languages. Semit. Lang, 15, 3-15.
Feleke, T. L., Gooskens, C., & Rabanus, S. (2020). Mapping the dimensions of linguistic distance: A study on South Ethiosemitic languages. Lingua, 243, [102893]. https://doi.org/10.1016/j.lingua.2020.102893
Feleke, T. L. (2021). Ethiosemitic languages: Classifications and classification determinants. Ampersand 8, 100074.
Fellman, J. (1993). Gurage: The last straw. The Journal of Modern African Studies, 31 (4), 673-674. doi:10.1017/S0022278X00012313.
Fellman, J. (1996). Lines on the classification of Ethiopian-Semitic. Studies in African Linguistics, 25(2), 205-206.
Goldenberg, G. (1968). Kɨtaniɲɲa: Studies in a Northern Gurage Language of Christians. Orientalia Suecans, 17, 61-102.
Goldenberg, G. (1977). The Semitic languages of Ethiopia and their classification. Bull. Sch. Orient Afr. Stud, 40 (3), 461–507.
Goldenberg, G. (2013). Semitic languages: Features, structures, relations, processes. Oxford: Oxford University Press.
Gutt, E. A. (1980). Intelligibility and Interlingual-comprehension among Selected Gurage Speech Varieties. Journal of Ethiopian Studies, 14, 57-85.
Habte, W. (2020). Descriptive Grammar of Ennar (Peripheral Western Gurage). Unpublished Ph. D. Thesis submitted to Department of Linguistics. Addis Ababa University.
Hetzron, R. (1972). Ethiopian Semitic: Studies in Classification, No. 2. Manchester: Manchester University Press.
Hetzron, R. (1977). The Gunnӓn-Gurage Languages, vol. 12. Istituto Universitario Orientale, Naples.
Hetzron, R. (1997). Outer South Ethiopic. In R. Hetzron (ed.), The Semitic Languages, PP, 535-549. London: Routledge.
Hudson, G. (2000). Ethiopian Semitic archaic heterogeneity. 14th International Conference of Ethiopian Studies. Addis Ababa: Addis Ababa University.
Hudson, G. (2002). Ethiopian Semitic Archaic Heterogeneity. Proceedings of 14th International Conference of Ethiopian Studies, pp. 1765-1786. Addis Ababa: Addis Ababa University, Institute of Ethiopian Studies.
Hudson, G. (2013). Northeast African Semitic: Lexical Comparisons and Analysis. Harrassowitz: Wiesbaden.
Kitchen, A., Ehret, C., Assefa, S., Mulligan, C. J. (2009). Bayesian phylogenetic analysis of Semitic languages identifies an early bronze age origin of Semitic in the near east. Proc. R. Soc. Lond. B Biol. Sci., 276, 2703-2710.
Leslau, W. (1952). Influence of Sidamo on the Ethiopic Languages of Gurage. Language, 28(1): 63-81.
Leslau, W. (1960). Sketches in Ethiopic Classification. Atti del Convegno lnternazionale di Studi Etiopie, Accademia Nazionale dei Lincei, Rendiconti, 48, 89-107.
Leslau, W. (1969). Toward a classification of the Gurage dialects. J. Semit. Stud, 14 (1), 96-109.
Menuta, F. (2014). Inherent Intelligibility among Guragina Varieties. Journal of Science and Development, 2(1),73-114, Hawassa University.
Menuta, F. (2015). Intergroup Communication Among Gurage: A Study in Intelligibility, Inter-lingual Comprehension and Accommodation. Saarbucken: Lambert Academic Publishing.
Meyer, R. (2006). Wolane: Descriptive Grammar of an East Gurage Language (Ethiosemitic). Köln: Köpp Verlag.
Meyer, R. (2010). Nominal Number Marking in Wolane. Aethiopica, 13, 135-151.
Murtonen, A. (1967). Early Semitic, a Diachronic Inquiry into the Relationship of Ethiopic to the Other So-called South-East Semitic Languages. Leiden: E.J. Brill.
Nerbonne J., Colen R., Gooskens C., Kleiweg P., Leinonen T. (2011). Gabmap-a web application for dialectology. Dialectologia: Revista electr`onica, 2 (2), 65–89.
Nurga, A. S. (2021). Language contact and its effects on language use of the Gurage varieties of Muher., In D. Ado, A. W. Gelagay and J. J. Bondi (eds.), Grammatical and Sociolinguistic Aspects of Ethiopian Languages (pp. 65-90). Amsterdam/ Philadelphia: John Benjamins Publishing Company.
Oda, O. (2017). Grammar of Silt’e. Council of Nationalities, SNNPRS, Branna Printing Enterprise.
Polotsky, H. J. (1949). Review of Leslau: Gafa! Documents. Journal of the American Oriental Society, 69, 36-41.
Shafi, O. (2015). Descriptive Grammar Of Mesqan. Unpublished Ph.D. thesis, Addis Ababa University, School of Graduate Studies.
Tadesse, M. (2009). Hazo: A Political History of the People in Gedebano, Gutazer, Wolane and Agemjay Localities of the Gurageland-Ethiopia (to 1991AD). Addis Ababa: Masters Publishers.
Teklemichael, K. (2002). The Morphology of Goggot. Unpublished MA thesis. Addis Ababa University, Department of Linguistics.
Ullendorff, E. (1960). The Ethiopians: An Introduction to Country and People. London: Oxford University Press.
Völlmin, S. (2017). Towards a Grammar of Gumer Phonology and Morphology of a Western Gurage Variety. Unpublished Ph.D. thesis presented to the Faculty of Arts and Social Sciences of the University of Zurich.
Wakjira, B. (2010). Morphology and Verb Construction Types of Kistaniniya. Trondheim: Norwegian University of Science and Technology (NTNU).

Appendix A is not available with this version

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Linguistic Distance among Gurage Language Varieties

Status:

Version 1

Abstract

Figures

Introduction

Data Analysis And Presentation Of Results

Discussion And Conclusion

Declarations

References

Appendix A

Additional Declarations

Status:

Version 1