Our study showed that individuals with and without T1D from a highly admixed population in Maranhão, a northeastern state of Brazil, have a higher European ancestry. Furthermore, these individuals have a higher percentage of African and Native American ancestry than other Brazilian populations. Concerning the Y chromosome, the most frequent were the ones with a European origin, mainly represented by the haplogroup R1b. The present study also shows that the DRB1 * 03: 01~ DQA1 * 05: 01~ DQB1 * 02: 01 haplotype was the most frequent in individuals with T1D, being the most prevalent risk alleles following DRB1 *03 and DRB1 *04. These data are like other studies performed in Brazil, as well in other countries. Moreover, an important finding was the relationship between European Y chromosome with DRB1 *03 and DRB1 *04 homozygous and DRB1*03/ DRB1*04 heterozygous genotypes in the T1D individuals. The above-mentioned data emphasized that although being an admixed population, Brazilian people still have a great influence from European autosomal and Y chromosome ancestry. Besides, even though the Brazilian population has European, Native American, and African as ancestries’ roots, the process of miscegenation could be quite different among the different regions, highlighting the importance to carry out studies in different states of a continental country, like Brazil.
The population from the state of Maranhão, located in the Northeast region of Brazil, is also composed by the miscegenation between European, African, and Native American populations10,25. This fact was noted in our study through the analysis of genomic ancestry. However, the percentages of miscegenation between these ethnicities are quite different in each Brazilian region, depending on the specific colonization process and the geographical area10,24,36. Through our analyses we found that, as in all Brazilian regions, European ancestry was the largest contributor, but in our population, it approached 50% in both groups (T1D and controls), differing from the weighted average of 68.1% found in the Brazilian population in a systematic review study conducted in 2019 24. We also obtained a similar percentage between African and Native American ancestry (around 25% each), which again differs from the Brazilian average of 19.6% African and 11.6% Native American 24. A possible explanation for this difference is due to the identification of three asymmetric mating models in Maranhão. In almost all Brazilian populations, an asymmetric mating pattern usually occurs, preferably between European men and Native American or African women 10. However, in Afro-descendant communities in Maranhão and the Amazon, another pattern of asymmetric mating was observed, occurring between African men and Native American women 22. Still, in Maranhão, the Guajajaras American Native also maintained contact with the Brazilian population, which is already a mixed race and, with African slaves, Guajajaras men mating with African or mixed women being more common 23.
When performing the analysis of ancestry-specific principal components analysis (ASPCA) with the panel of the HGDP-CEPH 31, we observed that the samples from Maranhão are grouped closer to the Europeans. When compared to a database of a healthy Brazilian population from all geographic regions of Brazil 10, they are closer to Native Americans. These findings corroborate the aforementioned facts.
Historically, the highest incidence of T1D occurs in whites of European ancestry 2,37. In our study, we observed a predominance of similar European autosomal ancestry in the T1D and control groups (47.3 and 48.5, respectively), which was different from that found in a large Brazilian analysis carried out by Gomes et al, where there was a higher percentage of European ancestry in the T1D individuals than in the control group (67.8 and 56.3, respectively) 38, which was in agreement with another study (77 and 71; respectively) also in Brazil, conducted only in the state of São Paulo39. This finding in our sample can be explained by the increasing incidence of T1D in ethnic minorities, as has been observed in the USA 37. However, despite the great miscegenation in our population, all T1D individuals had at least 35% European ancestry.
In the colonization process of Maranhão, there were some differences with the rest of Brazil, mainly in the origin of the Europeans involved 18,19,20. In 1535, the Portuguese arrived in the lands of Maranhão and met the Native American people 18. In 1607, the French people landed on the island, and in 1615, they handed over the São Luís fortress to the Portuguese 18,19, remaining in their domain until 1641, when it was invaded by the Dutch, expelled 2 years later 18,20. The first historical records of the entry of slaves in Maranhão dates from around 1655 and ends in 1831. It is estimated that about 187,000 African slaves joined and that in 1822, they corresponded to 50% of the population of Maranhão. These Africans were mainly from Guinea-Bissau, Togo, Benin, Nigeria, and Angola and to a lesser extent Senegal, Gambia, Guinea, Alto-Volta, Ghana, Congo, and the archipelagos of Cape Verde and São Tomé and Príncipe 21.
Despite the presence of these three ethnic groups in Maranhão, the male contribution to the miscegenation process was predominantly European in all regions of Brazil 10,29, which is confirmed in the study of our population with and without T1D, which had a predominance of the European Y chromosome, with the R1b haplogroup being the most frequent.
The R1b has a high frequency in western Europe, including in the original countries of the people of Maranhão, with percentages reaching approximately 57 in Portugal 40 and in the Netherlands41, and 68.7 in France42. In the T1D group, it was 39.24% and 54.55% in the control group. To clarify the different sub-haplogroups of R1b, with the possibility of being more specific in their phylogeography and specific origin in each of these countries, we should use Y-SNPs 29, which was not possible in the present study.
The second most frequent haplogroup in the T1D group was E1b1b (17.72%) and in the control group, it was E1b1a (9.79%). The haplogroup E is seen in Africa, Europe, and the Middle East and includes several subhaplogroups with different distributions on these continents. Some sublineages are from sub-Saharan Africa, such as E1b1a. Other E1b1b subhaplogroups have a similar frequency in Africa and Europe (E1b1b-M78), with a high prevalence in North Africa, and in the Iberian Peninsula (E1b1b- M81) and in West Asia and Europe (E1b1b-M123) 43. We believe that this higher frequency of E1b1b is due to the Portuguese influence in our population, since in a study conducted by Martiniano et al., E1b1b had the third-highest frequency (12.0%) in the studied Portuguese population40.
The haplogroup Q exhibits Asian descendants and has established itself in the Americas 44, being almost restricted to the Native American population and currently uncommon in the admixed Brazilian population. In a study carried out with the population of the different regions of Brazil, only 3.1% of the Y chromosomes belonging to the Q were found 29. In our sample, we obtained it in the T1D group (6.33%) and in the control group (2.80%). This greater contribution of Native Americans in our study may be due to the diversity of asymmetric miscegenation models found in Maranhão, as detailed above.
The analysis of autosomal ancestry and Y chromosome ancestry in our population detected an important European influence. Concerning these latter facts, we have performed a hypothesis relating both ancestries with the most important worldwide genetic susceptibility marker to diabetes type 1: the HLA system.
The DRB1*03 and DRB1*04 alleles are the most frequent risk alleles in individuals with T1D 45, especially in European populations1. As expected, our results showed that when comparing our T1D group, the CRsr and regional REDOME controls, the DRB1 * 03 and DRB1 * 04 alleles showed an odds ratio of risk for association with T1D, with DRB1 * 04 (30.26%), DRB1 * 03 (29.93%) and the DR3 / DR4 heterozygous genotype (26.32%) as the most frequents. In a large analysis of the T1D population in all Brazilian regions, a result similar to ours was found 46. It is reported that approximately 30% of individuals with T1D have DR3/DR4 in heterozygosity 6,47, corroborating our findings.
In Europeans, the highest risk for the disease is associated with the DRB1*04:01/02/04/05~DQA1*03~DQB1*03:02 (DR4-DQ8) haplotypes and with DRB1*03:01~DQA1*05:01~DQB1*02:01 (DR3-DQ2) 1,6,7. In our study, the most frequent haplotype was DR3- DQ2 (5.63%), and the most frequent HLA genotype was DRB1*03:01~DQA1*05:01~DQB1*02: 01 (DR3-DQ2) homozygote (7.24%). We still found that the most frequent DQA1 alleles were 05:01 (29.14%), and 03:01 (26.82%), and the most frequent DQB1 alleles were 03:02 (32.57%), 02:01 (26.32%), compatible with the fact that these are the most frequent DQA1 and DQB1 alleles associated with T1D 4. In a study performed by Santos et al., it was observed that in Brazilian individuals with T1D, the same haplotype, DRB1*03:01~DQA1*05:01~DQB1*02:01 (DR3-DQ2), was the most frequently found in this subpopulation of individuals46.
When we analyzed the relationship of the DRB1 * 03 and DRB1 * 04 alleles with the CRsr, we noticed its greater frequency in self-reported white and brown individuals. Some studies have suggested that there is no good correlation between self-reported color and ancestry in Brazilian individuals 12,48, but in our analysis, we found that in both groups (T1D and Control), European ancestry was predominant among the self-reported white and brown people (P <0.05). In another Brazilian survey, HLA-DRB1* 03 and DRB1 * 04 were also more prevalent in the self-reported white people 46.
The Y chromosome polymorphisms allow discrimination between individuals within a population and shorter biogeographical inference of their paternal ancestry 15,49. To the best of our knowledge, this is the first study relating the ancestry of the paternal lineage (Y chromosome) with the genotyping of HLA class II in T1D individuals.
We observed that when the Y chromosome was Native American (haplogroup Q), DRB1 * 01, DRB1 * 16, DQA1 01:01, and DQB1 05:01 were the most frequent, differing from the other groups. In Native American populations supposedly without non-Amerindian gene flow, only five HLA-DRB1 allele strains are commonly observed (DRB1 * 04, DRB1 * 08, DRB1 * 09, DRB1 * 14, and DRB1 * 16). Arnaiz-Villena et al. described that HLA genes do not confer susceptibility to T1D in Native Americans unless if these people are mixed with Europeans. According to this same author, only one case of T1D in an Amerindian individual without a European mixture was reported 50. We consider that the low casuistry of Native American Y chromosomes and the presence of a significant percentage of European ancestry in our sample do not allow us to infer the influence of the HLA-DRB1, DQA1, and DQB1 alleles according to the Native American Y chromosome ancestry.
When we analyzed the European Y, we found that the most frequent alleles were DRB1 * 03 and DRB1 * 04; DQA1* 03:01 and 05:01; and DQB1* 02:01 and 03:02. The DRB1* 03 or DRB1 * 04 homozygous genotype detected only in European Y and the high frequency of European Y in the DRB1*03/ DRB1* 04 heterozygous genotype suggest the association of these alleles, known to be the most frequent in the European population 1, with a European ancestry of the Y chromosome in our sample. However, these results should be interpreted with caution, since African and Native American Y sample sizes were small in the present study. Thus, further studies should be conducted with large admixed sample sizes to clarify those possible associations.
Our study has some limitations. The color/race of the family members was informed by the participants, since the individual's perception of his/her color/race may be different from a third family member; moreover, we can not exclude that social factors may also affect CRsr. However, we did not have access to the self-reported information of many family members, because they are geographically distant or deceased, especially grandparents and great-grandparents. Also, when performing the analysis of ancestry-specific principal component analysis (ASPCA) of the autosomal ancestry of the panel of 46 INDEL markers, we did not obtain a similar database from the Portuguese population, which is not included in the HGDP-CEPH 31 for comparison, being a limiting factor, as it is one of the most important origins of the Brazilian population. We also consider a limitation of the fact that we used a second control/BMD group from the REDOME database; nonetheless, this was pair-matched by region and CRsr. Although information of Y haplogroup was not available in the REDOME, we may suggest, due to the large interquartile range of autosomal ancestry in selected individuals from REDOME CRsr brown and white, that they share the HLA alleles of similar Y haplogroups.
The strength of our study is the more specific assessment of the ancestry of individuals with T1D in a highly miscegenated population, using genomic and Y chromosome markers. We consider that the use of healthy individuals in control groups is also an important factor in our analysis. Besides, we believe it is the first study correlating patrilineal ancestry with HLA class II analysis in T1D individuals.
In conclusion, our study demonstrated that individuals with and without T1D in Maranhão have European origin as their largest component, and African and Native American percentage higher than other Brazilian populations. The European patrilineal origin was evidenced by the higher frequency of the R1b haplogroup. The predominance of the HLA DRB1 * 03 and DRB1 * 04 alleles conferring greater risk in our population and being more frequently related to the ancestry of the European Y chromosome, suggests that in our population, the risk of T1D may have been transmitted by European ancestors during our miscegenation process. However, the Y sample sizes of Africans and Native Americans were small, and further research should be conducted with large mixed sample sizes to clarify this possible association. To fill the knowledge gap of the ancestry and genetic origin of T1D in an admixed population like the Brazilian one, further studies with other ancestry markers, in conjunction with the analysis of HLA class II, in T1D individuals and controls need to be addressed.