Crop genetic diversity is a component of agro-biodiversity, with high value for nutrition and adaptation to biotic and abiotic stresses (Zimmerer et al. 2019), particularly in the context of global changes (Pironon et al. 2019). It contributes to render farming systems more stable, robust, and sustainable. On the other hand, the development, dissemination and adoption of improved cultivars is a pathway to increase crop productivity and align on market demands. A deep knowledge of certified cultivars at genomic level, as well as crop genetic diversity actually grown locally, are important information to take into account for plant diversity management and crop improvement.
Rice is rapidly becoming a staple food in the African diet. In West Africa, average annual production is 10.098.106 tons (milled equivalent, 2009–2019 period, Soullier et al. 2020), and average annual growth of production (2009–2019) is higher than 10% (Soullier et al. 2020). However, it still does not meet the demand, the imports representing 46% of rice consumption on average annually over the 2009–2019 period (Soullier et al. 2020), so that the efforts to increase rice production in a sustainable manner have to be pursued.
Two rice species are cultivated in West Africa: the African rice Oryza glaberrima Steud., and the Asian rice Oryza sativa L. Asian rice is nowadays by far the more cultivated in West African farming fields, with O. glaberrima being restricted to small areas, grown in places that are unsuitable for O. sativa, maintained mainly for social reasons (Linares 2002). African rice is known for its resistance to biotic and abiotic stresses and was consequently included to O. sativa genetic improvement programs, led by AfricaRice (Africa Rice Center) at the continental scale, to create improved cultivars named NERICA (New Rice for Africa), and some of the cultivars named ARICA (Advanced Rice for Africa) (Sarla and Swamy 2005; Ndjiondjop et al. 2018a).
A small number of studies aimed at describing rice genetics in West Africa, some of them being limited in terms of number of genetic markers (< 100), (Chen et al. 2017; Luther et al. 2017; Yelome et al. 2018). However, Ndjiondjop et al. (2018a) used a panel of 27,560 SNPs to characterize 330 widely used rice genotypes in Africa, including NERICA and ARICA cultivars, and found two highly differentiated genetic groups corresponding to the different rice growing systems: lowland (primarily indica and lowland NERICAs) and upland (japonica and upland NERICAs). In addition, Diop et al. (2020) analyzed a set of 280 samples (landraces from Casamance/Senegal and other cultivars from West Africa in general) by genotyping-by-sequencing (GBS) approach. Analyzed samples belonged to four genetic subpopulations: O. glaberrima, O. sativa japonica, O. sativa indica Group 1, and O. sativa indica Group 2, with high genetic diversity within the majority group O. sativa indica.
Focusing on Burkina Faso in particular, research projects aiming to describe the genetic diversity from rice landraces began with a prospection by Moussa Sié in 1983–1984 all over the country (Sié et al. 1998; Sié et al. 1999). Then, a new sampling was performed in 2008 by Honoré Kam in 59 villages from four regions in western Burkina Faso, including 47 O. glaberrima samples among the 330 collected accessions, with a characterization of agro-morphological diversity (Kam et al. 2017a) and genetic diversity using 23 microsatellites markers (Kam et al. 2017b). As expected, considering that upland ecology is not very common in Burkina Faso, only 11 accessions were assigned to O. sativa japonica genetic group; the indica group was consequently by far the most abundant and could be subdivided into three genetic groups (Kam et al. 2017b).
Considering the recent rise in areas cultivated in rice in Burkina Faso (three-fold between 2006 and 2016, FAO), and the intensification of rice cultivation (Demont et al. 2013), it is likely that the rice genetic diversity in the fields is rapidly evolving as well, rendering important to get a more recent picture of rice genetic diversity in this country.
Among the four rice production systems existing in West Africa (rainfed upland, rainfed lowland, irrigated and mangrove swamps, AfricaRice 2011; Katic et al. 2013), two are important in Burkina Faso. First, rainfed lowland rice represents the largest rice growing area in the country (67% between 1984 and 2009, MAHRH 2011). Second, irrigated rice represents less than 30% of harvest areas (FAO), but constitutes more than half of the total paddy rice produced (MAHRH 2011). Rice is grown in Burkina Faso over all 13 regions but most of the production comes from Hauts Bassins, Centre-Est, Boucle du Mouhoun and Cascades regions (DGPER 2010; MAHRH 2011).
Because rice genomics is important for agrobiodiversity characterization in an objective of rice self-sufficiency (McCouch et al. 2013), it is timely to provide an update on the genetic characterization of rice cultivated in Burkina Faso. To this purpose, we selected two sets of samples for genotyping: cultivars certified in Burkina Faso or West Africa and samples from farmer’s fields located in six study sites in western Burkina Faso, in order to address the following questions: (1) What is the genetic diversity of rice from Burkina Faso compared to worldwide rice diversity ? (2) What is the genetic diversity and differentiation of rice actually grown by the smallholder rice farmers in irrigated and rainfed lowland sites in western Burkina Faso ? (3) What is the correspondence between certified cultivars, genetic assignation of field samples and cultivar’s names as known by the farmers ?
We selected 27 certified cultivars from INERA Farako-Bâ, to represent the diversity of certified cultivars including some already adopted by the farmers in Burkina Faso, and recent elite varieties from African breeding programs. The information related to each of them appears in Table 1.
Table 1
List of registered cultivars from INERA/ Burkina Faso analyzed in this study.
Cultivar
|
Synonym
|
Origin
|
Introduction Date
|
Group
|
FKR02
|
GAMBIAKA
|
Gambie
|
1970
|
indica
|
FKR04
|
SINTANE DIOFOR
|
Senegal (Casamance)
|
1960
|
-
|
FKR14
|
4418
|
Inde
|
1976
|
indica
|
FKR16
|
4456
|
Inde
|
1976
|
indica
|
FKR18
|
SC 27
|
Burkina Faso
|
1980
|
indica
|
FKR21
|
ITA 257
|
IITA/Nigéria
|
1987
|
japonica
|
FKR33
|
1195-5-2
|
Burkina Faso
|
1982
|
japonica
|
FKR34
|
RP 1125-1526-2
|
Inde
|
1984
|
indica
|
FKR42
|
IR64
|
IRRI/Philippines
|
1989
|
indica
|
FKR45N
|
WAB880-1-38
NERICA 12
|
AfricaRice
|
1999
|
NERICA (upland)
|
FKR55
|
WAB450-I-BL-1-736-HB
NERICA8
|
AfricaRice/INERA
|
-
|
NERICA (upland)
|
FKR56N
|
WSA 161-B-9-3
NERICA L-41
|
AfricaRice
|
1999
|
NERICA L (lowland)
|
FKR59
|
WAB9984
|
AfricaRice
|
2009
|
japonica
|
FKR60N
|
WAS122-IDSA-1
NERICA L-20
|
AfricaRice
|
1999
|
NERICA L (lowland)
|
FKR61
|
WAB C1 65
|
AfricaRice
|
2009
|
japonica
|
FKR62N
|
WAS 122-IDSA-1
NERICA L-19
|
AfricaRice
|
1999
|
NERICA L (lowland)
|
FKR64
|
TS2
|
Taiwan/DGPV/INERA
|
-
|
indica
|
FKR66
|
WAT1046-B43
|
AfricaRice
|
2010
|
indica
|
FKR70
|
IR 75-884-12-12
|
AfricaRice
|
2009
|
indica
|
FKR74
|
WAB 2094-WAC2-TGR2-B
ARICA1
|
AfricaRice/INERA
|
-
|
ARICA (lowland)
|
FKR76
|
F6-36
|
DGPV/INERA
|
2012
|
indica
|
FKR78
|
F6-41
|
DGPV/INERA
|
2012
|
indica
|
FKR80
|
F6-49
|
DGPV/INERA
|
2012
|
indica
|
FKR84
|
ORYLUX 6
|
AfricaRice/INERA
|
-
|
indica
|
NERICA4
|
WAB450-I-B-P-91-HB
|
AfricaRice/INERA
|
-
|
NERICA (upland)
|
SAHEL177
|
WAS 197-B-6-3-11
|
AfricaRice/Saint-Louis
|
-
|
indica
|
SAHEL328
|
WAS 197-B-4-1-5
|
ISRA/AfricaRice
|
-
|
indica
|
For each cultivar, are indicated potential synonym, the country or organism of origin, the date of introduction in Burkina Faso and the genetic group, based on a priori knowledge. Most of the cultivar’s names begins with ‘FKR’ for ‘Farako-Bâ Riz’.
|
In addition, we took advantage of a sampling previously performed in six sites in western Burkina Faso (Barro et al. 2021). These six sites are located in three geographical zones (Bama, Banzon and Karfiguela), each zone comprising one irrigated area and a neighboring rainfed lowland (Fig. 1a and https://dataverse.ird.fr/dataset.xhtml?persistentId=doi:10.23708/8FDWIE). The present study focused on the 50 fields studied in 2018 (7–11 fields per site, 8,33 on average, see Table 2). A map of the studied fields is presented in Fig. 1a. Each sample used in this study corresponded to one leaf per field, and was collected between September and December 2018. In 40 out of these 50 fields (80%), we interviewed the farmer and asked for the rice cultivars grown. We also performed farmer’s interviews in the same sites in the two previous years (2016–2017) and subsequent year (2019) (Barro et al. 2021). A synthesis of obtained responses (annual frequencies of cultivars grown over the four-year period) are reported in Fig. 1b.
Table 2
Obtained data for each of the study site: characteristics, GPS coordinates, number of analyzed samples and gene diversity
Zone
|
Site
|
Rice production system
|
Latitude
|
Longitude
|
N samples
|
Gene Diversity
|
Bama
|
Badala
|
RL
|
11.368
|
-4.373
|
7
|
0.125
|
0.117
|
Bama
|
IR
|
11.390
|
-4.410
|
9
|
0.131
|
Banzon
|
Senzon
|
RL
|
11.288
|
-4.829
|
7
|
0.113
|
0.124
|
Banzon
|
IR
|
11.335
|
-4.796
|
11
|
0.108
|
Karfiguela
|
Tengrela
|
RL
|
10.648
|
-4.838
|
7
|
0.155
|
0.132
|
Karfiguela
|
IR
|
10.678
|
-4.813
|
9
|
0.130
|
Average
|
|
|
|
|
|
0.131
|
0.124
|
Global
|
|
|
|
|
50
|
0.137
|
Wet lab work (both DNA extraction and SNP genotyping) was performed at the Genotyping Services Lab at the International Rice Research Institute (IRRI). DNA fingerprinting approach used the Illumina Infinium rice 6K chip (C6AIR) (Thomson 2014), a set of SNP designed to characterize the diversity within Oryza sativa species. This chip was already used in rice diversity studies, for example to characterize rice samples from Bangladesh (Kretzschmar et al. 2018). Genotyping data table was provided by the genotyping platform and used for subsequent analysis (raw data available at https://github.com/Africrop/divrice_bf).
In order to place the rice diversity from Burkina Faso in the global context of Asian rice diversity, we downloaded the 29mio SNP datasets from the 3K genome data available at https://snp-seek.irri.org/. Obtained PLINK binary files were converted to VCF using PLINK software v1.9 (www.cog-genomics.org/plink/1.9/) (Chang et al. 2015), enabling the keep-allele-order option. SNPs positions corresponding to the C6AIR were then extracted from the VCF file using bcftools v1.9 (Danecek et al. 2021), then imported within the R software v4.1.0 (R Core Team 2018), as well as the genotyping table from accessions from Burkina Faso. Datasets from the chip genotyping and from the 3K genome data were merged prior to apply genomic filters. In order to keep best-quality SNPs, we applied the following filters SNPwise: less than 15% of missing data considering only the accessions from Burkina Faso, less than 10% missing data considering the whole dataset and an additional filter on heterozygosity which get rid of positions with more than 45% heterozygosity. We ended up with a final dataset including 5,247 SNPs.
We first made a Principal Component Analysis (PCA, Fig. 2) using LEA3.1 R package (Gain and François 2021), graphical display was made using ggplot2 R package v3.3.5 (Wickham 2016). Datasets were converted to the genind format from the adegenet R package v2.1.3 (Jombart 2008) and analyzed both with adegenet and hierfstat R package v0.5-7 (Goudet and Jombart 2020). A Discriminant Analyses on Principal Components (DAPC) was performed, using the 3K diversity groups as reference to assign accessions from Burkina Faso to these groups (Fig. S2). We then restrained our analysis on the accessions from this study (samples from Burkina Faso) and first computed a genetic tree within these samples (Fig. 3). Genetic distances between the accessions were computed using the dist.gene function and the resulting Neighbor-Joining tree was computed using the ape R package v5.5 (Paradis and Schliep 2019). Graphical representation was made using the “fan” option of the ggtree R package v3.1.2 (Yu 2020). A PCA was then computed considering only the field genotypes (with or without a peculiar accession, Fig. 3 and Fig. S1) with the dudi.pca function of ade4 R package v1.7-17 (Dray and Dufour 2007). Finally basic population genetics descriptive statistics (gene diversity and populations pairwise Fst) considering different levels of hierarchy were computed using hierfstat (Table 2 and Table 3).
Table 3
Pairwise FST matrix – Zone and type of riziculture level
|
Bama_RL
|
Bama_IR
|
Banzon_RL
|
Banzon_IR
|
Karfiguela_RL
|
Karfiguela_IR
|
Bama_RL
|
NA
|
|
|
|
|
|
Bama_IR
|
0.001848587
|
NA
|
|
|
|
|
Banzon_RL
|
0.012288212
|
-0.019369126
|
NA
|
|
|
|
Banzon_IR
|
0.056308688
|
-0.012856699
|
-0.01936913
|
NA
|
|
|
Karfiguela_RL
|
0.294933952
|
0.253058369
|
0.28337828
|
0.32875531
|
NA
|
|
Karfiguela_IR
|
0.031039162
|
0.011808917
|
0.04078993
|
0.10737341
|
0.26147627
|
NA
|
The PCA analysis including both this study’s samples and the 3K rice genomes (Fig. 2), as well as the DAPC analysis (Fig. S2), showed that the vast majority of the samples corresponds to indica group. This was expected as none of the samples came from upland growing systems, where japonica are generally found (Ndjiondjop et al. 2018a). Also, Diop et al. (2020) found that indica was the most widely cultivated type of lowland rice in West Africa; with very few of their samples revealed as japonica.
On the other hand, seven reference cultivars (FKR45N, FKR59, FKR33, FKR21, FKR61, NERICA4, NERICA8, see Fig. 2, Fig. 3) were attributed to the japonica group. These cultivars were known to be japonica or Nerica upland (Table 1, Fig. 3) so that this result is congruent with expectations. The japonica group do not contain any of the analyzed samples from farmer’s fields (Fig. 2, Fig. 3). The cultivar FKR04, a cultivar introduced from Casamance (Senegal) in 1960 belonged to admix (Table 1, Fig. 2, Fig. 3). Finally, one field sample, from the field labelled ‘TG02’, belong to Aus group (Fig. 2, Fig. 3). The Aus genetic group does not seem to be common in West Africa in general, as it does not appear in the two studies cited previously (Diop et al. 2020; Ndjiondjop et al. 2018a), whereas Sié et al. (1999) reported the presence of two Aus rice samples in a previous sampling performed in Burkina Faso.
Global gene diversity estimated in the 27 certified cultivars from Burkina Faso was 0.282. It represented various genetic groups (indica, japonica and admix, Fig. 2). Within-group diversity was also apparent as we noticed the certified cultivars from Burkina Faso are not so closed from each other’s within indica and japonica diversity groups (Fig. 2). We argue however that the diversity used for the certified cultivars in Burkina Faso could still be enlarged by mobilizing more genetic diversity of the rice worldwide germplasm.
Global gene diversity estimated in the 50 analyzed field samples from Burkina Faso was 0.137, and within-site genetic diversity was the highest in Tengrela site (0.132; Table 2). On the other hand, Banzon irrigated site presented the lowest genetic diversity (0.108; Table 2). Tengrela was also involved in all highest pairwise genetic differentiation values (Table 3, Table S1). Highest between site genetic differentiation was between Tengrela rainfed lowland and Banzon irrigated perimeter (FST = 0.328 [0.311–0.347]). Such a specificity of this site of Tengrela, compared to others study sites, was also evidenced in a previous study on agricultural practices, showing that rice was mostly grown by women, for self-consumption only, with low frequency of chemical fertilization but often manure from household waste (Barro et al. 2021).
Tengrela was also the only site among the six where only landraces were grown (no use of certified cultivars, Fig. 1b, Barro et al. 2021) and where a sample was attributed to Aus group (Fig. 2, Fig. S1). The farmer named its cultivar ‘Samperema’. We note that the six other samples from Tengrela (whose names include ETP, Bedankaki, Bandakadi / Debale, Tchombiais) are also differentiated from the samples found in the five other sites (Fig. 3, and see the point aspect in Fig. 4 and Fig. S1). They could derive from hybridization between the locally grown landrace belonging to the Aus diversity group and introduced indica varieties, as suggested by their affinity in the PCA with TG02 (Fig. S1).
In terms of rice growing systems, we note that the samples from irrigated areas (in blue, Fig. 3) and rainfed lowlands (in orange, Fig. 3) do not specifically differ from each other, with the exception of Tengrela village, as previously mentioned (TG samples in Fig. 3). Rainfed lowlands considered as a whole (three sites) had a higher gene diversity (0.149) compared to irrigated areas (0.125). The genetic differentiation between all rainfed lowlands fields and all irrigated areas was estimated to FST = 0.030 [0.027–0.034]. This is likely due to the peculiarity of Tengrela site, as we note that the Fst obtained between rainfed lowlands and irrigated areas from the geographic zones of Banzon and Bama both do not differ from zero (Table 3 and Fig. S1).
In the phylogenetic tree (Fig. 3), as well as PCA analyses (Fig. 4 and Fig. S1), we also note that many field samples (from five sites: Senzon, Banzon, Karfiguela, Badala, Bama) are identical to each other’s and to the reference cultivar FKR64 (commonly named TS2 in Burkina Faso, see Table 1). This cultivar, originating from Taïwan (Table 1), was frequently mentioned by the farmers from all sites, except in the peculiar site of Tengrela (Fig. 1b). Consequently, genetic data and farmer’s responses were in agreement in that this rice cultivar is the most frequently grown in the studied sites from western Burkina Faso. However, at the field level, the genetic assignation of samples did not always correspond to the cultivar given names according to farmers’ interviews (see the color of labels in Fig. 4 and Fig. S1). This likely reflects the dynamics of rice genetic diversity in farmer’s field and illustrates that genetic pool is not fixed but still evolving.
Our results offer a picture of rice genetic diversity in six sites from lowland and irrigated areas of western Burkina Faso. We confirmed that indica rice is by far the most frequently grown, but we identified a sample from the Aus genetic group. We globally did not notice major differences between lowland and irrigated areas, except for the lowland site of Tengrela, where rice is grown traditionally (low input) by women using traditional landraces. We confirmed the predominance of one certified cultivar in the five other sites.
Further research is required to encompass rice agrobiodiversity actually grown in this country and in West Africa in general. As a perspective of this study, we first propose to include more certified varieties and to extend the geographic areas to cover all important rice production areas from Burkina Faso. For instance, it would be interesting to study samples from other regions in Burkina Faso, such as the Boucle du Mouhoun, that was shown to be the most diversified in a previous study (Kam et al. 2017a). In addition, it could have been interesting to include rainfed upland rice fields, although this rice production system is minor in Burkina Faso (10% of the rice land area and 5% of national rice production, MAHRH 2011). Third, deciphering potential within-field rice genetic diversity is also an interesting research question for future work, that was not addressed in our study where only one plant sample from each field was analyzed (but see Gouda et al. 2020).
We also documented the genetic diversity of 27 certified cultivars, including indica, japonica and Nerica. This may offer the perspective (not straightforward though) to try to design easy-to-use genetic markers (see Ndjiondjop et al. 2018b) for markers discriminant for rice species) useful for quality control and seed certification. Finally, we argue for more studies combining rice genetic diversity with human and social science to understand further the rationale behind rice farmer’s seed choice. This would allow understanding the apparent discrepancies between genetic assignations and naming of the cultivars in the fields, and more importantly getting useful information to design suitable strategies for rice genetic diversity management.
Indeed, while genetic improvement is very important to increase yield and fight poverty and food insecurity (Arouna et al. 2017), it is also critical to preserve agrobiodiversity and to include landraces, especially the preferred by farmers and consumers, in breeding programs and dissemination projects. This is especially pertinent in the context of global changes because farmer rice varieties in West Africa were shown to be robust / tolerant to sub-optimal conditions (Mokuwa et al. 2013).