Herein, single-cell transcriptomics using a lineage tracing modality was used to examine gene expression in murine corneolimbal epithelial cells. Several novel genes were uncovered that potentially identify putative stem/progenitor cells in cluster 3, one of which, namely Gas1, was validated using qPCR, flow cytometry and immunostaining to ascertain spatial and temporal expression patterns across the ocular surface. This gene is upregulated in muscle, neural and skin stem cells (references please), highlighting its wide-ranging potential to mark a multitude of progenitor cells, including those with the limbus. Additionally, this gene codes for a surface protein that could be used to tether antibodies for cell purification to study their biology and produce higher quality, efficacious SC grafts for patients with LSCD.
Bulk RNA sequencing has been performed for over two decades on many cell types. However, this technique lacks the resolution necessary to identify rare cells or those with genes expressed at low levels [34]. First performed in 2009 on mouse blastocysts [35], scRNA-seq has been widely used to study cell populations using a variety of methods (reviewed in [36] and [37]) and has even been explored, but not used, in diagnostics [38]. Several platforms are available to analyze the transcriptome of single cells, including SmartSeq-2 (utilized herein), 10X Chromium (10X Genomics, Pleasanton, CA), massively parallel RNA single-cell sequencing (MARS-seq) [39], the Fluidigm C1 systems SmartSeq [40] and mRNA Seq HT [37]. While Smart-seq v4 was recently used [41], the most common modality to evaluate the corneal epithelium has been 10X, likely due to its low cost, ability to analyze thousands of cells in a single reaction, and ease of sample preparation [13, 25, 42]. However, this approach precludes pre-selecting cells for downstream analyses, potentially omitting rare populations and thus suffering from selection bias, and the inability to detect more than 500–1000 genes/single cell [37]. The SmartSeq-2 technique we employed is a plate-based method that captures full-length transcripts [43], with the added capacity to produce ~ 1,000,000 reads/cell and the ability to amplify low-abundance genes [44]. A study that compared SmartSeq-2 and 10X with the same peripheral blood sample, discovered that while the former only captured ~ 10% of the number of cells compared to 10X, it produced 10-fold more reads/cell [44]. Nonetheless, Smartseq-2 suffers from inefficient reading of transcripts > 4 kb in length, preferentially amplifies abundant transcripts, and is limited by the number of cells that can be analyzed [45, 46]. As the majority of previous investigations on corneolimbal epithelia engaged either bulk sequencing [47–49] or 10X [13, 14, 25], notwithstanding its limitations, we employed, for the first time, Smartseq-2 to provide an alternative approach in an attempt to resolve an issue that continues to plague the field.
For the current investigation, potential marker genes for LESCs were considered based on two criteria. Firstly, they were expressed in cluster 3 (Fig. 2) which was deemed to harbor SCs [25–28], and secondly, the protein product of these genes was expressed on the cell surface. This prerequisite feature would facilitate downstream antibody tethering by either FACS or MACS [50] for isolating populations that contain more SCs, investigating their biology and function, and generating better quality, longer-lasting grafts for patients with LSCD. Certainly, it has been demonstrated that clinical success in patients with LSCD receiving a SC transplant is associated with grafts that comprise > 3% p63+ cells [51, 52]. With this in mind, several genes were identified, primarily residing in cluster 3 (Fig. 2). These genes included those involved in regulating embryonic (Zfp296) [53], mesenchymal (Fosb) [54], hematopoietic (Cdc20) [55], or cancer-associated (Cdca7) [56] SCs, although due to their nuclear or cytoplasmic localization, they were not pursued. Another gene of interest that was identified in cluster 3 was Bambi (Fig. 2). This gene has been identified in a subpopulation of mesenchymal [57] stem cells, and codes for a TGFβ pseudo receptor that acts as a signaling antagonist [58]. However, previous investigations have determined it to be a dispensable gene, as embryonic deletion does not affect fertility nor does it produce a discernible phenotype in mice [59].
Our approach also uncovered several genes that have been deemed by others as marking corneal epithelial progenitor cells. For example, Id1, a transcription factor that regulates bone morphogenetic protein (BMP) is associated with SC quiescence [60] and was expressed in cells from cluster 3. However, it has also been detected in immune cells [3, 61], thereby questioning its specificity or exclusivity for LESCs. Notably, both K14 [32, 62] and K15 [63] have been proposed as LESC markers, although they are also expressed by neighboring conjunctival epithelia [63, 64]. Whilst the Gpha2 protein was found to be expressed in the limbus by multiple investigators [13, 14, 65], our study detected the gene to be expressed in both Confetti+ and Confetti− cells (data not shown) and in all clusters, suggesting it may not be as robust a marker as previously thought. Moreover, there are conflicting reports on the differentiation status of cells that express Gpha2, some describing them as being more mature, i.e., akin to an early TAC [14, 66], while others suggest they represent quiescent SCs [3, 13]. Similar to previous scRNA-seq investigations [42, 67, 68], LESC markers ABCB5 [69] and ABCG2 [70, 71] were not detected as highly expressed genes. Either they were not expressed in our data set (ABCB5), or they were found in multiple clusters (ABCG2) and were thus ruled out as they could not be used to accurately identify cells in cluster 3 (Fig. 1).
Reassuringly, the recently discovered LESC marker gene Lrig1 [25] was predominantly expressed in cluster 3, although it displayed low-level expression in clusters 0 and 1 (Fig. 2e). Importantly, this gene was not expressed in the neighboring conjunctival cluster 2. Numerous other limbal markers have also been identified in the conjunctiva, including K15 and K19 [63], ΔNp63 [72], and ABCG2 [73], suggesting they are not exclusive to cells in this region. Lrig1 was therefore used as a reference to confirm the limbal localization of novel genes (Figs. 3–5) and our scRNA-seq (Fig. 2) implies it identifies both stem/progenitor cells including TACs. Using qPCR, Lrig1 was predominantly expressed in limbal epithelia, and flow cytometry identified a rare population of Lrig1+ corneolimbal epithelial cells (Fig. 3). Furthermore, the Lrig1 protein was confined to the limbus and peripheral cornea, highlighting its expression in both the stem/progenitor cells and their early progeny (Figs. 3–5) [25]. Interestingly, both K14 and Lrig1 have been identified as progenitor cell markers in other organs such as skin [74] and gut [75]. These observations highlight that there may be a set of common markers that are potentially conserved in different, but related tissues.
We focused on Gas1 as this gene was predominantly found in cluster 3 and its protein product expressed on the cell surface (Fig. 2). To the best of our knowledge, this is the first report disclosing its expression in the normal mammalian cornea. Gas1 encodes a glycosylphosphatidylinositol (GPI)-anchored cell-surface protein [20, 76, 77] that has been described as a tumor suppressor [78, 79] due to its ability to negatively regulate the cell cycle. Interestingly, Gas1 modulates the growth and proliferation of muscle [21], neural [22], renal [80], and hair follicle [81] SCs. Relevant to the current topic, Gas1 plays a crucial role in eye development, with mutations in this gene resulting in micro-ophthalmia due to dysregulated cell proliferation in retinal pigmented epithelium and neural retina [19], and severe craniofacial deformations from dysregulated Sonic Hedgehog signaling [77].
We found Gas1 expression in cluster 3, which we identified as harboring putative LESCs, and flow cytometry confirmed that cells expressing this marker are indeed scarce, representing 1–3% of the total population (Fig. 3). Others have demonstrated a similar proportion of putative LESCs using label-retention [82, 83]. Furthermore, the stable limbal expression of Gas1 during aging, concurrent with K14 expression (Fig. 5), supports the proposition that at least some of these cells are bona fide corneal progenitors. However, some of these cells are also likely to be TACs, given that cells within the peripheral cornea also expressed this marker (Figs. 4 and 5). A key feature regarding SC markers is that they are expressed throughout life. Adult SCs appear to decline in both number and functional capacity during aging [84], as seen in neural [85], mesenchymal [86], hematopoietic [87] and muscle [88] compartments. This phenomenon is also apparent in both human [89–91] and mouse [92–94] corneas. Immunoreactivity for Gas1 in mouse corneas remained stable during ageing, suggesting that the limbus and its resident LESCs are relatively stable throughout life (from 20 to 60-weeks). While this provides reassurance that Gas1 is marking a long-lived population within the limbus, further functional experiments need to be performed to confirm this proposition.
Finally, Gas1 expression was found to be increased following a central corneal epithelial injury (Fig. 6), similar to other markers such as K15 [63], p63 and ABCG2 [95]. This likely reflects the function of the gene or protein being examined. For example, ABCG2 has anti-oxidative properties and is likely upregulated following injury to protect the proliferating cells from injury-induced oxidative stress [96]. Additionally, p63, and the ΔNp63α isoform in particular, facilitates the differentiation of corneal/limbal progenitor cells to TACs [97], a crucial element of the wound-healing process. Gas1 has specific roles in cellular growth arrest, and its upregulation during the post-injury period is likely to inhibit cellular proliferation and hyperplasia of the central cornea [98]. An interesting observation from the current investigation was that Gas1 was only upregulated in young (not old) mice following injury (Fig. 6). This data hints at an age-related decline in the ability to generate functional tissue following injury, a phenomenon observed previously [33]. While our group [32] and others [99] have observed a decline in LESC clones over age, this does not necessarily represent a decline in SC number, as neutral drift may reduce clone size over time [100]. However, while the number of SCs may remain stable during aging, our data represent a potential decrease in SC function in aged mice [93]. Interestingly, in humans, despite the reduction in limbal niche structures [89], no discernible decrease in SC markers was found in cultured cells, again demonstrating reduced function rather than the number of LESC during aging.