Blood collection and PBMC isolation
The samples from both COVID-19 patients and healthy donors were collected in accordance with the Declaration of Helsinki after approval by the institutional review boards. Each participant signed informed consent. All COVID-19 patients were tested positive for SARS- CoV-2 using reverse transcriptase chain reaction (RT-PCR) from an upper respiratory tract (nose/throat) swab test in accredited laboratories. Peripheral blood was collected in ethylenediaminetetraacetic acid (EDTA) tubes following subsequent isolation of PBMCs using Ficoll-Paque density centrifugation according to standard protocol. PBMCs were suspended in fetal bovine serum (FCS) with 10% dimethyl sulfoxide (DMSO) and stored in liquid nitrogen.
HLA typing of study participants
PBMCs isolated from COVID-19 patients were thawed and washed with RPMI 1640 supplemented with 10% FCS, 1% Penicillin-Streptomycin solution and Benzonase nuclease (Merck-Millipore, 2500 U/mL), resuspended and incubated at 37C for 30 minutes. For the healthy donor samples, DNA was isolated directly from whole blood. PBMCs were counted and up to 1,000,000 cells were aliquoted for subsequent DNA isolation. DNA was isolated using the DNeasy Blood & Tissue Kit (Qiagen, cat. #69506) according to manufacturer’s protocol. HLA typing was done using next-generation sequencing according to the manufacturer's protocol (GenDx).
SARS-CoV-2 epitope selection and peptide synthesis
Fifty SARS-CoV-2 epitope-HLA combinations were selected for each of the top 10 most prevalent HLA alleles in Italy. The selection was primarily based on SARS-CoV-2 epitopes that had the highest predicted binding affinity to the MHC according to NetMHCpan-4.022, as well as receiving a prediction score higher than 0.5 using NetChop-3.123. The SARS-CoV-2 proteome was obtained from UniProt (Proteome ID: UP000464024). Predicted 9-11 mer epitopes from the following open reading frames (ORFs) of the SARS-CoV-2 proteome were included in the analysis: ORF 1ab, 3a, 6, 7a, 7b, 8, 9b, 10, 14, envelope (E), membrane (M), nucleoprotein (N) and spike (S) protein. In addition, SARS-CoV-2 epitopes that were predicted to be most immunogenic by the science community were included for analysis (Table S1)48-51.
A total of 438 unique peptides were synthesized by the Chemical Biology group, Leiden University Medical Centre.
Generation of UV-cleavable pHLA monomers
The UV-cleavable peptides were synthesized in-house as described previously52. Recombinant HLA-A*01:01, A*02:01, A*03:01, A*11:01, A*24:02, B*07:02, B*08:01, B*15:01, B*18:01 and B*51:01 heavy chains and human beta-2 microglobulin (B2M) were produced in Escherichia coli and isolated from resulting inclusion bodies53. MHC class I refolding reactions and purification by gel filtration HPLC were performed, and HLA-A and B heavy chains and B2M were refolded in the presence of UV-cleavable peptides (Table S5) following subsequent biotinylation as described previously54.
Generation of fluorescent pHLA multimers
MHC complexes were loaded with the selected SARS-CoV-2 peptides via UV-induced ligand exchange52,55. In brief, pHLA complexes with UV-sensitive peptide were subjected to 254/366 nM UV light for 1h at 4˚C in the presence of a rescue peptide. The following amounts of 14 different fluorescent streptavidin conjugates were added to 10 ml of pHLA monomer (100 mg/ml): 1 ml of SA-BB790 (BD, custom), 1 ml of SA-BB630 (BD, custom), 1 ml of SA-APC- R700 (BD, 565144), 0.6 ml of SA-APC (Invitrogen, S868), 1 ml of SA-BV750 (BD, custom), 2 ml of SA-BV650 (BD, 563855), 2 ml of SA-BV605 (BD, 563260), 2 ml of SA-BV480 (BD, 564876), 2 ml of SA-BV421 (BD, 563259), 1 ml of SA-BUV615 (BD, 613013), 1.5 ml of SA- BUV563 (BD, 565765), 2 ml of SA-BUV395 (BD, 564176), 1.25 ml of SA-BV711 (BD, 563262) and 0.9 ml of SA-PE (Invitrogen, S866). For each pHLA monomer, conjugation was performed with two of these fluorochromes resulting in up to 75 dual fluorescent color codes. Subsequently, milk (1% w/v, Sigma) was added to block and capture unspecific peptide binding residues, and fluorescently labelled pHLA multimers were incubated for 30 min on ice. Finally, D-biotin (26.3 mM, Sigma) in PBS and NaN3 (0.02% w/v) was added to block residual binding sites.
Combinatorial encoding of pHLA multimers and surface marker staining
PBMCs were thawed and washed with RPMI 1640 supplemented with 10% human serum, 1% Penicillin-Streptomycin solution and Benzonase nuclease (Merck-Millipore, 2500 U/mL), resuspended and incubated at 37C for 30 minutes. The following amounts of fluorescently labelled pHLA multimers were used to stain CD8 T cells: 1 ml of SA-BB790-pHLA, SA-BB630-pHLA, SA-APC-R700-pHLA, SA-BV750-pHLA, SA-BV650-pHLA, SA-BV605- pHLA, SA-BV480-pHLA, SA-BV421-pHLA, SA-BUV615-pHLA, SA-BUV563-pHLA, SA- BUV395-pHLA, SA-BV711-pHLA, SA-PE-pHLA and 2 ml of SA-APC-pHLA. The cells were stained in Brilliant Staining Buffer Plus (BD, 563794) according to manufacturer’s protocol. Final staining volume was 100 up to 194 ml depending on the amount of fluorescent pHLA multimers for each individual sample. Cells were incubated for 15 min at 37 °C. Subsequently cells were stained with 2 ml of a(nti)CD8-BUV805 (BD, 564912), 1 ml of aCD4- APC-H7 (BD, 641398), aCD14-APC-H7 (BD, 560180), aCD16-APC-H7 (BD, 560195), aCD19-APC-H7 (BD, 560252), a2B4-FITC (BD, 550815), aTim-3-BV786 (BD, 742857), aPD1-BUV737 (BD, 612791), aNKG2A-PE-Cy7 (Beckman, B10246) and 0.5 ml of LIVE/DEAD Fixable IR Dead Cell Stain Kit (Invitrogen, L10119) and incubated on ice for 20 min. Samples were analyzed on the BD FACSymphony A5.
Identification of antigen-specific CD8 T cell responses
Analysis of antigen-specific CD8 T cell responses was carried out without prior knowledge about clinical patient characteristics to avoid experimental bias. The following gating strategy was applied to identify CD8+ T cells: (i) selection of live (IRDye low-dim) single-cell lymphocytes [forward scatter (FSC)-W/H low, side scatter (SSC)-W/H low, FSC/SSC-A], (ii) selection of anti-CD8+ and ‘dump’ (anti-CD4, anti-CD14, anti-CD16, anti-CD19) negative cells. Antigen-specific CD8 T cell responses that were positive for two none of the other pHLA multimer channels were identified using Boolean gating. The full gating strategy used on the BD FACSymphony A5 is shown in Fig. S1. Cut-off values for the definition of positive responses were ≥ 0.005% of total CD8 T cells and ≥ 5 events. A minimum of 1,000 CD8 T cells were acquired per sample. To reduce researcher-bias caused by manual gating, only positive responses that were confirmed by three independent people were defined as real. Data was analysed using either the BD FACSDiva v.8.0.1 or the FlowJo 10.6.2 software. To monitor the reproducibility of the assay system, reference samples with up to 10 CD8 T cell responses present at varying frequencies were included in each analysis.
Peptide stimulation of SARS-CoV-2-specific CD8 T cells
PBMCs were thawed and washed with RPMI 1640 supplemented with 10% human serum, 1% Penicillin-Streptomycin solution and Benzonase nuclease (Merck-Millipore, 2500 U/mL), resuspended and incubated at 4C for 60 minutes. Cells were washed and cultured at 37C for 12 hours in 96-wells U-bottom plates at 1x105 PBMCs per well in the presence of GolgiPlug (1:1000, BD, 555029), and either equimolar amounts of DMSO alone (negative control), or phorbol 12-myristate 13-acetate (50 ng/ml) and Ionomycin (1 μg/ml) (positive control) or SARS-CoV-2 peptides (2 μg/ml of each peptide). Cells were washed, resuspended in 50 ml FACS buffer and stained for 20 min on ice with 1 ml of aCD8-BUV805 (BD, 564912), 0.5 ml of aCD4-APC-H7 (BD, 641398), aCD14-APC-H7 (BD, 560180), aCD16-APC-H7 (BD, 560195), aCD19-APC-H7 (BD, 560252), aNKG2A-PE-Cy7 (Beckman, B10246), 0.125 ml of aHLA-DR-BUV661 (BD, 565074) and 1 ml of CD137-PE/Dazzle-594 (BioLegend, 309825).
After incubation, cells were washed with PBS and resuspended in 50 ml PBS and 0.125 ml LIVE/DEAD Fixable IR Dead Cell Stain Kit (Invitrogen, L10119). After a 10 min incubation on ice, cells were washed, fixed and permeabilized using the Foxp3 Transcription Factor Staining Buffer Set (eBioscience, 00-5523-00) according to manufacturer’s protocol. Intracellular cytokines were stained for 20 min on ice in 50 ml staining volume with 1 ml of aIFNγ-APC (BD, 554702), aTNFα-FITC (BD, 554512), aIL17-PE (BioLegend, 512306) and 0.25 ml of aIL2-BV750 (BD, 566361). Cells were washed twice with FACS buffer after staining subsequently measured on the BD FACSymphony A5. The following gating strategy was applied to identify CD8+ T cells: (i) selection of live (IRDye low-dim) single-cell lymphocytes [forward scatter (FSC)-W/H low, side scatter (SSC)-W/H low, FSC/SSC-A], (ii) selection of anti-CD8+ and ‘dump’ (anti-CD4, anti-CD14, anti-CD16, anti-CD19) negative cells. The gates applied for the identification of TNFα, IFNγ, IL-2, IL-17, CD137, HLA-DR and NKG2A positive cells of total CD8+ T cells were defined based on the cells cultured with DMSO for each individual. Reactivity and activation upon peptide stimulation was defined based on a fold-change of ≥ 1.5 in expression of the cytokine and activation markers included in the analysis.
Flow cytometer settings
The following 21-color instrument settings were used on the BD FACSymphony A5: blue laser (488 nm at 200 mW): FITC, 530/30BP, 505LP; BB630, 600LP, 610/20BP; BB790, 750LP, 780/60BP. Red laser (637 nm at 140 mW): APC, 670/30BP, APC-R700, 690LP, 630/45BP, IRDye and APC-H7, 750LP, 780/60BP. Violet laser (405 nm at 100 mW): BV421, 420LP, 431/28BP; BV480, 455LP, 470/20BP; BV605, 565LP, 605/40BP; BV650, 635LP, 661/11BP; BV711, 711/85, 685; BV750, 735LP, 750/30BP, BV786, 780/60BP, 750LP. UV laser (355 nm at 75 mW): BUV395, 379/28BP, BUV563, 550LP, 580/20BP; BUV615, 600LP, 615/20BP; BUV661, 630LP, 670/25BP; BUV737, 735/44BP, 770LP; BUV805, 770LP, 819/44BP.
Yellow-green laser (561 nm at 150 mW): PE, 586/15BP; PE/Dazzle-594, 600LP, 610/20BP; 2.PE-Cy7, 750LP, 780/60BP. Appropriate compensation controls were included in each analysis.
Statistical analysis
Differences in magnitude of identified CD8 T cell responses stratified based on antigen source or recognized epitope were assessed using the non-parametric Mann-Whitney U-test. Ordinary one-way ANOVA test was used to assess differences in binding affinity of predicted epitopes restricted to different HLA alleles. The data cut-off for all analyses was 21 May 2020. Statistical analysis was performed using Excel 16.36 and PRISM 8 (Version 8.4.0).
Sorting of SARS-CoV-2 specific CD8 T cells
PBMCs were thawed and washed with cold RPMI 1640 supplemented with 10% human serum, 1% Penicillin-Streptomycin solution and Benzonase nuclease (Merck-Millipore, 2500 U/mL), resuspended and incubated on ice for 60 minutes. SARS-CoV-2-specific CD8 T cells were stained with TotalSeqTM-streptavidin-labelled (Biolegend, 405271, 405273, 405275, 405277) pHLA multimers on ice for 30 min in a final staining volume of 100ul. Subsequently cells were stained with 2 ml of aCD8-BV421 (BD, 562428), 1 ml of aCD4-FITC (BD, 345768), aCD14- FITC (BD, 345784), aCD16-FITC (BD, 335035), aCD19-FITC (BD, 345776), 0.5 ml of LIVE/DEAD Fixable IR Dead Cell Stain Kit (Invitrogen, L10119), and TotalSeqTM-C anti- human hashtag antibodies (BioLegend, 394661, 394665, 394667, 394669, 394673) on ice for 20 min. Stained cells from 5 patients were pooled, washed and sorted on the FACSAria Fusion into 0.04% bovine serum albumin (BSA, w/v) in PBS at 4C. Forward and side scatter settings were used to select for lymphocytes and to exclude doublets. Viable CD8 T cells were identified and sorted based on low LIVE/DEAD IRdye staining and high CD8 expression.
Single-cell RNA-seq and TCR-seq capturing, library construction and sequencing
The single-cell suspension was split in two samples that were successively loaded onto a Chromium Single Cell Chip (10x Genomics) according to the manufacturer’s protocol for co- encapsulation with barcoded Gel Beads at a capture rate of 1200 and 960 individual cells, respectively. The following 10x genomics kits were used to produce the Gel Bead-In Emulsions (GEMs) and the resulting sequence libraries (Gene expression library, Feature Barcode library, TCR library) according to manufacturer’s protocol: Chromium Next GEM Single Cell 5’ Library and Gel Bead kit v1.1 (10x Genomics, PN-1000167), Chromium Next Gem Single Cell V(D)J Reagent Kit v1.1 (10x Genomic, PN-1000165), Chromium Single Cell 5’ Library Construction Kit (10x Genomics, PN-1000020), Chromium Single Cell 5’ Feature Barcide Library Kit (10x Genomics, PN1000080) and Chromium Next GEM Chip G Single Cell kit (10x Genomics, PN-100127). The three libraries were combined in relative fractions of 0.785, 0.085 and 0.130 in order to generate sufficient reads per cell for each type of library. The final library pool was sequenced on a NextSeq Mid Flowcell, with 150 cycle chemistry kit in paired-end fashion 26-8-130bp.
Single-cell RNA-seq data analysis
The Cell Ranger Software Suite (v.3.1.0) was used to perform sample de-multiplexing, barcode processing and single-cell 5’ unique molecular identifier (UMI) counting. Single-cell RNA- seq data analysis was performed with Scanpy 1.5.156. Data was analyzed with Python 3.7.6. pandas 1.0.1, and NumPy 1.18.1 were used for data manipulation, and Seaborn 0.10.0 and Matplotlib 3.1.3 were used for plotting. The following criteria were applied to each cell of all five patients: gene count between 200 and 2500, mitochondrial gene percentage <0.25 and ribosomal gene percentage >0.2. Data was then normalized to depth 10 000, and ln(1+x) was calculated. After filtering and normalization number of counts per cell and percentage of mitochondrial genes were regressed out from the data using scanpy.pp.regress_out. Data was subsequently scaled with scanpy.pp.scale using default parameters. PCA was computed on highly variable genes. A neighborhood graph of observations was computed with 50 principal components and n_neighbors=10. UMAP plots were plotted using scaled data. Louvain clustering was performed with scanpy.tl.louvain with default parameters. Marker genes were found using scanpy.tl.rank_genes_groups on the non-scaled data (use_raw=True) with t-test. Marker genes were filtered based on a minimum fold change of >2 or <0.5 and maximum Benjamini-Hochberg FDR value of 0.05. Gene ontology analysis using selected differentially expressed gene sets was performed using DAVID Bioinformatics Resources 6.8 using default settings (https://david.ncifcrf.gov/home.jsp).
TCR V(D)J sequencing and analysis
Full length TCR V(D)J segments were enriched from amplified cDNA from 5’ libraries via PCR amplification using the Chromium Single-Cell V(D)J Enrichment kit (10x Genomic, PN- 1000005) according to the manufacturer’s protocol (10x Genomics). TCR sequences for each single T cell were assembled by Cell Ranger vdj pipeline (v.3.1.0), leading to the identification of CDR3 sequences and the re-arranged TCR gene. TCR repertoire analysis was performed with Scirpy 0.3. TCR diversity and TCR clonal size were estimated using scirpy.tl.alpha_diversity and ir.pl.clonal_expansion (performing the normalization), respectively. V(D)J gene usage was estimated with scirpy.pl.vdj_usage. Abundance of particular TRB V segments was estimated with scirpy.pl.group_abundance, performing the normalization. Multiple sequence alignment was performed with T-Coffee Expresso using the default settings57.