Belgian patient and control cohorts
Members of the Belgian BELNEU consortium recruited patients at neurological centers associated with university or general hospitals in Belgium [26, 27]. The LBD cohort comprised 844 LBD patients with mean age at onset age (AAO) of 62.9 ± 11.8 years (31.5% female; 20.0% with positive familial history). In the LBD cohort, 233 patients were diagnosed with DLB (mean AAO 70.2 ± 10.2 years, range 34-88; 33.0% females; 24.0% with positive familial history), of which 159 patients (68.2%) had a clinical diagnosis and 74 patients (31.8%) a neuropathological diagnosis of DLB. The remaining 611 patients in the LBD cohort had a clinical diagnosis of PD (mean AAO 60.6 ± 11.3 years, range 24-88; 31.1% females and 18.5% with positive familial history). Patients had a clinical examination by a neurologist and neuroimaging. We collected medical history of patients and family members. A positive family history of disease was given if at least one first-degree relative was affected. DLB patients were diagnosed in accordance with the established criteria for possible, probable or pathological DLB [7, 28], and PD patients according to the NINDS diagnostic criteria for PD [29]. The PD cohort was genetically profiled for the 5 major PD genes (SNCA, LRRK2, PARK2, PINK1 and PARK7) by means of Sanger sequencing for simple mutations and multiplex amplicon quantification (MAQ, Agilent, Multiplicom, Niel, Belgium), quantitative real-time PCR or multiplex ligation-dependent probe amplification (MLPA) [30] for copy number variants [31].
A geographically matched control cohort consisted of 664 individuals with a mean age at inclusion (AAI) of 72.0 ± 9.4 years (range 34-88) and comprised 60.5% females. Control individuals were recruited among healthy partners of patients visiting a memory clinic, and negative for neurological or psychiatric antecedents or neurological complaints, or community-recruited individuals scoring >25 on a Montreal Cognitive Assessment (MoCA) [32] with a negative individual or familial history of neurodegenerative or psychiatric diseases.
Ethical assurances
The ethics committee of the University Hospital Antwerp and University of Antwerp approved the clinical and genetic research protocols. Collection of biological samples was in accordance with the written informed consent signed by the participant and/or their legal guardian.
Whole genome sequencing
Short-read paired-end WGS of two affected siblings of family A (Fig. 1a), subsequent read alignment to the human reference genome (GRCh37/hg19) and base and variant calling were performed by Complete GenomicsTM Inc [33]. To annotate and select genetic variants in the WGS data we used GenomeComb [34]. High quality variants were selected based on a sequence coverage of at least 20x, a variant call score (Complete Genomics Inc.) of ≥60 dB, i.e. a probability of ≥106:1 describing the likelihood of the variant call compared to the second most likely call, and genomic location outside repeat regions marked as simple repeats or micro satellites by RepeatMasker v3.0 [35]. Novel or rare (minor allele frequency (MAF) < 1%) variants in the 1000 Genome Project database [36] and below 25% in our in-house next generation sequencing database of Belgian patients with distinct neurological disorders), non-synonymous coding and splice site variants were selected. Finally, we focused on variants in line with autosomal recessive inheritance, i.e. homozygous or compound heterozygous variants. We slected homozygous variants and multiple compound heterozygous variants in the same gene.
Targeted resequencing of VPS13C
We performed PCR amplification of all 86 coding exons and flanking splice sites of VPS13C by amplicon-target PCR amplification (MASTR technology; Agilent). Amplicons were uniquely tagged; based on the Nextera XT shotgun library preparation protocol (Illumina), containing sample-specific indices [37]. Libraries (n=384) were pooled and sequenced in one run on the MiSeq platform using the MiSeq V3 chemistry, generating paired-end sequence reads of 300 nucleotides (Illumina). After sample de-multiplexing, we mapped sequence reads using the Burrows-Wheeler Aligner (BWA) [38, 39] to a mini-genome, combining the target sequences extracted from the human genome, reference sequence hg19. Sequence variants were called using GATKv3.5 HaplotypeCaller [40, 41] and variants annotated using GenomeComb [34]. We used the Combined Annotation Dependent Depletion (CADD_Phred) score, to predict the deleterious impact of non-synonymous variants in VPS13C, whereby a score above 20 represents the top 1% most deleterious variants in the genome [42]. Alternative splicing of VPS13C produces four different transcripts, ubiquitously expressed. Coding variants are numbered relative to the translation initiation codon in the largest VPS13C transcript (GenBank Accession Number NM_020821.2. Amino acid changes, numbered according to the largest VPS13C isoform (GenPept Accession Number NP_065872.1). Sequencing reads, visualized with the Integrative Genomics Viewer (IGV) [43] using BAM files of individual samples.
Sanger sequencing
VPS13C exons 7-8, 27, 37-38, 41, 46, 54, 60-61, 70-73, 76-77 and 80, <85% 20X covered with the MASTR assay, were analyzed with Sanger sequencing and rare (MAF <1%) non-synonymous coding and splice site variants, identified by targeted resequencing, were validated and genotyped in available family members with Sanger sequencing. Primers were designed using the online Primer3 software [44]. Target regions were PCR amplified from genomic DNA and subsequently Sanger sequencing using the BigDye® Terminator Cycle Sequencing kit v3.1 (Applied Biosystems) on an ABI3730 automated sequencer (Applied Biosystems). Sanger sequences were analyzed using Seqman (DNASTAR) and NovoSNP software [45].
Allele-specific PCR to determine allele-phase configuration
We used allele-specific PCR amplification to determine cis/trans configuration of two VPS13C variants present in the same exon. For each mutation, an allele-specific and a wild-type primer were designed in combination with a general second primer using the online Primer3 software [44] to amplify both the wild-type and mutant allele separately (Table S8, ). The amplicon containing the location of the second mutation purified and Sanger sequenced (as described above).
Haplotype sharing analysis
For haplotype sharing, we selected 11 polymorphic STR markers surrounding VPS13C at chromosome 15q21 for genotyping: D15S1008, D15S198, D15S155, chr15:62080540-62080584, chr15:62440295-62440338, D15S1036, D15S997, D15S159, D15S993 and D15S1507. The STR markers were PCR amplified using fluorescently labelled primers and size-separated using GeneScan 500 Liz Size Standard (Applied Biosystems) on an ABI3730xl DNA Analyzer (Applied Biosystems). Local Genotype Viewer, used to score fragment lengths.
In silico prediction of splice-site variants
For evaluation of the of splice site variants in silico we used five splicing prediction programs (SpliceSiteFinder-like, MaxEntScan, NNSPLICE and GeneSplicer) integrated in Alamut Visual version 2.11.0 (Interactive Biosoftware).
Cell culture
Lymphoblast cells, immortalized by Epstein Barr virus transformation of lymphocytes collected from whole blood on lithium heparin according to standard procedures [46, 47], were cultured in Roswell Park Memorial Institute 1640 medium (RPMI 1640; Life Technologies), supplemented with 15% fetal calf serum (Sigma Aldrich), 2 mM L-glutamine (Life Technologies) and 500 U/500 µg penicillin/streptomycin (Life Technologies).
Human cervical carcinoma (HeLa) cells were cultured in Modified Eagles medium (MEM; Life Technologies), supplemented with 10% fetal calf serum (Sigma Aldrich) and 500 U/500 µg penicillin/streptomycin (Life Technologies). Human neuroblastoma cells (SH-SY5Y) were cultured in Modified Eagles medium (MEM; Life Technologies) supplemented with 10% fetal calf serum (Sigma Aldrich), 1% non-essential amino acids (Life Technologies), 2 mM glutamine (Life Technologies) and 500 U/500 µg penicillin/streptomycin (Life Technologies).
Long-read cDNA sequencing
Total RNA was isolated and purified from lymphoblast cells using the RNeasy Mini Kit as recommended by the manufacturer (Qiagen), and subsequently treated with DNase (Turbo DNase Kit; Ambion). Polyadenylated RNA was enriched from total RNA samples using the Poly(A) RNA Selection Kit (Lexogen; M039100). First strand synthesis, performed on 50ng Poly A+ RNA by means of the SuperScript IV First-Strand Synthesis System using VN primer (VNP), which anneals to poly A-tails, and Template Switching Oligonucleotides (TSO) (Table S8). Both oligonucleotides have sequence tags for subsequent second strand generation and PCR amplification using a Forward and Reverse primer (Table S8). cDNA amplification of VPS13C fragments containing the double variants was performed using LongAmp® Taq DNA Polymerase (New England Biolabs) using primers designed with the online Primer3 software (Table S8, ) [44]. Native barcoding and adapter ligation were carried out using the Native Barcoding Expansion 1-12 (EXP-NBD104; Oxford Nanopore Technologies, ONT) in conjunction with the Ligation Sequencing Kit (SQK-LSK109; ONT).
VPS13C amplicons from 11 individual samples were pooled equimolar and sequenced on a MinION, using a single FLO-MIN106 flow cell (ONT). In total 900Mb of data was generated (~650K reads). Base calling and barcode de-multiplexing of the raw data was performed with Guppy (v.3.2.2). Further analysis was performed with a pipeline integrated in GenomeComb [34]. Alignment to the hg38 reference sequence [48] was performed with minimap2 using the splicedhq preset [49]. Samtools was used for the conversion to BAM and sorting [50], which enabled visualization with IGV [43]. Single nucleotide variant calling and haplotyping was performed using longshot (v0.4.0) [51].
Splice-site variant analysis on cDNA
Lymphoblast cells were treated with 100 μg/ml cycloheximide (Sigma-Aldrich BVBA) or equal amounts of dimethyl sulfoxide (Fisher Scientific) for 4 h. The RNeasy procedure (RNeasy Mini Kit; Qiagen) was used to isolate and purify total RNA from lymphoblast cells as recommended by the manufacturer. Subsequently, total RNA was treated with DNase (Turbo DNase Kit; Ambion). cDNA synthesis of total RNA was performed primed with random hexamer primers using the SuperScript® III First-Strand Synthesis System for RT-PCR (Invitrogen), followed by a RNase H digestion to remove the RNA template from the cDNA:RNA hybrid molecule. To investigate the effect of VPS13C c.4166-8C>A on splicing, primers were designed using the online Primer3 software (Table S8, ) [44]. Exon skipping of exon 38 due to VPS13C c.4166-8C>A was investigated with a F primer located in exon 37 and a R primer located in exon 39. VPS13C cDNA was amplified in a total volume of 15 µl. Fragments were analyzed on a 2 % agarose gel, and sized using TrackIt™ 100 bp DNA Ladder (Invitrogen). Additionally, the VPS13C amplicons were purified and Sanger sequenced (as described above).
Quantitative RT-PCR
Total RNA was isolated from lymphoblast cells, using the Ribopure kit (Ambion) or the Magtration Reagent MagDEA RNA 100 kit (Precision System Science), and was treated with DNase (Turbo DNase Kit; Ambion). First-strand cDNA was synthesized with the SuperScript III First-Strand Synthesis System (Invitrogen) utilizing random hexamer and oligo dT primers. PCR was carried out for 40 cycles with a VPS13C-specific forward and reverse primer (Table S8), using the Power SYBR Green PCR Master Mix (Life Technologies) on an ABI Viia™7 Real-Time PCR System (Applied Biosystems). Relative expression levels were quantified against two housekeeping genes, TBP and hYWAZ, using qbase software (Biogazelle, Ghent, Belgium).
Generation of VPS13C antibody
The cDNA encoding the amino acids 3621-3742 of human VPS13C, isoform 2A (NM_020821), was PCR amplified, using primers carrying the restriction sites for NheI and XhoI (Table S8, ), and subsequently cloned into a pET28a vector (Novagen), in frame with an N-terminal 6xHis tag (pET28-6xHis-VPS13C.2). The construct was transformed into BL21(DE3) competent E. coli (New England Biolabs) and expressed for 3h, at 28°C, to allow proper folding of the protein. Expressed recombinant protein was purified with Ni-NTA resin (Qiagen) followed by gel filtration chromatography using an ÄKTA™ pure system equipped with a HiLoad 26/600 Superdex 75 pg column (GE Healthcare). Purity of recombinant protein was assessed by Coomassie Brilliant blue staining. New Zealand white rabbits were immunized with different amounts of purified recombinant protein, and the serum was subsequently tested by Western blot for VPS13C immunoreactivity, using VPS13C knockout HeLa cell extracts as a negative control (Fig. S12). Polyclonal antibodies from the sera displaying the highest titer were affinity purified against the recombinant protein immobilized on NHS-activated Sepharose® 4 Fast Flow (GE Healthcare) according to the manufacturer’s instructions. Antibodies were eluted from the column with 100mM Glycine pH 2.7 and immediately neutralized with 1M Tris-HCl pH 9.0. The eluted fraction was dialyzed against PBS for 48h, with three daily buffer changes. Finally, purified antibodies were concentrated to 1 mg/mL using a Vivaspin 15R, 30 000 MWCO hydrosart (Sartorius AG), and frozen in aliquots containing 50% glycerol.
Western blotting
Protein analysis performed in lymphoblast cells and brain lysates of patient carriers of >1 VPS13C variants and of family A. Cells and brain tissue were lysed in modified radioimmune precipitation buffer (RIPA: 1% sodium dodecyl sulfate, 150 mM NaCl, 0.5% Na-Doc, 1% NP-40, 50 mM Tris-HCL; pH, 8.0) supplemented with protease and phosphatase inhibitor (2x Complete Protease inhibitor cocktail and 1x Phosphostop phosphatase inhibitor cocktail; both from Roche). Lysates were sonicated on ice, cleared at 20,000 g for 15 min at 4°C, and supernatants were collected for immunoblotting. Protein concentrations were measured using a BCA Protein Assay Kit (Pierce™; Thermo Fisher Scientific). Equal amounts of proteins were separated on a 3-8% Tris-Acetate gel (Life Technologies) and electro-blotted onto a polyvinylidene difluoride membrane (Hybond P; Amersham Biosciences). Membranes were blocked in 5% skimmed milk in PBS and probed with primary antibodies against VPS13C (polyclonal rabbit anti-human VPS13C (1:1,000; Novus Biologicals; NBP1-94043), polyclonal rabbit anti-human VPS13C (1:1,000; Novus Biologicals; NBP1-94044) or polyclonal rabbit anti-human (1:2,000; generated in house as described above)) and GAPDH (monoclonal mouse anti-human GAPDH (1:20,000; Genetex; GTX627408)). Immunodetection was performed with specific secondary antibodies conjugated to horse-radish peroxidase in combination with the ECL prime chemiluminescent detection system (GE Healthcare) or the WesternBright Sirius detection system (Isogen Life Science). The band intensities were determined by quantifying the mean pixel grey values in a rectangular region of interest using Image Quant TL software (GE Healthcare Life Sciences) and subsequently normalized to GAPDH.
Preparation of cDNA constructs
The codon optimized coding sequence of wild type (WT) human VPS13C was purchased in a Gateway® compatible pDONR vector from GeneArtTM Gene synthesis (pDONR-VPS13CWT; Life Technologies). The VPS13C missense variants p.Trp395Cys and p.Ala444Pro were introduced into the pDONR-VPS13CWT vector with QuickChange Site-Directed Mutagenesis. Sequence verified mutant entry clones were subcloned into the Gateway® pcDNA™-DEST40 Vector with a C-terminal V5-6x His tag (Thermo Fisher Scientific) or into an in-house developed Gateway®-compatible pCR3 Vector with a C-terminal EmGFP-tag. Sequences were verified by DNA sequencing and primers used for cloning are listed in Table S8 ().
Transfections
HeLa or SH-SY5Y cells were transfected using either X-tremeGENE™ 9 DNA Transfection Reagents (Sigma Aldrich), Lipofectamine LTX plus (Invitrogen), both according to manufacturer’s protocol, or Polyethylenimine (PEI) according to an in house optimized protocol. Briefly, cells were seeded in a 6-well plate at 2.5x105 cells per well, 24 hours before transfection. On the day of transfection, cell medium was replaced by medium without antibiotics. 1.44µg plasmid DNA was diluted in 115µl Opti-MEM (Life-Technologies) and in parallel, 7.21µl PEI was diluted 115µl Opti-MEM. The diluted PEI was added to the DNA and mixed gently by vortexing. After 10 min of incubation at room temperature, the solution was added to the cells. All cells were evaluated 48h post-transfection.
Immunohistochemistry and live cell labeling
HeLa or SH-SY5Y cells were grown on 12 mm glass coverslips (Fisher Scientific), fixed for 20 min with 4% paraformaldehyde (PFA; Laborimpex) in PBS at room temperature and washed three times in PBS. The cells were permeabilized in 0.25% Triton X-100 for 10 min, washed three times in PBS and blocked for 1h in 5% BSA (Merck) with normal donkey serum (1:500; Merck). Cells were then incubated overnight at 4°C with one or a combination of the following antibodies: monoclonal mouse anti-V5 tag (1:400; Life Technologies; R960-25), polyclonal goat anti-V5 tag (1:2,000; Abcam; ab9137), polyclonal rabbit anti-human Giantin (Golgi; 1:10,000; Covance; 924302), polyclonal rabbit anti-human TGN46 (Trans-Golgi Network; 1:800; Abcam; ab50595), monoclonal mouse anti-human PDI (endoplasmic reticulum (ER); 1:100; Abcam; ab2792), monoclonal mouse anti-human LAMP1 (lysosomal-associated membrane protein 1; 1:200; Developmental Studies Hybridoma Bank; H4A3), polyclonal rabbit anti-human Rab7 (late endosomes; 1:200; Sigma Aldrich; R4779) and monoclonal mouse anti-human CD63 (early endosomes; 1:250; Abcam, ab8219). To visualize the primary antibodies, the cells were incubated with secondary antibodies conjugated to Alexa Fluor 488, Alexa Fluor 594 or Alexa Fluor 647 (all 1:500; Thermo Fisher/Life Technologies) for 1h at room temperature.
For live cell imaging, transfected HeLa cells were plated on a standard 35 mm glass bottom dish (MatTek Corporation, P35G-1.5-14-C) and incubated for 30 min at 37°C with growth medium containing 50nM LysoTracker® probe (Thermo Fisher). After incubation, the medium was replaced by FluoroBrite DMEM (Life Technologies).
Image acquisition and analysis
Images were taken with a Zeiss LSM700 confocal microscope using either a 63x/1.40 Plan-Apochromat or a 40x/1.30 Plan-Neofluar objective. Filters, dichroics and scanning modes were set to exclude crosstalk between the different fluorescence channels, pixel sizes were set according to the Nyquist sampling theorem and z-stacks comprising entire cells, acquired at optimal step sizes. To visualize co-localization of VPS13C with different organelle markers, Fiji software generated line-intensity plot profiles [52]. For quantification of VPS13C mislocalization, two researchers who were blind for the genotype visually scored cells.
Neuropathology
Brains of patients P1 and P3, fixed in formalin for 13 weeks and 5 weeks, respectively. Samples, taken from the frontal cortex areas 4, 6, 8, 9, 10, 11, 12, 24 and 46, the superior temporal cortex, hippocampus and amygdala, parietal and occipital cortex, the thalamus, neostriatum, pallidum, mesencephalon, pons, medulla oblongata and cerebellum. Cytological stains included Cresyl-Violet, Hematoxylin-Eosin, and Klüver-Barrera as myelin stain. Immunohistochemistry, performed with antibodies against β-amyloid (4G8), hyperphosphorylated tau (AT8), ubiquitin, TDP-43, FUS, p62 and α-synuclein.
Statistics
Burden and variance-component tests implemented in the optimized sequence kernel association test (SKAT-O) provided in the R package SKAT v2.0.0, used to investigate association of single VPS13C variants with MAF ≤ 1% and PD. First, power calculation was performed within the SKAT framework using a logistic test for dichotomous traits (target sequence: 12,941bp, causal variant percentage = 20%, negative variant percentage = 20%, Maximal OR = 5). Under these conditions, the total sample cohort required to reach 80% power with a 0.05 significance level is at least 1050 individuals. Our patient and control cohort consisted of 1508 individuals and met the requirements. Adjustment to SKAT-O applied taken the small sample size (< 2000 individuals). Gender was included as covariate. We considered a two-sided p value < 0.05 significant. To investigate association between bi-allelic VPS13C mutations and PD, we compared statistically the mutation frequencies between the patient and control group using Fisher’s exact statistics, and we considered a p-value of < 0.05 significant. Data are represented as the average ± standard deviation of a minimum of 3 independent experiments. For the description of the statistical significance of differences, the Multiple Comparisons of a one-way ANOVA using the GraphPad Prism V7.01 software calculated P-values. Values were considered significant if * 0.01< P < 0.05; **0.001 < P < 0.01; *** 0.0001 <P < 0.001; ****P < 0.0001.