Human iPSCs Maintenance and Neuronal Differentiation
The neuronal cells were derived from the WTC-11 human induced pluripotent stem cell (iPSC), some with the addition of an inactive gRNA library for downstream experiments. The iPSCs were obtained from Genome Engineering and Stem Cell Center at Washington University in St. Louis, propagated in mTeSR plus media (100–0276, StemCell Technologies, Vancouver, Canada), and seeded onto 6-well plates coated with Matrigel (354277, Corning, Corning, USA), diluted 1:6 in DMEM/F12 media (11320033, Thermo Fisher, Waltham, USA). The iPSCs were maintained, expanded, and incubated at 37°C and 5% CO2. The media was replaced every day and cells were passaged once they reached 80% confluency. Cells were rinsed with Dulbecco’s Phosphate-Buffered Saline (DPBS, 14190144, Thermo Fisher) and treated with 1mL ReLeSR (05872, StemCell Technologies) at room temperature for one minute. ReLeSR was then aspirated, and cells were incubated in a 37°C incubator for 5 minutes before being collected and split at a 1:10 dilution in a new plate.
Upon reaching 80% confluency, iPSCs were differentiated into motor neurons as per the protocol outlined by Du et al [26]. In brief, the natural developmental signaling factors were recapitulated with small molecules including Ascorbic acid (72132, StemCell technologies), CHIR99021 (72054, StemCell Technologies), DMH-1 (73634, StemCell Technologies), SB-431542 (72234, StemCell Technologies), Retinoic acid (R2625, Sigma, St. Louis, USA), Purmorphamine (72204, StemCell Technologies), ROCK inhibitor (Y-27632, 72304, StemCell technologies), and Compound E (73952, StemCell Technologies) using the protocol26. Additionally, the differentiation media also contained N-2 (17502001, Thermo Fisher) and B-27 (A3582801, Thermo Fisher) supplements. iPSCs underwent differentiation into motor neurons across four distinct stages, incorporating varying time and concentrations of the various small molecules (Fig. S1).
The induction of neuroepithelial cells was initiated by introducing stage 1 differentiation media composed of 50% neural basal (21103049, Thermo Fisher), 50% DMEM/F12, 0.5X N2, 0.5XxB2, 0.1mM Ascorbic acid, 1X Glutamax (35050061, Thermo Fisher), 1X 0.1% Penicillin/Streptomycin (15140122, Thermo Fisher), 2 µM DMH-1, 2 µM SB-431542, and 3 µM CHIR99021. On day 7, the cells were then transitioned to stage 2 media, introducing two additional small molecules, namely, 0.5 µM Purmorphamine and 0.1 µM Retinoic acid, while reducing CHIR99021’s concentration from 3 µM to 1 µM. This stage induces OLIG2 + expression, initiating the cells' motor neuron progenitor (MNP) identity. On day 4 of being on stage 2 media, about 1 million cells were frozen down per aliquot for later experiments (see below, raft plating and imaging).
Between each stage, cells were dissociated with TrypLE (354277, Gibco, Gaithersburg, MD, USA) and re-plated. Each time the cells underwent the differentiation process and required dissociation; they were treated with 10mM ROCK inhibitor (72304, StemCell Technologies) for 24 hours following the passage. For dissociation, cells were washed with DPBS, 1X TrypLE was introduced, and the vessel was incubated at 37°C for 3–5 minutes. The cells were then centrifuged at 200g for 5 minutes before resuspending in appropriate media. It is worth mentioning that while resuspending the cell pellet, several triturations were performed to break the cell clusters into single cells.
Raft Plating and Imaging Motor Neurons
MNPs were seeded on two different kinds of raft plates, a single reservoir and a quad reservoir, each containing 100x100µm microrafts (Cell MicroSystems, Inc., NC, USA). To prepare the plates for cell seeding, the plates were washed with 1 mL/well and 5 mL/well (for quad and single raft plates, respectively) of 1xPBS (10010023, Corning) and incubated at 37°C for 5 minutes. These washes were done three times to remove the glucose layer from the raft plates. The raft plates were then coated with 4µg ⁄mL Poly-d-lysine (PDL, P0899, Sigma) and incubated overnight at 37°C. The next day, the raft plates were rinsed three times with molecular grade water (46-000-CV, Corning) and coated with 5µg/mL mouse laminin (23017015, Fisher Scientific) for similar overnight incubation. Both PDL and Laminin solutions were made with molecular grade water.
MNPs frozen on stage 2 day 4 were thawed, centrifuged at 200g for five minutes, resuspended in 1mL stage 2 media, and subsequently plated on a Matrigel-coated 6-well plate. The cells were then grown for an additional 3 days on stage 2 media. On the 14th day since the start of the differentiation process, the cells were nucleofected with Cas9 plasmid and plated on stage 4 media consisting of 50% neural basal media, 50% DMEM/F12, 0.5X N2, 0.5X B27, 0.1mM Ascorbic acid, 1X Glutamax, 1X 0.1% Penicillin/Streptomycin, 0.1 µM Purmorphamine, and 0.5 µM Retinoic acid. Approximately 2 million cells were nucleofected in a large cuvette per reaction using Lonza’s P3 primary cell 4D-nucleofector kit (V4XP-3024, Lonza, Basel, Switzerland). The cells were allowed to recover from the transfection for 3 days while inducing MNX + motor neuronal cell fate. In this manuscript, Cas9 plasmid was not added, but this detail is included for more clarity on the overall timing.
On day 18 of differentiation, the cells were seeded on raft plates. For quad and single raft plates, 80,000 cells/well and 320,000 cells/well were seeded respectively. Cell counts for raft plating was performed using a hemocytometer (with higher accuracy than automated cell counters). The next day, media was changed to stage 5 media, which consisted of same small molecules as of stage 4 media but with the addition of 0.1 µM Compound E. Notably, the protocol mentions stage 3, an optional MNP expansion step, which we skip. Full media changes are conducted every day during stages 1, 2, and 4, and a half media change every other day when on stage 5.
Motor neurons were stained and imaged 48-hours after the introduction of stage 5 media. A master mix of the staining media was prepared, consisting of 4.05 µM Hoechst 33342 (H3570, Thermo Fisher), and 1 µM Tubulin Tracker Deep Red (T34076, Thermo Fisher). For staining, the media was changed by gently removing from the plates and adding 500µL/well and 2mL/well of stage 5 media to the quad and single plates, respectively. Following the application of the stain, the plates were then incubated at 37°C for 30 minutes, after which the stain media was removed and three washes with stage 5 media in the same volume as the stain media were performed. For imaging, the plates were positioned on a Cell Microsystems plate adapter and imaged on Molecular Device’s INCell 6500HS Confocal microscope at 20x with 0.45 NA. The live cell chamber was used, set to 5% CO2 and 37°C. The imaging protocol included 40,000 rafts / 1,368 fields and 25,600 rafts / 960 fields in the single and quad plates, respectively, with 12% field of view overlap. Exposure times for Hoechst (405 nm), Tubulin Tracker Deep Red (642 nm), and brightfield all averaged less than 100ms.
Immunofluorescence to Confirm Motor Neuron Identity
To determine the iPSC's differentiation proficiency, cells were analyzed with immunofluorescent staining at different stages during the differentiation process. First, the cells were fixed with 4% paraformaldehyde (PFA, 158127, Sigma Aldrich) for 15 minutes and washed three times with 1X PBS. Next, cells were permeabilized using 0.5% Triton-X 100 (93443, Sigma Aldrich) for 15 minutes and blocked with 3% Bovine Serum Albumin (BSA, A9418-10, Sigma- Aldrich,) for 1 hour. Both primary and secondary antibodies were diluted with BSA, placed on the cells and incubated for 1 hour. The primary antibodies that were tested included 1:75 SOX1 (AF3369, R&D Systems), 1:500 Olig2 rIgG (AB9610, Sigma), 1:100 NKX2.2 mIgG (74.5A5, DSHB), 1:75 MNX1 mIgG (81.5C10, DSHB), 1:70 ISL1 mIgG (40.2D6, DSHB), 1:300 CHAT gIgG (AB144P, Sigma-Aldrich), and 1:100 MAP2 rIgG (MAB3418-25UG, Thermo Scientific). The secondary antibodies that were used included 1:1000 donkey anti-mouse (A-31571, Thermo Fisher), 1:1000 donkey anti-rabbit (ab150075, Thermo Fisher), and 1:200 donkey anti-goat (A-21432, Thermo Fisher). Through confocal microscopy (INCell Analyzer 6500), the cells were then imaged, and cell tracing and fluorescence intensity extraction was done using IN Carta Image Analysis Software (Molecular Devices).
Passaging and Maintenance of U20S Cells
In some experiments we utilized a human osteosarcoma cancer cell line, U2OS cells (HTB-96, ATCC). The U2OS cell line engineered to express doxycycline inducible Cas9 was acquired from GESC@MGI WashU. The U2OS cells were cultured in McCoy's 5A Modified Medium (16600082, Gibco) supplemented with 10% tetracycline-free fetal bovine serum (FBS, PS-FB3, Peak Bio, Pleasanton, CA, USA). Cells were grown in either T75 or T150 tissue culture flasks and were passaged at 90% confluency. Trypsinization consisted of placing 0.25% Trypsin-EDTA 1x (25200056, Gibco) on the cells for 5 minutes at 37°C, followed by collection of cell suspensions and centrifugation for 3 minutes at 1200 RPM.
U2OS Cell Imaging
The U2OS cells were plated on raft plates, either a single plate containing 40,000 microrafts or a quad plate containing 25,600 microrafts (Cell Microsystems). 48,000 U2OS cells were seeded in a single raft plate, while 12,000 U2OS cells were seeded per well in the quad plates. Cells were plated 48 hours prior to staining with 4.05 µM Hoechst, and 0.5 µM MitoTracker Green (M7514, Thermo Fisher) for 30 minutes at 37oC. Like neuronal imaging, U2OS imaging consisted of placing the plates in a Cell MicroSystems plate adapter and imaging on the Molecular Devices INCell 6500HS Confocal microscope at 20x with 0.45 NA. The live cell chamber was used, set to 5% CO2 and 37°C. In total, 40,000 and 25,600 fields were imaged in the single and quad plates respectively, with 12% field of view overlap. Exposure times for Hoechst (405 nm) and MitoTracker Green (488 nm) averaged < 100ms.
Training Image Annotation and Sorting
Some of the general details pertaining to image annotation via FIVTools are listed in and illustrated in Figure S2. To ensure comprehensive testing, a separate set of 11,308 images including all three wavelengths (brightfield, tubulin, and Hoechst) was selected, distinct from the 12,689 images utilized for training. The 16-bit multichannel images of single 20x fields-of-view were loaded directly into FIVTools. If the cells were plated on microrafts, then raft calibration was performed (which connected the plate coordinates to the raft coordinates). For 96-well plate experiments, nuclei segmentation was performed first, to identify the raft-sized regions around each nucleus. Image-display parameters were then adjusted to optimize the contrast for both nuclear and accessory channels. The neuronal classifiers were trained with nuclei in the blue channel and tubulin in the red channel. U2OS classifiers were trained with nuclei in the blue channel and mitochondria in the green channel. It is worth noting that these channels could be swapped as needed. Initially, half a dozen experienced scientists manually annotated each of the scanned raft images in a well by pressing a number key (which designated the class index, initially arbitrary), then clicking on a raft to assign a class label (see Fig. 2 for the list of labels). These labels could be checked using the “Check Annotation” feature in the program, enabling the user to quickly visualize groups of images from different classes. This functionality provided flexibility, allowing users to modify annotations at any point in time. Next, all the annotated regions were exported with the adjusted parameters, so that the images were placed in a sub-folder with the name of the class. These images were exported as RGB 8-bit per channel bitmaps with a width and height of 128x128 pixels. The number of images per class is listed in Table S1.
Machine Learning Training
We used Keras/Tensorflow 2.10 to train the CNNs. The workflow was executed in a Jupyter notebook inside of Visual Studio Code, using a python kernel in Anaconda. See “TF_NB_MainCells01_1.ipynb”, then navigate to the code section “Raft CNN – Classifiers” > “Directory Defines Classes” > “RGB 8-bit per Channel”. First, images were loaded into a TF dataset, cached, and prefetched (see Code Availability). Each classifier was generated using “neural architecture search”, by randomizing various parameters and then inferring on additional images. The following parameters were randomly assigned: CNN kernel size (2–4), CNN initial number of filters (18–36), CNN depth (max(1, log(image width, kernel size))), and filter increase rate (1.25 to 1.75). The number of dense layers after convolution started with a random size that was no more than half of the flattened size. In addition, the dense layer dropout was randomized, regardless of the presence of batch normalization in the dense layers or residuals in the CNN layers. Additionally, the decision to use single loss or multi loss for each class was also randomly turned on and off.
Images were resized, rescaled between 0 and 1, then fed through the convolutional layers or the transfer learning layers. Early stopping was used if the loss stopped decreasing, usually with a patience set at ten percent of the number of epochs, which was usually set to be around 200. Batch size was 64, and training:test split was 80:20. As soon as each training finished, the script exported several evaluation metrics, including the best loss and accuracy as well as inference on the full dataset. Then the thread was reset, a new random set of parameters was created, and the process was repeated. Over a hundred individual classifiers were automatically and independently trained in each round.
Most of the manuscript did not use transfer learning with pre-trained layers, but this was implemented for one specific experiment (Fig. 6). In that case, a variety of transfer layers used included Keras’s built in convnext.ConvNeXtTiny, efficientnet_v2.EfficientNetV2B0, inception_v3.InceptionV3, MobileNetV3Small, nasnet.NASNetMobile, ResNet50, and xception.Xception instead of the CNNs mentioned above.
Evaluation and Analysis
In addition to assessing the loss and the accuracy of each classifier, we also treated the fully inferred output as two distinct classes to calculate additional metrics. Given that both the classifiers, neuronal and U2OS, were designed to identify one ‘ideal’ class, we minimized the labels to represent the preferred class (1 Neuron or 1 Cell) vs all the other classes. Afterwards, we computed the AUC for sensitivity/specificity and the Max F1 score for precision/recall. Figures and results were visually represented using either the software Spotfire (Tibco, California) or with the Matplotlib library in Python.