Towards Clinical Subtypes in Schizophrenia: Integrating Cognitive, Functional, and Digital Phenotyping Assessments

doi:10.21203/rs.3.rs-4258332/v1

Download PDF

Article

Towards Clinical Subtypes in Schizophrenia: Integrating Cognitive, Functional, and Digital Phenotyping Assessments

https://doi.org/10.21203/rs.3.rs-4258332/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Heterogeneity in the clinical presentation of schizophrenia impairs both proper and preventative care. The digital phenotyping data gathered from an international multi-site cohort study in people with schizophrenia (SZ) offers a novel opportunity to explore clinically meaningful subtypes in the context of clinical, functional, and cognitive data. Using a set of behavioral features derived from smartphone digital phenotyping, clinical assessment of symptoms including PANSS, clinical assessment of cognition with BACS, and clinical assessment of functioning with the social functioning assessments over the target period of twelve months, we found that the international cohort of 74 patients were categorized into three well-defined clusters that suggest clinically actionable targets from differential correlations in each. Namely, the identified clusters seemed to share phenotypic traits with the affective psychosis with more severe symptomatic presentation, a non-affective SZ with functional impairment, and a higher functioning non-affective SZ cluster. Partial correlation analysis further highlighted the emergence of different features per cluster, where anxiety symptoms were most notable for one group, whereas psychotic symptoms were most notable for the other two. Importantly, we showcase an analysis pipeline that transparently addresses challenges of missing data and potential skew so that this research methodology can be applied to future prospective validation studies. This study hopes to build a foundation for future digital phenotyping clustering work by scaling up to new sites, and populations to uncover the nature and extent of heterogeneity in schizophrenia.

Health sciences/Biomarkers/Prognostic markers

Health sciences/Diseases/Psychiatric disorders/Schizophrenia

Health sciences/Biomarkers/Prognostic markers

Health sciences/Diseases/Psychiatric disorders/Schizophrenia

Subtyping mental illnesses is critical to proper diagnosis and treatment [1], yet the ideal subtyping of psychotic spectrum illness remains unrealized. Advances in subtyping have been concomitant with new methods to better assess clinical phenotypes. From Kraeplin’s use of longitudinal note cards, over a century ago, that created the first modern subtyping for schizophrenia [2] to modern machine learning driven classifications [3], new tools can help discern novel subtypes. This paper explores the potential of one such new approach, digital phenotyping, to identify novel subtypes through the use of longitudinal smartphone surveys and sensors.

Digital phenotyping, defined as the moment-by-moment quantification of the individual-level human phenotype in-situ using data from smartphones and other personal digital devices. As one example, prior studies have longitudinally monitored and utilized personal smartphone metrics related to changes in sleep duration, home time, social interactions, attention, and symptom surveys to predict when a person who has experienced first-episode psychosis may be at elevated risk of relapse [4]. Digital phenotyping thus enables greater precision to measure target behavioral phenotypes, making it a promising approach for subtyping schizophrenia.

Digital phenotyping methods also allow for assessments via ecological momentary assessments and surveying participants about symptoms. A 2024 systematic review and meta-analysis of ecological momentary assessments studies in psychosis reported [5] that while there is still no consensus on the validity of EMA prompts or their frequency of use in research studies, this approach can be successfully employed across a wide range of groups of people with schizophrenia. There is already impressive literature using ecological momentary assessments to assess social functioning and social cognition in schizophrenia [6], which underscores the potential of this approach to minimize retrospective bias. Given the many use cases, there have already been efforts to use ecological momentary assessments to derive novel clusters for people with schizophrenia and related illnesses [7–8].

Combining both the ecological momentary assessments as well as sensor data from digital phenotyping presents a novel approach for better assessing relevant self reported states and related behaviors. Prior efforts at subtyping schizophrenia can inform efforts around using digital phenotyping for this task in two ways. First they suggest the importance of multimodal data streams. No single sensor or related feature (e.g. hometime derived from GPS) is likely to be predictive or informative in isolation [9]. Numerous reviews and research on clustering have demonstrated that successful approaches often combine data across different domains such as those represented in the National Institute of Mental Health's Research Domain Criteria (NIMH's RDoC). Research by Bell et al. in 2013 offers a compelling case for the value of social cognition as yet another factor to consider in subtyping of schizophrenia. Second, past research highlights the necessity for transparent and reproducible data processing pipelines for digital phenotyping data [10–11]. Despite a plethora of proposed subtypes across all of mental health, one common challenge is the difficulty to reliably replicate subtyping results, due to scalability issue and representativeness of the clinical population, in a clinically actionable manner able to guide clinical care.

Clustering of any mental illness is only as productive as the data used for that clustering, and the validity and reliability of digital phenotyping signals remains unclear. While many studies have shown this method is acceptable and feasible to use with people with schizophrenia [12–13], there have been few efforts at replication. Numerous recent review papers have highlighted that challenges to methodological rigor are the primary barrier in advancing digital phenotyping research and its clinical potential [14–15]. But others suggest the potential of digital phenotyping derived metrics of sleep, mobility, and phone use (social) to inform transdiagnostic processes underlying many mental health conditions [16].

Digital phenotyping clustering results thus first need to be shared in a manner that can enable replicable science and offer a pathway towards clinical translation. Thus this paper focuses on those methods to ensure a transparent validation approach. and to create a rigorous analysis pipeline. This is especially critical in digital phenotyping where despite an increasing amount of research, few studies or resulting digital biomarkers have ever been replicated let alone validated. Approaching digital phenotyping subtyping through this more rigorous approach, we also opt to explore a dataset that reflects the full potential and limitations of the method. Namely, we examine a multisite international sample of people with schizophrenia and the United States and India to highlight the scalability of digital phenotyping, but also address the challenges of missing data inherent to this approach. This paper thus presents pilot results in the context of a novel pipeline for expanding digital phenotyping clustering work to new sites, populations, and even illnesses.

Subjects

Statistical analysis was performed on data from the Smartphone Health Assessment for Relapse Prevention (SHARP) study, for which seventy-six participants with schizophrenia were recruited. The SHARP study was conducted in three sites, Beth Israel Deaconess Medical Center at Boston, MA, USA; The Sangath Bhopal Hub with the All India Institute for Medical Sciences (AIIMS) in Bhopal, India; and the National Institute of Mental Health and Neuroscience (NIMHANS) in Bangalore, India. Participants in Bangalore and Bhopal were recruited from outpatient psychiatric and psychological services at their respective institutions. All participants were required to be in active treatment, be diagnosed with a psychotic spectrum disorder (confirmed by a clinician using DSM-5 criteria), and have access to a smartphone with access to cellular service or wifi. To ensure technology compatibility, participants went through an initial one-week trial period of passive data collection, with participants offered an alternative device (Samsung Galaxy M31) if they experienced multiple days of no passive data during the trial period. At the BIDMC site participants were deemed ineligible if their device did not pass the trial period and participants had no alternative.

Protocol

Each participant was enrolled in the study for a target period of twelve months, for 13 visits. Each month participants engaged in an hour long visit (in person at Bhopal and Bangalore, virtual at BIDMC) with a clinical research assistant. During each visit participants completed the Positive and Negative Syndrome Scale for Schizophrenia (PANSS) [17]. Participants were then asked to completed the following surveys in the next 24 hours via redcap: The Patient Health Questionnaire-9 (PHQ-9) [18], Generalized Anxiety Disorder-7 (GAD-7)[19], Social Functioning Scale (SFS)[20], Pittsburgh Sleep Quality Index (PSQI)[21], and other scales which are not analyzed in this study. At the intake visit, 6 month, and 12 month visit participants were also asked to complete the Brief Assessment of Cognition in Schizophrenia (BACS) [22].

Passive and active data were collected using mindLAMP, an open-source smartphone application developed by the Division of Digital Psychiatry at Beth Israel Deaconess Medical Center [23]. For the active data, participants were prompted twice a day to complete two of the following six surveys, which include PHQ-9, GAD-7, ‘Sleep’, ‘Sociability’, ‘Psychosis’, and a medication adherence assessment.

The passive data analyzed in this study were collected on a daily basis. While numerous passive data features were obtained, this analysis limits the number of passive data features to avoid multiple comparisons of derived features and focuses on interpretable digital data streams. This includes the amount of time participants spent at their home (home time), the amount of time they spent using their phone screens (screen duration), and how much they moved around to different locations throughout their day (entropy). Home time and screen duration were analyzed in hours, while entropy was analyzed on a scale of 0 to 1. Hometime and entropy were both derived from raw GPS data, and screen duration was derived from the raw device state sensor data.

At BIDMC participants were compensated $50 for visits 1, 7, and 13, and $20 for the remaining visits, for a potential maximum of $350. At Bhopal and Bangalore participants were paid between 500 to 2000 rupees for each visit, with the compensation depending on reimbursement for travel expenses. Across all sites, no compensation was provided for app engagement, or for the volume of passive data.

Brief Assessment of Cognition in Schizophrenia (BACS)

The Brief Assessment of Cognition in Schizophrenia (BACS) is a set of tests that evaluate cognitive areas often affected in schizophrenia; verbal memory, working memory, motor speed, attention, executive functions, and verbal fluency. These areas are key because they tend to be significantly challenged in schizophrenia and are linked to the disease's prognosis. The BACS is designed to be mobile and user-friendly for a wide range of health professionals, including nurses, psychiatrists, neurologists, social workers, and other mental health providers. It can be completed in approximately 30 minutes, requires minimal time for scoring, and doesn't demand extensive training to administer. Further details about each cognitive area and its method of assessment can be found below.

Verbal Memory

Verbal memory is assessed through a list learning task, in which participants are asked to remember as many of a list of 15 words as possible, assessed five times for a potential total score of 75 (score being the number of words recalled per trial).

Working Memory

Working memory is assessed through the digit sequencing task, in which participants are given increasingly large clusters of numbers, and asked to tell the assessment administrator the numbers in order from lowest to highest. The assessment is scored based on the number of correct responses(0–28), and the longest sequence recalled without errors (0–8).

Motor Speed

Motor speed is assessed through the token motor task. Participants are asked to place as many of 100 plastic tokens into a container as possible, with a time limit of sixty seconds, and a restriction of only being able to place two tokens at a time. Participants are scored on the number of tokens correctly placed in the container (100). Due to limitations on in-person visits during the study administration (which occurred in 2021), participants at the BIDMC site were not assessed for motor functioning.

Verbal Fluency: Verbal fluency was measured by two assessments: category instances and controlled oral word association test. In the former, participants are asked to name as many words in a single category (such as ‘tools’) in sixty seconds as possible. In the latter, participants are given two rounds of sixty seconds to name as many words beginning with a letter (different letter per round), such as ‘F’, as possible. For both assessments, the final score is the number of words generated.

Attention and Speed of Information Processing: Attention and speed of information processing was assessed through the symbol coding task, in which participants were shown symbol-number match pairs (ex: 9: ∞), and asked to write the numbers corresponding to a list of symbols. Scores were taken out of potential total numerals that could be matched (0-110).

Executive Functions

Executive functions were measured with the tower of london task. Over twenty trials, participants were shown two mismatching pictures, each with a configuration of the three balls of different colors arranged on three pegs, and asked to state how many times the balls must be moved in one image for the color arrangement to match. Scoring was based on the number of correct responses (0–22). The assessment ceased if participants responded incorrectly five consecutive times, and if participants responded correctly to all twenty assessments, were given two more trials.

Social Functioning Scale (SFS)

The SFS is among the most widely utilized methods for measuring social functioning in people with schizophrenia. It has been shown to demonstrate strong reliability and validity. The SFS is made up of seven subscales, the titles, and descriptions of which are included below. The SFS Composite score is the mean of the subscale scores.

The independence - competence subscale measures how much assistance people need to perform day-to-day tasks and responsibilities. Tasks include using public transport, looking after personal hygiene, and doing laundry.

The employment sub-scale measures factors about a person’s employment and daily routines.

Theinterpersonal functioning subscale measures the size of a person’s social circle and asks about how comfortable people are in their interactions with friends and relatives, as well as in groups of people.

The independence - performance subscale measures how often people complete day-to-day tasks, such as washing dishes, budgeting, and preparing meals.

The prosocial activities subscale measures how often individuals take part in social activities. Examples of activities include playing sports, visiting relatives, or attending parties.

The recreation sub-scale measures how often people take part in recreational activities that can be performed either alone or with other people. Activities include sewing, cooking, or watching television.

The social engagement (also known as withdrawal) sub-scale measures the person’s social tendencies, including how much time they spend alone or outside of the home, and their likelihood of engaging in conversation.

Positive and Negative Syndrome Scale in Schizophrenia (PANSS)

The PANSS is a well established, widely used, and well validated assessment of symptom severity in schizophrenia, designed to account for the heterogeneity of symptom presentation for schizophrenia spectrum disorders. It is composed of 30 items, broken into 3 domains: positive symptoms, negative symptoms, and general psychopathology. The PANSS is scored through the summation of different items across the different scales, such that the positive and negative scales have a range of 7–49, while the general psychopathology scale has a range of 16–112. The positive symptoms subscale measures the presence of symptoms superimposed on one’s mental status, such as hallucinations, disordered thinking, and paranoia. The negative symptoms sub-scale measures deficits in existing psychological processes, rather than novel ones such as in positive symptoms. As such, the sub-scale assesses anhedonia, and social reclusion.

Data Processing

Each of the 76 participants had available a different number of passive and active data (ranging from 2–13 samples per participant, with mean of 6.46 samples per participant) samples across the study period. Across all 76 participants for 27 features, SFS, PANSS, and clinical assessments (PSQI, GAD, PHQ9) were fully available, whereas EMA Assessments (Social, Psychosis, Mood, and Anxiety), passive sensor features (Hometime, Entropy, and Screen Duration), and the BACS cognitive assessments had varying levels of missingness. Furthermore, BACS scores were only available at most three times (intake visit, 6 month, and 12 month) per participant which meant that the data sampling frequency was also different across features. Hence, in order to observe as many features while retaining the original sample size, we decided to probe the central tendency of individuals by taking the mean across the available data per individual for each variable. After averaging, each of the EMA assessments had 21 participants missing for Mood, 25 for Psychosis, 22 for Social, and 20 for Anxiety. Passive sensor-derived features of Hometime, Entropy, and Screen Duration had 14, 17, and 17 participants missing respectively. BACS sub-scores for Tower, VM, Digit, and Fluency each had one participant missing, while 8 participants were missing BACS: Symbols, and 27 participants were missing BACS: Motor. While there were no site-specific pattern of missingness for all other features, due to COVID regulations at the time of data collection, in-person motor assessments were unavailable for the 25 participants from Boston, accounting for the relatively high level of missingness specifically for BACS: Motor. In terms of percentage of data availability per feature, 15 of the 28 features had complete data, 4 features had 98.7%, and the rest of the 9 features had varying amounts of availability between 64.5–89.5% (Fig. 1A).

Multiple Imputation by Chained Equation (MICE) Imputation

Prerequisite to performing dimension reduction methods, such as PCA, is to have a complete dataset. The nature of the missingness was assumed to be random (MAR) and hence we decided to impute the dataset by applying the Multiple Imputation by Chained Equation (MICE). MICE has been noted to be a useful tool in psychiatric research to garner insights from datasets that inevitably contain missingness. Mice package from statsmodels.imputation was used to perform MICE imputation on the dataset for each of the 27 variables. After performing imputation, in order to check whether the imputed dataset was appropriately resembling the original dataset, we performed the Solmogorov-Smirnov (KS) test to compare the similarity of the original and imputed variable’s distribution (Fig. 1B). Across all 27 variables, the imputed distributions exhibited no significant difference compared to the original variable’s distribution. Based on this validation, all subsequent analyses were performed using the imputed dataset.

Accounting for Skewness

To ensure validity of subsequent analyses, we checked whether each of the feature variables exhibited adequate level of normality by quantifying the skewness. We found that 16 of the 27 features had skewness beyond the range of (-0.5,0.5), which is typically the range of minimal skew. For right-tailed (positive) skew, we performed log-transformation, whereas those within the range of (-0.5,0.5) were kept untransformed. Notably, while two features (BACS: Tower, and SFS: Competence) had notably high left-tailed (negative) skew, we kept these variables untransformed as incorporating different types of transformation can obscure the original dataset’s patterns of behavior. The post-transform skewness was much more contained within the (-0.5,0.5) range with only 3 out of 27 features with skewness beyond the (-1.0,1.0) range which are considered high level of skewness (Fig. 2A). We further visualize and test for normality via the QQ plot in Fig. 2B. We also note that while log-transformed features are now interpreted based on percentage change rather than an absolute change, we can still compare the relative strength of features and their associations without compromising the underlying structure of the data.

Max-Min Normalization and Outlier Rejection

After accounting for skewness via feature transformation, we performed a max-min normalization so that all features were scaled to be within the range of (0,1). Finally, applying Tukey’s method, we identified data points outside 3*IQR rejected the outliers. This resulted in rejecting 2 participants with the final set of 74 participants for subsequent analyses. Finally, using this filtered/normalized dataset, an average BACS and SFS scores (‘BACS Composite’, ‘SFS Composite’) were calculated per participant to yield a total of 29 features for further analyses.

Principal Component Analysis and K-means Clustering

We investigated whether the 74 participants could be classified naturally into different subtypes based on their features. We performed PCA analysis using the sklearn.decomposition PCA package and plotted the contribution of explained variance from each principal component (Fig. 3A). We found that the Elbow method found an inflection point for the cumulative explained variance after the first nine principal components, which accounted for 80.4% of the overall variance from the dataset. Hence, we decided to use the first 9 principal components to perform dimensional reduction and proceeded to investigate naturally occurring clusters using the k-means clustering algorithm. We used three metrics to identify the best-fit number of clusters to classify our dataset - Silhouette score, Elbow Method, and the Davies-Bouldin Index. Elbow method identified the biggest slope change around k = 3, which indicated that k = 3 was optimal. Davies-Bouldin Index found the smallest index for k = 3, further substantiating the validity of using k = 3. Finally, the Silhouette Score was also the highest at k = 3, which guided our decision to model our clusters with k = 3 (Fig. 3B-C).

Cluster-wise Mean and Correlation across Features

For k = 3, cluster-wise mean and SEM were quantified for each of the 29 variables. 25 participants were classified under cluster A, 17 participants were classified under cluster B, and 32 participants were classified under cluster C. We ran one-way ANOVA to compare significant differences across the three clusters (Fig. 4A). The features can also be categorized into the macro-dimensions of EMA, Clinical Symptom Assessments, Passive Sensor Data, BACS Cognitive Assessment, and SFS Social Functioning Assessments. In addition to the mean and SEM, we computed probability density function via the kernel density estimation (kde) for each of the clusterwise features (Fig. 4B). This figure further showcases the distinct distributional patterns of features across the three clusters.

Partial Correlation Calculation

Partial correlation measures the degree of association between two variables, with the effect of a set of controlling variables removed. When determining the numerical relationship between two variables of interest, using their correlation coefficient will give misleading results if there are other confounding variables that are numerically related to both variables of interest. This misleading information can be avoided by controlling for the confounding variable, which is done by computing the partial correlation coefficient. In order to focus on the features that were least redundant, for partial correlational analyses, we decided to omit all PANSS-related features other than PANSS Total, all BACS subdomain scores other than the BACS Composite, and all SFS subdomain scores other than the SFS Composite scores. This yielded a total of 13 features instead of 29, which we use to focus on independent sources of input for the correlational explorations. Based on the partial correlation at the significance level of p = 0.05, we found 5 significant partial correlations for cluster A, 6 for cluster B, and 5 for cluster C (Fig. 5A-C).

Difference of Partial Correlations via the Fisher r-to-Z transformation

Using the Fisher r-to-z transformation, we calculate a value of z that can be applied to assess the significance of the difference between two partial correlation coefficients (ra and rb) found in two clusters of comparison. If ra is greater than rb, the resulting value of z will have a positive sign; if ra is smaller than rb, the sign of z will be negative. Based on the two-tailed Z-test, we noted the significant Z-score difference (at p = 0.05) on each of the cluster-to-cluster difference heatmap (Fig. 6).

Cluster-wise Mean: EMA (Psychosis, Mood, Anxiety, Social) - Fig. 4A

The three clusters exhibited similar levels of mean for EMA: Psychosis, Mood, and Anxiety (NS). However, for the EMA: Social, red group exhibited statistically greater mean than the rest of the two clusters who had similar outcomes.

Cluster-wise Mean: Clinical Symptom Assessments (PHQ9, GAD, PSQI) - Fig. 4A

The three clusters presented graded levels of mean for PHQ9, GAD and PSQI. Specifically, the blue group exhibited the lowest clinical symptoms for all three surveys, and the yellow group exhibited moderate clinical symptoms, and finally, the red group exhibited the highest amount of clinical symptoms (all statistically significant at p < 0.0001).

Cluster-wise Mean: Passive Sensor Data - Fig. 4A

Red group exhibited the highest Screen Duration, Entropy, and Hometime. Meanwhile, the yellow and the blue groups showcased similar levels of Screen Duration and Hometime, while yellow had greater Entropy than blue. In all of these three passive features, we find significant differences across clusters (Screen Duration p < 0.05; Entropy p < 0.01; Hometime; p < 0.001).

Cluster-wise Mean: PANSS Score - Fig. 4A

Red group showed a two-fold or greater score in PANSS Total, PANSS General, and PANSS Negative compared to the yellow or blue group, but a marginally greater score in PANSS Positive. Albeit a much smaller effect, the yellow group had greater PANSS scores than the blue group in all of these features. There was no significant difference across clusters in PANSS Positive, but significant differences across clusters in all other PANSS Scores (p < 0.001).

Cluster-wise Mean: BACS Cognitive Assessment - Fig. 4A

Three groups varied in different subdomains of BACS Cognitive assessments. The red group had the highest BACS: Digit, Fluency, VM, and Tower score, whereas the blue had the highest Motor scores. For the Symbols subdomain, the red and blue groups were similarly higher than the yellow. The BACS Composite score is the average of the subdomain scores and showed that the red had the highest composite score, followed by the blue and the yellow. All subdomains and composite scores had significant difference across clusters (Motor and VM p < 0.01; all else p < 0.0001).

Cluster-wise Mean: SFS Social Functioning Assessment - Fig. 4A

SFS Composite score is the average of the different subscales of SFS score. We find that the blue had the highest composite SFS score, and red a close second, followed by the yellow group which had lower general SFS score (p < 0.0001). Blue also had the highest Employment, Performance, Competence, Engagement, Prosocial activities, showcasing robust social functioning in multiple domains. Red had the highest Recreation, and Interpersonal Behavior scores. SFS Prosocial activities found no significant difference across clusters but all other subdomains and composite score yielded a significant difference across clusters (p < 0.0001).

Cluster-wise distribution of features - Fig. 4B

In addition to quantifying the mean and SEM, we observed the distributional patterns of each of the features for the three clusters using the probability density function (pdf) via the kernel density estimation (kde). We found that while mean and SEM can well summarize the general central tendency as well as the variability around the mean, pdf can showcase potential differences in modality and range of values as well. While EMA features exhibited generally similar distributional patterns for all three clusters, SFS subdomain scores were particularly distributed distinctively across the three clusters.

Significant Partial Correlations: Yellow Group (Cluster A) - Fig. 5

There were 5 significant partial correlations found for the Yellow group. We found a positive partial correlation between EMA Mood vs EMA Anxiety (pcorr = 0.57, p = 0.03227), and between PHQ9 vs GAD (pcorr = 0.79, p = 0.0008). Meanwhile, we found negative partial correlations for EMA Psychosis vs Entropy (pcorr = -0.61, p = 0.021), EMA Psychosis vs Hometime (pcorr = -0.64, p = 0.014), and Hometime vs Entropy (pcorr = -0.74, p = 0.00229).

Significant Partial Correlations: Red Group (Cluster B) - Fig. 5

There were 6 significant partial correlations found for the Red group. We found a positive partial correlation between EMA Social vs PANSS Total (pcorr = 0.87, p = 0.0235), and PHQ9 vs BACS Composite (pcorr = 0.82, p = 0.0466). Meanwhile, we found negative partial correlations for EMA Anxiety vs Entropy (pcorr = -0.90, p = 0.0142), EMA Anxiety vs PANSS Total (pcorr = -0.84, p = 0.03770), EMA Anxiety vs BACS Composite (pcorr = -0.82, p = 0.04575), and Entropy vs BACS Composite (pcorr = -0.82, p = 0.04735).

Significant Partial Correlations: Blue Group (Cluster C) - Fig. 5

There were 5 significant partial correlations found for the Blue group. We found a positive partial correlation between EMA Psychosis vs EMA Social (pcorr = 0.66, p = 0.00108), EMA Psychosis vs PANSS Total (pcorr = 0.48, p = 0.028), and EMA Mood vs SFS Composite (pcorr = 0.45, p = 0.03981). Meanwhile, we found negative partial correlations for EMA Mood vs Screen Duration (pcorr = -0.46, p = 0.0341), and PANSS Total vs BACS Composite (pcorr = -0.47, p = 0.03257).

Significant Difference in Partial Correlations between Groups - Fig. 6

Significant differences in partial correlations between pairwise comparison of clusters were illustrated as a heatmap. Cluster A vs B found 20 significant differences, 9 showcasing more positive partial correlations for A over B, and 11 showcasing more positive partial correlations for B over A. Cluster B vs C also found 20 significant differences, 8 showcasing more positive partial correlations for B over C, and 12 showcasing more positive partial correlations for C over B. Meanwhile, Cluster A vs C only found 9 significant differences, 6 showcasing more positive partial correlations for A over C, and 3 showcasing more positive partial correlations for C over A.

In this study, we explore the application of scalable smartphone-based digital phenotyping towards improving the reliability and replicability of schizophrenia subtyping. To address heterogeneity in the phenotypes of schizophrenia, we collect a precise but diverse set of data, including clinical symptoms, smartphone-based behavioral metrics, and functional outcomes. This data provides a window into the differential associations of behaviors, symptoms, and functioning in schizophrenia that are not captured by more traditional methods. In this dataset, we identify three distinct clusters of patients, possibly reflecting clinical subtypes. We demonstrate clear differences in mean feature values and partial correlations between these subgroups. Each identified cluster displayed unique properties across multiple dimensions of measurement, though identifying them as true clinical subtypes requires more investigation and subsequent validation.

The clusters identified in this study demonstrate clear differences across behavioral, functional, and symptoms measures, demonstrating the potential of this approach to capture the variable expressivity of schizophrenia phenotypes. This method improves upon previous methods of digitally phenotyping schizophrenia. Generally, previous digital phenotyping studies have reported small pairwise correlations between both self-reported and clinically assessed symptoms [24–26]. Our clustering approach mitigates this problem. Instead of comparing static variables, we target dynamic markers linking symptoms, behaviors, and functioning. This method improves the validity and strength of identified correlations, as indicated by the observed differential reactivity between clusters (Table 2). Digital phenotyping patterns may thus vary based on subtypes, and subtype-specific labeling may better reflect more individual experiences of illness. Clustering results offer a productive example of the utility of the smartphone-based digital phenotyping method and how new hypotheses can be tested surrounding the intersection between clinical, cognitive, EMA, and behavioral data.

Differences in cluster phenotypes provide preliminary insight into potential moderators, mediators, and covariates of clinical presentation. Unsurprisingly, the clustering algorithm primarily categorizes participants according to the severity of clinical symptoms reflected in the PANSS, PSQI, PHQ-9, and GAD-7, as shown in Figure 4A. The red group presented with severe anxiety, depression, sleep dysfunction, and psychosis (PANSS), along with more screen time and greater mobility entropy and hometime. The red group also reported greater degrees of cognition and social functioning. This is unexpected, given that classical symptomatology indicates impairment in social and cognitive functioning is carried by severe clinical symptoms of schizophrenia. The yellow group was characterized by moderate affective/psychosis symptoms but had the lowest cognition and social functioning, implying functional impairment and a poor prognosis. The blue group was the highest-functioning group across all clusters, with high social functioning and comparable cognition capacity with the red group, but displayed fewer affective and psychotic symptoms. While further validation is needed, these clusters broadly seem to represent subtypes of affective SZ (red), non-affective SZ with poor prognosis (yellow), and non-affective SZ with good prognosis (blue).

While correlational analysis does not permit discussion of causation, our approach of using partial correlations to isolate independent feature-to-feature associations may provide insight for researchers and clinicians to provide personalized interventions revolving around a multitude of behavioral features. Our approach also explains why generalizations of schizophrenia applied across entire populations may fail or even cause harm. Results from the partial correlation and difference map across clusters consistently suggested that the red group was best characterized by both affective and psychosis symptoms (Figure 5), where EMA Anxiety, EMA Psychosis, PHQ9, and PANSS Total particularly differentiated the red group from the yellow and blue groups (Figure 6). Meanwhile, the yellow and blue groups were both primarily characterized by psychosis (Figure 5), but surprisingly differed most based on the patterns of entropy (Figure 6). Interestingly, the yellow group presented a negative association between EMA psychosis and hometime, which seems in disagreement with a 2022 digital phenotyping study that found “mood and psychosis were both significantly more elevated when at home” in certain subgroups of the SZ population. However, we also find that EMA psychosis was also negatively associated with mobility entropy, which may suggest psychotic symptoms are relieved by mobility within or near the home environment (Figure 5, Table 2). Hence, the finding from the 2022 study that psychosis was elevated at home may be confounded by the presence of different levels of mobility entropy for the study cohorts [27].

It is also notable that the difference map between the red vs. blue, or the red vs. yellow (20 total significant differences of partial correlations for each respective comparison), yielded two-fold greater pairwise feature associative differences compared to the blue vs. yellow groups (9 total significant differences of partial correlations). This finding is not only consistent with the general phenotypic mean trends of features across clusters but also provides a secondary lens through which clinicians and researchers can make distinct action plans for treatment. Given this result, examining trends in other feature variables, namely behavioral data from digital phenotyping, functional data from the SFS, and cognitive data from the BACS, may provide insight into the relationship between classical symptoms and these other clinically relevant factors. While the potential of such a multivariate analysis to offer a more accurate characterization of schizophrenia is already well known, past methods have often focused on less scalable modalities like neuroimaging, genetics, or EEG [28]. Our results provide different, scalable insights and more novel targets for clinical and research efforts.

Our methods require replicable and generalizable research as next steps. Although this study does not make clinical predictions, schizophrenia research in general involves an oversaturation of prediction models [29, 30] with little replication. On the contrary, the mindLAMP app used to capture this data has already been used by numerous independent teams [31] and is currently used in the AMP-SZ study of clinical high risk for psychosis patients across over 40 global sites with digital phenotyping data already shared in the NIH National Data Archive [32]. Further, all code used to process the digital phenotypes, including imputing missingness and adjusting for skew, is publicly shared in CORTEX, found at docs.lamp.digital/data_science/intro. Thus, it is feasible to conduct a validation study with the same feature variables that can be separately clustered and compared to this study by calculating a measure of similarity, such as a rand index, between clustering results.

Our results are relevant in the context of prior clustering work. The utility of clustering to improve mental health symptom detection from digital phenotyping data [33] has already been shown to improve predictive performance in depression [34]. A 2021 pilot study clustering smartphone digital phenotyping and MRI data across people with schizophrenia or Alzheimer's disease reported three clusters, one of which expressed high degrees of communication and low mobility, and another of which reported reduced social interactions [35]. These pilot results agree with our results suggesting mobility and social aspects as differential markers of clusters. Another 2022 paper applied clustering methods to assess the risk of relapse using data from the 2014 Cross Check digital phenotyping study [36]. This study also finds utility in mobility metrics. However, the differential partial correlations found in our work (Figures 4–6) suggest the need for a complementary approach building off this impressive earlier work.

The limitations of this method include the need for a larger sample size. Clusters of fewer than 30 participants are more prone to overfitting. Validating the derived clusters can involve fitting clusters via another dataset, as previously mentioned. Additionally, many of our features are engineered from raw sensor data. Different teams and software packages may calculate the feature variables differently. For example, our data analysis package determines time spent at home (hometime) by clustering GPS data into distinct, significant locations. The method used to determine a participant’s home can vary across different clustering methods; other methods might not use clustering at all. Further, as with any digital phenotyping analysis, differences in mobile device type across participants result in different raw data, which may influence the engineered features. Lastly, the nature of clustering obscures the association between participant clusters and mean feature variables. It remains unclear whether differences in behavioral features are a product of different subtypes or whether these behaviors themselves influence clinical symptoms.

In conclusion, we demonstrate an ability to identify three distinct clusters in schizophrenia patients and employ a new but rapidly expanding approach to a multisite and international sample population. We created a novel, reusable, and accessible data processing pipeline to address common challenges in digital phenotyping such as high skew and missingness. While our results need to be replicated in larger samples, the preliminary findings suggest some environmental factors (such as hometime and mobility entropy) in addition to social factors (such as screen time) may differentially interact with classically assessed psychosis, mood, and anxiety symptoms. Like any comparable approach, these results require validation. To that end, we keep our data collection software entirely open source and available to other research groups for protocol replication. These results thus underscore the feasibility of capturing multimodal digital phenotyping assessments in schizophrenia that have the potential to suggest clinically relevant clusters and treatment targets.

CONFLICT OF INTEREST:

JT is a scientific advisor of Precision Mental Wellness. JT is PI of an investigator initiated grant from Otsuka Pharmaceuticals. The other authors report any conflicts of interest.

ACKNOWLEDGMENTS:

The original dataset used in this work was collected under the support of the Wellcome Trust UK (grant no. 215843/Z/19/Z).

Gouse BM, Weinberg JM, Brown HE. Risk Stratification to Reduce Excess Mortality in Early Psychosis. JAMA Network Open. 2024;7(3):e240623. 3
Berrios, G. E., & Hauser, R. (1988). The early development of Kraepelin's ideas on classification: a conceptual history. Psychological medicine, 18(4), 813–821.
Chekroud, A. M., Bondar, J., Delgadillo, J., Doherty, G., Wasil, A., Fokkema, M., … Choi, K. (2021). The promise of machine learning in predicting treatment outcomes in psychiatry. World Psychiatry, 20(2), 154–170.
Cohen, A., Naslund, J. A., Chang, S., Nagendra, S., Bhan, A., Rozatkar, A., … Torous, J. (2023). Relapse prediction in schizophrenia with smartphone digital phenotyping during COVID-19: a prospective, three-site, two-country, longitudinal study. Schizophrenia, 9(1), 6.
Bell IH, Eisner E, Allan S, Cartner S, Torous J, Bucci S, Thomas N. Methodological characteristics and feasibility of ecological momentary assessment studies in psychosis: a systematic review and meta-analysis. Schizophrenia Bulletin. 2024;50(2):238–65.
Durand D, Strassnig MT, Moore RC, Depp CA, Ackerman RA, Pinkham AE, Harvey PD. Self-reported social functioning and social cognition in schizophrenia and bipolar disorder: using ecological momentary assessment to identify the origin of bias. Schizophrenia research. 2021;230:17–23.
Wenzel J, Dreschke N, Hanssen E, Rosen M, Ilankovic A, Kambeitz J, Fett AK, Kambeitz-Ilankovic L. Ecological momentary assessment (EMA) combined with unsupervised machine learning shows sensitivity to identify individuals in potential need for psychiatric assessment. European Archives of Psychiatry and Clinical Neuroscience. 2023 Sep 16:1–1.
van Genugten CR, Schuurmans J, Hoogendoorn AW, Araya R, Andersson G, Baños RM, Berger T, Botella C, Cerga Pashoja A, Cieslak R, Ebert DD. A Data-Driven Clustering Method for Discovering Profiles in the Dynamics of Major Depressive Disorder Using a Smartphone-Based Ecological Momentary Assessment of Mood. Frontiers in psychiatry. 2022;13:755809.
Lane E, D’Arcey J, Kidd S, Onyeaka H, Alon N, Joshi D, Torous J. Digital Phenotyping in Adults with Schizophrenia: A Narrative Review. Current Psychiatry Reports. 2023;25(11):699–706.
Currey D, Torous J. Digital phenotyping correlations in larger mental health samples: analysis and replication. BJPsych Open. 2022;8(4):e106.
Cohen AS, Schwartz E, Le TP, Cowan T, Kirkpatrick B, Raugh IM, Strauss GP. Digital phenotyping of negative symptoms: the relationship to clinician ratings. Schizophrenia Bulletin. 2021;47(1):44–53.
Buck B, Munson J, Chander A, Wang W, Brenner CJ, Campbell AT, Ben-Zeev D. The relationship between appraisals of auditory verbal hallucinations and real-time affect and social functioning. Schizophrenia Research. 2022;250:112–9.
Moran EK, Shapiro M, Culbreth AJ, Nepal S, Ben-Zeev D, Campbell A, Barch DM. Loneliness in the Daily Lives of People With Mood and Psychotic Disorders. Schizophrenia Bulletin. 2024 Mar 1:sbae022.
Baryshnikov I, Rosenström T, Isometsä E. Predicting a short-term change of suicidal ideation in inpatients with depression: An ecological momentary assessment. Journal of affective disorders. 2024 Jan 15.
Bufano P, Laurino M, Said S, Tognetti A, Menicucci D. Digital Phenotyping for Monitoring Mental Disorders: Systematic Review. Journal of Medical Internet Research. 2023;25:e46778.
Walsh AE, Naughton G, Sharpe T, Zajkowska Z, Malys M, van Heerden A, Mondelli V. A collaborative realist review of remote measurement technologies for depression in young people. Nature Human Behaviour. 2024 Jan 15:1–3.
Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophrenia Bulletin. 1987;3(2):261–276.
Kroenke K, Spitzer RL, & Williams JB. The PHQ-9: validity of a brief depression severity measure. Journal of general internal medicine. 2001;6(9): 606–613.
Spitzer RL, Kroenke K, Williams JB, Löwe B. A brief measure for assessing generalized anxiety disorder: the GAD-7. Archives of internal medicine. 2006;166(10), 1092–1097.
Bosc M, Dubini A, Polin V. Development and validation of a social functioning scale, the Social Adaptation Self-evaluation Scale. European Neuropsychopharmacology., 1990; (1), S57-S70.
Buysse DJ, Reynolds III CF, Monk TH, Berman S R, Kupfer, DJ. The Pittsburgh Sleep Quality Index: a new instrument for psychiatric practice and research. Psychiatry research. 1989;28(2):193–213.
Keefe, R. et al. The Brief Assessment of Cognition in Schizophrenia: reliability, sensitivity, and comparison with a standard neurocognitive battery. Schizophr. Res. 68, 283–297 (2004).
Bilden R, Currey D, Vaidyam A, Patel S, Meyer A, Scheuer L, et al. BIDMCDigitalPsychiatry/LAMP-platform: release 2023.2.15. 2023, Zenodo. http://dx.doi.org/10.5281/zenodo.7643628
Moura I, Teles A, Viana D, Marques J, Coutinho L, Silva F. Digital phenotyping of mental health using multimodal sensing of multiple situations of interest: A systematic literature review. Journal of Biomedical Informatics. 2023;138:104278.
Mendes JP, Moura IR, Van de Ven P, Viana D, Silva FJ, Coutinho LR, Teixeira S, Rodrigues JJ, Teles AS. Sensing apps and public data sets for digital phenotyping of mental health: systematic review. Journal of medical Internet research. 2022;24(2):e28735.
Henson P, Torous J. Feasibility and correlations of smartphone meta-data toward dynamic understanding of depression and suicide risk in schizophrenia. International journal of methods in psychiatric research. 2020;29(2):e1825.
Ranjan T, Melcher J, Keshavan M, Smith M, Torous J. Longitudinal symptom changes and association with home time in people with schizophrenia: an observational digital phenotyping study. Schizophrenia Research. 2022;243:64–9.
Moser DA, Doucet GE, Lee WH, et al. Multivariate Associations Among Behavioral, Clinical, and Multimodal Imaging Phenotypes in Patients With Psychosis. JAMA Psychiatry. 2018;75(4):386–395. doi:10.1001/jamapsychiatry.2017.4741
Arshi, B., Smits, L. J., Wynants, L., Cowley, L. E., Reeve, K., & Rijnhart, E. (2023). Number of publications on new clinical prediction models: a systematic literature search.
Adam M. Chekroud et al., Illusory generalizability of clinical prediction models. Science 383,164–167(2024). DOI:10.1126/science.adg8538
Bilden R, Torous J. Global collaboration around digital mental health: the LAMP consortium. Journal of Technology in Behavioral Science. 2022;7(2):227–33.
Wannan CM, Nelson B, Addington J, Allott K, Anticevic A, Arango C, Baker JT, Bearden CE, Billah T, Bouix S, Broome MR. Accelerating Medicines Partnership® Schizophrenia (AMP® SCZ): Rationale and Study Design of the Largest Global Prospective Cohort Study of Clinical High Risk for Psychosis. Schizophrenia Bulletin. 2024 Mar 7:sbae011.
Ameko MK, Cai L, Boukhechba M, Daros A, Chow PI, Teachman BA, Gerber MS, Barnes LE. Cluster-based approach to improve affect recognition from passively sensed data. In 2018 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI) 2018 Mar 4 (pp. 434–437). IEEE.
Currey D, Torous J. Digital phenotyping data to predict symptom improvement and mental health app personalization in college students: prospective validation of a predictive model. Journal of Medical Internet Research. 2023;25:e39258.
Passive digital phenotyping: Objective quantification of human behaviour through smartphones. [Thesis fully internal (DIV), University of Groningen]. University of Groningen.
Zhou J, Lamichhane B, Ben-Zeev D, Campbell A, Sano A. Predicting Psychotic Relapse in Schizophrenia With Mobile Sensor Data: Routine Cluster Analysis. JMIR mHealth and uHealth. 2022;10(4):e31006.

Demographic Variable	All Sites (n=76)	BIDMC (n=26)	Bhopal (n=25)	Banglore (n=25)
Age	Mean = 32.9	Mean = 38.0	Mean = 29.4	Mean = 31.2
Gender
Male	53.9% (n=41)	34.6% (n=9)	64.0% (n=16)	64.0% (n=16)
Female	43.4% (n=33)	57.7% (n=15)	36.0% (n=9)	36.0% (n=9)
Other	2.6% (n=2)	7.7% (n=2)	0.0% (n=0)	0.0% (n=0)
Race
White/Caucasian	21.1% (n=16)	61.5% (n=16)	0.0% (n=0)	0.0% (n=0)
Black/African-American	6.5% (n=5)	19.2% (n=5)	0.0% (n=0)	0.0% (n=0)
Asian	65.8% (n=50)	0.0% (n=0)	100.0% (n=25)	100.0% (n=25)
Multiracial or Other	5.3% (n=4)	15.4% (n=4)	0.0% (n=0)	0.0% (n=0)
Missing	1.3% (n=1)	3.8% (n=1)	0.0% (n=0)	0.0% (n=0)
Education
Some high school or less	10.5% (n=8)	0.0% (n=0)	24.0% (n=6)	8.0% (n=2)
High school/some college or less	56.5% (n=43)	65.3% (n=17)	72.0% (n=18)	32.0% (n=8)
University or postgraduate degree	31.6% (n=24)	30.8% (n=8)	4.0% (n=1)	60.0% (n=15)
Missing	1.3% (n=1)	3.8% (n=1)	0.0% (n=0)	0.0% (n=0)

Table 1. Demographics (prior to outlier rejection)

Cluster Group	Feature 1	Feature 2	Partial correlation (r)	Significance (p-val)
Group A	EMA Mood	EMA Anxiety	0.57	0.03227
Group A	EMA Psychosis	Hometime	-0.64	0.014
Group A	EMA Psychosis	Entropy	-0.61	0.021
Group A	Hometime	Entropy	-0.74	0.00229
Group A	PHQ9	GAD	0.79	0.0008
Group B	EMA Social	PANSS Total	0.87	0.0235
Group B	PHQ9	BACS Composite	0.82	0.0466
Group B	EMA Anxiety	Entropy	-0.90	0.0142
Group B	EMA Anxiety	PANSS Total	-0.84	0.03770
Group B	EMA Anxiety	BACS Composite	-0.82	0.04575
Group B	Entropy	BACS Composite	-0.82	0.04735
Group C	EMA Psychosis	EMA Social	0.66	0.00108
Group C	EMA Psychosis	PANSS Total	0.48	0.028
Group C	EMA Mood	SFS Composite	0.45	0.03981
Group C	EMA Mood	Screen Duration	-0.46	0.0341
Group C	PANSS Total	BACS Composite	-0.47	0.03257

Table 2. Significant partial correlations

Feature 1	Feature 2	Group 1	Partial correlation (Group 1)	P-value (Group 1)	Group 2	Partial correlation (Group 2)	P-value (Group 2)	Z score (Diff)	P-value (Diff)
EMA Social	PANSS Total	A	0.0994	0.7354	B	0.8721	0.0235	-3.6328	0.0003
EMA Psychosis	SFS Composite	A	0.096	0.744	B	0.6696	0.1457	-2.0877	0.0368
Entropy	PANSS Total	A	0.3998	0.1566	B	-0.6032	0.205	3.2806	0.001
Hometime	SFS Composite	A	0.2362	0.4162	B	-0.6994	0.1219	3.2379	0.0012
PANSS Total	SFS Composite	A	-0.3148	0.273	B	0.6334	0.177	-3.1381	0.0017
Entropy	BACS Composite	A	0.2826	0.3276	B	-0.8166	0.0474	4.2036	0
Screen Duration	SFS Composite	A	0.1024	0.7277	B	0.6936	0.1264	-2.2001	0.0278
PHQ9	PANSS Total	A	-0.1935	0.5074	B	0.5493	0.2589	-2.3793	0.0173
PHQ9	BACS Composite	A	-0.1635	0.5765	B	0.818	0.0467	-3.8484	0.0001
EMA Psychosis	Hometime	A	-0.6365	0.0144	B	0.4621	0.3561	-3.6628	0.0002
EMA Psychosis	PANSS Total	A	0.4594	0.0984	B	-0.5148	0.2961	3.1172	0.0018
EMA Mood	EMA Anxiety	A	0.5728	0.0323	B	-0.0898	0.8657	2.1695	0.03
EMA Psychosis	Entropy	A	-0.6079	0.0211	B	0.0269	0.9597	-2.1427	0.0321
PHQ9	Entropy	A	0.0474	0.8723	B	0.7283	0.1007	-2.5674	0.0102
EMA Social	SFS Composite	A	0.3227	0.2605	B	-0.7994	0.0563	4.1872	0
EMA Social	Screen Duration	A	-0.3196	0.2653	B	0.6638	0.1506	-3.3075	0.0009
EMA Anxiety	PHQ9	A	-0.2366	0.4154	B	0.7241	0.1037	-3.385	0.0007
EMA Anxiety	PANSS Total	A	0.1679	0.5662	B	-0.837	0.0377	4.0379	0.0001
EMA Anxiety	BACS Composite	A	0.0721	0.8064	B	-0.8199	0.0457	3.5939	0.0003
EMA Anxiety	Entropy	A	-0.0385	0.8959	B	-0.9011	0.0142	4.2099	0

Table 3-1. Significant partial correlations difference (Group A vs B)

Feature 1	Feature 2	Group 1	Partial correlation (Group 1)	P-value (Group 1)	Group 2	Partial correlation (Group 2)	P-value (Group 2)	Z score (Diff)	P-value (Diff)
EMA Anxiety	Entropy	B	-0.9011	0.0142	C	0.2413	0.2921	-5.2974	0
EMA Anxiety	PANSS Total	B	-0.837	0.0377	C	-0.0642	0.7821	-3.5234	0.0004
EMA Social	Screen Duration	B	0.6638	0.1506	C	-0.297	0.1911	3.3976	0.0007
EMA Social	PANSS Total	B	0.8721	0.0235	C	-0.2995	0.1872	5.0721	0
EMA Social	SFS Composite	B	-0.7994	0.0563	C	0.221	0.3357	-4.0609	0
EMA Anxiety	PHQ9	B	0.7241	0.1037	C	0.1391	0.5476	2.3848	0.0171
Entropy	BACS Composite	B	-0.8166	0.0474	C	-0.3111	0.1698	-2.5346	0.0113
Hometime	SFS Composite	B	-0.6994	0.1219	C	-0.0166	0.9432	-2.6107	0.009
PANSS Total	SFS Composite	B	0.6334	0.177	C	-0.0401	0.863	2.4188	0.0156
PSQI	Entropy	B	-0.4409	0.3815	C	0.1948	0.3974	-2.0611	0.0393
PSQI	PANSS Total	B	-0.4689	0.3482	C	0.4136	0.0624	-2.9147	0.0036
Screen Duration	PANSS Total	B	-0.5796	0.228	C	0.0717	0.7575	-2.2542	0.0242
PHQ9	BACS Composite	B	0.818	0.0467	C	0.1387	0.5487	3.1068	0.0019
PHQ9	Entropy	B	0.7283	0.1007	C	-0.1977	0.3904	3.4582	0.0005
PHQ9	Hometime	B	0.3997	0.4324	C	-0.23	0.3158	2.0204	0.0433
EMA Anxiety	BACS Composite	B	-0.8199	0.0457	C	-0.1893	0.4111	-2.9646	0.003
EMA Social	PHQ9	B	-0.3913	0.443	C	0.2718	0.2333	-2.1267	0.0334
EMA Mood	Screen Duration	B	0.1888	0.7202	C	-0.4639	0.0342	2.1304	0.0331
EMA Psychosis	SFS Composite	B	0.6696	0.1457	C	-0.1779	0.4405	3.0416	0.0024
EMA Psychosis	PANSS Total	B	-0.5148	0.2961	C	0.4791	0.028	-3.3523	0.0008

Table 3-2. Significant partial correlations difference (Group B vs C)

Feature 1	Feature 2	Group 1	Partial correlation (Group 1)	P-value (Group 1)	Group 2	Partial correlation (Group 2)	P-value (Group 2)	Z score (Diff)	P-value (Diff)
EMA Psychosis	Entropy	A	-0.6079	0.0211	C	-0.116	0.6166	-2.0838	0.0372
EMA Psychosis	Hometime	A	-0.6365	0.0144	C	-0.1016	0.6611	-2.2998	0.0215
Entropy	SFS Composite	A	0.2535	0.3819	C	-0.2955	0.1934	1.9939	0.0462
Entropy	Hometime	A	-0.7438	0.0023	C	-0.3208	0.1562	-2.2152	0.0267
Entropy	PANSS Total	A	0.3998	0.1566	C	-0.2241	0.3288	2.3039	0.0212
Entropy	BACS Composite	A	0.2826	0.3276	C	-0.3111	0.1698	2.1656	0.0303
PHQ9	GAD	A	0.7853	0.0009	C	0.431	0.0511	2.1145	0.0345
EMA Mood	EMA Social	A	0.2227	0.4442	C	-0.4197	0.0582	2.3832	0.0172
EMA Mood	Screen Duration	A	0.2854	0.3225	C	-0.4639	0.0342	2.8149	0.0049

Table 3-3. Significant partial correlations difference (Group A vs C)

Survey Type	Survey Questions for EMA: Anxiety	Value Range (0 - 21)
Anxiety	Today I feel anxious	0 (Not at all) - 3 (Nearly All the Time)
Anxiety	Today I cannot stop worrying	0 (Not at all) - 3 (Nearly All the Time)
Anxiety	Today I am worrying too much about different things	0 (Not at all) - 3 (Nearly All the Time)
Anxiety	Today I have trouble relaxing	0 (Not at all) - 3 (Nearly All the Time)
Anxiety	Today I feel so restless it's hard to sit still	0 (Not at all) - 3 (Nearly All the Time)
Anxiety	Today I am easily annoyed or irritable	0 (Not at all) - 3 (Nearly All the Time)
Anxiety	Today I feel afraid something awful might happen	0 (Not at all) - 3 (Nearly All the Time)
Survey Type	Survey Questions for EMA: Mood	Value Range (0 - 27)
Mood	Today I feel little interest or pleasure	0 (Not at all) - 3 (Nearly All the Time)
Mood	Today I feel depressed	0 (Not at all) - 3 (Nearly All the Time)
Mood	Today I had trouble sleeping	0 (Not at all) - 3 (Nearly All the Time)
Mood	Today I feel tired or have little energy	0 (Not at all) - 3 (Nearly All the Time)
Mood	Today I have a poor appetite or am overeating	0 (Not at all) - 3 (Nearly All the Time)
Mood	Today I feel bad about myself or that I have let others down	0 (Not at all) - 3 (Nearly All the Time)
Mood	Today I have trouble focusing or concentrating	0 (Not at all) - 3 (Nearly All the Time)
Mood	Today I feel too slow or too restless	0 (Not at all) - 3 (Nearly All the Time)
Mood	Today I have thoughts of self-harm	0 (Not at all) - 3 (Nearly All the Time)
Survey Type	Survey Questions for for EMA: Social	Value Range (0 - 12)
Social	Today during the daytime I have gone outside my home*	3 (Not at all) - 0 (Nearly All the Time)
Social	Today I preferred to spend time alone	0 (Not at all) - 3 (Nearly All the Time)
Social	Today I had arguments with other people	0 (Not at all) - 3 (Nearly All the Time)
Social	Today I felt uneasy with groups of people	0 (Not at all) - 3 (Nearly All the Time)
Survey Type	Survey Questions for EMA: Psychosis	Value Range (0 - 15)
Psychosis	Today I have heard voices or saw things others cannot	0 (Not at all) - 3 (Nearly All the Time)
Psychosis	Today I have thought racing through my head	0 (Not at all) - 3 (Nearly All the Time)
Psychosis	Today I feel I have special powers	0 (Not at all) - 3 (Nearly All the Time)
Psychosis	Today I feel people are watching me	0 (Not at all) - 3 (Nearly All the Time)
Psychosis	Today I feel people are against me	0 (Not at all) - 3 (Nearly All the Time)

Table 4. Questionnaires used for Ecological Momentary Assessment (EMA)

JT is a scientific advisor of Precision Mental Wellness. JT is PI of an investigator initiated grant from Otsuka Pharmaceuticals. The other authors report any conflicts of interest.

All sites received ethics approval from respective their Institutional Review Boards (IRBs): Beth Israel Deaconess Medical Center, Sangath IRB and All India Institute of Medical Sciences Bhopal Institutional Human Ethics Committee (IHEC), and the National Institute for Mental Health and Neurosciences IHEC. All participants provided written informed consent.

Download PDF

Version 1

posted

You are reading this latest preprint version

Towards Clinical Subtypes in Schizophrenia: Integrating Cognitive, Functional, and Digital Phenotyping Assessments

Status:

Version 1

Abstract

Figures

Introduction

Methods

Subjects

Protocol

Brief Assessment of Cognition in Schizophrenia (BACS)

Social Functioning Scale (SFS)

Positive and Negative Syndrome Scale in Schizophrenia (PANSS)

Data Processing

Multiple Imputation by Chained Equation (MICE) Imputation

Accounting for Skewness

Max-Min Normalization and Outlier Rejection

Principal Component Analysis and K-means Clustering

Cluster-wise Mean and Correlation across Features

Partial Correlation Calculation

Difference of Partial Correlations via the Fisher r-to-Z transformation

Results

Cluster-wise Mean: EMA (Psychosis, Mood, Anxiety, Social) - Fig. 4A

Cluster-wise Mean: Clinical Symptom Assessments (PHQ9, GAD, PSQI) - Fig. 4A

Cluster-wise Mean: Passive Sensor Data - Fig. 4A

Cluster-wise Mean: PANSS Score - Fig. 4A

Cluster-wise Mean: BACS Cognitive Assessment - Fig. 4A

Cluster-wise Mean: SFS Social Functioning Assessment - Fig. 4A

Cluster-wise distribution of features - Fig. 4B

Significant Partial Correlations: Yellow Group (Cluster A) - Fig. 5

Significant Partial Correlations: Red Group (Cluster B) - Fig. 5

Significant Partial Correlations: Blue Group (Cluster C) - Fig. 5

Significant Difference in Partial Correlations between Groups - Fig. 6

Discussion

Declarations

CONFLICT OF INTEREST:

ACKNOWLEDGMENTS:

References

Tables

Additional Declarations

Status:

Version 1