Insights into neurobiological mechanism of probabilistic decision-making impairments in schizophrenia from Akt1 and PV interneurons in mice

doi:10.21203/rs.3.rs-4648573/v1

Download PDF

Article

Insights into neurobiological mechanism of probabilistic decision-making impairments in schizophrenia from Akt1 and PV interneurons in mice

https://doi.org/10.21203/rs.3.rs-4648573/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Schizophrenia, a debilitating disorder with genetic and neurobiological underpinnings, often manifests cognitive deficits, including impaired decision-making. Utilizing Akt1 heterozygous mutant (HET) mice as a model, which mimic schizophrenia due to AKT1’s implication as a susceptibility gene, we investigated the involvement of Akt1 and its neural mechanisms influencing strategic decision-making to identify potential therapeutic targets for schizophrenia-associated cognitive impairments. In six experiments, we first revealed that lesions targeting the dorsomedial striatum (DMS) significantly impacted performance in a mouse version of the two-choice probabilistic decision-making task, surpassing effects observed in other striatal subregions. Behavioral assessments in HET mice unveiled notable disturbances, including reduced accumulated trials to reach criteria, diminished ratio of lose-stay behavior, elevated learning rates, and decreased choice consistency in reinforcement learning models. Moreover, we found a strong correlation between DMS local field potential power and choice behavior, particularly evident in no-reward conditions. The behavioral abnormalities observed in HET mice were restored when the DMS was chemogenetically inhibited, while their locomotor activity remained unaffected. Furthermore, RNAseq analysis and immunohistochemistry uncovered reduced expression of striatal parvalbumin (PV) interneurons in HET mice. Targeted lesioning of PV interneurons in the DMS of wild-type mice resulted in behavioral alterations mirroring those in HET mice. In summary, our findings suggest that Akt1 deficiency-induced downregulation of PV expression alters neural oscillations in the DMS, influencing choice strategies, especially in no-reward conditions during probabilistic decision-making. These results underscore the crucial involvement of AKT1 and PV interneurons in modulating strategic decision-making, with particular relevance to the understanding of schizophrenia.

Biological sciences/Neuroscience

Biological sciences/Psychology

Biological sciences/Physiology

Schizophrenia

cognitive dysfunction

Akt1

the dorsomedial striatum

parvalbumin interneuron

probabilistic decision making

Schizophrenia is a chronic, severe mental illness that affects how a person thinks, feels, and behaves. Common symptoms associated with schizophrenia include positive, negative, and cognitive symptoms. Cognitive dysfunction is a core feature of schizophrenia that predicts functional outcome and treatment adherence (1, 2). Accumulating evidence indicates that cognitive deficits are present in adolescents at risk for schizophrenia and in untreated first-episode schizophrenic patients (3). However, the neural basis of cognitive deficits in schizophrenia remains unclear and current antipsychotics have demonstrated limited efficacy and reliability in treating cognitive impairments (4).

Decision making is a fundamental cognitive function involving intricate interactions among a distributed network of brain circuits. Patients with schizophrenia show different behavioral patterns and worse performance than healthy controls in many decision-making tasks, such as in Iowa gambling task (5, 6), probabilistic stimulus selection task (7), and probabilistic reversal learning (8). Similar results were also reported in schizophrenic patients with psychosis by our group in a two-choice probabilistic task (9). By fitting a reinforcement-learning model to behavioral data, these patients appear to update their reward values faster and have a lower degree of choice perseveration than their controls.

Genetic studies highlight the involvement of susceptibility genes, including AKT1 (protein kinase Bα), in schizophrenia pathogenesis (10–12). AKT1, a key signaling intermediate downstream from the dopamine D2 receptor, is the best-established target of antipsychotic drugs, and the AKT1-GSK3 signaling cascade is important for the expression of dopamine-associated behaviors (13, 14). Studies of schizophrenia postmortem brain tissue (10, 15), Akt1-deficient mice (16–18), and functional neuroimaging in humans (19) further support the biological function of AKT1 and its role in schizophrenia susceptibility. Intriguingly, Akt1-deficient mice exhibited altered neural properties of striatal medium spiny neurons and methamphetamine-induced alteration of striatal activity (17). These mice also displayed heightened reward prediction error (RPE, the discrepancy between expected and actual rewards) and updated reward information more rapidly in a two-choice dynamic foraging T-maze (20).

The striatum has been proposed to contribute to reward learning and decision-making. Lesions of the dorsal striatum impaired working memory, attention, and cognitive control (21). In a reward-based decision-making task, subjects need to integrate information of feedback to estimate the causal relationship between the action and the result and maximize rewards (22). The activity of mesostriatal dopamine neurons had been shown to signify the RPE (23). The dorsal striatal neurons in rats showed the RPE coding like that of dopamine neurons during a probabilistic Pavlovian conditioning task (24). Selective lesions of the dorsomedial striatum (DMS) impaired serial spatial reversal learning in rats (25). Neither lesions of nucleus accumbens core/shell nor dorsolateral striatum affected the behavioral performance in reversal learning. These results suggest the involvement of the dorsal striatum in reversal learning. Further clarification is required concerning the precise role of the striatum in complex decision-making scenarios where there is no obvious correct choice, and its implications for AKT1 and cognitive impairments related to schizophrenia.

In this study, a series of six experiments were conducted to investigate various aspects of decision-making in a mouse model of schizophrenia. Experiment 1 focused on the effects of different striatal subregions using lesion and model fitting techniques in a two-choice probabilistic task (2C task, Fig. 1A). Experiment 2 employed Akt1 heterozygous mutant (Akt1^+/− or HET) mice, targeting AKT1's role as a susceptibility gene to examine choice behaviors and model fitting compared to wild-type (WT) littermate controls within the 2C task paradigm. Experiment 3 involved simultaneous recording of local field potentials (LFP) in the dorsomedial striatum (DMS) during behavioral tasks to correlate neural activity with performance. Experiment 4 used chemogenetic inhibition in HET mice and WT controls to establish a causal link between DMS activity and behavioral responses. Experiment 5 employed RNA sequencing (RNA-seq) and immunohistochemistry in the striatum to identify differential gene expressions influencing decision-making regulation between HET and WT mice. Finally, Experiment 6 selectively lesioned parvalbumin (PV) interneurons in the DMS to investigate their role in modulating decision-making processes and their impact on behavioral outcomes in the 2C task paradigm.

Animal

Male C57BL/6 mice were obtained from the Animal Center of National Taiwan University (NTU) School of Medicine. Akt1 heterozygous (HET) male mice and their wild-type (WT) littermates were bred from Akt1 HET pairs. PV-Cre (Jax-008069) and GAD-cre (Jax-010802) mice were used to study specific role of cell types in the 2C task. All mice were on a C57BL/6 background and genotyped via PCR of tail DNA. They were housed individually with ad libitum food and water, starting experiments at 2–3 months old. Mice were handled and weighed daily for one week before experiments. All animal procedures adhered to protocols approved by the Animal Care and Use Committees at NTU.

Two-choice probabilistic task (2C task)

The 2C task was adapted from a dynamic foraging task used previously in humans and mice (9, 20, 48). Shown in Fig. 1A, this task featured a two-alternative forced-choice paradigm with one lever offering high-rate rewards and the other low-rate rewards, conducted daily over 45-minute sessions. Trial counts and choice outcomes were recorded using The Graphic State 4.2.03 software from Coulbourn Instruments. The experimental protocol included shaping, surgery and recovery, reshaping, and testing phases, detailed below.

The shaping and reshaping phases

Animals underwent food (or water in Experiment 1) restriction to 85% of their original body weight and locomotor activity assessment before the shaping phase. Each shaping stage lasted 45 minutes, with mice advancing to the next stage daily upon meeting criteria.

The surgery and recovery phase

Experiments 1, 3, 4, and 6 included surgery and recovery. Under isoflurane anesthesia (1.5%), mice underwent stereotaxic surgery with skull burr holes drilled. Procedures included neurotoxin microinjection, electrode implantation, or viral microinjection as dictated by experimental conditions. Post-surgery, spontaneous locomotor activity was assessed in an open field using EthoVision video tracking (Noldus Information Technology).

For Experiment 1, lesions targeted the dorsomedial striatum (DMS; AP, 0.5 mm; ML, ± 1.5 mm; DV, -3.0 mm), dorsolateral striatum (DLS; AP, 0.5 mm; ML, ± 2.5 mm; DV, -3.0 mm), or nucleus accumbens (NA; AP, 1.8 mm; ML, ± 1.1 mm; DV, -4.7 mm) with NMDA solution infusion via Hamilton syringe. Post-operative analgesics were administered for 7 days.

In Experiment 3, electrode implants targeted the DMS region (AP, 0.5 mm; ML, ± 1.5 mm; DV, -3.0 mm) using a 4-electrode array. Electrodes were secured with dental cement and analgesics were provided for 7 days. Mice fully recovered before entering the reshaping phase in all experiments.

For viral microinjection surgery (Experiments 4 and 6), each mouse underwent bilateral microinjection of a virus mix targeting the DMS (AP, 0.5 mm; ML, ± 1.5 mm; DV, -3.0 mm; 0.6 µL per site). The virus mix consisted of AAV with Cre-inducible Gi-coupled human M4 muscarinic receptor (AAV-hsyn-DIO-hM4D(Gi)-mCherry, NTU AAV core) and AAV with Cre expression driven by CMV promoter (AAV-CMV-Cre, NTU AAV core) in a 1:1 ratio. AAV groups received the full virus mix, while sham groups received AAV-CMV-Cre only, matched in volume to the virus mix. For PV-Cre mice, AAV8 injections contained Cre-inducible expression of diphtheria toxin A (AAV-mCherry-FLEx-DTA, UNC vector core). Mice remained in their home cage for 3 weeks post-surgery to allow for full virus expression and recovery before entering the reshaping phase of the 2C task.

The testing phase: Following the shaping phase (or reshaping phase in Experiments 1, 3, 4, and 6), mice entered the testing phase, aiming to achieve specific reward rates: 60%-20% for sucrose water in Experiment 1 and 80%-20% for food pellets in other experiments (Fig. 1A). Each 45-minute daily session comprised 3 to 6 blocks (each block with 10 trials). Sessions began with house and food magazine lights illuminating. A nose-poke initiated a trial, extinguishing the food magazine light. A 5-second fixed inter-trial interval (ITI) preceded insertion of stimulus-response levers. After the ITI, two levers were presented, and mice pressed one. Each press led to a reward or no-reward outcome, followed by food magazine illumination. Trials ended when the reward was collected or after a 5-second wait post-nose-poke. Mice learned through trial and error to identify the high reward rate lever. Completion criteria required achieving ≥ 70% accuracy in lever choice across three consecutive blocks, with an average accuracy > 75%. Mice had 2 weeks to meet these criteria; failure resulted in data exclusion.

The analysis of choice strategy in the 2C task

Trial-by-trial choice data from all mice in the testing phase of the 2C task were recorded and analyzed for accumulated trials and choice strategy. The analysis of choice strategy encompassed four distinct strategies: win-stay, win-shift, lose-stay, and lose-shift. The ratio of each choice strategy was computed using a custom R code. The ratio for each choice strategy was determined by dividing the number of occurrences of the specific strategy by the total accumulated trials.

Fitting a reinforcement learning model to behavioral data in the 2C task

To explore the mechanism governing RPE (reward prediction error)-driven choice behavior, we selectively applied a reinforcement model to fit trial-by-trial behavioral data from mice engaged in the 2C task. Model fitting was performed using Rstan and hBayesDM R packages with custom code. Hierarchical Bayesian modeling with the MCMC algorithm estimated parameters from trial-by-trial choice data. Differences in parameters among mice were compared using posterior distribution values from the Bayesian estimation.

We applied a modified Q-learning model to examine how reward prediction error (RPE) affects and updates expectations. The model separates the learning rate (α) into α_rew for rewarding results and α_nor for no-reward results, determining the update speed of expected values. The model equations are as follows:

Qc (t) = Qc (t − 1) + α _rew δ (t − 1) + α _nor δ (t − 1)

δ (t − 1) = Rc (t − 1) – Qc (t − 1)

Here, α_nor is set to 0 on reward trials, and α_rew is set to 0 on no-reward trials.

To characterize how the choice tendency is guided by the updated expectation, we assumed that the probability of choosing the previously selected lever, P c(t), was determined by the Boltzmann exploration, represented in a logistic form assigning a weight to each action:

Pc (t) = e^(βQc )/(e^(βQc ) + e^(βQnc ) )

Here, the parameter β denotes the choice consistency (choice perseveration or exploration/exploitation) parameter, describing the tendency to make actions guided by expected reward values.

For MCMC analysis, both α_rew and α_nor were assigned a non-informative beta distribution (β (1.2, 1.2)) between 0 and 1 for the prior. A Gaussian prior between 0 and 10 was assigned to β.

In vivo electrophysiological recording of the DMS

Measuring local field potentials (LFPs): In Experiment 3, LFPs in the DMS were recorded during the 2C task. Event time points were imported into MATLAB for ERP analysis. Normalized LFPs were segmented into − 1 to 1-second epochs around each event: (1) Trial initiation (nose-poke to start), (2) Lever press (choice-making), and (3) Outcome (entering the food magazine for reward or no reward). This segmentation facilitated ERP component extraction for decision-making analysis.

Histological verification of electrode placement

After behavioral testing, mice were euthanized, and electrode positions marked by passing current (10 µA, 30 sec) to create iron deposits, visualized with potassium ferrocyanide.

Inhibition of the DMS of Akt1 HET mice during the 2C task

To investigate the causal relationship between the DMS neuronal activity and reward-related decision-making behavior, we employed chemogenetic modulation to directly inhibit the activity of the DMS in the 2C task. Adult male HET and WT mice (90–100 days old, n = 4–5 per group) were used in Experiment 4. Following virus mixture injection (AAV-hsyn-DIO-hM4D(Gi)-mCherry + AAV-CMV-cre), mice received clozapine N-oxide (CNO, 5 mg/kg, i.p.) 30 minutes before testing. Freshly prepared CNO in 1% DMSO saline was used. After meeting criteria, mice underwent 2-day CNO-off sessions to mitigate chronic injection effects (49).

RNA sequencing (RNA-seq) and validation

RNA Sample Collection

Left or right striatum was dissected from male HET and WT mice (90–100 days old, n = 4 each) in Experiment 5. RNA was extracted using Trizol (Thermal Fisher) and QIAamp RNeasy Mini Kit (QIAGEN). Samples were quantified by Qsep100 Capillary gel electrophoresis (RQN > 8.0), Nanodrop 2000 (260/280 ratio between 1.8 ~ 2.0, 260/230 > 2.0), and Qubit 3 Fluorometer (RNA concentration). Only high-quality RNA was used for RNA sequencing.

RNA-Seq Library Construction and Sequencing

Poly-A enriched libraries were prepared using the SureSelect Strand Specific RNA Library Prep Kit (Integrated Science) and sequenced on the Illumina Miniseq system with an eight-base index for sample identification.

Analysis for RNA-Seq Data

Raw read quality was assessed with FastQC (Babraham Bioinoformatics) and mapped using STAR 2.7.6a (mapping rates > 98%). Mapped reads were aligned to the Mus musculus genome GRCm38 with Gencode vM25 annotation. Alignment quality was checked by RSecQC, and gene expression levels were quantified by featureCounts as transcript per million. Differential expression analysis and volcano plots were generated using limma in R.

Gene selection and primer design: Target and reference genes were selected based on differential expression (top 10 by p-value), significant fold changes (log2FC > 2), and associations with schizophrenia, parvalbumin (PV) expression, or Akt1 function. Notable genes included Akt1, PV, GAD67, Calr, Ascl1, and Cldn5, with Gapdh as the reference gene. Primers were designed using PrimerQuest (Integrated NDA Technologies). Following primers were used in this experiment. Akt1: Forward- TCGTGTGGCAGGATGTGTAT; Reverse- ACCTGGTGTCAGTCTCAGAGG. Gapdh: Forward-TGTGTCCGTCGT GGATCTGA; Reverse- CCTGCTTCACCACCTTCTTGA. Gad67: Forward- CACA GGTCACCCTCGATTTTT; Reverse- ACCATCCAACGATCTCTCTCATC. Pvalb: Forward-ATCAAGAAGGCGATAGGAGCC; Reverse- GGCCAGAAGCGTCTTTG TT. Calretinin: Forward- TTTCAGGGTATGAAGCTGACCTC; Reverse-TGACACT CTTCCTGTAGGTGGTG. Cldn5: Forward-GCAAGGTGTATGAATCTGTGCT; Reverse- GTCAAGGTAACAAAGAGTGCCA. Ascl1: Forward- TTGAACTCTATG GCGGGTTC; Reverse- CAAAGTCCATTCCCAGGAGA.

Reverse Transcriptome-Quantitative Real-time PCR (RT-qPCR): RNA was extracted as mentioned above, and cDNA synthesized using LunaScript RT SuperMix Kit (#E3010, New England Biolabs). For qPCR, 0.5 µl of cDNA was used in a 10 µl reaction with SYBR Green I-based Luna Universal qPCR Master Mix (#M3003, New England Biolabs), and Applied Biosystems StepOne qPCR machine. Threshold cycles (CT) were calculated, and relative expression determined using the ΔΔCT algorithm: ΔΔCT = (CT_A – CT_ref) − (CT_B – CT_ref); Relative expression = 2^(-ΔΔCT).

Immunohistochemistry

Immunohistochemistry labeled PV interneurons on 40 µm brain sections with antibody (1:250; Synaptic Systems). Neuronal density in the DMS was measured using NIH ImageJ.

Selective lesioning PV interneurons in the DMS

For investigating the causal relationship between the DMS PV interneurons and the reward-related decision-making behavior, we selectively lesioned DMS PV interneurons by the virus-expressed diphtheria toxin A (DTA) in PV-cre mice before the 2C task. Adult male PV-cre mice and their WT littermates (90–100 days old) were used (n = 8–11, per group) in Experiment 6. The experimental schedule followed the previously described protocol, with virus injection (AAV-mCherry-FLEx-DTA) for virus expression occurring 3 weeks before the task.

Data analyses and statistics

Data are presented as mean ± SEM. Behavioral data were analyzed using Student's t-test or one-way ANOVA for genotypic differences, and Mann-Whitney U test for choice strategy ratios. Effect sizes were measured by Cohen’s d (≥ 0.8, large effect) and rank-biserial r (maximum = 1). Pearson correlation evaluated relationships between behavioral data and neural oscillation power. Data with misplaced injections or electrodes were excluded. The two-sample Kolmogorov-Smirnov test was employed to reveal genotypic/group differences in the distribution of model parameters of the reinforcement learning model. A p-value below 0.05 was considered statistically significant.

Experiment 1: Lesions in the DMS notably affected performance in the 2C task more than other striatal subregions.

In Experiment 1, mice with dorsomedial striatum (DMS), dorsolateral striatum (DLS), and nucleus accumbens (NA) lesions were used to evaluate striatal subregion roles in the 2C task (Fig 1A). Compared to the sham group, selective excitotoxic lesions of different subregions of the striatum had no significant effect on their locomotor activity in the open field before and after brain surgery, except in the NA lesion condition (t(9) = 2.602, p < 0.05, Cohen's d = 1.735 > 0.8). As depicted in Fig 1B, our behavioral data indicated that DMS lesions significantly impaired behavioral performance in the 2C task compared to the sham group (t(14) = 2.003, p < 0.05, Cohen's d = 1.071 > 0.8). Conversely, lesions of the DLS and NA demonstrated no effect on accumulated trials compared to the sham group (both p > 0.05). No significant group difference in choice strategy was found among the three subregions (Fig 1C). Furthermore, two-sample Kolmogorov-Smirnov test revealed that DMS lesions caused more pronounced differences in reinforcement learning model parameters compared to lesions in the other two striatal subregions. As shown in Fig 1D, mice with lesions of the DMS had a significantly lower reward learning rate (α_rew, D(40000) = 0.3459, p < 0.05), no-reward learning rate (α_nor, D(40000) = 0.7985, p < 0.05), and higher choice consistency (β, D(40000) = 0.5552, p < 0.05) compared to their sham controls in our reward-no-reward model based analysis.

Experiment 2: Akt1-deficient (HET) mice displayed aberrant behaviors in the 2C task compared to their WT littermates.

HET mice were employed as a schizophrenia model in Experiments 2 and 3 to investigate reward-based probabilistic decision making, choice strategy, and neural activity in the 2C task. No significant genotypic effect was observed in their locomotor activity (data not shown). Compared to WT controls, HET mice required fewer trials to reach learning criteria (t(16) = 3.695, p < 0.01, Cohen's d = 1.848 > 0.8; Fig 2A), and exhibited decreased lose-stay strategy (t(16) = 2.536, p < 0.05, Cohen's d = 1.268 > 0.8; Fig 2B). The trial-by-trial choice behavioral data were further fitted with a reinforcement learning model to estimate α_rew, α_nor, and β. The Kolmogorov–Smirnov test revealed genotypic differences in the probability of the posterior distributions of parameters. HET mice exhibited a lower α_rew(D(40000) = 0.6483, p < 0.05), a higher α_nor (D(40000) = 0.8708, p < 0.05), and a lower β (D(40000) = 0.4189, p < 0.05; Fig 2C) compared to their WT littermates.

Experiment 3: The power of local field potential in the DMS is highly correlated with no-reward condition

In Experiment 3, neural activity in the DMS of both WT and HET mice correlated with their choice behaviors, depicted in Figs 2D and 2E, respectively. Implantation of recording electrodes did not alter their open field test locomotor activity before or after brain surgery (data not shown). In the no-reward condition, significant correlations were found between theta and gamma powers in the DMS and accumulated trials (Fig 2F: theta: r(18) = 0.4, gamma: r(18) = 0.6618; both p < 0.05). However, in the reward condition (Fig 2G), there were no significant correlations between theta or gamma power and accumulated trials. These findings highlight the critical role of theta and gamma powers in the DMS in guiding choice behavior during the two-choice probabilistic task, particularly when a reward is expected but not received.

Further analysis revealed correlations between DMS local field potential power and parameters of the reinforcement learning model, detailed in Table 1. Notably, in the no-reward condition, theta and gamma powers were significantly correlated with the no-reward learning rate in both WT and HET mice (all p < 0.05). A marginal correlation was observed between DMS local field potential power and choice consistency in HET mice, but not in WT mice. In contrast, in the reward condition, no significant correlations were found between DMS local field potential power and reinforcement learning model parameters. These results underscore a robust link between DMS local field potential power and choice behavior specifically under conditions where an anticipated reward is not received during the 2C task.

Experiment 4: Observed behavioral abnormalities in HET mice were restored by DMS inhibition.

Compared to WT controls, HET mice showed significantly higher DMS gamma power during habituation (t(16) = 2.34, p < 0.05, Cohen's d = 1.17 > 0.8; Fig 3A). To establish causality, DREADDs were used to inhibit DMS activity in HET mice during decision-making (Fig 3B). There were no significant differences in open field test locomotor activity before or after microinjection surgery (Fig 3C). Chemogenetic inhibition of DMS activity with CNO in HET mice significantly increased accumulated trials in the 2C task compared to HET vehicle-treated controls (U(5, 4) = 0.50, p < 0.05, rank-biserial r = 0.95; Fig 3D). Despite the small sample size, the effect size index (rank-biserial r) indicates a substantial effect, reaching a maximum value of 1. Although a trend was observed between HET-control and WT groups (U(4, 5) = 4, p = 0.057, rank-biserial r = 0.6), no significant difference was found between WT and HET-CNO groups (U(4, 4) = 5, p = 0.2429, rank-biserial r = 0.38). Additionally, HET-CNO mice exhibited an increased lose-stay strategy compared to HET controls (U(6, 4) = 2, p < 0.05, rank-biserial r = 0.83; Fig 3E). Model-based analysis (Fig 3F) showed that CNO-treated HET mice had decreased α_nor and increased β compared to HET controls, indicating that inhibiting DMS activity affected choice behavior and strategy in these HET mice.

Experiment 5: RNA-seq analysis and immunohistochemistry data highlight unique expression of striatal parvalbumin (PV) interneurons in HET mice.

RNA sequencing (RNA-seq) analysis was performed on striatal samples from 4 WT and 4 HET mice, followed by validation using RT-qPCR and immunohistochemistry. Principal components analysis (PCA) depicted the distribution of gene expression across samples (Fig 4A). Volcano plot analysis identified top differentially expressed genes, with Akt1 and parvalbumin (Parvb/PV) among the notable up and down-regulated genes (Fig 4B). Detailed results of the RNA-seq analysis, including fold changes, are provided in the supplementary table.

RT-qPCR confirmed decreased expression of Akt1 (t(7) = 5.532, p < 0.01, Cohen's d = 4.182 > 0.8) and parvalbumin (t(7) = 1.925, p < 0.05, Cohen's d = 1.455 > 0.8) in HET mice compared to WT controls. No significant differences were observed in the expression of glutamic acid decarboxylase 67 (GAD67) and calretinin (Calr) between genotypes (Fig 4C). Additionally, downregulated expression of Ascl1, a key transcription factor in striatal neurogenesis, was validated (t(7) = 2.815, p < 0.05, Cohen's d = 2.128 > 0.8). Immunohistochemistry further confirmed reduced PV interneuron expression in the DMS of HET mice compared to WT controls (t(9) = 3.406, p < 0.01, Cohen's d = 2.271 > 0.8; Fig 4D), consistent with the gene expression findings.

Experiment 6: Selective lesion of PV interneurons in the DMS of DTA-treated PV-Cre mice disrupted choice behaviors as observed in HET mice.

In Experiment 6, we specifically lesioned PV interneurons in the DMS via diphtheria toxin A (DTA) in PV-Cre mice to ensure the causal effect between reduced expression of PV interneurons and choice behavior in the 2C task. As depicted in Fig 5A, the region-specific expression of DTA resulted in a reduced density of PV-positive interneurons in the DMS (t(12) = 4.843, p < 0.01, Cohen's d = 2.796 > 0.8). The expression of DTA did not affect spontaneous locomotor activity in PV-Cre mice during habituation in the testing chamber before or after the treatment (Fig 5B). However, compared to the control group, selective lesion of PV interneurons in the DMS led to a significant reduction of accumulated trials in the acquisition of the 2C task (t(16) = 2.179, p < 0.05, Cohen's d = 1.09 > 0.8; Fig 5C). The analysis of choice strategy further revealed a decreased "lose-shift" strategy in DTA-treated PV-Cre mice (U(8, 11) = 14, z = -2.435, p < 0.01, rank-biserial r = 0.68; Fig 5D). Similar to HET mice, our trial-by-trial model fitting data revealed that DTA-treated PV-Cre mice exhibited a higher α_rew (D(40000) = 0.9937, p < 0.05), a higher α_nor (D(40000) = 0.9474, p < 0.05), and a lower β (D(40000) = 0.9305, p < 0.05) than the sham controls did (Fig 5E). In contrast, selective chemogenetic inhibition of GABAergic interneurons in the DMS of GAD2-cre mice has no significant effect on accumulated trials, choice strategies, and model parameters (except the choice consistency (ß)) in the 2C task (data not shown).

This study encompasses six experiments using the Akt1 mutant mouse model, revealing impaired behavioral performances and altered choice strategies in the 2C task, particularly evident in the absence of rewards. Integration of in vivo local field potential recordings and chemogenetic data highlights the critical role of the DMS in abnormal decision-making behaviors. RNA-seq analysis, immunohistochemistry, and selective lesioning of PV interneurons in the DMS further support these findings. Together, these results underscore the pivotal roles of Akt1 and PV interneurons in regulating choice strategies in reward-based decision-making, especially under no-reward conditions. This study identifies Akt1's influence on probabilistic decision-making strategies within the DMS and PV interneurons in a schizophrenia mouse model, representing a significant discovery. The findings suggest targeting PV interneurons in the DMS as a potential therapeutic approach for addressing cognitive deficits in schizophrenia.

PV interneurons have emerged as a significant focus in understanding cognitive functions, particularly in schizophrenia. Postmortem studies consistently reveal reduced PV interneuron numbers in schizophrenia patients (26–28), with NMDAR hypofunction in these cells proposed as a potential underlying mechanism (29). Additionally, PV interneurons are critical for generating gamma oscillations (30–80 Hz), crucial for cognitive processes (30–34). Studies in Huntington’s disease models suggest that striatal PV interneurons play a crucial role in learning by providing local inhibitory input to striosomes (35). Our findings align with these insights, emphasizing that altered brain oscillations due to decreased PV interneurons in the striatum significantly impact cognitive functions. Moreover, fast-spiking PV interneurons in the striatum tightly regulate medium spiny neurons (MSNs), balancing firing between direct (D1) and indirect (D2) pathway neurons (36). Optogenetic and chemogenetic studies further indicate that PV interneurons modulate striatal output, enhancing early learning and action selection (37, 38). Despite greater abundance in the lateral than medial striatum in rodents (39, 40), PV interneurons exert stronger regulatory control over DMS efferents compared to other GABAergic interneurons (41). Additionally, PV interneurons in medial and lateral striatum exhibit intrinsic excitability differences (42), potentially influencing reward-related decision-making in distinct dorsal striatal subregions. Our findings support the critical role of striatal PV interneurons in choice strategy and probabilistic decision-making, relevant to cognitive deficits observed in schizophrenia. Consistent with this, we previously reported that schizophrenic patients with psychosis exhibit rapid reward value updating (high learning rate, α) and reduced choice perseveration (low β) in a two-choice task (9). These behavioral observations may reflect alterations in striatal PV interneurons. Our task and analyses are adaptable to both human and mouse models, providing a valuable and translational method to investigate reward-related learning and decision-making in basic and clinical research contexts.

Previous studies have highlighted Akt1's role in modulating reward learning and prediction error in mice (20), as well as its involvement in methamphetamine-induced psychosis and striatal neuronal activity (17). Given Akt1's association with dopamine-related behaviors and schizophrenia, comparing our findings on Akt1 and PV across experiments in this study is crucial. In Experiment 2, HET mice showed significant changes in behavioral performance (accumulated trials), choice strategy (reduced lose-stay strategy in no-reward conditions), and model parameters (increased α_nor and decreased β). Similar outcomes were observed in Experiment 6 with selective PV interneuron lesions in PV-Cre mice treated with DTA, including reduced accumulated trials, decreased lose-shift strategy in no-reward conditions, and altered α_nor and β compared to controls. Intriguingly, Experiment 5 revealed reduced striatal PV interneurons and decreased Akt1, parvalbumin, and Ascl1 expressions in HET mice using immunohistochemistry and RNA-seq. Ascl1 is crucial for interneuron development (43, 44), and Akt1 inhibition reduces Ascl1-induced PV-neuron differentiation in cell culture (16). Thus, chronic PV interneuron deficiency in HET mice may stem from compensatory mechanisms involving Ascl1 due to long-term Akt1 knockdown during brain development.

In contrast to Experiment 2 with HET mice, Experiment 4 revealed contrasting behavioral outcomes. Chemogenetic inhibition of the dorsomedial striatum (DMS) in CNO-treated Akt1 HET mice reversed behavioral abnormalities observed in untreated HET controls. Specifically, treated mice showed increased lose-stay strategy under no-reward conditions, decreased α_nor, and increased β compared to controls, aligning with Experiment 3's indication of DMS's pivotal role in choice behavior, especially without rewards. The striatum, a key basal ganglia structure influencing motor function and reward learning, is predominantly composed of GABAergic medium spiny neurons (MSNs) (45). Akt1-deficient mice exhibit reduced cumulative miniature IPSC amplitudes in striatal MSNs and altered activity following methamphetamine exposure (17). Our findings suggest that DMS chemogenetic inhibition temporarily disinhibited striatal GABAergic MSN activity, restoring theta power and gamma waves. Future studies could employ in vivo electrophysiology to investigate neural oscillation changes in Akt1 mutant mice's DMS. PV interneurons in the DMS receive glutamatergic input from the cingulate cortex, distinct from the dorsolateral striatum (DLS) (42). These PV neurons locally inhibit striosomal activity (35), crucial for the GPi-LHb circuit, implicated in anti-reward processing (46, 47). Therefore, future research may benefit from multi-site electrophysiological recordings to track dynamics in the striatum-GPi-LHb circuit during decision-making.

In conclusion, this study establishes a strong correlation between altered behavioral responses, shifts in choice strategies, and variations in model parameters observed in HET mice during the 2C task. The pivotal role of PV interneurons in the DMS in regulating strategic decision-making, particularly evident in no-reward conditions, suggests their potential as therapeutic targets for addressing cognitive impairments in schizophrenia. Despite the limited sample size in some experiments, the observed effects and effect sizes remain substantial and consistent. Our findings pave the way for future investigations, providing deeper insights into the neural circuits underlying strategic decision-making and cognitive dysfunctions in schizophrenia. Specifically, conducting multi-site electrophysiological recordings to scrutinize the striatum-globus pallidus internus-lateral habenula (striatum-GPi-LHb) circuit during decision-making could offer valuable avenues for further exploration. Overall, our findings illuminate Akt1's role in precisely modulating probabilistic decision-making strategies within specific brain regions and cell types in a mouse model of schizophrenia, marking a significant advance in our understanding of schizophrenia and the development of treatments, particularly for cognitive deficits.

Acknowledgments

This research was supported by MOST/NSTC grant numbers 112-2321-B-002 -022, 111-2423-H-002 -009, 110-2410-H-002-235-MY3 and 109-2410-H-002-087-MY3 and by grant support from National Taiwan University and National Taiwan University Hospital.

Disclosures

The authors declare no financial or non-financial interests.

Bowie CR, Harvey PD. Cognitive deficits and functional outcome in schizophrenia. Neuropsychiatr Dis Treat. 2006 Dec;2(4):531–6.
Koychev I, Joyce D, Barkus E, Ettinger U, Schmechtig A, Dourish CT, et al. Cognitive and oculomotor performance in subjects with low and high schizotypy: implications for translational drug development studies. Transl Psychiatry. 2016 May 17;6(5):e811–e811.
Mohamed S, Paulsen JS, O’Leary D, Arndt S, Andreasen N. Generalized Cognitive Deficits in Schizophrenia: A Study of First-Episode Patients. Arch Gen Psychiatry. 1999 Aug 1;56(8):749.
Baldez DP, Biazus TB, Rabelo-da-Ponte FD, Nogaro GP, Martins DS, Kunz M, et al. The effect of antipsychotics on the cognitive performance of individuals with psychotic disorders: Network meta-analyses of randomized controlled trials. Neurosci Biobehav Rev. 2021 Jul;126:265–75.
Shurman B, Horan WP, Nuechterlein KH. Schizophrenia patients demonstrate a distinctive pattern of decision-making impairment on the Iowa Gambling Task. Schizophr Res. 2005 Jan;72(2–3):215–24.
Saperia S, Da Silva S, Siddiqui I, Agid O, Daskalakis ZJ, Ravindran A, et al. Reward-driven decision-making impairments in schizophrenia. Schizophr Res. 2019 Apr;206:277–83.
Waltz JA, Frank MJ, Robinson BM, Gold JM. Selective Reinforcement Learning Deficits in Schizophrenia Support Predictions from Computational Models of Striatal-Cortical Dysfunction. Biol Psychiatry. 2007 Oct;62(7):756–64.
Reddy LF, Waltz JA, Green MF, Wynn JK, Horan WP. Probabilistic Reversal Learning in Schizophrenia: Stability of Deficits and Potential Causal Mechanisms. Schizophr Bull. 2016 Jul;42(4):942–51.
Li CT, Lai WS, Liu CM, Hsu YF. Inferring reward prediction errors in patients with schizophrenia: a dynamic reward task for reinforcement learning. Front Psychol [Internet]. 2014 Nov 11 [cited 2023 Sep 18];5. Available from: http://journal.frontiersin.org/article/10.3389/fpsyg.2014.01282/abstract
Emamian ES, Hall D, Birnbaum MJ, Karayiorgou M, Gogos JA. Convergent evidence for impaired AKT1-GSK3β signaling in schizophrenia. Nat Genet. 2004 Feb;36(2):131–7.
Schwab SG, Hoefgen B, Hanses C, Hassenbach MB, Albus M, Lerer B, et al. Further Evidence for Association of Variants in the AKT1 Gene with Schizophrenia in a Sample of European Sib-Pair Families. Biol Psychiatry. 2005 Sep;58(6):446–50.
Mathur A, Law MH, Megson IL, Shaw DJ, Wei J. Genetic association of the AKT1 gene with schizophrenia in a British population. Psychiatr Genet. 2010 Jun;20(3):118–22.
Beaulieu JM, Gainetdinov RR, Caron MG. The Akt–GSK-3 signaling cascade in the actions of dopamine. Trends Pharmacol Sci. 2007 Apr;28(4):166–72.
Beaulieu JM, Sotnikova TD, Yao WD, Kockeritz L, Woodgett JR, Gainetdinov RR, et al. Lithium antagonizes dopamine-dependent behaviors mediated by an AKT/glycogen synthase kinase 3 signaling cascade. Proc Natl Acad Sci. 2004 Apr 6;101(14):5099–104.
Zhao Z, Ksiezak-Reding H, Riggio S, Haroutunian V, Pasinetti GM. Insulin receptor deficits in schizophrenia and in cellular and animal models of insulin receptor dysfunction. Schizophr Res. 2006 May;84(1):1–14.
Chang CY, Chen YW, Wang TW, Lai WS. Akting up in the GABA hypothesis of schizophrenia: Akt1 deficiency modulates GABAergic functions and hippocampus-dependent functions. Sci Rep. 2016 Sep 12;6(1):33095.
Chen YW, Kao HY, Min MY, Lai WS. A Sex- and Region-Specific Role of Akt1 in the Modulation of Methamphetamine-Induced Hyperlocomotion and Striatal Neuronal Activity: Implications in Schizophrenia and Methamphetamine-Induced Psychosis. Schizophr Bull. 2014 Mar;40(2):388–98.
Luo DZ, Chang CY, Huang TR, Studer V, Wang TW, Lai WS. Lithium for schizophrenia: supporting evidence from a 12-year, nationwide health insurance database and from Akt1-deficient mouse and cellular models. Sci Rep. 2020 Jan 20;10(1):647.
Tan HY, Nicodemus KK, Chen Q, Li Z, Brooke JK, Honea R, et al. Genetic variation in AKT1 is linked to dopamine-associated prefrontal cortical structure and function in humans. J Clin Invest. 2008 May 1;JCI34725.
Chen YC, Chen YW, Hsu YF, Chang WT, Hsiao CK, Min MY, et al. Akt1 deficiency modulates reward learning and reward prediction error in mice. Genes Brain Behav. 2012 Mar;11(2):157–69.
Chudasama Y, Robbins TW. Functions of frontostriatal systems in cognition: Comparative neuropsychopharmacological studies in rats, monkeys and humans. Biol Psychol. 2006 Jul;73(1):19–38.
Schultz W. Dopamine reward prediction error coding. Dialogues Clin Neurosci. 2016 Mar 31;18(1):23–32.
Schultz W, Dayan P, Montague PR. A Neural Substrate of Prediction and Reward. Science. 1997 Mar 14;275(5306):1593–9.
Oyama K, Hernádi I, Iijima T, Tsutsui KI. Reward Prediction Error Coding in Dorsal Striatal Neurons. J Neurosci. 2010 Aug 25;30(34):11447–57.
Castañé A, Theobald DEH, Robbins TW. Selective lesions of the dorsomedial striatum impair serial spatial reversal learning in rats. Behav Brain Res. 2010 Jun;210(1):74–83.
Benes F. GABAergic Interneurons Implications for Understanding Schizophrenia and Bipolar Disorder. Neuropsychopharmacology. 2001 Jul;25(1):1–27.
Volk DW, Edelson JR, Lewis DA. Altered expression of developmental regulators of parvalbumin and somatostatin neurons in the prefrontal cortex in schizophrenia. Schizophr Res. 2016 Nov;177(1–3):3–9.
Pantazopoulos H, Wiseman JT, Markota M, Ehrenfeld L, Berretta S. Decreased Numbers of Somatostatin-Expressing Neurons in the Amygdala of Subjects With Bipolar Disorder or Schizophrenia: Relationship to Circadian Rhythms. Biol Psychiatry. 2017 Mar;81(6):536–47.
Gonzalez-Burgos G, Lewis DA. NMDA Receptor Hypofunction, Parvalbumin-Positive Neurons, and Cortical Gamma Oscillations in Schizophrenia. Schizophr Bull. 2012 Sep 1;38(5):950–7.
Bragin A, Jando G, Nadasdy Z, Hetke J, Wise K, Buzsaki G. Gamma (40-100 Hz) oscillation in the hippocampus of the behaving rat. J Neurosci. 1995 Jan 1;15(1):47–60.
Penttonen M, Kamondi A, Acsády L, Buzsáki G. Gamma frequency oscillation in the hippocampus of the rat: intracellular analysis in vivo. Eur J Neurosci. 1998 Feb;10(2):718–28.
Fuchs EC, Zivkovic AR, Cunningham MO, Middleton S, LeBeau FEN, Bannerman DM, et al. Recruitment of Parvalbumin-Positive Interneurons Determines Hippocampal Function and Associated Behavior. Neuron. 2007 Feb;53(4):591–604.
Sohal VS, Zhang F, Yizhar O, Deisseroth K. Parvalbumin neurons and gamma rhythms enhance cortical circuit performance. Nature. 2009 Jun;459(7247):698–702.
Başar E, Başar-Eroglu C, Karakaş S, Schürmann M. Gamma, alpha, delta, and theta oscillations govern cognitive processes. Int J Psychophysiol. 2001 Jan;39(2–3):241–8.
Friedman A, Hueske E, Drammis SM, Toro Arana SE, Nelson ED, Carter CW, et al. Striosomes Mediate Value-Based Learning Vulnerable in Age and a Huntington’s Disease Model. Cell. 2020 Nov;183(4):918-934.e49.
Damodaran S, Evans RC, Blackwell KT. Synchronized firing of fast-spiking interneurons is critical to maintain balanced firing between direct and indirect pathway neurons of the striatum. J Neurophysiol. 2014 Feb 15;111(4):836–48.
Lee K, Holley SM, Shobe JL, Chong NC, Cepeda C, Levine MS, et al. Parvalbumin Interneurons Modulate Striatal Output and Enhance Performance during Associative Learning. Neuron. 2017 Mar;93(6):1451-1463.e4.
Gage GJ, Stoetzner CR, Wiltschko AB, Berke JD. Selective Activation of Striatal Fast-Spiking Interneurons during Choice Execution. Neuron. 2010 Aug;67(3):466–79.
Kita H, Kosaka T, Heizmann CW. Parvalbumin-immunoreactive neurons in the rat neostriatum: a light and electron microscopic study. Brain Res. 1990 Dec;536(1–2):1–15.
Todtenkopf MS, Stellar JR, Williams EA, Zahm DS. Differential distribution of parvalbumin immunoreactive neurons in the striatum of cocaine sensitized rats. Neuroscience. 2004 Jan;127(1):35–42.
Fino E, Vandecasteele M, Perez S, Saudou F, Venance L. Region-specific and state-dependent action of striatal GABAergic interneurons. Nat Commun. 2018 Aug 21;9(1):3339.
Monteiro P, Barak B, Zhou Y, McRae R, Rodrigues D, Wickersham IR, et al. Dichotomous parvalbumin interneuron populations in dorsolateral and dorsomedial striatum. J Physiol. 2018 Aug;596(16):3695–707.
Dong Z yong, Pei Z, Wang Y ling, Li Z, Khan A, Meng X ting. Ascl1 Regulates Electric Field-Induced Neuronal Differentiation Through PI3K/Akt Pathway. Neuroscience. 2019 Apr;404:141–52.
Yang J, Yang X, Tang K. Interneuron development and dysfunction. FEBS J. 2022 Apr;289(8):2318–36.
Lanciego JL, Luquin N, Obeso JA. Functional Neuroanatomy of the Basal Ganglia. Cold Spring Harb Perspect Med. 2012 Dec 1;2(12):a009621–a009621.
Hong S, Amemori S, Chung E, Gibson DJ, Amemori K ichi, Graybiel AM. Predominant Striatal Input to the Lateral Habenula in Macaques Comes from Striosomes. Curr Biol. 2019 Jan;29(1):51-61.e5.
Hu H, Cui Y, Yang Y. Circuits and functions of the lateral habenula in health and in disease. Nat Rev Neurosci. 2020 May;21(5):277–95.
Liu HH, Liu CM, Hsieh MH, Chien YL, Hsu YF, Lai WS. Dysregulated affective arousal regulates reward-based decision making in patients with schizophrenia: an integrated study. NPJ Schizophrenia, 2022 8(1), 1–10.
Carvalho Poyraz F, Holzner E, Bailey MR, Meszaros J, Kenney L, Kheirbek MA, et al. Decreasing Striatopallidal Pathway Function Enhances Motivation by Energizing the Initiation of Goal-Directed Action. J Neurosci. 2016 Jun 1;36(22):5988–6001.

Table 1 is available in the Supplementary Files section

The authors have declared there is NO conflict of interest to disclose

Download PDF

Reviewer #2 agreed at journal
10 Aug, 2024
Review #1 received at journal
25 Jul, 2024
Reviewer #1 agreed at journal
03 Jul, 2024
Reviewers invited by journal
02 Jul, 2024
Submission checks completed at journal
01 Jul, 2024
First submitted to journal
28 Jun, 2024
Unknown event
28 Jun, 2024
Editor assigned by journal
27 Jun, 2024

You are reading this latest preprint version

Insights into neurobiological mechanism of probabilistic decision-making impairments in schizophrenia from Akt1 and PV interneurons in mice

Status:

Version 1

Abstract

Introduction

Materials and methods

Animal

Two-choice probabilistic task (2C task)

The analysis of choice strategy in the 2C task

Fitting a reinforcement learning model to behavioral data in the 2C task

δ (t − 1) = Rc (t − 1) – Qc (t − 1)

Pc (t) = e^(βQc )/(e^(βQc ) + e^(βQnc ) )

Inhibition of the DMS of Akt1 HET mice during the 2C task

RNA sequencing (RNA-seq) and validation

Immunohistochemistry

Selective lesioning PV interneurons in the DMS

Data analyses and statistics

Results

Experiment 3: The power of local field potential in the DMS is highly correlated with no-reward condition

Experiment 4: Observed behavioral abnormalities in HET mice were restored by DMS inhibition.

Experiment 5: RNA-seq analysis and immunohistochemistry data highlight unique expression of striatal parvalbumin (PV) interneurons in HET mice.

Experiment 6: Selective lesion of PV interneurons in the DMS of DTA-treated PV-Cre mice disrupted choice behaviors as observed in HET mice.

Discussion

Declarations

References

Table

Additional Declarations

Supplementary Files

Status:

Version 1