Subjects
Male (n = 896) and female (n = 898) heterogeneous stock rats (HS; NMcwiWfsm:HS; RRID # RRID:RGD_13673907)74 tested in this study were part of the NIDA Center for GWAS in Outbred Rats (Principal Investigator, Dr. Abraham Palmer) and obtained from colonies maintained by Dr. Leah Solberg Woods [2015–2018: Medical College of Wisconsin: males n = 815, females n = 816; 2019: Wake Forest University: males n = 81; females: n = 82]. Experiments were conducted in batches of approximately 100 rats (4- to 5-weeks of age) at 3- to 4-month intervals. Rats were quarantined for 1- to 2-weeks upon arrival to University at Buffalo before being transferred to colony housing. The colony room was maintained at a constant temperature (22 ± 1°C), humidity range (~ 55% ± 5%), and lights were on a reverse cycle (lights on from 19:00 to 07:00).
Rats were housed in same-sex pairs in plastic cages (42 × 22 × 19 cm) lined with bedding (Aspen Shavings). Prior to the start of the experiment, rats (n = 1590) were first tested on four behavioral tasks (social reinforcement, locomotor response to novelty, light reinforcement, and choice reaction time; data are not reported here). A subset of animals (n = 204) were not tested on the social reinforcement test because it was introduced after the first two batches had already been tested. At the onset of data collection, the mean (± SEM) age of the rats was postnatal day 136.58 ± 0.29.
Behavioral testing was conducted 6 days/week (Monday through Saturday) during the dark phase of the light-dark cycle between the hours of 08:30 and 12:30. Food (Teklad Laboratory Diet #8604) was available ad libitum in the home cages. Access to water was restricted to 30 min immediately following testing on Monday through Friday. At the end of the testing on Saturday, animals were given free access to water until approximately 12:30 h on Sunday (approximately 20 hr prior to testing on Monday).
This study was conducted in accordance with protocols approved by the Institutional Animal Care and Use Committee at the University at Buffalo, and animals were treated in compliance with the Guide for the Care and Use of Laboratory Animals, and the study is reported in accordance with the ARRIVE guidelines 75.
Apparatus
Testing occurred in 24 locally constructed operant chambers (24 × 22 × 20 cm; Fig. 1) housed in sound-attenuating cabinets (Model # 3000000187, Coleman, Wichita, KS), which were previously described in detail 76. Briefly, the chambers had stainless-steel rod floors, aluminum back and side walls, and a Plexiglas front wall and top. Each test chamber had three snout poke receptacles (4 cm in diameter) located in the back and side walls. Infrared photobeam detectors, located 1 cm from the snout poke receptacle entrance, were used to record snout pokes. Three stimulus lights were located above each snout poke receptacle, and a fourth light was located in the ceiling of the test chamber. A Sonalert tone generator (SC628EJR; Mallory Sonalert Products, Indianapolis, IN) mounted on the right wall provided a pulsed 1.9-kHz tone. Acrylic dishes were located inside the left and rear snout poke receptacles and were connected to Tygon tubing, which delivered precise amounts of water from 60 ml syringes mounted on two single-speed syringe pumps (3.33 rpm, PHM-100; Med Associates, St. Albans, VT) external to the sound-attenuating cabinet.
The apparatus was controlled by MED-PC IV software (RRID:SCR_012156) with 1 ms temporal resolution running on computers with Microsoft Windows operating systems. Equipment was tested before test sessions and following any session in which a rat earned fewer than 30 reinforcers.
Sequential Patch Depletion Procedure
Behavior was measured using a sequential patch depletion procedure that was previously described (Fig. 2a)26,77. Briefly, during this task, water-restricted rats were offered a concurrent choice between two water “patches” (left- or back-wall snout poke receptacles) and could elect to “stay” in the current patch or “leave” for an alternative patch at any time.
A snout poke in the receptacle (i.e., entering the patch) resulted in immediate delivery of 150 µl water. If rats made a choice to “stay” in this patch, successively smaller amounts of water were delivered. To simulate patch depletion, the volume of water for successive “stay” choices was reduced 20% for each reinforcer presentation. For example, the first reward volume was 150 µl of water, the second reward was 120 µl, and the third reward was 96 µl, etc. with each reward delivery separated by a minimum of 4 s (Fig. 2a and 2b). Once in a patch, water was available according to a modified Fixed Interval (FI) 4-s schedule; each water delivery was followed by a 4-s interval during which the reinforcer was unavailable. However, successive “stay” choices were achieved in one of two ways: rats could emit a snout poke to the same receptacle as the previous response following the 4-s interval (traditional FI contingency) or the rat could remain with its snout in the receptacle for the duration of the 4-s interval (analogous to a Fixed Time or FT schedule).
If a rat made the choice to “leave” a patch by poking its snout in the alternative receptacle (“alternative patch”), a changeover delay (COD) was imposed to simulate “travel” cost78. During the COD, reinforcers were not available at either location regardless of responding. When the rat left a patch to travel to the alternative patch, the abandoned patch was replenished (the volume of the water was reset to 150 µl for the first reinforced response on returning to that patch).
When a patch change occurred with a 0-s COD, the stimulus light above the abandoned patch was extinguished, and the stimulus light above the newly poked receptacle was illuminated simultaneously with the delivery of 150 µl of water. For CODs > 0 s (6, 12, 18, or 24 s), the stimulus light above the abandoned receptacle was extinguished and a 1.9-kHz tone was pulsed for the duration of the COD. At the end of the delay, the pulsed tone was turned off and the stimulus light above the newly poked receptacle was illuminated. The first snout poke into the new location after the onset of the stimulus light resulted in the delivery of 150 µl of water. Sessions lasted for 10 min or until the rat earned a cumulative total of 5 ml of water, whichever occurred first.
The COD was constant within a session but varied between sessions (i.e., days of the week) in the following sequence (Monday through Saturday): 0, 0, 6, 12, 18, and 24 s, with the first session of the week being excluded from data analysis (i.e., the first 0 s session). This 6-day cycle was repeated four times for a total of 24 test sessions. Data from the last two cycles of sessions for each COD delay were averaged for data analysis.
Dependent Measures
There were five primary dependent measures: rate of water reinforcement, number of patch changes, average time in a patch, average water volume rejected by leaving the patch (“rejection volume”/“indifference point”), and the deviation from optimality. The first and final patches, including associated patch changes, of each session were not included in these calculations. These patches were excluded because, in the case of the first patch, rats had not yet experienced the COD, and the last patch resulted in session termination due to the session ending and so not accurately representing rats’ choice behavior. Rate of reinforcement was calculated by summing the volume of water earned and dividing that by time required to earn those reinforcers. Number of patch changes was the number of times rats chose to switch to the alternative receptacle (“leave” choices) averaged over the last two sessions. Time in patch was the mean duration the rat stayed at one receptacle before leaving for the other. The rejection volume/indifference point was defined as the mean amount of water (µL) available at the abandoned patch when the rat switched to the alternative snout poke location, e.g., if the rat had earned 120 µl before leaving, the leaving volume would be the next volume scheduled for delivery, 96 µl in this example. The optimal rejection volume, based on the MVT, was operationally defined as the volume of water that maximized the average reward volume rate across all patches including the COD travel time. This was calculated as the cumulative rate of return from the patch (µL/s) for each reward, considering the COD length (travel time to the patch) and the 4 s between successive rewards (Fig. 2c). For example, when the COD was 6 s, the cumulative rate of return for the first reward in a patch was 150 µL divided by 6 s (25 µL/s). For the second reward, it was [150 + 120] µL divided by [6 + 4] s (27 µL/s). For the third reward, it was [150 + 120 + 96] µL divided by [6 + 4 + 4] s (26.14 µL/s). In this instance, an optimal strategy predicts leaving after two rewards are obtained and the rejection volume is 120 µL. The percent deviation from the optimal rejection volume (referred to as percent volume deviation) was calculated as follows: (optimal rejection volume − observed rejection volume)/optimal rejection volume x 100. Positive values indicate rats overharvested and stayed in the patch when the volume of the collected reinforcers had dropped below the value required to maximize, whereas negative values indicate the rat left when the volume was greater than the optimal volume. The same approach was used to calculate the percent deviation from optimal time in patch (referred to as percent time deviation). Negative values indicate the observed time in the patch was longer than optimal, and positive values indicate the observed time in the patch was less than optimal.
Statistical Analysis
Statistical analyses used SPSS Statistics software (IBM, Armonk, NY). Descriptive statistics indicated that the distributions of dependent measures were normal (skewness < |1|), so parametric statistics were used throughout.
To examine the effects of COD travel time on patch leaving, we used a mixed factor analysis of variance (ANOVA), with delay as the within-subject factor (0, 6, 12, 18, and 24 s) and sex as the between-subject factor (male and female), in conjunction with post-hoc comparisons using Fisher’s least significant difference (LSD) test for within-subject comparisons and independent samples t-tests with Bonferroni corrections for between-subject comparisons.
To examine whether analyses applied to traditional DD tasks was possible, hyperbolic equations were fitted to each rat’s rejection volume (indifference points) based on that described by Mazur 79, using GraphPad Prism (GraphPad Software Inc., San Diego, CA):
V = bA/(1 + kD)
where V indicates the rejected volume of the diminishing reinforcer when the rat left the current patch for the alternative patch in µl, A represents the amount of water from the alternative patch (150 µL), and D represents the delay to receiving the 150-µL reinforcer (COD of 0, 2, 4, 8, 16, or 24 s). The bias parameter, b, was calculated such that the product of b and A equaled each animal’s indifference point at a 0-s delay80. The discount parameter (k) is an index for the rate of discounting or overall sensitivity to delayed reinforcers, such as the first reinforcer in an alternative patch. In DD tasks, larger values of k indicate steeper discount functions, stronger aversion to delayed reinforcers, more rapid devaluation of reinforcer value by delay, and thus greater impulsive choice. Here, k indicates higher relative levels of overharvesting and a preference for the smaller, sooner rewards available in the current patch over traveling to an alternative patch. The normalized area under the curve (AUC) of the discount function was calculated, which summarizes the influence of delay length on the choice to remain at a patch location. The AUC measure provides a simple measure of overharvesting/discounting that is not tied to a particular discount function81. Smaller AUC values indicate higher levels of overharvesting. The k, b, and AUC values were analyzed using independent samples t-tests, with sex as the between-subject factor.
Composite scores for all variables were calculated by the sum of the variable across all delays. These composite scores were used to calculate Pearson’s correlation coefficients. To assess the difference in the strength of their association with DD and reward maximization, correlation coefficients were compared using Fisher r-to-z transformations29,30, which is recommended for comparing correlation coefficients from the same sample with one variable in common82. For all statistical tests, a p < .05 was used as the alpha criterion.
Table 1
Pearson Correlation Matrices for Male and Female Rats
| r value | |
AUC | VD | TD | TIP | PC | WR | |
Males (n = 896) | |
k | −0.76** | 0.56** | −0.05 | 0.32** | −0.30** | 0.36** | |
AUC | | −0.56** | −0.27** | −0.50** | 0.45** | 0.09** | |
VD | | | −0.79** | −0.86** | −0.90** | 0.05 | |
TD | | | | −0.92** | 0.87** | 0.37** | |
TIP | | | | | −0.89** | −0.33** | |
PC | | | | | | 0.20** | |
| Females (n = 898) |
k | −0.73** | 0.64** | −0.13** | 0.43** | −0.35** | 0.26** | |
AUC | | −0.59** | 0.29** | −0.52** | 0.47** | 0.18** | |
VD | | | −0.75** | 0.86** | −0.88** | 0.03 | |
TD | | | | −0.89** | 0.88** | 0.36** | |
TIP | | | | | −0.90** | −0.30** | |
PC | | | | | | 0.26** | |
Note. k, free parameter discounting index; AUC, area under the curve for indifference points; VD, volume deviation; TD, time deviation; TIP, time in patch; PC, number of patch changes; WR, water rate of reinforcement *Correlation is significant at the 0.05 level. ** is significant at the 0.01 level (two-tailed analysis).
Table 2
z-Scores for Relative Strengths of Correlations between Delay Discounting and Reward Maximization Metrics in Male and Female Rats
Comparison | z-score |
TIP | PC | WR |
Males |
k vs. VD | 42.97** | 30.33** | 10.14** |
k vs. TD | 39.58** | −32.81** | -0.23 |
AUC vs. VD | 12.58** | 33.10** | 0.67 |
AUC vs. TD | 19.49** | −15.90** | −5.52** |
| Females |
k vs. VD | −23.43** | 28.63** | 8.20** |
k vs. TD | 37.44** | −34.65** | −2.16* |
AUC vs. VD | −31.36** | 31.63** | 2.54* |
AUC vs. TD | 18.80** | -19.13** | −4.76** |
Note. TIP, time in patch; PC, number of patch changes; WR, water rate of reinforcement; k, free parameter impulsivity index; VD, volume deviation; TD, time deviation; AUC, area under the curve for indifference points; Significant difference between correlation coefficients using the z-test procedure (*p < 0.05; **p < 0.01) outlined by Meng et al. (1992) and the R Cocur package (Diedenhofen & Musch, 2015).