The agony of choice: Comparing abundance estimates from multiple N-mixture model variants for a dataset of reptile observations

doi:10.21203/rs.3.rs-4676496/v1

Download PDF

Research Article

The agony of choice: Comparing abundance estimates from multiple N-mixture model variants for a dataset of reptile observations

https://doi.org/10.21203/rs.3.rs-4676496/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Ecological surveys rarely achieve perfect detection of target species, and failure to account for imperfect detection can produce erroneous estimates of abundance. N-mixture models account for variation in detectability by separating the observation process from the ecological process that determines true site-level abundance, making these models theoretically well-suited to studies of inconspicuous species, such as reptiles. Multiple N-mixture model variants have been published, but little is known about their ability to provide ecologically realistic abundance estimates from real-world observation data. Given their novelty and potential for wider use, studies that help users decide which variant to use in a particular case would be valuable. If different, yet data-appropriate N-mixture model variants provide substantially incongruent abundance estimates for the same dataset, then their uncritical use in ecology is problematic. Using a dataset of reptile observations from south-eastern Zimbabwe, we compare the estimates of five N-mixture model variants. For each species, we assess congruence between the site-level abundance estimates of each variant. We then use a novel metric to assess the performance of each model variant based on the precision and ecological feasibility of its abundance estimates, accounting for goodness-of-fit. We find that model variant pairs were rarely congruent in their abundance estimates, and that model performance varies significantly according to species occupancy and detection probability. We provide a framework for the application of multiple N-mixture model variants in faunal ecology to guide analytical decision-making.

Ecological modelling

model selection

population monitoring

herpetology

time-to-detection

detection/non-detection

It is now widely accepted that ecological surveys rarely, if ever, achieve perfect detection of the target species (Gu and Swihart 2004; Mazerolle et al 2007; Kellner and Swihart 2014; Dénes et al 2015). Failure to account for the imperfect detectability of species can yield biased abundance estimates that mask significant trends and produce inaccurate predictions of species distributions, with profound implications for conservation and management (Royle et al 2005; Kéry and Schmidt 2008; Tingley and Beissinger 2013; Lahoz-Monfort et al 2014; Karenyi et al 2018). Occupancy and N-mixture models have improved our ability to accommodate imperfect detection by separating the observation process from the ecological process determining true site-level abundance (MacKenzie et al 2002; Royle 2004; Kellner and Swihart 2014; Halstead et al 2021). These models account for the fact that individuals may be unavailable for detection due to variation in detectability, rather than genuine absence from a site (MacKenzie et al 2002). This makes them theoretically well suited to studies of inconspicuous species with naturally low detection probabilities, such as reptiles (Mazerolle et al 2007; Ward et al 2017; Ficetola et al 2018a).

The poor detectability of many species is a central limitation to the advance of reptile ecology (Mazerolle et al 2007; Steen 2010; Durso et al 2011). While detection probabilities of reptiles vary across species, habitats, and survey methods, they are routinely low (e.g., < 0.01–0.17 in Steen et al 2012; 0.03–0.46 in Durso et al 2011). N-mixture models provide estimates of latent abundance (N) and are thus more directly suited to population monitoring than occupancy models (Royle 2004). With 21% of reptile species currently listed as threatened (Cox et al 2022), the implementation of effective survey methods and robust analyses is crucial for population monitoring. Given their ability to accommodate variation in detectability, several N-mixture models have been used to evaluate spatial and temporal trends in the relative abundance of reptile species. Many of these studies have incorporated Royle’s (2004) static N-mixture model for spatially replicated count data, with Poisson, zero-inflated Poisson (ZIP), or negative binomial (NB) distributions (e.g., Couturier et al 2013; Doré et al 2013; Buckland et al 2014; Ward et al 2017; Angeli et al 2018; Ficetola et al 2018a; Duchesne et al 2023). The Royle-Nichols (RN) model (Royle and Nichols 2003), which exploits the relationship between abundance and detection probability to provide parameter estimates from detection/non-detection (DND) data, has also been successfully implemented (e.g., Ariefundy et al 2013; Erb et al 2015; Hu et al 2019). The time-to-detection (TTD) N-mixture model estimates abundance and detection probability as a function of the time taken to encounter the first individual in a survey (Strebel et al 2021). To our knowledge, a TTD approach has not been trialled for reptiles, but its theoretical efficiency and performance in studies on birds (Henry et al 2020) and amphibians (Halstead et al 2018, 2021) is promising.

Despite their potential for wider use, little is known about the capacity for N-mixture models to provide robust abundance estimates for species with very low detection probabilities (i.e., < 0.1). The utility of these estimates in studies of cryptic reptiles has been questioned (Steen 2010; Steen et al 2012), and covariate combinations selected using established model ranking criteria often provide ecologically unrealistic abundance estimates (Koetke et al 2024), especially for rare and specialised species (Joseph et al 2009). What is more, the fact that a single survey can yield multiple types of data (e.g., detection/non-detection and count data) further complicates choice of model variant. Although goodness-of-fit is an important measure of the reliability of N-mixture models that might be used to choose a model variant, and functions have been developed that allow users to test the goodness-of-fit of individual model variants (MacKenzie and Bailey 2004; Knape et al 2018), these are not compatible across all variants due to underlying differences in model formulation. Moreover, due to their relative novelty, critical comparisons of multiple N-mixture model variants are rare (Dénes et al 2015), both in general and especially for species with low detection probabilities. As such, studies that help users decide which variant to use in their case would be valuable. If different, yet data-appropriate N-mixture model variants provide substantially incongruent abundance estimates for the same dataset, then their uncritical use in ecology is problematic.

Here, we compare the performance of five N-mixture model variants by sampling and modelling the abundance of reptiles in south-eastern Zimbabwe. We assess (i) whether different N-mixture model variants provide ecologically realistic abundance estimates of African savanna reptiles; (ii) whether the abundance estimates of the different N-mixture model variants are congruent; and (iii) whether model performance varies according to underlying ecological attributes (i.e., detectability) of the species in question, thus restricting suitability of particular variants to species with particular attributes.

Data handling

All data processing, model fitting, analysis, and plotting was conducted in R (v. 4.3.2, R Core Team 2023), using functions in the following packages: unmarked (v. 1.2.5, Fiske and Chandler 2011 and Kellner et al 2023), AICcmodavg (v. 2.3.3, Mazerolle 2023), ggplot2 (v. 3.4.4, Wickham 2016), and GGally (v. 2.2.1, Schloerke et al 2024). The map in Fig. 1 was created in QGIS (v. 3.10.7, QGIS Development Team 2020).

Study area

Our study took place on Malilangwe Wildlife Reserve (MWR), Chiredzi District, Masvingo Province, Zimbabwe. The reserve spans 46 730 ha of largely intact savanna habitat with a history of cattle ranching and cotton cultivation, and is now managed for ecotourism and conservation. Megafauna are abundant on MWR. The climate of the reserve is characterised by a cool, dry period from April to August, a hot, dry period from September to October, and a hot, wet period from November to March. Mean minimum monthly temperature ranges from 13.4°C in July to 23.7°C in December, while mean maximum monthly temperature ranges from 23.2°C in June to 33.9°C in November (Clegg and O’Connor, 2012). The reserve receives a mean annual precipitation of 560 mm, occurring predominantly during the hot summer months (Clegg and O’Connor, 2012).

Site selection

We selected 21 survey sites separated across seven structurally diverse vegetation communities on MWR (Fig. 1). These habitats span a precipitation gradient and reflect both natural and anthropogenic variation in habitat structure, described and mapped by Clegg and O’Connor (2012). We selected vegetation types 2, 3, 4, 9, 10, 18, and 22 for our surveys (see Clegg and O’Connor 2012 for a detailed description of each). The poor detectability of African savanna reptiles, the diverse reptile assemblage and high local habitat heterogeneity on MWR combine to constitute an ideal study system.

Rasters of the vegetation types were generalised in TerrSet 2020 (v. 19.0.6, Clark Labs 2021) such that only contiguous patches at least 20 ha in area remained. Roads were rasterised and a 300 m buffer was applied to account for site accessibility. The 21 required sites were plotted randomly within the intersection of the accessibility buffer and the generalised habitat patches in QGIS, with a distance of at least 250 m between sites to ensure spatial independence of detections. Each point represented the north-west corner of a one-hectare (100 x 100 m) survey block.

Visual encounter surveys

Between 7 November and 3 December 2022, we actively searched each site five times for reptiles. Each survey lasted 60 minutes and involved KMvW and an assistant walking around the block, searching for active reptiles and reptiles hiding under rocks, logs, or other debris. Moved cover objects were immediately replaced as found, and a minimum period of 48 hours was left between visits to the same site to avoid disturbance. A waypoint was noted at each reptile detection using a GPS logging smartphone app (BasicAirData GPS Logger, v. 3.1.2), including the location, time, species, and number of individuals observed. We took note of physical features, as well as location and behaviour, to avoid double-counting individuals within surveys. We assumed that reptile populations were closed within the survey period. To account for variation in detectability, survey order was randomised such that the start times of each of the five surveys per site differed by at least 60 minutes. Surveys were conducted between 07:00 and 17:00 under all weather conditions except heavy rain.

Covariate measurements

We measured site and observational covariates to account for variance in our models. We identified 17 habitat variables (bare ground cover, leaf litter cover, woody debris cover, herbaceous cover, maximum herb height, soil depth, soil sand content, soil clay content, soil silt content, total outcrop cover, total canopy cover, total shrub cover, Vachellia tortilis canopy dominance, Vachellia tortilis shrub dominance, scaled tree count, maximum tree height, and maximum shrub height). We consolidated these by calculating the Pearson’s correlation coefficient (r) for each pairwise comparison. Strongly correlated variables (|r| > 0.7) were interpreted as reflecting shared ecological processes, and the variable with the greatest perceived relevance to the study of habitat occupancy was retained while the other was discarded. We retained 13 variables. Variables and measurement methods are summarised in Supplementary Table S1.

Weather data were gathered for each survey in order to quantify four observational covariates for each reptile detection. Ambient temperature and relative humidity were recorded every half hour by a wireless personal weather station (Davis Vantage Pro2 Plus), installed at Headquarters on MWR (Fig. 1). Site-level precipitation was interpolated from daily readings of 22 rain gauges installed across the property, using a triangulated irregular network (TIN) in QGIS. As surveys were not conducted in heavy rain, cumulative 48-hour precipitation was selected as the default precipitation covariate. Time-of-day was measured as the duration in minutes between sunrise and each reptile observation. However, because ambient temperature was autocorrelated with relative humidity (|r| = 0.83–0.86) and time-of-day (|r| = 0.67–0.70), we excluded the latter two covariates.

Model fitting

We fitted the time-to-detection (TTD) N-mixture model (Strebel et al 2021), Royle-Nichols (RN) N-mixture model (Royle and Nichols 2003), and three variants (Poisson, zero-inflated Poisson [ZIP], and negative binomial [NB] distributions) of Royle’s (2004) N-mixture model for all species detected in at least five surveys. Covariates were centred and scaled prior to modelling. Ambient temperature and cumulative 48-hour precipitation were included as observational covariates for all species. In general, for each species, we selected a combination of site covariates that we perceived to be ecologically relevant and modelled these with and without observational covariates (see Supplementary Tables S2a–f for models and a per-species justification on covariate selection). Null models were also included for each species. We then ranked models by the Akaike information criterion corrected for small sample sizes (AIC_c, Hurvich and Tsai 1995). Models with ΔAIC_c ≤ 2 relative to the most parsimonious model (i.e., lowest AIC_c) were selected for abundance estimation (cf. Burnham and Anderson 2004). We tested goodness-of-fit (GOF) of the RN, Poisson, ZIP, and NB models using the MacKenzie-Bailey GOF test (MacKenzie and Bailey 2004) with 1 000 parametric bootstrap samples. Where multiple models fell within the averaging criterion (ΔAIC_c ≤ 2), we tested GOF of the global model. We accounted for overdispersion by multiplying the standard error (SE) of the abundance estimates by the square root of the dispersion parameter (ĉ) obtained from the GOF test. No GOF test is currently available for the TTD N-mixture model, a limitation that we address in the discussion.

Model comparisons

(i) Ecologically reasonable abundance estimates

We recorded the site-level abundance estimates of each model for each species (Supp. Table S4). We compared these to reasonable abundance estimates drawn from available literature (e.g., Jacobsen 1989, Branch 1998; Meiri 2024), expert opinion, and extensive personal field observations. As species abundance is unlikely to be similar across occupied sites, we noted a range of plausible estimates. Where published population density estimates were not available, we first referred to estimates from related species. For most species, however, the limits of the range of ecologically reasonable abundance estimates were informed by interspecific variation in adult body size, which broadly correlates with home range size and dispersal ability, and, thus, population density (Perry and Garland 2002; Doherty et al 2019). We also noted the expected number of truly unoccupied sites (i.e., sites where expected abundance = 0), inferred from current knowledge on habitat requirements of the individual species. Per Joseph et al (2009), we visualised the ecological feasibility of the five N-mixture model variants by producing histograms of the abundance estimates for a representative selection of species, showing proximity to expected values (Fig. 2).

(ii) Congruence

We assessed congruence of the N-mixture models by conducting paired t-tests of the log₁₀-transformed site-level abundance estimates for each species. Pairs of models were considered incongruent if the test rejected a null hypothesis that the mean difference between estimates was equal to zero (see Supplementary Table S3 for test results). We recorded r, r², and the equation of the fitted line to illustrate proportionality between paired model estimates per species (Supplementary Table S4).

(iii) Performance in the context of detectability

Because the five selected N-mixture model variants are not all comparable by existing model ranking methods or goodness-of-fit tests, we developed a simple comparative metric to assess model performance, taking ecological feasibility and model fit into account:

$$\text{P}\text{e}\text{r}\text{f}\text{o}\text{r}\text{m}\text{a}\text{n}\text{c}\text{e}=\left|1-\frac{\begin{array}{c}no.reasonable abundance\\ estimates | present\end{array}}{\begin{array}{c}|no. estimated presences\\ -no. expected absences|\end{array}}\right|+ \frac{\stackrel{-}{\text{S}\text{E} | \text{p}\text{r}\text{e}\text{s}\text{e}\text{n}\text{t}}}{\stackrel{-}{\text{a}\text{b}\text{u}\text{n}\text{d}\text{a}\text{n}\text{c}\text{e} | \text{p}\text{r}\text{e}\text{s}\text{e}\text{n}\text{t}}}$$

In the first component of the metric, the numerator represents the proportion of predicted occupied sites with ecologically realistic abundance estimates. If the model matched our expected number of occupied sites (the denominator) and provided realistic abundance estimates at all of those sites, then the fraction would equal |1|. In the second component of the metric, the fraction represents the mean standard error (SE) of the abundance estimates at all assumed occupied sites divided by the mean abundance at all assumed occupied sites. As SE is inflated for overdispersed models (i.e., ĉ > 1), this component penalises the model for imprecise abundance estimates. Thus, a perfect model would have the first component of the metric equal 0 and the second component equal some very small number, such that model performance decreases as the score increases. A model that provides unrealistic abundance estimates or predicts an incorrect number of occupied sites would cause the first component of the metric to increase. A model with high SE (either inherently or after accounting for overdispersion) would cause the second component of the metric to increase. However, the metric allows models to exceed the expected number of occupied sites, provided that the abundance estimates at these sites are ecologically realistic.

To assess the relationship between performance and detectability, we obtained mean occupancy (ψ) and detection probability (p) estimates for each species by fitting the MacKenzie et al (2002) occupancy model. For these models, we selected the same combination of covariates identified in the N-mixture model ranking process. We tested whether N-mixture model performance scores varied significantly according to model variant or species detectability (ψ and p), using the Kruskal-Wallis H test and Mann-Whitney U test, respectively. Additionally, we noted the behaviour of each model variant in relation to the individual components of the metric.

We recorded 1 340 observations of 25 reptile species, of which 16 were observed in at least five surveys. We obtained at least one acceptable model (well-fitted and/or with reasonable, non-null abundance estimates) for the nine most frequently detected species, all of which were lizards (Afroedura transvaalica, Gerrhosaurus flavigularis, Hemidactylus tasmani, Lygodactylus capensis, Panaspis maculicollis, Platysaurus intermedius rhodesianus, Trachylepis damarana, Trachylepis margaritifera, and Trachylepis striata). Covariate combinations for each of these nine species (selected by AIC_c) did not vary substantially between variants, but observational covariates were rarely included in the best models. Models for the remaining seven species either did not converge, were very poorly fitted, or provided erroneous abundance estimates (e.g., maximum abundance ≤ 0.5, or abundance < SE when adjusted for ĉ).

The abundance estimates from different N-mixture model variants were rarely congruent (Supplementary Table S3). While we could not assess goodness-of-fit of the TTD variant, and thus could not penalise it for possible overdispersion, its estimates were congruent with those from the RN variant in seven out of nine species, and the RN variant was typically well fitted. Similarly, the Poisson and ZIP variants were also congruent for seven out of nine species, but both tended to be poorly fitted. Although the NB variant was often well fitted (and more parsimonious than the other two count variants), it very rarely provided congruent abundance estimates. The tendency for pairs of model variants to be congruent varied somewhat according to the detectability of the modelled species (Fig. 3). Abundance estimates from all variants were most congruent for A. transvaalica, but we attribute this to a high frequency of very low estimates. Estimates for two of the most widespread species (P. maculicollis, and T. striata) were also largely congruent (six out of nine and five out of nine variant pairs, respectively) and were more ecologically realistic.

We detected substantial variation in the ability of the five model variants to produce ecologically reasonable abundance estimates. This variation was primarily interspecific, but in some cases reflected species-by-model variation (Fig. 2). While model performance scores varied significantly according to occupancy (W = 1900, p < < 0.05) and detection probability (W = 1225, p < < 0.05), it did not vary significantly between model variants (X² = 3.37, df = 4, p > 0.05). Across variants, performance tended to increase with species detectability, but was poor for species with very high p (Fig. 4).

Few models accurately predicted the number of true absences (as inferred from known habitat requirements of each species), except for two inconspicuous habitat specialists (A. transvaalica and H. tasmani). The TTD variant frequently overestimated absence, while the Poisson, ZIP, and NB variants tended to overestimate presence. The predicted number of occupied sites was lower than expected in G. flavigularis and T. damarana, implying that these species were more localised than we expected. This was reflected in a high frequency of predicted absences for these two species by the TTD, RN, Poisson, and ZIP model variants.

The TTD and RN variants tended to underestimate abundance in all species, although the RN variant overestimated absences less frequently than the TTD variant. The Poisson and ZIP variants tended to overestimate the presence of habitat specialists, although the resulting abundance estimates were often within a reasonable range. The NB variant tended to overestimate abundance, sometimes severely. In all model variants, the most ecologically realistic abundance estimates were obtained for moderately conspicuous habitat generalists (L. capensis and T. striata), while estimates for highly conspicuous habitat specialists (P. intermedius rhodesianus and T. margaritifera) were surprisingly poor.

Our results confirm that N-mixture models can provide ecologically reasonable abundance estimates for some reptile species, but that choice of model variant can have important implications for estimates of abundance. We found that overall congruence between model variants was rare, although there were exceptions (e.g., TTD-RN and Poisson-ZIP variant pairs were generally congruent in their abundance estimates). Our results show that N-mixture model performance across variants increased significantly with species detectability, although models performed very poorly for the two most conspicuous species (i.e., highest p). Importantly, we demonstrate that individual aspects of model performance varied between model variants, such that blanket application of a single model variant to a suite of ecologically diverse species has the potential to introduce artefactual variation in abundance estimates.

The five N-mixture model variants we tested performed best for widespread, fairly easy-to-find species, being well-fitted and providing congruent abundance estimates. For species with lower occupancy and detection probability, we tended to observe considerable variation in the performance of each model variant, suggesting that specific variants may provide substantially better abundance estimates for these species. However, the relative performance of individual model variants was inconsistent between ecologically similar species. What is more, some models that ranked highly according to our performance metric were either poorly fitted or provided erroneous abundance estimates (e.g., excessive presence estimates of habitat specialists). We attribute this to the rare instances where presence estimates were higher than expected but models were well fitted and generated reasonable abundance estimates, and thus were not penalised. As such, a priori matching of species and N-mixture model variants based on individual performance criteria is difficult, highlighting the risks of using a single variant to estimate abundance. Nonetheless, we observed a number of interactions between model variant, aspects of model performance, and species detectability that are important for users to consider.

Within variants, models selected by AIC_c ranking were generally well-fitted and included ecologically relevant site covariates. However, parsimony did not necessarily imply that the models provided ecologically reasonable abundance estimates. Due to underlying differences in model variant formulation, we cannot comment on differences in parsimony between TTD, RN, and count variants. But, within the count variants, which are directly comparable by AIC_c, the NB variant often provided unreasonably high abundance estimates despite frequently ranking as the most parsimonious count variant. Despite that fact, the NB variant performed well for some species with moderate to high occupancy and detection probabilities (Fig. X), providing reasonable abundance estimates for G. flavigularis, L. capensis, and T. damarana. However, the tendency for the NB variant to severely overestimate abundance, especially for rare and inconspicuous species and with few survey replicates, has been widely observed (Joseph et al 2009; Couturier et al 2013; Dennis et al 2015; Kéry 2018). Thus, we reaffirm that additional measures, such as interrogation of the ecological feasibility of estimates, should be incorporated in N-mixture model selection and interpretation (cf. Joseph et al 2009; Koetke et al 2024).

The nine lizard species we modelled occur widely in Zimbabwe and their ecology is comparatively well known (Branch 1998; Howard and Hailey 1999; Jacobsen and Broadley 2000, Pietersen et al 2021; Stander 2023). However, literature on their respective population densities is sparse (Meiri 2024). While we were able to deduce an informed range of ecologically feasible abundance estimates, the upper limit of this range is disputable. As such, congruence between multiple N-mixture model variants suggested that our ranges were valid, but well-fitted models with abundance estimates above our expected upper limit were still informative. For example, we observed the best performance scores in the models for P. maculicollis, an inconspicuous but widespread species. The TTD, Poisson, ZIP, and NB model variants predicted an upper abundance limit of approximately 71–77 individuals per hectare, with most sites predicted to have 13–14 individuals. This constitutes a higher-than-expected maximum, but one which may be ecologically reasonable, given the habitat heterogeneity within the study area. Had we only selected the RN variant (which was well-fitted but scored lowest according to our performance metric) to estimate abundance of this species, we would have concluded that most sites host around three individuals, which is unlikely given the diminutive body size of this species. This demonstrates how a comparison of multiple N-mixture model variants can improve our confidence in abundance estimates when prior knowledge of species density is lacking, as is typical for reptiles.

Comparing the outputs of multiple variants also aids in determining whether or not the study species are amenable to N-mixture modelling in the first place. Estimating the abundance of conspicuous habitat specialists (P. intermedius rhodesianus and T. margaritifera) proved to be problematic for all of the N-mixture model variants we tested. These species were encountered in every survey at sites where they were present. As such, they demonstrated zero heterogeneity in detection probability. This situation renders the RN variant useless, as there is little heterogeneity from which to generate an abundance estimate, apart from the dichotomy of occupied versus unoccupied sites. In our study, the RN variant provided few reasonable estimates for these species, and even estimated abundances of zero at sites where the species are known to be present. Detection times were consistently short enough to cause a similar effect in the TTD variant as well. While the three count variants produced some reasonable abundance estimates for these species, they demonstrated lack of fit (ĉ = 0 or ĉ >> 1) and erroneously predicted that the species were present at most sites. As such, it appears that there are critical natural parameter values at which N-mixture models tend to become unreliable. A traditional capture-mark-recapture (CMR) design may be a more appropriate method of obtaining abundance estimates of reliably detectable species. For such species, we recommend that efforts be directed at increasing capture efficiency, rather than computing the influence of variation in detection probability (which may be marginal).

We share the experience of other herpetologists (e.g., Steen 2010, Steen et al 2012) in that we were unable to fit reliable models for reptiles with very low detection probabilities (p < ~ 0.25). For the seven species which were not included in the model comparison exercise, we attribute poor models either to low detection rates (i.e., insufficient to fit a non-null model) or limitations in accurately measuring relevant covariates. The latter issue arose for enigmatic species (e.g., Mochlus sundevallii) but may also explain the poor performance of models for conspicuous habitat specialists, where sites appeared structurally heterogeneous although we observed little heterogeneity in detectability. Additionally, while we observed four snake species in our surveys, only one (Psammophis subtaeniatus) was encountered more than once. As this species is a highly mobile habitat generalist, we were unable to construct acceptable models for it. We reaffirm that obtaining sufficient detections and capturing relevant environmental data are central challenges to snake ecology.

The selection and accurate measurement of ecologically relevant covariates is both critical and a common challenge in N-mixture modelling (Angeli et al 2018; Ficetola et al 2018b). The fact that observational covariates were rarely included in the best performing models either suggests that remote sensing of the climatic variables was inadequate, or that climatic variation within the survey period was insufficient to explain heterogeneity in detection probability. It is also possible that moving cover and inspecting rock cracks during our surveys reduced the significance of these observational covariates, as some reptiles were detected when they were inactive (notably, nocturnal species such as A. transvaalica and H. tasmani). As the ecology of many reptile species is poorly known (Tingley et al 2016; Meiri 2024), dependencies on particular habitat variables may not be obvious, and it is unclear what measurement scale is required to generate reliable abundance estimates. Within a reptile community, species/individuals are likely to have very different body sizes and dispersal capabilities and are thus affected by environmental conditions at different scales (Doherty et al 2019). This creates a ‘Catch-22’, as N-mixture models have been cited as a promising tool for studying secretive species hampered by low detection probabilities, yet we lack the prior knowledge required to select valid covariates. Parameter estimates from N-mixture models are further limited by how accurately we can capture heterogeneity in detection across sites and between surveys (Barker et al, 2017; Goldstein and de Valpine 2022), but we may have no notion of whether these estimates are realistic or not.

How, then, can we best apply N-mixture models in reptile community ecology? While goodness-of-fit is an empirically crucial measure of model performance (Knape et al 2018), our results show that reliance on other, individual aspects of N-mixture model performance (parsimony, precision and ecological feasibility of estimates, and perceived relevance of covariates) may have serious implications in determining the acceptance or rejection of abundance estimates. Comparing multiple model variants eases our reliance on these factors in interpreting model outputs. It is unlikely that a single methodological framework will ever be appropriate for all members of a reptile community (Foster et al 2012), and the same applies to N-mixture models (Ficetola et al 2018b). When data on behavioural ecology and population density are lacking, model comparison may also allow us to infer reasonable site-level abundance estimates from a range of possible values. In such a situation, incongruence between model estimates is informative rather than problematic.

While different N-mixture model variants may indeed provide substantially incongruent abundance estimates, this does not invalidate their value in ecology. Rather, our results indicate that individual model variants are likely suitable to different species and different datasets, but this suitability is not inherently obvious. As the field of N-mixture modelling is growing in popularity and complexity, we recommend that researchers continue to compare multiple model variants to account for differences in model performance associated with species detectability. This, in turn, may inform future studies on ecologically similar species by providing context that is relevant to model selection and interpretation. In our experience, this is achievable at little extra cost to standard site occupancy surveys, simply requiring additional time spent on data processing and analysis. Comparing multiple N-mixture model variants may also indicate whether abundance estimates are sufficiently consistent to be used in population monitoring. Even if absolute abundance is not estimable by N-mixture models, relative abundance may still be a viable tool for interpreting ecological processes governing the distribution of species (Barker et al 2017; Goldstein and de Valpine 2022). If a single model variant is chosen for this purpose, then its congruence with other variants can be quantified and reported as demonstrated in this study. For enigmatic species, this comparative approach brings us a step closer towards identifying and understanding ecologically relevant covariates, convening on reasonable abundance estimates.

Finally, comparing the outputs of multiple model variants aids in determining whether an N-mixture approach is appropriate to studying the species at hand or not. Given that N-mixture models are cost-effective in terms of data collection, an alternative methodology, such as CMR, could be applied in the same system. We acknowledge the limitations of our data and predict that improved knowledge on species detectability and ecology, particularly with regards to appropriate covariate selection and measurement, may indeed allow users to match species with appropriate N-mixture model variants or alternative analyses a priori.

Filter species by sample size: Although N-mixture models are theoretically applicable to rare species, detections must be sufficiently high to fit sound models. In our case, we were only able to model species with a detection rate greater than 0.1 (i.e., detected in at least 11/105 surveys). Filtering out species with very low detection rates reduces uncertainty in interpreting covariate effects on abundance and detection probability.

Fit and select suites of appropriate models for comparison: Fit data-appropriate N-mixture model variants to the filtered dataset. For each species, select the most parsimonious model within each variant, provided it is reasonably well-fitted (i.e., accounting for mild overdispersion does not yield very poor parameter estimates) and includes ecologically reasonable covariates. As it is not possible to test whether multiple null models are statistically congruent, they must be ignored unless there is sufficient ecological evidence to support the validity of their estimates (e.g., in the case of wide-ranging generalists that occupy large home ranges).

Test for congruence: Conduct paired t-tests of the log₁₀-transformed site-level abundance estimates for each species. If all variant pairs are congruent in their abundance estimates, and if these estimates align with expected values based on existing ecological knowledge of the study species, then the estimates are well-supported and may be accepted. If the variants are incongruent, then interrogate their respective performance in terms of ecological realism and goodness-of-fit.

Choose the best performing model: If the abundance estimates from multiple N-mixture model variants are significantly incongruent for the same species, then retain the most ecologically realistic and well-fitted model(s). Recall that goodness-of-fit strongly influences our confidence in the model’s estimates, and the precision of our parameter estimates is scaled accordingly.

Review: The outcome of the model comparison exercise may reveal that the sampled species are not amenable to N-mixture modelling. Therefore, if the performance of all model variants is poor, either gather more data or attempt an alternative analysis. For species with very high detection probabilities, resampling the population using a CMR protocol may be worth the effort.

Competing interests

The authors have no relevant financial or non-financial interests to disclose.

Funding

This work was supported by the Malilangwe Trust.

Author Contribution

Both authors contributed to the study conception and design. KMvW conducted the fieldwork, analysed the data, and wrote the initial manuscript. Both authors discussed the results and contributed to the final manuscript.

Acknowledgement

We thank the Malilangwe Trust for funding and facilitating the fieldwork that yielded the dataset analysed in this article. We thank Dr Bruce Clegg for his valuable insight into the ecology and history of Malilangwe Wildlife Reserve, and for his assistance in site selection and mapping, and coordinating fieldwork. We thank Ben Tsuvuka for his vital assistance in conducting the surveys.

Data Availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

Angeli NF, Lundgren IF, Pollock CG, Hillis‐Starr ZM, Fitzgerald LA (2018) Dispersal and population state of an endangered island lizard following a conservation translocation. Ecol Appl 28(2):336–347. https://doi.org/10.1002/eap.1650
Ariefiandy, A, Purwandana, D, Seno, A, Chrismiawati, M, Ciofi, C, and Jessop, T S (2014) Evaluation of three field monitoring-density estimation protocols and their relevance to Komodo dragon conservation. Biodivers Conserv 23:2473–2490. https://doi.org/10.1007/s10531-014-0733-3
Barker RJ, Schofield MR, Link WA, Sauer JR (2017) On the reliability of N‐mixture models for count data. Biometrics 74(1):369–377. https://doi.org/10.1111/biom.12734
Bornand CN, Kéry M, Bueche L, Fischer M (2014) Hide‐and‐seek in vegetation: time‐to‐detection is an efficient design for estimating detectability and occurrence. Methods Ecol Evol 5(5):433–442. https://doi.org/10.1111/2041-210X.12171
Branch WR (1998) Field Guide to Snakes and Other Reptiles of Southern Africa. Struik, Cape Town.
Buckland S, Cole NC, Aguirre-Gutierrez J, Gallagher LE, Henshaw SM, Besnard A, Tucker RM, Bachraz V, Ruhomaun K, Harris S (2014) Ecological effects of the invasive giant Madagascar day gecko on endemic Mauritian geckos: applications of binomial-mixture and species distribution models. PLoS One, 9(4):e88798. https://doi.org/10.1371/journal.pone.0088798
Burnham KP, Anderson DR (2004) Multimodel inference: understanding AIC and BIC in model selection. Sociol Method Res 33(2):261–304. https://doi.org/10.1177/0049124104268644
Clegg BW, O'Connor TG (2012) The vegetation of Malilangwe wildlife reserve, south-eastern Zimbabwe. Afr J Range For Sci 29(3):109–131. https://doi.org/10.2989/10220119.2012.744352
Couturier T, Cheylan M, Bertolero A, Astruc G, Besnard A (2013) Estimating abundance and population trends when detection is low and highly variable: a comparison of three methods for the Hermann's tortoise. J Wildlife Manage 77(3):454–462. https://doi.org/10.1002/jwmg.499
Cox N, Young BE, Bowles P et al (2022) A global reptile assessment highlights shared conservation needs of tetrapods. Nature, 605(7909):285–290. https://doi.org/10.1038/s41586-022-04664-7
Dénes FV, Silveira LF, Beissinger SR (2015) Estimating abundance of unmarked animal populations: accounting for imperfect detection and other sources of zero inflation. Methods Ecol Evol 6(5):543–556. https://doi.org/10.1111/2041-210X.12333
Dennis EB, Morgan BJT, Ridout MS (2015) Computational aspects of N‐mixture models. Biometrics, 71(1):237–246. https://doi.org/10.1111/biom.12246
Doherty TS, Fist CN, Driscoll DA (2019) Animal movement varies with resource availability, landscape configuration and body size: a conceptual model and empirical example. Landscape Ecol 34:603–614. https://doi.org/10.1007/s10980-019-00795-x
Doré F, Grillet P, Thirion JM, Besnard A, Cheylan M (2011) Implementation of a long-term monitoring program of the ocellated lizard (Timon lepidus) population on Oleron Island. Amphibia-Reptilia, 32(2), 159-166. https://doi.org/10.1163/017353710X551381
Duchesne T, Rault PA, Quistinic P, Dufrêne M, Lourdais O (2023) Combining forest exploitation and heathland biodiversity: Edges structure drives microclimates quality and reptile abundance in a coniferous plantation. Forest Ecol Manag 544:121188. https://doi.org/10.1016/j.foreco.2023.121188
Durso AM, Willson JD, Winne CT (2011) Needles in haystacks: estimating detection probability and occupancy of rare and cryptic snakes. Biol Conserv 144(5):1508–1515. https://doi.org/10.1016/j.biocon.2011.01.020
Erb LA, Willey LL, Johnson LM, Hines JE, Cook RP (2015) Detecting long‐term population trends for an elusive reptile species. J Wildlife Manage 79(7):1062–1071. https://doi.org/10.1002/jwmg.921
Foster MS, McDiarmid RW, Chernoff N. (2012) Studying Reptile Diversity. In: McDiarmid RW, Foster MS, Guyer C, Gibbons JW, Chernoff N (eds) Reptile Biodiversity: Standard Methods for Inventory and Monitoring. University of California Press, Berkeley, California, USA, pp 3–5
Ficetola GF, Barzaghi B, Melotto A, Muraro M, Lunghi E, Canedoli C, Lo Parrino E, Nanni V, Silva-Rocha I, Urso A, Carretero MA (2018a) N-mixture models reliably estimate the abundance of small vertebrates. Sci Rep 8(1):10357. https://doi.org/10.1038/s41598-018-28432-8
Ficetola GF, Romano A, Salvidio S, Sindaco R (2018b) Optimizing monitoring schemes to detect trends in abundance over broad scales. Anim Conserv 21(3):221–231. https://doi.org/10.1111/acv.12356
Fiske I, Chandler R (2011) Unmarked: an R package for fitting hierarchical models of wildlife occurrence and abundance. J Stat Softw 43:1–23. https://doi.org/10.18637/jss.v043.i10
Goldstein BR, de Valpine P (2022) Comparing N-mixture models and GLMMs for relative abundance estimation in a citizen science dataset. Sci Rep 12(1):12276. https://doi.org/10.1038/s41598-022-16368-z
Gu W, Swihart RK (2004) Absent or undetected? Effects of non-detection of species occurrence on wildlife–habitat models. Biol Conserv 116(2):195–203. https://doi.org/10.1016/S0006-3207(03)00190-3
Halstead BJ, Kleeman PM, Rose JP (2018) Time-to-detection occupancy modeling: An efficient method for analyzing the occurrence of amphibians and reptiles. J Herpetol 52(4):415–424. https://doi.org/10.1670/18-049
Halstead BJ, Rose JP, Kleeman PM (2021) Time‐to‐detection occupancy methods: performance and utility for improving efficiency of surveys. Ecol Appl 31(3):e2267. https://doi.org/10.1002/eap.2267
Henry DA, Lee AT, Altwegg R (2020) Can time‐to‐detection models with fewer survey replicates provide a robust alternative to traditional site‐occupancy models? Methods Ecol Evol 11(5):643–655. https://doi.org/10.1111/2041-210X.13379
Howard KE, Hailey A (1999) Microhabitat separation among diurnal saxicolous lizards in Zimbabwe. J Trop Ecol, 15(3):367–378. https://doi.org/10.1017/S0266467499000887
Hu Y, Gillespie G, Jessop TS (2019) Variable reptile responses to introduced predator control in southern Australia. Wildlife Res 46(1):64–75. https://doi.org/10.1071/WR18047
Hurvich CM, Tsai CL (1995) Model selection for extended quasi-likelihood models in small samples. Biometrics 51(3):1077–1084. https://doi.org/10.2307/2533006
Jacobsen NHG (1989) A Herpetological Survey of the Transvaal. PhD thesis, University of Natal.
Jacobsen NHG, Broadley DG (2000) A new species of Panaspis Cope (Reptilia: Scincidae) from southern Africa. Afr J Herpetol 49(1):61–71. https://doi.org/10.1080/21564574.2000.9650017
Joseph LN, Elkin C, Martin TG, Possingham HP (2009) Modeling abundance using N‐mixture models: the importance of considering ecological mechanisms. Ecol Appl 19(3):631–642. https://doi.org/10.1890/07-2107.1
Kellner KF, Swihart RK (2014) Accounting for imperfect detection in ecology: a quantitative review. PloS One, 9(10):e111436. https://doi.org/10.1371/journal.pone.0111436
Kellner KF, Smith AD, Royle JA, Kéry M, Belant JL, Chandler RB (2023) The unmarked R package: Twelve years of advances in occurrence and abundance modelling in ecology. Methods Ecol Evol 14(6):1408–1415. https://doi.org/10.1111/2041-210X.14123
Kéry M (2018) Identifiability in N‐mixture models: A large‐scale screening test with bird data. Ecology 99(2):281–288. https://doi.org/10.1002/ecy.2093
Kéry M, Schmidt B (2008) Imperfect detection and its consequences for monitoring for conservation. Community Ecol 9(2):207–216. https://doi.org/10.1556/comec.9.2008.2.10
Knape J, Arlt D, Barraquand F, Berg Å, Chevalier M, Pärt T, Ruete A, Żmihorski M (2018) Sensitivity of binomial N‐mixture models to overdispersion: The importance of assessing model fit. Methods Ecol Evol 9(10):2102–2114. https://doi.org/10.1111/2041-210X.13062
Koetke LJ, Hodder DP, Johnson CJ (2024) Using camera traps and N‐mixture models to estimate population abundance: Model selection really matters. Methods Ecol Evol 15(5):900–915. https://doi.org/10.1111/2041-210X.14320
Lahoz‐Monfort JJ, Guillera‐Arroita G, Wintle BA (2014) Imperfect detection impacts the performance of species distribution models. Global Ecol Biogeogr 23(4):504–515. https://doi.org/10.1111/geb.12138
MacKenzie DI, Bailey LL (2004) Assessing the fit of site-occupancy models. J Agr Biol Env St 9:300–318. https://doi.org/10.1198/108571104X3361
MacKenzie DI, Nichols JD, Lachman GB, Droege S, Andrew Royle J, Langtimm CA (2002) Estimating site occupancy rates when detection probabilities are less than one. Ecology 83(8):2248–2255. https://doi.org/10.1890/0012-9658(2002)083[2248:ESORWD]2.0.CO;2
Mazerolle MJ (2023) AICcmodavg: Model selection and multimodel inference based on (Q)AIC(c). R package version 2.3.3, https://cran.r-project.org/package=AICcmodavg
Mazerolle MJ, Bailey LL, Kendall WL, Royle JA, Converse SJ, Nichols JD (2007) Making great leaps forward: accounting for detectability in herpetological field studies. J Herpetol 41(4):672–689. https://doi.org/10.1670/07-061.1
Perry G, Garland Jr T (2002) Lizard home ranges revisited: effects of sex, body size, diet, habitat, and phylogeny. Ecology 83(7):1870–1885. https://doi.org/10.1890/0012-9658(2002)083[1870:LHRREO]2.0.CO;2
Pietersen D, Verburgt L, Davies J (2021) Snakes and other reptiles of Zambia and Malawi. Penguin Random House, South Africa.
QGIS Development Team (2020) QGIS Geographic Information System. Open Source Geospatial Foundation Project. http://qgis.osgeo.org
R Core Team (2023) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.
Reiner F, Brandt M, Tong X et al (2023) Africa tree cover map [Data set]. Zenodo. https://doi.org/10.5281/zenodo.7764460
Royle JA (2004) N-mixture models for estimating population size from spatially replicated counts. Biometrics 60(1):108–115. https://doi.org/10.1111/j.0006-341X.2004.00142.x
Royle JA, Nichols JD (2003) Estimating abundance from repeated presence–absence data or point counts. Ecology 84(3):777–790. https://doi.org/10.1890/0012-9658(2003)084[0777:EAFRPA]2.0.CO;2
Royle JA, Nichols JD, Kéry M (2005) Modelling occurrence and abundance of species when detection is imperfect. Oikos, 110(2):353–359. https://doi.org/10.1111/j.0030-1299.2005.13534.x
Schloerke B, Cook D, Larmarange J, Briatte F, Marbach M, Thoen E, Elberg A, Crowley J (2024) GGally: Extension to 'ggplot2'. R package version 2.2.1, https://CRAN.R-project.org/package=GGally.
Stander RI (2023) The Reptiles of the Limpopo Province and Kruger National Park: Their ecology, behaviour and distribution. Business Print, Pretoria.
Steen DA (2010) Snakes in the grass: secretive natural histories defy both conventional and progressive statistics. Herpetol Conserv Bio 5(2):183–188.
Steen DA, Guyer C, Smith LL (2012) A case study of relative abundance in snakes. In: McDiarmid RW, Foster MS, Guyer C, Gibbons JW, Chernoff N (eds) Reptile Biodiversity: Standard Methods for Inventory and Monitoring. University of California Press, Berkeley, California, USA, pp 287–294
Strebel N, Fiss CJ, Kellner KF, Larkin JL, Kéry M, Cohen J (2021) Estimating abundance based on time‐to‐detection data. Methods Ecol Evol 12(5):909–920. https://doi.org/10.1111/2041-210X.13570
Tingley MW, Beissinger SR (2013) Cryptic loss of montane avian richness and high community turnover over 100 years. Ecology 94(3):598–609. https://doi.org/10.1890/12-0928.1
Ward RJ, Griffiths RA, Wilkinson JW, Cornish N (2017) Optimising monitoring efforts for secretive snakes: a comparison of occupancy and N-mixture models for assessment of population status. Sci Rep 7(1):18074. https://doi.org/10.1038/s41598-017-18343-5
Wickham H (2016) ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag, New York, USA.

No competing interests reported.

vanWykandMaritz2024NmixturemodelcomparisonsSupplementaryMaterials.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

The agony of choice: Comparing abundance estimates from multiple N-mixture model variants for a dataset of reptile observations

Status:

Version 1

Abstract

Figures

Introduction

Materials and methods

Data handling

Study area

Site selection

Visual encounter surveys

Covariate measurements

Model fitting

Model comparisons

(i) Ecologically reasonable abundance estimates

(ii) Congruence

(iii) Performance in the context of detectability

Results

Discussion

User recommendations

Declarations

Competing interests

Funding

Author Contribution

Acknowledgement

Data Availability

References

Additional Declarations

Supplementary Files

Status:

Version 1