Sample acquisition
We sampled a total of lung tissues (n = 195) from birds salvaged throughout New Mexico, USA representing 32 species within 20 families (Supplementary Table 1). The majority were hunter-donated tissues from two subspecies of sandhill cranes (Antigone canadensis, n = 146). The remaining (n = 49) were salvaged after death by private individuals and wildlife rescue organizations and subsequently donated to the Museum of Southwestern Biology (MSB). Lung tissues were frozen on dry ice and stored in -80°C freezers before being archived in vapor phase nitrogen freezers (-196°C) in the MSB Division of Genomic Resources at the University of New Mexico (UNM). We identified cranes to subspecies (A.c. canadensis and A.c. tabida) in the field using several diagnostic morphological parameters including mass, wing chord length, culmen length, and tarsus length. All birds were sexed internally by inspecting gonads during dissection and aged by the presence and size of the bursa of Fabricious (present in young birds); cranes were additionally aged by plumage and iris coloration. Precise GPS coordinates of sampling locations were not available for many salvaged avian samples. To estimate sample location, we calculated the latitude and longitude centroids of the New Mexico county where each bird was recovered using the R package geosphere48. We additionally recorded the date of salvage to assess temporal variation in fungal communities.
Ecological and behavioral variables of avian hosts
Using published data (AVONET49 database and reported estimates50) and expert knowledge, for each species in the dataset we assigned feeding guild, range size in km2, migration distance in km, and mean hand-wing index (a measure of avian flight ability)51 to each species in the dataset. When available, we included the recorded mass in grams of individual birds; when mass was not recorded, we included the mean species mass as reported in the AVONET dataset.
lllumina ITS2 amplicon sequencing and processing
We lyophilized approximately 25 mg of lung tissue for 24 hours, and then performed a CTAB DNA extraction followed by an AMPure bead (Agencourt Bioscience Corporation, Beverly, MA, USA) purification. The ITS2 rRNA region was amplified with the ITS3F and ITS4R primers52,53 in a 30-cycle PCR using the HotStarTaq Plus Master Mix Kit (Qiagen Inc, Cambridge, MA, USA) under the following conditions: 95°C for 5 min, followed by 30 cycles of 95°C for 30 sec, 53°C for 40 sec and 72°C for 1 min, after which a final elongation step at 72°C for 10 min was performed. PCR products were checked on a 2% agarose gel. Based on DNA quality and quantity, 183 samples were multiplexed and pooled in equal proportions based on molecular weight and DNA concentrations. Pooled products were purified using Ampure XP beads (Beckman Coulter Inc, Brea, CA, USA) following manufacturer instructions and were used to create DNA libraries following the Illumina MiSeq DNA library preparation protocol for paired-end reads. Samples were sequenced by MR DNA on an Illumina MiSeq Shallowater, Texas (http://www.mrdnalab.com). The raw reads were submitted to the NCBI Short Read Archive (SRA) under BioProject PRJNA1136563.
We used USEARCH54 v.11.0.667 to merge paired-end reads, remove adapters and primers, and perform quality filtering using a maximum expected error threshold of 1.0. With UNOISE355, we performed error correction (denoised) on unique sequences and clustered them into zero-radius operational taxonomic units (zOTUs), preferred to distinguish between species and strains that would otherwise be grouped together. Initial taxonomic assignment was performed with the SINTAX56 algorithm against the UNITE v.7 database and compared with BLASTn. We removed all non-fungal (Metazoa and Viridiplantae) sequences. We evaluated sufficient sequencing depth using the rarecurve function in vegan57 v.2.6-4.
Assignment of trophic mode
We used FUNGuild58 v.1.1 to assign trophic modes (i.e., pathotroph, saprotroph, or symbiotroph) and functional guilds to fungal zOTUs. Animal-fungal symbionts from each sample were pooled by determining the percent of reads in each sample that belonged to zOTUs whose guild assignments included animal pathogen, animal parasite, and animal symbiotroph.
Identification of core mycobiome
The core mycobiome refers to the common set of fungal zOTUs in a host taxon. To identify the core lung mycobiome of the migratory birds in our dataset, we first transformed the full dataset to proportional abundances and filtered the zOTUs, keeping only those zOTUs that were detected in > 50% of bird samples and with > 0.01% relative abundance. We then ran the function core from the package microbiome59, to obtain a subset of core taxa and used it for downstream co-occurrence analyses.
Alpha and beta diversity
We rarefied the data set using the rarefy_even_depth function in phyloseq60 v.1.42.0 using the sample with the fewest reads as the target sample depth. Rarefaction removed 20 zOTUs leaving 511 zOTUs for downstream diversity analyses. Using the rarefied community data we calculated zOTU richness, Chao1 index, Shannon diversity index, and inverse Simpson index using the function estimate_richness from the phyloseq60 package. We used non-metric multidimensional scaling (NMDS) to explore drivers of diversity and visualize groupings of fungal lung communities into host phylogenetic, demographic and ecological categories. To visually compare lung mycobiome beta diversity and host phylogenetic diversity we first merged fungal reads by host-taxa, repeatedly rarefied at 1,000 reads, calculated the Bray-Curtis dissimilarity index between each sample, and averaged the matrices. We then used the averaged matrix to generate a heatmap and corresponding dendrogram using the function heatmap2 from the package gplots61 v.3.1.3.1. We generated a phylogenetic distance matrix from the maximum clade credibility (MCC) tree (see phylogenetic analyses below) with the cophentic_phylo function from the R package ape62. We plotted the phylogenetic distance matrix below the beta diversity matrix and created a tanglegram to indicate discordance between the phylogenetic tree and the Bray-Curtis dissimilarities. To test for differences in bird fungal assemblages among multiple life history traits and phylogenetic scales, we used Bray–Curtis and PERMANOVA (permutational multivariate analysis of variance) models (Bray-Curtis ~ group, 10,000 permutations) using the adonis2 function in vegan57. If there were differences in multivariate dispersion assessed with the betadispr function from vegan57,an ANOSIM (analysis of similarities) test was used in lieu of a PERMANOVA. We repeated PERMANOVA and ANOSIM tests of community dissimilarity with a subset of fungal taxa containing known animal symbionts and opportunistic pathogens. We investigated the correlation between geographic distance and fungal community similarity using a Mantel correlogram and Mantel test in vegan57. Because similarity is expected to decrease with geographical distance, we plotted distance decay by computing Bray-Curtis between each pair of samples along a spatial gradient.
Co-occurrence analyses
To evaluate microbial associations within the avian mycobiome, we ran co-occurrence analyses on the core fungal zOTUs transformed to proportional abundance and the rarefied subset of animal-symbionts. We calculated Spearman rank abundance correlations and tested for significant correlations using the co_occurrence function from the phylosmith63 package v.1.0.6. We used the co_occurrence_network function from phylosmith63 to plot co-occurrence patterns.
Generalized dissimilarity modeling
We used generalized dissimilarity modeling (GDM) in the R package gdm64 v.1.5.0-9.1 to evaluate fungal-host phylogenetic associations and test the effects of geographic and morphological correlates of dispersal and migration on host fungal community dissimilarity. GDM predicts changes in beta diversity over space, time, and environmental gradients and allows the inclusion of several types of distance matrices as response and predictor variables (e.g. Jaccard, phylogenetic distance). GDMs use linear combinations of I-splines to model changes in predictors across ecological distance with the relative heights of the I-splines64,65. Using rarefied abundance-weighted fungal zOTU tables, we calculated Bray-Curtis dissimilarity matrices as response variables in our GDMs. To address uneven sampling, we ran 1,000 GDMs on the repeatedly rarefied datasets of abundances summed within host species. To examine the effect of host phylogenetic distance on fungal community dissimilarity, we additionally generated a phylogenetic distance matrix from the MCC tree using the cophentic_phylo function from the R package ape62. We tested the effects of five predictor variables: host mass, species hand-wing index, range size (km2), migratory distance (km), and host phylogenetic distance on Bray-Curtis matrices of fungal community dissimilarity. We scaled or log10 transformed each predictor. We extracted and plotted all I-splines regardless of their significance from each of the 1,000 fitted models. We used the gdm.varImp function from the gdm64 package to evaluate model fit and variable importance among all 1,000 models.
Cultured lung fungi
We cultured a portion (25–50 mg) of each lung sample on yeast glucose media (1% yeast extract, 2% glucose, 1.5% agar) with the addition of tetracycline (10 mg/L) and chloramphenicol (50 mg/L). Each plate received 3–5 fragments (approximately 0.25 cm each) of tissue from a single lung. Plates were incubated at room temperature for two days to three months depending on fungal growth rate. Individual yeast colonies or hyphal tips were transferred to fresh media to obtain single-isolate cultures. We sorted isolates based on morphology and chose representative isolates for molecular barcoding of the internal transcribed spacer (ITS) region. We extracted DNA following a standard CTAB procedure with an isoamyl alcohol-chloroform extraction. Subsequently, we amplified the DNA using polymerase chain reaction (PCR) with ITS1-F and ITS452,66. The following PCR cycle was implemented: initial denaturation at 95°C for 10 min, followed by 34 cycles of 95°C for 15 sec, annealing at 52°C for 30 sec, and extension at 72°C for 1 min, with a final extension at 72°C for 5 min. We confirmed products with gel electrophoresis and purified them with ExoSAP-IT (Affymetrix, Santa Clara, CA, USA) following manufacturer recommendations. The entire ITS region was targeted for Sanger sequencing using BigDye Terminator v3.1 (Applied Biosystems, Foster City, CA, USA), and the sequences were identified using BLAST67. If samples within one morphotype varied in taxonomic classification, we sequenced additional fungal cultures for more accurate identification. We identified Fusarium sequences to species complexes by employing FUSARIUM-ID68 v.3.0. We deposited sequences in GenBank under accession numbers PP804255-PP804425 (Supplementary Table 2).
Phylogenetic analyses
We constructed phylogenies for select fungal groups with human health consequences, including Aspergillacaeae, Cryptococcus-like yeasts, Onygenales, and Saccharomycetales. We aligned sequences with mafft69,70 v.7.487 using automatically determined settings and trimmed the resulting alignment with trimal71 v.1.4.1 in automated1 mode. The final alignments were 246 bp for Aspergillaceae, 251 bp for Cryptococcus-like yeasts, 241 bp for Onygenales, and 252 bp for Saccharomycetales. We constructed maximum likelihood trees in IQ-Tree72 v.1.6.12 using the GTR + GAMMA substitution model with 10,000 ultrafast bootstraps. The resulting trees were visualized in ggtree73. Sequences were checked against reference sequences from GenBank as well as compared to those from our previous study of the lung communities of small mammals in the southwestern U.S.8 (Supplementary Table 3).
We tested the level of host phylogenetic signal in alpha diversity indices: zOTU richness, Shannon diversity, inverse Simpson, and Chao1. For each alpha diversity index, we calculated a mean value at the host taxon level from the rarefied dataset as trait inputs for tests of phylogenetic signal. We downloaded a set of 10,000 phylogenetic trees from https://birdtree.org/ with the Hackett backbone74,75 and generated a maximum clade credibility tree using the function maxCladeCred from the R package phangorn76 v.2.5.5. We then pruned the tree to the taxa in our dataset using the keep.tip function in ape62. We tested for phylogenetic signal using Moran’s I, Abouheif’s Cmean, and Pagel’s Lamda tests using the R packages adephylo77 v.1.1–13 and phytools78 v.2.0. We evaluated adjusted P values against a 0.05 significance level from permutation tests (n = 10,000) to identify phylogenetic signal in each alpha diversity metric.