Figures
Abstract
Metabarcoding has become an important tool in the discovery of biodiversity, including fungi, which are the second most speciose group of eukaryotes, with diverse and important ecological roles in terrestrial ecosystems. We have designed and tested new PCR primers that target the D1 variable region of nuclear large subunit (LSU) ribosomal DNA; one set that targets the phylum Ascomycota and another that recovers all other fungal phyla. The primers yield amplicons compatible with the Illumina MiSeq platform, which is cost-effective and has a lower error rate than other high throughput sequencing platforms. The new primer set LSU200A-F/LSU476A-R (Ascomycota) yielded 95–98% of reads of target taxa from environmental samples, and primers LSU200-F/LSU481-R (all other fungi) yielded 72–80% of target reads. Both primer sets have fairly low rates of data loss, and together they cover a wide variety of fungal taxa. We compared our results with these primers by amplifying and sequencing a subset of samples using the previously described ITS3_KYO2/ITS4_KYO3 primers, which amplify the internal transcribed spacer 2 (ITS2) of Ascomycota and Basidiomycota. With approximately equivalent read depth, our LSU primers recovered a greater number and phylogenetic diversity of sequences than the ITS2 primers. For instance, ITS3_KYO2/ITS4_KYO3 primers failed to pick up any members of Eurotiales, Mytilinidiales, Pezizales, Saccharomycetales, or Venturiales within Ascomycota, or members of Exobasidiomycetes, Microbotryomycetes, Pucciniomycetes, or Tremellomycetes within Basidiomycota, which were retrieved in good numbers from the same samples by our LSU primers. Among the OTUs recovered using the LSU primers were 127 genera and 28 species that were not obtained using the ITS2 primers, although the ITS2 primers recovered 10 unique genera and 16 species that were not obtained using either of the LSU primers These features identify the new primer sets developed in this study as useful complements to other universal primers for the study of fungal diversity and community composition.
Citation: Asemaninejad A, Weerasuriya N, Gloor GB, Lindo Z, Thorn RG (2016) New Primers for Discovering Fungal Diversity Using Nuclear Large Ribosomal DNA. PLoS ONE 11(7): e0159043. https://doi.org/10.1371/journal.pone.0159043
Editor: Stefanie Pöggeler, Georg-August-University of Göttingen Institute of Microbiology & Genetics, GERMANY
Received: October 27, 2015; Accepted: June 27, 2016; Published: July 8, 2016
Copyright: © 2016 Asemaninejad et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Raw data can be found at the ENA website under the accession number PRJEB11433. Centroid sequences of all OTUs and occurrence of OTUs across all samples (tag-mapped OTU tables), with RDP and Blast annotations, are available as S3 Table.
Funding: This work was supported in part by Natural Sciences and Engineering Research Council (Canada) Engage Partnership Grant Program EGP 446568-13 to RGT and partner Entomogen, Inc. Parks Canada provided financial and logistical support for fieldwork by M. Bidochka and F. Hunter (Entomogen) in Torngat Mountains National Park. Funding for ZL is from the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grants Program (#418241-2012). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors did not receive any funding from a commercial partner, but instead Entomogen’s expenses inobtaining the samples that they provided to us for analysis were considered “in-kind” matching funding toward the project, which allowed us to obtain the funding from NSERC that paid for the work. This does not alter the authors' adherence to PLOS ONE policies on sharing data and materials.
Abbreviations: BLAST, Basic Local Alignment Search Tool; D1, variable Domain 1; ITS, Internal Transcribed Spacer; ISU, Identical Sequence Units; LSU, Large Subunit; NCBI, National Center for Biotechnology Information; NGS, Next Generation Sequencing; OTU, Operational Taxonomic Unit; PCR, Polymerase Chain Reaction; rDNA, ribosomal DNA
Introduction
Identification and classification of a wide variety of organisms using DNA sequences has helped overcome many limitations of traditional morphological approaches, including providing faster analyses, resolving convergent morphologies, recognition of closely related or sister species, and the ability to identify cryptic organisms from complex or opaque substrates [1,2]. This has been especially true for the assessment of the fungi for which, as the second most speciose eukaryotic group with complex and often cryptic life histories and convergent morphological characteristics, traditional taxonomic approaches have long been problematic [3,4]. Alongside a growing recognition that fungi have an important role in nutrient mineralization and uptake affecting plant productivity and overall ecosystem process [5,6], there has been growing application of genetic markers for the purpose of identification and community composition analyses [7,8].
However, the utility of DNA as a tool to catalogue biodiversity, resolve phylogenies, or explore patterns in ecological communities depends strongly on choice or design of primers for selecting the appropriate genetic markers [9]. Primer sets need to be general enough to match across all members of a broad taxonomic group, while containing mismatches to non-target taxa, and yet yield a gene product variable enough to distinguish taxa at narrow, preferably species-level, resolution [10,11]. Using DNA for species-level identifications (DNA barcoding sensu [12]) typically relies on short, diagnostic DNA sequences, such as the mitochondrial COI gene for metazoan animals [13], the rbcl+matk combination of protein coding genes for plants and green algae [14], or the ribosomal internal transcribed spacer (ITS) region in fungi [15]. However, ongoing examination of the ITS for fungal diversity has revealed that the ITS region is too variable to align over distantly related taxa, and therefore unable to confidently place sequences at the level of family, order or class for which no closely matching reference sequences exist [16]. Moreover, in some genera of filamentous ascomycetes there is little variation in the ITS region, making this domain undesirable for identification or taxonomic analyses at the species level [17–22]. Overall, fungal identification and taxonomic analyses using ITS remain problematic. The D1 variable region of the large ribosomal subunit [23] is an attractive alternative, since it (often together with D2) has proven useful in species-level identification and phylogenetic reconstruction in various fungal groups [24,25].
An ongoing challenge in diversity studies is the pairing of appropriate genetic markers and targeted primer development that works with contemporary technologies. Next Generation Sequencing (NGS) can capture low abundance DNA and provide detailed analyses of relative abundances in microbial communities [26] at a relatively low cost. Among NGS methods, Illumina MiSeq sequencing is the most effective and extensively used technology globally [16] due to its low rate of error and the lowest cost per million bases [27], but requires short diagnostic regions of 300 base pairs to be effective (http://www.illumina.com/systems/sequencing.html). Unfortunately, no primers have been developed to target the D1 region across a wide diversity of fungi with a short amplicon suitable for the Illumina MiSeq platform. Here we introduce and evaluate two primer sets targeting the D1 region of the large subunit (LSU) of ribosomal DNA to evaluate fungal diversity. We demonstrate how these primers can be applied for discovery of major fungal groups to recover a broad range of fungal taxa with confident higher-level placement (family, order, etc.) and potential species or species-group identification.
Materials and Methods
Primer design
Invariate regions representing potential primer sites were first identified in an alignment of 591 sequences of Basidiomycota isolated from soil supplemented with 225 reference sequences from GenBank (National Center for Biotechnology Information [NCBI]) and anchored with the sequence of Saccharomyces cerevisiae (Accession #: J01355; [28]). This alignment, from the 5’ end of LSU to position 650 (primer site LR3, [29]) was created in two halves using SINA v1.2.9 [30] and the two files merged using profile:profile alignment in MUSCLE [31]. The alignment was trimmed to the potential primer sites and amplicon region (bases 200–500), and a neighbor-joining tree recovered essentially the same topology and nearly all species-level OTUs obtained using the full alignment (223 of 382 aligned sites were parsimony informative; data not shown).
The region of LSU between bases 150 and 550 from different fungal groups was used to query GenBank using BLAST [32], separately specifying all orders of Agaricomycetes [33] and other Basidiomycota known from soil. The potential 5’ primer at bases 200–220 was recognized as the reverse complement of ITS6-R, previously recommended for amplification of diverse fungi [34], and was modified slightly for greater coverage of fungi and renamed LSU200-F. At the 3’ end, positions 458–481, which did not represent any known primers, were selected for further investigation (Fig 1). These short sequences were used to query GenBank using BLAST, directed to target taxa (Fungi) and non-target taxa (Eukarya, not Fungi).
A, Approximate location of LSU200-F/LSU481-R and LSU200A-F/LSU476A-R primers in relation to the D1/D2 variable domains within the LSU of Saccharomyces cerevisiae J01355 in relation to ITS1, ITS2, ITS4 from White et al. [35], NS31 from Simon et al. [36], AM1 from Helgason et al. [37], AMV4.5N-F and AMDG-R from Sato et al. [38], AML1 and AML2 from Lee et al. [39], WANDA from Dumbrell et al. [40], ITS3_KYO2 from Toju et al. [41], and ITS7o from Kohout et al. [42]. B, LSU200-F/LSU481-R and LSU200A-F/LSU476A-R alignments made using a custom reference sequence set, aligned using Muscle v 3.8.31 [31] and visualized using CLC Sequence Viewer (http://www.clcbio.com/), with S. cerevisiae J01355 included as positional reference.
The primers referred to as LSU200-F and LSU481-R (Table 1) showed nearly complete identity, particularly at their 3’ ends, towards all Basidiomycota, Glomeromycota, Chytridiomycota, Blastocladiomycota, Kickxellomycotina, Mortierellomycotina, Mucoromycotina, and Zoopagomycotina, but not to Ascomycota, and mismatches to other Eukarya (non-target taxa) were numerous. To optimize Ascomycota primers, a separate alignment of Ascomycota was created by querying Entrez for representatives of all major Ascomycota lineages [43] then using BLAST to obtain similar sequences, resulting in an alignment of 131 sequences. Primer LSU200-F had six sites of mismatch with various Ascomycota which were disregarded as they represented either a single base mismatch in the middle of the sequence (e.g., 208A in Saitoella, 209C in Taphrina) or were mismatches at the 3’ end that occurred only rarely (e.g., 217G in Neolecta and 218C in Taphrina; but see footnotes to Table 1). For other mismatches, bases 203K, 214C, 215R, and 217Y were altered to match the majority of Ascomycota for the new primer LSU200A-F (Table 1).
The reverse primer LSU481-R had two pairs of mismatches to most Ascomycota at bases 468–469 and 474–475, which were corrected, and the revised primer LSU476A-R was moved 5 bases 3’-ward for greater specificity against plants and other non-fungal eukaryotes. Both forward and reverse primers were modified on their 5’ ends to include the following: forward and reverse Illumina MiSeq adaptors to allow sequences to bind to the flow cells, a 4 bp random linker (NNNN) to increase the diversity of the first positions sequenced and so enable the instrument to more easily separate clusters and set appropriate fluorescence levels, and unique 8 nt barcode sequences with an edit distance of at least 4 (https://github.com/ggloor/miseq_bin). Primer layout was as follows:
5′–[Illumina Forward Adaptor][NNNN][Barcode][PCR Primer]–3′.
The unique forward and reverse barcode sequences are a paired-end multiplexing approach that allowed us to combine samples within a single Illumina run and bioinformatically separate them back into samples afterwards [44].
We compared our developed primer sets against another primer set currently used in Illumina sequencing (the ITS3_KYO2/ITS4_KYO3 primers, hereafter referred to as “ITS2 primers”) described by Toju et al. [41], where the forward primer is ITS3_KYO2 (5′–GAT GAA GAA CGY AGY RAA–3′), and the reverse primer is ITS4_KYO3 (5′– CTB TTV CCK CTT CAC TCG –3′). These primers target diverse taxonomic groups within Ascomycota and Basidiomycota. Primers were ordered with an Illumina Adaptor, linker, and unique barcode as described above.
Soil and peat collection
Subarctic soil samples were taken from eight sites within 500 m of the Torngat Mountains Base Camp and Research Station, Kangidluasuk, Labrador, Canada (58.454° N; 62.802° W, elevation 30–200 m a.s.l.). Samples were collected under Parks Canada Research permit TMNP-2012-11533. Soils ranged from mostly loamy (samples 1 to 7) to sandy (sample 8). Boreal peat samples were collected from a poor nutrient Sphagnum-dominated fen at the White River Experimental Research sites near White River, Ontario, Canada (48.354° N; 85.338° W, elevation 450 m a.s.l.), managed by the Ontario Forest Research Institute and the Ontario Ministry of Natural Resources, with permission. Eight large, intact peat cores complete with above-ground herbaceous and shrubby vegetation were collected [45]. Sub-samples of peat (approximately 10 g fresh weight) were obtained from the top (0–5 cm), and bottom (~30 cm) horizon from each core. A list of associated aboveground vegetation for both sampling locations can be found in S1 Table.
DNA extraction
Soil samples (approximately 10 g wwt) were shaken in 100 mL of 1M sodium pyrophosphate (Anachemia) to disrupt soil colloids. Samples were washed through three stacked sieves (1.18 mm, 250 um, 53 um) for 1–2 minutes with dH2O. Organic matter, including mycelia and spores, was collected from the finest sieve and lyophilised for 24 h (Virtis Bench Top 3.5 Freeze Dryer), then ground in liquid nitrogen using a mortar and pestle. DNA was isolated from 1 g of ground sample following the high molecular weight DNA soil protocol (http://www.borealgenomics.com/assets/Aurora-HMW-Soil-App-Note-13-01-09.pdf) using an Aurora electrophoretic nucleic acid separator (Boreal Genomics) [46]. Cartridges were washed and soaked in 10% bleach for at least 30 minutes between each sample.
Peat samples (approximately 25 g wwt) were lyophilised for 48 h, an aliquot ground in liquid nitrogen, and kept frozen until DNA extraction. Total genomic DNA was extracted from 25 mg of each sub-sample of peat moss from each horizon using a Zymo DNA isolation kit (Zymo Research Corporation) to obtain genomic DNA free of PCR-inhibitory phenolic compounds. Genomic DNA was quantified and tested for extraction quality using a nanodrop (Nanodrop 2000, Thermo Scientific) (S1 Fig). Three random high quality DNA extracts (determined by Nanodrop absorbance ratios) for both subarctic soil and upper horizon peat were chosen to be re-sequenced using ITS2 primers to allow for direct comparison of the quality and quantity of sequence data obtained by primers targeting two different regions of the rDNA gene.
PCR protocol
Optimal PCR conditions were determined from DNA extracts of Agaricus bisporus and CIL Plus Mycoactive™ Potting Soil (for LSU200-F/LSU481-R), and of a Harposporium sp. isolate for LSU200A-F /LSU476A-R. LSU PCR reactions were carried out in a total volume of 25 μL, with 0.5–4 μL template DNA, 12.5 μL of Toughmix (Quanta Biosciences), and 1.25 μL each of forward and reverse primers (5 μM). Optimal PCR conditions were 2 min at 94°C, followed by 29 cycles for 30 s at 94°C, 30 s at 62°C (55°C in first cycle followed by the remainder at 62°C for Ascomycota primers), with a final elongation temperature of 72°C for 18 s. ITS2 PCR reactions followed the protocol outlined by Toju et al. [41], scaled to a total volume of 25 μL, using 1 μL template DNA (20 ng/μL for peat and soil samples). Optimal PCR conditions were 3 min at 94°C, followed by 35 cycles for 20 s at 94°C, 30 s at 47°C, 20 s at 72°C, with a final elongation temperature of 72°C for 7 min [41]. PCR products were normalized with a Qubit fluorimeter with the dsDNA HS kit (Life Technologies) and submitted for paired-end MiSeq Illumina sequencing (2 x 300 bp with V3 chemistry) at the London Regional Genomics Centre (Robarts Research Institute, London, Canada).
Bioinformatic analysis
Two MiSeq runs were done for each LSU primer set, and for each soil type, resulting in four multiplexed runs, and separate bioinformatic analyses for each run. All ITS2 samples were run together on a separate multiplexed MiSeq run. Raw FASTQ data was sent through a custom MiSeq data processing pipeline (https://github.com/ggloor/miseq_bin/tree/master).
PANDAseq (https://github.com/neufeld/pandaseq) [47] was used to overlap reads with a minimum overlap length of 30 nt. Any sequences containing ambiguous base calls were removed, as well as any reads that did not perfectly match the primer sequence. Barcode and primer sequences were trimmed prior to clustering. The pipeline grouped the overlapped FASTQ output into identical sequences (ISUs) by identity. The ISUs were checked for chimeras using the UCHIME algorithm [48] and grouped into operational taxonomic units (OTUs) at 97% similarity, using the UCLUST clustering algorithm within USEARCH v 7.0.1090. We chose 97% similarity based on a study that established this as the optimal threshold to translate OTUs into taxonomic species when using single-linkage clustering and the V6 variable region of SSU (16S) rDNA, a region of comparable variability to the D1 divergent domain of LSU rDNA [49]. Therefore, each OTU consists of a centroid sequence—a representative sequence from the most common sequence type within each OTU—around which are clustered reads that are ≥97% similar. Sequence reads that were less than 0.1% of the total abundance in any sample were automatically removed from the sample-mapped OTU table.
Because of the large dataset obtained by NGS, more precise identification of each OTU using a phylogenetic approach and post-hoc analysis is not practical, nor required for ecological comparisons of fungal communities. Instead, general taxonomic classifications were done via the online RDP Classifier [50] using the Fungal LSU Training set 11 as a reference database (https://rdp.cme.msu.edu/classifier/classifier.jsp) for the two LSU primer sets, and the Warcup Fungal ITS Trainset 1 for the ITS2 data (https://rdp.cme.msu.edu/classifier/classifier.jsp). However, centroid sequences with confidence scores lower than 90% were queried using a nucleotide BLAST [51] and re-classified using taxa that scored >80% query coverage and identity within the distance tree of results (since top BLAST hits are not always the closest phylogenetic neighbour [52]).
Raw data can be found at the ENA website under the accession number PRJEB11433. Centroid sequences of all OTUs and occurrence of OTUs across all samples (tag-mapped OTU tables), with RDP and Blast annotations, are available as S2 Table.
Results
Primer performance: target sequence yields
In total, 1 435 400 reads representing 68 670 ISUs were amplified from eight subarctic soil samples using the LSU200-F/LSU481-R primers (Table 2). Of these, 4 827 were flagged as chimeric, leaving 63 843 clusters that were grouped into 2 991 OTUs at 97% similarity. Prior to analysis, 2 681 OTUs with singleton reads were discarded, leaving 310 OTUs (877 244 total reads). There were 188 fungal (target) OTUs identified (60.6% of total retained OTUs; 79.6% of final reads). From peat, many more raw reads were obtained (4 777 383); after quality filters, these clustered to 403 OTUs, of which 122 (30.3%) were identified as fungal, but the fraction of final reads on-target was similar (71.7%). Using the LSU200A-F/LSU476A-R primers, 212 target (Ascomycota) OTUs were recovered from subarctic soils (87.6% of final OTUs, 97.9% of final retained reads), and 89 Ascomycota OTUs from peat (63.5% of final OTUs, 95.3% of final retained reads). The ITS2 primers (ITS3_KYO2/ITS4_KYO3) were used on a subset of soil and peat samples, and yielded 554 358 reads, which after quality filtering, clustered to 73 OTUs, of which 72 were fungal (98.6% of final OTUs, 99.5% of final reads).
The number of raw reads and of on-target reads per sample is dependent in part on the yield of the Illumina run and the number of samples multiplexed per run, but the latter number is useful as a judge of sequencing depth for each primer-sample pairing. The mean number of on-target reads per sample ranged from 38 543 to 167 685; values for the reactions using ITS2 primers were intermediate (Table 2). An informative measure of primer performance is the OT Score, the percent of raw reads per sample retained after quality filtering, and on target. The mean OT Scores for the three primer sets in our tests ranged from 42.3 for the ITS2 primers to 72.9 for the LSU200A-F/LSU476A-R primers with peat samples.
Primer performance: fungal diversity recovered
The diversity of target and non-target OTUs recovered using the new the LSU200-F/LSU481-R and LSU200A-F/LSU476A-R primers from the full set of eight soil and 16 peat samples is available through the European Nucleotide Archive (ENA) under the accession number PRJEB11433. Here, we present the diversity recovered from the subset of samples that were sequenced using both these primers and the ITS2 primers. Using ITS2 primers, the 72 fungal OTUs included 19 of Basidiomycota and 53 of Ascomycota; 34 of these 72 OTUs could be placed with confidence in 29 genera, and 19 OTUs could be identified to species (Table 3). Using the LSU200-F/LSU481-R primers, 158 OTUs could be placed in 79 genera and 27 identified to species, and using LSU200A-F/LSU476A-R primers, 90 OTUs could be identified to 64 genera, and 21 identified to species. Together, the LSU primers recovered 127 genera and 28 species that were not obtained using the ITS2 primers, but the ITS2 primers recovered 10 unique genera and 16 species that were not obtained using either of the LSU primers.
Taxa were included only if the total read count across the three subsamples were >3, and are listed alphabetically by higher taxa (A = Ascomycota, B = Basidiomycota, C = Chytridiomycota, E = Entomophthoromycota, G = Glomeromycota, K = Kickxellomycotina, M = Mucoromycotina, Mi = Microsporidia, Z = Zoopagomycotina). (#) = Number of OTUs identified to that taxon.
There was broad agreement at higher taxonomic levels between the fungal communities detected using the LSU primers and ITS2 primers (S3 Table). The predominant fungi recovered from subarctic soils using LSU200-F/LSU481-R primers included Mortierellomycetes, Agaricales, Thelephorales, and Sebacinales; OTUs recovered using LSU200A-F/LSU476A-R primers were dominated by members of Helotiales, Leotiomycetes, and Chaetothyriales; OTUs recovered using ITS2 primers by Helotiales, unknown Pezizomycotina, Chaetothyriales, Agaricales, Russulales, Sebacinales and Thelephorales. From peat, the OTUs recovered using LSU200-F/LSU481-R primers were dominated by Mortierellomycetes, Agaricales, Atheliales, Cantharellales, and Sebacinales; those recovered using LSU200A-F/LSU476A-R primers by Helotiales, unknown Ascomycota, and Mytilinidiales; and those recovered using ITS2 primers by Helotiales, unknown Leotiomycetes, Agaricales, Chaetothyriales and Sebacinales.
Discussion
The primer sets presented here were designed to provide high efficiency sequence tag information data on fungal diversity in Next Generation Sequencing studies using the Illumina platform. Both primer sets (LSU200-F/LSU481-R and LSU200A-F/LSU476A-R) amplify a ~300 bp region of the rDNA LSU, which provides reliable placements of sequence clusters within the Fungi and, together they yield a broad array of Fungi from chytrids to mushrooms. In addition, the non-target reads obtained using the LSU200-F/LSU481-R primers (approximately 20–30% of retained reads) provide sequence data on a number of important eukaryotic groups that are poorly known in soil [53]: Amoebozoa, Centroheliozoa, Choanoflagellida, Metazoa, Rhizaria and unicellular Streptophyta. We establish the validity and reproducibility of these primers for high-throughput sequencing of environmental samples using replicates of DNA extracted from two types, one of which (peat) was particularly high in humic acid and phenolic compounds.
Including both soil and peat samples, data clean-up resulted in the loss of 25.7% of reads, on average, for primers LSU200-F/LSU481-R, and 26.3%, on average, using the LSU200A-F/LSU476A-R primers. The complement of the data lost to poor quality sequences, chimeras, singleton OTUs, and non-target sequences is the OT Score, the percent of raw sequence reads retained and on-target. The OT Scores for LSU200-F/LSU481-R primers averaged 53.7 across the samples in our study, and 70.6 for LSU200A-F/LSU476A-R primers. These values are higher than the OT Score for the ITS2 primers ITS3_KYO2/ITS4_KYO3 of 42.3 (57.5% data loss to clean-up) in this study, and considerably better than the 78.6% data loss reported for those primers in [54]. In our study, the greatest loss of reads occurred through elimination of singleton OTUs, discarded from our data prior to analysis. Singleton reads are often removed to avoid introducing biases in OTU computation, since poor sequences—reads with greater numbers of errors—are likely to inflate the number of OTUs recovered [55]. In contrast, the inclusion of singletons is unlikely to change observed data trends in ecological comparisons even if they include true observations of rare species [56]. While our data had high loss of OTUs after the removal of singletons, read loss—which is arguably a more important limitation in metagenomic studies [57]—was minimal.
Although the ITS2 primers recovered as many on-target sequences per sample as our new LSU primers in our tests, they yielded far fewer fungal OTUs, representing a much smaller range of higher taxa of fungi than were obtained using our new primers LSU200-F/LSU481-R and LSU200A-F/LSU476A-R. The only other published study using these ITS2 primers for the purpose of biodiversity discovery on DNA extracted from soil samples [58] recovered considerably more total fungal OTUs (3208 OTUs) than our data (72 OTUs). However, Balint et al. [58] analysed considerably more samples (96 versus 6), used two different annealing temperatures (51 and 55°C), and ran three PCR replications for each annealing temperature. This suggests that three replicates of two different annealing temperatures and a larger number of target reads may be required for the ITS2 primers to obtain a similar number OTUs as our new LSU primers [54]. That said, the ITS2 primers, which were designed to recover Ascomycota and Basidiomycota [41], will not yield sequences of chytrids, Glomeromycota, or the zygomycete groups obtained using LSU200-F/LSU481-R, and in our tests they also failed recover important groups of Ascomycota (e.g., Eurotiales, Mytilinidiales, Pezizales, Saccharomycetales, Venturiales) and Basidiomycota (e.g., Exobasidiomycetes, Microbotryomycetes, Pucciniomycetes, and Tremellomycetes) that were recovered by our primers. In contrast, the ITS2 primers did recover two OTUs tentatively identified as Archaeorhizomycetes (Ascomycota), an important group in boreal/montane soils and peat [59] not recovered using LSU200A-F/LSU476A-R. However, as noted in Table 1, improving the match of primer LSU200A-F to Archaeorhizomycetes may be achieved by using the modified forward primer AACKGCGAGTGAAGCRGBM; the reverse primer LSU476A-R already has a perfect match to known Archaeorhizomycetes sequences.
Primer specificity, or the lack of primer universality in recovering broad groups such as the entire kingdom Fungi, has been a persistent issue that has not previously been addressed for biodiversity studies using the Illumina platform. With the exception of LR3/TW13 and LR5/TW14 primers that have been designed based on a highly conserved region within the LSU, all other fungal specific genetic markers and ‘universal’ primers discriminate against various groups of fungi [60]. The ITS1F and other ITS primers (ITS1 and ITS5) have shown biases towards the amplification of fungal groups within the Basidiomycota, while others, particularly ITS2, ITS3 and ITS4, favour Ascomycota [60]. More specific primers such as the ITS4-B, designed to capture basidiomycetes, can amplify the ITS region of a very limited group within the target phylum [61]. Since our new LSU primers provide ample read counts in environmental samples, have low rates of data loss, and cover a breadth of fungal taxa, we suggest that they provide a promising addition to complement and counterpart other universal primers in metagenomic studies.
Supporting Information
S1 Fig. DNA concentrations and comparative absorption values for three soil types.
Black lines indicate the median, red diamonds indicate the mean. (A) DNA concentrations (ng/μL) of three soil types after Aurora HMW DNA Extraction (Subarctic Soil), and Zymogen Soil DNA Extraction Kit (Lower Peat and Upper Peat). (B) 260/280 and (C) 260/230 absorption ratios for the resulting DNA extractions. A ratio value of ~1.8 for 260/280 and 2.0–2.2 for 260/230 values indicate DNA free from most proteins, phenols, or other contaminants.
https://doi.org/10.1371/journal.pone.0159043.s001
(DOCX)
S1 Table. Partial list of vegetation found in association with samples taken from subarctic and peatland soils.
Peat plant species list obtained from [Dieleman et al. [44]] Supplementary Table S1; plants associated with subarctic soil samples identified by Michael Burzynski.
https://doi.org/10.1371/journal.pone.0159043.s002
(DOCX)
S2 Table. Retrieved sequence data from all samples (OTU tag-mapped tables), using primer sets LSU200A-F/LSU476A-R (Ascomycota), LSU200-F/LSU481-R (all other fungi), and ITS3_KYO2/ITS4_KYO3, including RDP and BLASTN annotations.
https://doi.org/10.1371/journal.pone.0159043.s003
(XLSX)
S3 Table. Median relative frequency and range (median; range) of sequences amplified by new LSU primers in comparison to Toju et al. (2012) ITS2 primers.
Relative frequency (%) calculated using total number of reads for each soil type (348 237a, 941 098b, 134 391, and 96 830 reads, respectively; n = 3 for all).
https://doi.org/10.1371/journal.pone.0159043.s004
(DOCX)
Acknowledgments
This work was supported in part by NSERC Engage grant EGP 446568–13 to RGT and partner Entomogen, Inc. Parks Canada provided financial and logistical support for fieldwork by M. Bidochka and F. Hunter (Entomogen) in Torngat Mountains National Park, and Michael Burzysnki (formerly with Parks Canada, Rocky Harbour, Newfoundland) identified plants from photos of the sampling areas in Torngat. Peat samples are provided as part of a larger experiment supported by Dr. James McLaughlin (Ontario Ministry of Natural Resources—Ontario Forest Research Institute), Dr. Brian Branfireun (University of Western Ontario, Biotron), and Dr. Charmaine Dean, Dean of the Faculty of Science, University of Western Ontario. Funding for ZL is from the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grants Program (#418241–2012). The supportive comments, feedback and discussion provided by two anonymous reviewers were greatly appreciated and significantly improved the manuscript.
Author Contributions
Conceived and designed the experiments: RGT GBG ZL. Performed the experiments: AA NW. Analyzed the data: AA NW GBG RGT. Contributed reagents/materials/analysis tools: RGT GBG ZL. Wrote the paper: AA NW GBG ZL RGT.
References
- 1. Atkins SD, Clark IM (2004) Fungal molecular diagnostics: a mini review. Journal of Applied Genetics 45: 3–15. pmid:14960763
- 2. Sun X, Guo L-D (2012) Endophytic fungal diversity: review of traditional and molecular techniques. Mycology 3: 65–76.
- 3. Shenoy BD, Jeewon R, Hyde KD (2007) Impact of DNA sequence-data on the taxonomy of anamorphic fungi. Fungal Diversity 26: 1–54.
- 4. Taylor DL, Hollingsworth TN, McFarland JW, Lennon NJ, Nusbaum C, Ruess RW (2014) A first comprehensive census of fungi in soil reveals both hyperdiversity and fine-scale niche partitioning. Ecological Monographs 84: 3–20.
- 5. Clemmensen KE, Bahr A, Ovaskainen O, Dahlberg A, Ekblad A, Wallander H, et al. (2013) Roots and associated fungi drive long-term carbon sequestration in boreal forest. Science 339: 1615–1618. pmid:23539604
- 6. Treseder KK, Lennon JT (2015) Fungal traits that drive ecosystem dynamics on land. Microbiology and Molecular Biology Reviews 79: 243–262. pmid:25971588
- 7. Fierer N, Ladau J, Clemente JC, Leff JW, Owens SM, Pollard KS, et al. (2013) Reconstructing the microbial diversity and function of pre-agricultural tallgrass prairie soils in the United States. Science 342: 621–624. pmid:24179225
- 8. Tedersoo L, Bahram M, Polme S, Koljalg U, Yorou NS, Wikesundera R, et al. (2014) Fungal biogeography. Global diversity and geography of soil fungi. Science 346: 1256688. pmid:25430773
- 9. Tedersoo L, Bahram M, Põlme S, Anslan S, Riit T, Kõljalg U, et al. (2015) Response to Comment on "Global diversity and geography of soil fungi". Science 349: 936–936.
- 10. Seifert KA, Samson RA, deWaard JR, Houbraken J, Lévesque CA, Moncalvo JM, et al. (2007) Prospects for fungus identification using CO1 DNA barcodes, with Penicillium as a test case. Proceedings of the National Academy of Sciences of the United States of America 104: 3901–3906. pmid:17360450
- 11. Watanabe M, Yonezawa T, Lee Ki, Kumagai S, Sugita‐Konishi Y, Goto K, et al. (2011) Evaluation of genetic markers for identifying isolates of the species of the genus Fusarium. Journal of the Science of Food and Agriculture 91: 2500–2504. pmid:21725977
- 12. Hebert PDN, Cywinska A, Ball SL, deWaard JR (2003) Biological identifications through DNA barcodes. Proceedings of the Royal Society of London Series B: Biological Sciences 270: 313–321. pmid:12614582
- 13. Ratnasingham S, Hebert PDN (2007) BOLD: The Barcode of Life Data System (http://www.barcodinglife.org). Molecular Ecology Notes 7: 355–364. pmid:18784790
- 14. Chase MW, Fay MF (2009) Barcoding of plants and fungi. Science 325: 682–683. pmid:19644072
- 15. Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL, Levesque CA, et al. (2012) Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proceedings of the National Academy of Sciences of the United States of America 109: 6241–6246. pmid:22454494
- 16. Lindahl BD, Nilsson RH, Tedersoo L, Abarenkov K, Carlsen T, Kjøller R, et al. (2013) Fungal community analysis by high-throughput sequencing of amplified markers—a user's guide. New Phytologist 199: 288–299. pmid:23534863
- 17. Wang H, Xiao M, Kong F, Chen S, Dou H-T, Sorrell T, et al. (2011) Accurate and practical identification of 20 Fusarium species by seven-locus sequence analysis and reverse line blot hybridization, and an in vitro antifungal susceptibility study. Journal of Clinical Microbiology 49: 1890–1898. pmid:21389150
- 18. Pařenicová L, Skouboe P, Frisvad J, Samson RA, Rossen L, ten Hoor-Suykerbuyk M, et al. (2001) Combined molecular and biochemical approach identifies Aspergillus japonicus and Aspergillus aculeatus as two species. Applied and Environmental Microbiology 67: 521–527. pmid:11157212
- 19. Varga J, Tóth B, Rigó K, Téren J, Hoekstra RF, Kozakiewicz Z (2000) Phylogenetic analysis of Aspergillus section Circumdati based on sequences of the internal transcribed spacer regions and the 5.8 S rRNA gene. Fungal Genetics and Biology 30: 71–80. pmid:10955909
- 20. Lindner DL, Banik MT (2011) Intragenomic variation in the ITS rDNA region obscures phylogenetic relationships and inflates estimates of operational taxonomic units in genus Laetiporus. Mycologia 103: 731–740. pmid:21289107
- 21. Skouboe P, Frisvad JC, Taylor JW, Lauritsen D, Boysen M, Rossen R (1999) Phylogenetic analysis of nucleotide sequences from the ITS region of terverticillate Penicillium species. Mycological Research 103: 873–881.
- 22. Jang J-h, Bae S-E, Jung S, Kim H-y, Ahn I-s, Son H-s (2014) FGMD: database of fungal genetic markers and multigene analysis in fungal classification. The Korean Journal of Public Health 2: 81–89.
- 23. Hassouna N, Michot B, Bachellerie JP (1984) The complete nucleotide sequence of mouse 28S rRNA gene. Implications for the process of size increase of the large subunit rRNA in higher eukaryotes. Nucleic Acids Research 12: 3563–3583. pmid:6328426
- 24. Kurtzman CP, Robnett CJ (1998) Identification and phylogeny of ascomycetous yeasts from analysis of nuclear large subunit (26S) ribosomal DNA partial sequences. Antonie Van Leeuwenhoek 73: 331–371. pmid:9850420
- 25. Moncalvo J-M, Vilgalys R, Redhead SA, Johnson JE, James TY, Aime MC, et al. (2002) One hundred and seventeen clades of euagarics. Molecular Phylogenetics and Evolution 23: 357–400. pmid:12099793
- 26. Hert DG, Fredlake CP, Barron AE (2008) Advantages and limitations of next-generation sequencing technologies: a comparison of electrophoresis and non-electrophoresis methods. Electrophoresis 29: 4618–4626. pmid:19053153
- 27. Luo C, Tsementzi D, Kyrpides N, Read T, Konstantinidis KT (2012) Direct comparisons of Illumina vs. Roche 454 sequencing technologies on the same microbial community DNA sample. PloS ONE 7: e30087. pmid:22347999
- 28. Georgiev OI, Nikolaev N, Hadjiolov AA, Skryabin KG, Zakharyev VM, Bayev AA (1981) The structure of the yeast ribosomal RNA genes. 4. Complete sequence of the 25 S rRNA gene from Saccharomyces cerevisae. Nucleic Acids Research 9: 6953–6958. pmid:6460984
- 29. Vilgalys R, Hester M (1990) Rapid genetic identification and mapping of enzymatically amplified ribosomal DNA from several Cryptococcus species. Journal of Bacteriology 172: 4238–4246. pmid:2376561
- 30. Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig W, Peplies J, et al. (2007) SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Research 35: 7188–7196. pmid:17947321
- 31. Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32: 1792–1797. pmid:15034147
- 32. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research 25: 3389–3402. pmid:9254694
- 33.
Hibbett D, Bauer R, Binder M, Giachini A, Hosaka K, Justo A, et al. (2014) Agaricomycetes. In: McLachlin DJ, Spatafora JW, editors. The Mycota, vol VIIA. Systematics and Evolution, 2nd ed. Berlin: Springer. pp. 373–428.
- 34. Dentinger BTM, Margaritescu S, Moncalvo J-M (2010) Rapid and reliable high-throughput methods of DNA extraction for use in barcoding and molecular systematics of mushrooms. Molecular Ecology Resources 10: 628–633. pmid:21565067
- 35.
White TJ, Bruns T, Lee S, Taylor J (1990) Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. In: Innis M, Gelfand D, Sninsky J, White T, editors. PCR Protocols: a Guide to Methods and Applications. New York: Academic Press, Inc. pp. 315–322.
- 36. Simon L, Lalonde M, Bruns T (1992) Specific amplification of 18S fungal ribosomal genes from vesicular-arbuscular endomycorrhizal fungi colonizing roots. Applied and Environmental Microbiology 58: 291–295. pmid:1339260
- 37. Helgason T, Daniell TJ, Husband R, Fitter AH, Young JPW (1998) Ploughing up the wood-wide web? Nature 394: 431–431. pmid:9697763
- 38. Sato K, Suyama Y, Saito M, Sugawara K (2005) A new primer for discrimination of arbuscular mycorrhizal fungi with polymerase chain reaction-denature gradient gel electrophoresis. Grassland Science 51: 179–181.
- 39. Lee J, Lee S, Young JPW (2008) Improved PCR primers for the detection and identification of arbuscular mycorrhizal fungi. FEMS Microbiology Ecology 65: 339–349. pmid:18631176
- 40. Dumbrell AJ, Ashton PD, Aziz N, Feng G, Nelson M, Dytham C, et al. (2011) Distinct seasonal assemblages of arbuscular mycorrhizal fungi revealed by massively parallel pyrosequencing. New Phytologist 190: 794–804. pmid:21294738
- 41. Toju H, Tanabe AS, Yamamoto S, Sato H (2012) High-coverage ITS primers for the DNA-based identification of ascomycetes and basidiomycetes in environmental samples. PloS One 7: e40863. pmid:22808280
- 42. Kohout P, Sudová R, Janoušková M, Čtvrtlíková M, Hejda M, Pánková H, et al. (2014) Comparison of commonly used primer sets for evaluating arbuscular mycorrhizal fungal communities: Is there a universal solution? Soil Biology and Biochemistry 68: 482–493.
- 43. Schoch CL, Sung G-H, López-Giráldez F, Townsend JP, Miadlikowska J, Hofstetter V, et al. (2009) The Ascomycota tree of life: a phylum-wide phylogeny clarifies the origin and evolution of fundamental reproductive and ecological traits. Systematic Biology 58: 224–239. pmid:20525580
- 44. Gloor GB, Hummelen R, Macklaim JM, Dickson RJ, Fernandes AD, MacPhee R, et al. (2010) Microbiome profiling by illumina sequencing of combinatorial sequence-tagged PCR products. PloS ONE 5: e15406. pmid:21048977
- 45. Dieleman CM, Branfireun BA, McLaughlin JW, Lindo Z (2015) Climate change drives a shift in peatland ecosystem plant community: Implications for ecosystem function and stability. Global Change Biology 21: 388–395. pmid:24957384
- 46. Zhou J, Bruns MA, Tiedje JM (1996) DNA recovery from soils of diverse composition. Applied and Environmental Microbiology 62: 316–322. pmid:8593035
- 47. Masella AP, Bartram AK, Truszkowski JM, Brown DG, Neufeld JD (2012) PANDAseq: paired-end assembler for illumina sequences. BMC Bioinformatics 13: 31. pmid:22333067
- 48. Edgar RC, Haas BJ, Clemente JC, Quince C, Knight R (2011) UCHIME improves sensitivity and speed of chimera detection. Bioinformatics 27: 2194–2200. pmid:21700674
- 49. Huse SM, Welch DM, Morrison HG, Sogin ML (2010) Ironing out the wrinkles in the rare biosphere through improved OTU clustering. Environmental Microbiology 12: 1889–1898. pmid:20236171
- 50. Wang Q, Garrity GM, Tiedje JM, Cole JR (2007) Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Applied and Environmental Microbiology 73: 5261–5267. pmid:17586664
- 51. Edgar RC (2010) Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26: 2460–2461. pmid:20709691
- 52. Koski LB, Golding GB (2001) The closest BLAST hit is often not the nearest neighbor. Journal of Molecular Evolution 52: 540–542. pmid:11443357
- 53. Lawley B, Ripley S, Bridge P, Convey P (2004) Molecular analysis of geographic patterns of eukaryotic diversity in Antarctic soils. Applied and Environmental Microbiology 70: 5963–5972. pmid:15466539
- 54. Schmidt P-A, Bálint M, Greshake B, Bandow C, Römbke J, Schmitt I (2013) Illumina metabarcoding of a soil fungal community. Soil Biology and Biochemistry 65: 128–132.
- 55. Valverde JR, Mellado RP (2013) Analysis of metagenomic data containing high biodiversity levels. PLoS One 8: e58118. pmid:23505458
- 56. Brown SP, Veach AM, Rigdon-Huss AR, Grond K, Lickteig SK, Lothamer K, et al. (2015) Scraping the bottom of the barrel: are rare high throughput sequences artifacts? Fungal Ecology 13: 221–225.
- 57. Smith DP, Peay KG (2014) Sequence depth, not PCR replication, improves ecological inference from next generation DNA sequencing. PLoS One 9: e90234. pmid:24587293
- 58. Balint M, Schmidt PA, Sharma R, Thines M, Schmitt I (2014) An Illumina metabarcoding pipeline for fungi. Ecology and Evolution 4: 2642–2653. pmid:25077016
- 59. Rosling A, Cox F, Cruz-Martinez K, Ihrmark K, Grelet G, Lindahl BD, et al. (2011) Archaeorhizomycetes: unearthing an ancient class of ubiquitous soil fungi. Science: 12: 876–879.
- 60. Bellemain E, Carlsen T, Brochmann C, Coissac E, Taberlet P, Kauserud H (2010) ITS as an environmental DNA barcode for fungi: an in silico approach reveals potential PCR biases. BMC Microbiology 10: 189–189. pmid:20618939
- 61. O'Brien HE, Parrent JL, Jackson JA, Moncalvo J-M, Vilgalys R (2005) Fungal community analysis by large-scale sequencing of environmental samples. Applied and Environmental Microbiology 71: 5544–5550. pmid:16151147