ResearchHub | Open Science Community

How to make more out of community data? A conceptual framework and its implementation as models and software

Otso Ovaskainen et al.Mar 20, 2017

Abstract Community ecology aims to understand what factors determine the assembly and dynamics of species assemblages at different spatiotemporal scales. To facilitate the integration between conceptual and statistical approaches in community ecology, we propose Hierarchical Modelling of Species Communities ( HMSC ) as a general, flexible framework for modern analysis of community data. While non‐manipulative data allow for only correlative and not causal inference, this framework facilitates the formulation of data‐driven hypotheses regarding the processes that structure communities. We model environmental filtering by variation and covariation in the responses of individual species to the characteristics of their environment, with potential contingencies on species traits and phylogenetic relationships. We capture biotic assembly rules by species‐to‐species association matrices, which may be estimated at multiple spatial or temporal scales. We operationalise the HMSC framework as a hierarchical Bayesian joint species distribution model, and implement it as R‐ and Matlab‐packages which enable computationally efficient analyses of large data sets. Armed with this tool, community ecologists can make sense of many types of data, including spatially explicit data and time‐series data. We illustrate the use of this framework through a series of diverse ecological examples.

Ecology

Philosophy

0

Paper

Save

Arthropod Diversity in a Tropical Forest

Yves Basset et al.Dec 13, 2012

Assessing Creepy Crawlies Arthropods are the most diverse group of terrestrial animal species, yet estimates of the total number of arthropod species have varied widely, especially for tropical forests. Basset et al. (p. 1481 , see the cover) now provide more reliable estimates of total arthropod species richness in a tropical rainforest in Panama. Intensive sampling of a half hectare of forest yielded just over 6000 arthropod species. Scaling up this result to the whole forest suggests that the total species diversity lies between 17,000 and 40,000 species.

Ecology

Anthropology

0

Paper

Save

Higher predation risk for insect prey at low latitudes and elevations

Tomas Roslin et al.May 18, 2017

Risky in the tropics It is well known that diversity increases toward the tropics. Whether this increase translates into differences in interaction rates among species, however, remains unclear. To simplify the problem, Roslin et al. tested for predation rates by using a single approach involving model caterpillars across six continents. Predator attack rates were higher toward the equator, but only for arthropod predators. Science , this issue p. 742

Ecology

Ecology, Evolution, Behavior And Systematics

0

Paper

Save

A comprehensive evaluation of predictive performance of 33 species distribution models at species and community levels

Anna Norberg et al.May 2, 2019

Abstract A large array of species distribution model ( SDM ) approaches has been developed for explaining and predicting the occurrences of individual species or species assemblages. Given the wealth of existing models, it is unclear which models perform best for interpolation or extrapolation of existing data sets, particularly when one is concerned with species assemblages. We compared the predictive performance of 33 variants of 15 widely applied and recently emerged SDM s in the context of multispecies data, including both joint SDM s that model multiple species together, and stacked SDM s that model each species individually combining the predictions afterward. We offer a comprehensive evaluation of these SDM approaches by examining their performance in predicting withheld empirical validation data of different sizes representing five different taxonomic groups, and for prediction tasks related to both interpolation and extrapolation. We measure predictive performance by 12 measures of accuracy, discrimination power, calibration, and precision of predictions, for the biological levels of species occurrence, species richness, and community composition. Our results show large variation among the models in their predictive performance, especially for communities comprising many species that are rare. The results do not reveal any major trade‐offs among measures of model performance; the same models performed generally well in terms of accuracy, discrimination, and calibration, and for the biological levels of individual species, species richness, and community composition. In contrast, the models that gave the most precise predictions were not well calibrated, suggesting that poorly performing models can make overconfident predictions. However, none of the models performed well for all prediction tasks. As a general strategy, we therefore propose that researchers fit a small set of models showing complementary performance, and then apply a cross‐validation procedure involving separate data to establish which of these models performs best for the goal of the study.

Genetics

Ecology

0

Paper

Save

International scientists formulate a roadmap for insect conservation and recovery

Jeffrey Harvey et al.Jan 6, 2020

Genetics

Ecology

0

Paper

Save

Towards understanding diversity, endemicity and global change vulnerability of soil fungi

Leho Tedersoo et al.Mar 19, 2022

Summary Fungi play pivotal roles in ecosystem functioning, but little is known about their global patterns of diversity, endemicity, vulnerability to global change drivers and conservation priority areas. We applied the high-resolution PacBio sequencing technique to identify fungi based on a long DNA marker that revealed a high proportion of hitherto unknown fungal taxa. We used a Global Soil Mycobiome consortium dataset to test relative performance of various sequencing depth standardization methods (calculation of residuals, exclusion of singletons, traditional and SRS rarefaction, use of Shannon index of diversity) to find optimal protocols for statistical analyses. Altogether, we used six global surveys to infer these patterns for soil-inhabiting fungi and their functional groups. We found that residuals of log-transformed richness (including singletons) against log-transformed sequencing depth yields significantly better model estimates compared with most other standardization methods. With respect to global patterns, fungal functional groups differed in the patterns of diversity, endemicity and vulnerability to main global change predictors. Unlike α-diversity, endemicity and global-change vulnerability of fungi and most functional groups were greatest in the tropics. Fungi are vulnerable mostly to drought, heat, and land cover change. Fungal conservation areas of highest priority include wetlands and moist tropical ecosystems.

Ecology

Law

1

Paper

Save

Global convergence of dominance and neglect in flying insect diversity

Amrita Srivathsan et al.Aug 4, 2022

ABSTRACT Most of arthropod biodiversity is unknown to science. For this reason, it has been unclear whether insect communities around the world are dominated by the same or different taxa. This question can be answered through standardized sampling of biodiversity followed by estimation of species diversity and community composition with DNA sequences. This approach is here applied to flying insects sampled by 39 Malaise traps placed in five biogeographic regions, eight countries, and numerous habitats (>220,000 specimens belonging to >25,000 species in 463 families). Unexpectedly, we find that 20 insect families account for >50% of local species diversity regardless of continent, climatic region, and habitat type. These consistent differences in family-level dominance explain two-thirds of variation in community composition despite massive levels of species turnover, with most species (>97%) in the top 20 families encountered at a single site only. Alarmingly, the same families that dominate global insect diversity also suffer from extreme taxonomic neglect with little signs of increasing activities in recent years. Tackling the biodiversity of these “dark taxa” thereby emerges as an urgent priority because the arthropod groups comprising most of the global flying insect diversity are particularly poorly known.

Ecology

Biochemistry

111

Paper

Save

Optimizing insect metabarcoding using replicated mock communities

Elzbieta Iwaszkiewicz‐Eggebrecht et al.Jun 22, 2022

Abstract Metabarcoding (high-throughput sequencing of marker gene amplicons) has emerged as a promising and cost-effective method for characterizing insect community samples. Yet, the methodology varies greatly among studies and its performance has not been systematically evaluated to date. In particular, it is unclear how accurately metabarcoding can resolve species communities in terms of presence-absence, abundances, and biomass. Here we use mock community experiments and a simple probabilistic model to evaluate the performance of different metabarcoding protocols. Specifically, we ask four questions: (Q1) How consistent are the recovered community profiles across replicate mock communities?; (Q2) How does the choice of lysis buffer affect the recovery of the original community?; (Q3) How are community estimates affected by differing lysis times and homogenization?; and (Q4) Is it possible to obtain adequate species abundance estimates through the use of biological spike-ins? We show that estimates are quite variable across community replicates. In general, a mild lysis protocol is better at reconstructing species lists and approximate counts, while homogenization is better at retrieving biomass composition. Tiny insects are more likely to be detected in lysates, while some tough species require homogenization to be detected. Results are less consistent across biological replicates for lysates than for homogenates. Some species are associated with strong PCR amplification bias, which complicates the reconstruction of species counts. Yet, with adequate spike-in data, species abundance can be determined with roughly 40% standard error for homogenates, and with roughly 50% standard error for lysates, under ideal conditions. In the latter case, however, this often requires species-specific reference data, while spike-in data generalizes better across species for homogenates. We conclude that a non-destructive, mild lysis approach shows the highest promise for presence/absence description of the community, while also allowing future morphological or molecular work on the material. However, homogenization protocols perform better for characterizing community composition, in particular in terms of biomass.

Genetics

Ecology

2

Paper

Save

SPIKEPIPE: A metagenomic pipeline for the accurate quantification of eukaryotic species occurrences and abundances using DNA barcodes or mitogenomes

Yinqiu Ji et al.Jan 29, 2019

Abstract The accurate quantification of eukaryotic species abundances from bulk samples remains a key challenge for community description and environmental biomonitoring. We resolve this challenge by combining shotgun sequencing, mapping to reference DNA barcodes or to mitogenomes, and three correction factors: (1) a percent-coverage threshold to filter out false positives, (2) an internal-standard DNA spike-in to correct for stochasticity during sequencing, and (3) technical replicates to correct for stochasticity across sequencing runs. This pipeline achieves a strikingly high accuracy of intraspecific abundance estimates from samples of known composition (mapping to barcodes R 2 =0.93, mitogenomes R 2 =0.95) and a high repeatability across environmental-sample replicates (barcodes R 2 =0.94, mitogenomes R 2 =0.93). As proof of concept, we sequence arthropod samples from the High Arctic systematically collected over 17 years, detecting changes in species richness, abundance, and phenology using either barcodes or mitogenomes. SPIKEPIPE provides cost-efficient and reliable quantification of eukaryotic communities, with direct application to environmental biomonitoring. Statement of authorship NMS has been involved in running the BioBasis sampling program for more than twenty years. TR, NMS, DWY, and OO conceived the study and its design. TH led the work in generating all the DNA samples and YJ led the work in assembling and annotating the mitogenomes for the mitochondrial genome reference database. TH led the work in generating the mock communities and bulk samples, with contributions from YJ and JW. YJ and DWY developed the molecular and bioinformatic methods. OO led the modelling of the data. TR and OO wrote the first draft of the manuscript, and all authors contributed substantially to its further improvement. Data accessibility statement Should the manuscript be accepted, the data supporting the results will be archived in an appropriate public repository (Dryad), and the data DOI will be included at the end of the article. The bioinformatic and R scripts and associated data tables will also be made available on github.com .

Genetics

Ecology

0

Paper

Save

Establishing insect community composition using metabarcoding of soil samples, and preservative ethanol and homogenate from Malaise trap catches: surprising inconsistencies between methods

Daniel Marquina et al.Apr 2, 2019

Abstract DNA metabarcoding allows the analysis of insect communities faster and more efficiently than ever before. However, metabarcoding can be conducted through several alternative approaches, and the consistency of results across methods has rarely been studied. We compare the results obtained by DNA metabarcoding of the same communities using two different markers – COI and 16S – and three different sampling methods – homogenized Malaise trap samples (homogenate), preservative ethanol from the same samples, and soil samples. Our results indicate that COI and 16S offer partly complementary information on Malaise trap samples, with each marker detecting a significant number of species not detected by the other. Different sampling methods offer highly divergent estimates of community composition. The community recovered from preservative ethanol of Malaise trap samples is quite distinct from that recovered from homogenate. Small and weakly sclerotized insects tend to be overrepresented in ethanol, with some exceptions that could be related to taxon-specific traits. For soil samples, highly degenerate COI primers pick up large amounts of non-target DNA and only 16S provides adequate analyses of insect diversity. However, even with 16S, very little overlap in MOTU content was found between the trap and the soil samples. Our results demonstrate that no metabarcoding approach is all-comprehensive in itself. For instance, DNA extraction from preservative ethanol is not a valid replacement for destructive bulk extraction but a complement. In future metabarcoding studies, both should ideally be used together to achieve comprehensive representation of the target community.

Genetics

Ecology

0

Paper

Genetics

4

0

Save