Metagenomics Methods And Protocols Pdf

Download Book Metagenomics Of The Microbial Nitrogen Cycle in PDF format. You can Read Online Metagenomics Of The Microbial Nitrogen Cycle here in PDF, EPUB, Mobi or Docx formats.

Metagenomics Methods And Protocols Pdf File

Metagenomics Of The Microbial Nitrogen Cycle

Author :Diana Marco
ISBN :9781908230607
Genre :Science
File Size : 60.41 MB
Format :PDF, ePub, Docs
Download :718
Read :850

The nitrogen (N) cycle is one of the most important nutrient cycles in the earth and many of its steps are performed by microbial organisms. During the cycling process greenhouse gases are formed including nitrous oxide and methane. In addition, the use of nitrogen fertilizers increases freshwater nitrate levels, causing pollution and human health problems. A greater knowledge of the microbial communities involved in nitrogen transformations is necessary to understand and counteract nitrogen pollution. Written by renowned researchers specialised in the most relevant and emerging topics in the field, this book provides comprehensive information on the new theoretical, methodological and applied aspects of metagenomics and other 'omics' approaches used to study the microbial N cycle. Recommended for microbiologists, environmental scientists and anyone interested in microbial communities, metagenomics, metatranscriptomics and metaproteomics of the microbial N cycle. This volume provides a thorough account of the contributions of metagenomics to microbial N cycle background theory, reviews state-of-the-art investigative methods and explores new applications in water treatment, agricultural practices and climate change, among others.

Metagenomics

Author :Diana Marco
ISBN :9781910190609
Genre :Science
File Size : 78.15 MB
Format :PDF, Kindle
Download :492
Read :739

Metagenomics continues to be one of the most dynamic scientific fields due largely to the development of new and cheaper sequencing technologies. The diversity of habitats explored with metagenomics and other meta-omics techniques has increased exponentially in recent years. The resulting cascade of data has led to a new range of methodological problems and solutions. In this collection of reviews, expert authors describe the cutting-edge and emerging conceptual and methodological tools being employed to deal with current issues in metagenomics. Topics covered include the integration of ecology and metagenomics; the organization, classification, analysis and interpretation of the vast amount of data; the new statistical and bioinformatic techniques; sample extraction and processing techniques; and various applications of metagenomics in specific areas. The volume is essential reading for researchers and students commencing projects in this field, for researchers active in metagenomics areas, and for educators interested in the latest developments. The volume is also of value to anyone involved in biotechnology, bioremediation, biodegradation and environmental microbiology.

Mining Of Microbial Wealth And Metagenomics

Author :Vipin Chandra Kalia
ISBN :9789811057083
Genre :Medical
File Size : 49.77 MB
Format :PDF, ePub, Docs
Download :216
Read :246

The existence of living organisms in diverse ecosystems has been the focus of interest to human beings, primarily to obtain insights into the diversity and dynamics of the communities. This book discusses how the advent of novel molecular biology techniques, the latest being the next-generation sequencing technologies, helps to elucidate the identity of novel organisms, including those that are rare. The book highlights the fact that oceans, marine environments, rivers, mountains and the gut are ecosystems with great potential for obtaining bioactive molecules, which can be used in areas such as agriculture, food, medicine, water supplies and bioremediation. It then describes the latest research in metagenomics, a field that allows elucidation of the maximum biodiversity within an ecosystem, without the need to actually grow and culture the organisms. Further, it describes how human-associated microbes are directly responsible for our health and overall wellbeing.“/p>

Microbial Nitrogen Cycling Dynamics In Coastal Systems

Author :
ISBN :STANFORD:wt558td2044
Genre :
File Size : 62.46 MB
Format :PDF, Kindle
Download :501
Read :404

Human influence on the global nitrogen cycle (e.g., through fertilizer and wastewater runoff) has caused a suite of environmental problems including acidification, loss of biodiversity, increased concentrations of greenhouse gases, and eutrophication. These environmental risks can be lessened by microbial transformations of nitrogen; nitrification converts ammonia to nitrite and nitrate, which can then be lost to the atmosphere as N2 gas via denitrification or anammox. Microbial processes thus determine the fate of excess nitrogen and yet recent discoveries suggest that our understanding of these organisms is deficient. This dissertation focuses on microbial transformations of nitrogen in marine and estuarine systems through laboratory and field studies, using techniques from genomics, microbial ecology, and microbiology. Recent studies revealed that many archaea can oxidize ammonia (AOA; ammonia-oxidizing archaea), in addition to the well-described ammonia-oxidizing bacteria (AOB). Considering that these archaea are among the most abundant organisms on Earth, these findings have necessitated a reevaluation of nitrification to determine the relative contribution of AOA and AOB to overall rates and to determine if previous models of global nitrogen cycling require adjustment to include the AOA. I examined the distribution, diversity, and abundance of AOA and AOB in the San Francisco Bay estuary and found that the region of the estuary with low-salinity and high C:N ratios contained a group of AOA that were both abundant and phylogenetically distinct. In most of the estuary where salinity was high and C:N ratios were low, AOB were more abundant than AOA—despite the fact that AOA outnumber AOB in soils and the ocean, the two end members of an estuary. This study suggested that a combination of environmental factors including carbon, nitrogen, and salinity determine the niche distribution of the two groups of ammonia-oxidizers. In order to gain insight into the genetic basis for ammonia oxidation by estuarine AOA, we sequenced the genome of a new genus of AOA from San Francisco Bay using single cell genomics. The genome data revealed that the AOA have genes for both autotrophic and heterotrophic carbon metabolism, unlike the autotrophic AOB. These AOA may be chemotactic and motile based on numerous chemotaxis and motility-associated genes in the genome and electron microscopy evidence of flagella. Physiological studies showed that the AOA grow aerobically but they also oxidize ammonia at low oxygen concentrations and may produce the potent greenhouse gas N2O. Continued cultivation and genomic sequencing of AOA will allow for in-depth studies on the physiological and metabolic potential of this novel group of organisms that will ultimately advance our understanding of the global carbon and nitrogen cycles. Denitrifying bacteria are widespread in coastal and estuarine environments and account for a significant reduction of external nitrogen inputs, thereby diminishing the amount of bioavailable nitrogen and curtailing the harmful effects of nitrogen pollution. I determined the abundance, community structure, biogeochemical activity, and ecology of denitrifiers over space and time in the San Francisco Bay estuary. Salinity, carbon, nitrogen and some metals were important factors for denitrification rates, abundance, and community structure. Overall, this study provided valuable new insights into the microbial ecology of estuarine denitrifying communities and suggested that denitrifiers likely play an important role in nitrogen removal in San Francisco Bay, particularly at high salinity sites.

The Metabolic Pathways And Environmental Controls Of Hydrocarbon Biodegradation In Marine Ecosystems

Author :Joel E. Kostka
ISBN :9782889193462
Genre :
File Size : 33.11 MB
Format :PDF, Kindle
Download :544
Read :1306

Biodegradation mediated by indigenous microbial communities is the ultimate fate of the majority of oil hydrocarbon that enters the marine environment. The aim of this Research Topic is to highlight recent advances in our knowledge of the pathways and controls of microbially-catalyzed hydrocarbon degradation in marine ecosystems, with emphasis on the response of microbial communities to the Deepwater Horizon oil spill in the Gulf of Mexico. In this Research Topic, we encouraged original research and reviews on the ecology of hydrocarbon-degrading bacteria, the rates and mechanisms of biodegradation, and the bioremediation of discharged oil under situ as well as near in situ conditions.

Research On Nitrification And Related Processes

Author :
ISBN :0123864909
Genre :Science
File Size : 71.76 MB
Format :PDF, ePub
Download :850
Read :816

The global nitrogen cycle is the one most impacted by mankind. The past decade has changed our view on many aspects of the microbial biogeochemical cycles, including the global nitrogen cycle, which is mainly due to tremendous advances in methods, techniques and approaches. Many novel processes and the molecular inventory and organisms that facilitate them have been discovered only within the last 5 to 10 years, and the process is in progress. Research on Nitrification and Related Processes, Part B provides state-of-the-art updates on methods and protocols dealing with the detection, isolation and characterization of macromolecules and their hosting organisms that facilitate nitrification and related processes in the nitrogen cycle as well as the challenges of doing so in very diverse environments. Provides state-of-the-art update on methods and protocols Deals with the detection, isolation and characterization of macromolecules and their hosting organisms Deals with the challenges of very diverse environments

Nitrogen Cycling In Bacteria

Author :James W. B. Moir
ISBN :9781904455868
Genre :Science
File Size : 89.25 MB
Format :PDF
Download :600
Read :591

Microorganisms that convert gaseous nitrogen (N2) to a form suitable for use by living organisms are pivotal for life on Earth. Another set of microbial reactions utilize the bio-available nitrogen creating N2 and completing the cycle. This crucial nutrient cycle has long been the subject of extensive research, and recent advances - in studying the biochemistry, bioinformatics, cell biology, and the physiology of bacterial nitrogen cycling processes, alongside the advent of the omics age - have had a massive impact, enabling us to fully appreciate the sheer diversity of approaches adapted by individual organisms. Research in this area is at a very exciting stage. This timely book provides comprehensive reviews of current nitrogen cycle research and gives a broader perspective on the state of our understanding of this key biogeochemical cycle. With contributions from expert authors from around the world, the topics covered include: the archaean N-cycle * redox complexes N-cycle * organization of respiratory chains in N-cycle processes * Mo-nitrogenase * nitrogen assimilation in bacteria * alternative routes to dinitrogen * nitrite and nitrous oxide reductases * assembly of respiratory proteins * nitric oxide metabolism * denitrification in legume-associated endosymbiotic bacteria * nitrous oxide production in the terrestrial environment * bacterial nitrogen cycling in humans. This book will serve as a valuable reference work for everyone working in this field and will also be of interest to researchers studying symbioses, environmental microbiology, plant metabolism, infection events, and other prokaryote-eukaryote interactions.

Processes In Microbial Ecology

Author :David L. Kirchman
ISBN :9780191624223
Genre :Science
File Size : 89.31 MB
Format :PDF
Download :409
Read :673

Microbial ecology is the study of interactions among microbes in natural environments and their roles in biogeochemical cycles, food web dynamics, and the evolution of life. Microbes are the most numerous organisms in the biosphere and mediate many critical reactions in elemental cycles and biogeochemical reactions. Because microbes are essential players in the carbon cycle and related processes, microbial ecology is a vital science for understanding the role of the biosphere in global warming and the response of natural ecosystems to climate change. This novel textbook discusses the major processes carried out by viruses, bacteria, fungi, protozoa and other protists - the microbes - in freshwater, marine, and terrestrial ecosystems. It focuses on biogeochemical processes, starting with primary production and the initial fixation of carbon into cellular biomass, before exploring how that carbon is degraded in both oxygen-rich (oxic) and oxygen-deficient (anoxic) environments. These biogeochemical processes are affected by ecological interactions, including competition for limiting nutrients, viral lysis, and predation by various protists in soils and aquatic habitats. The book neatly connects processes occurring at the micron scale to events happening at the global scale, including the carbon cycle and its connection to climate change issues. A final chapter is devoted to symbiosis and other relationships between microbes and larger organisms. Microbes have huge impacts not only on biogeochemical cycles, but also on the ecology and evolution of more complex forms of life, including Homo sapiens.

Microbial Metagenomics Reveals Climate Relevant Subsurface Biogeochemical Processes

Author :
ISBN :OCLC:1052084712
Genre :
File Size : 47.71 MB
Format :PDF, Mobi
Download :340
Read :275

Abstract : Microorganisms play key roles in terrestrial system processes, including the turnover of natural organic carbon, such as leaf litter and woody debris that accumulate in soils and subsurface sediments. What has emerged from a series of recent DNA sequencing-based studies is recognition of the enormous variety of little known and previously unknown microorganisms that mediate recycling of these vast stores of buried carbon in subsoil compartments of the terrestrial system. More importantly, the genome resolution achieved in these studies has enabled association of specific members of these microbial communities with carbon compound transformations and other linked biogeochemical processes–such as the nitrogen cycle–that can impact the quality of groundwater, surface water, and atmospheric trace gas concentrations. The emerging view also emphasizes the importance of organism interactions through exchange of metabolic byproducts (e.g., within the carbon, nitrogen, and sulfur cycles) and via symbioses since many novel organisms exhibit restricted metabolic capabilities and an associated extremely small cell size. New, genome-resolved information reshapes our view of subsurface microbial communities and provides critical new inputs for advanced reactive transport models. These inputs are needed for accurate prediction of feedbacks in watershed biogeochemical functioning and their influence on the climate via the fluxes of greenhouse gases, CO2, CH4, and N2 O. Trends: Datasets from subsurface samples can now be resolved into collections of complete or near-complete microbial genomes, yielding information about biogeochemical roles and mechanisms by which surface- and groundwater quality and atmospheric compositions are impacted. Deep sequencing reveals extremely high levels of diversity in both the vadose zone and groundwater. Many novel organisms have an extremely small cell size and small genome size, with restricted metabolic capability. Their growth is likely tightly linked to that of other community members. Genomic analyses suggest that subsurface geochemical processes reflect the functioning of complex communities as opposed to a few dominant species. Newly discovered microorganisms catalyze transformations relevant to greenhouse gases and processing of biologically critical elements.

Understanding Terrestrial Microbial Communities

Author :Christon J. Hurst
ISBN :9783030107772
Genre :
File Size : 51.62 MB
Format :PDF, Kindle
Download :762
Read :915

Top Download:

Published online 2010 Apr 1. doi: 10.1186/1479-7364-4-4-282

Book review

Metagenomics is a young overarching discipline that seeks to understand population dynamics and interactions among vast microbial worlds that have not been revealed by traditional culture methods in the microbiology laboratory. Metagenomics is emerging as an essential scientific discipline: there are far-reaching implications in understanding ecosystem responses to changes in the natural environment, their adaptation to artificial niches resulting from human activities and their role as a source of human disease. And there are substantial opportunities for beneficial applications of meta-genomics knowledge -- for example, in energy production, medicinals and bioremediation.

Advances in instrumentation and computational and molecular tools enable high-throughput gathering and analyses of environmental samples at several levels: sequence information for microbial DNA, RNA and protein, and analysis of metabolic intermediates. As a new field of study, metagenomics integrates large-scale data about molecules to describe the biodiversity and relationships of an enormous microbial 'underground' in their natural environment. The impact of such ambitious inquiry, however, cannot be understated. It begins by interrogating the microbial world, but metagenomics has the potential to provide a molecular view of interrelationships among all living organisms.

Metagenomics: Theory, Methods, and Applications reviews the field at several levels. Chapter 1 is an overview of how culture-independent metagenomic analysis of environmental DNA sequences of small subunit ribosomal RNA genes increased knowledge about the diversity of all microbial groups. Multiple strand amplification of microbial DNA using Phi29 DNA polymerase, shotgun sequencing and pyrosequencing enabled high-throughput sequencing of microbial DNA. The development of metagenomic bioinformatics tools enabled genome assembly and classification of large-scale sequencing data. These metagenomic approaches have so far expanded bacterial classification into 30 new divisions. Archaea are now represented by 50 distinct phylogenetic groups in both extreme and non-extreme environments. Fungi are now estimated at over one million species, distributed among 49 distinct phylotypes. New eukaryotic microbes in anoxic environments are being identified as well. In addition, metagenomic identification and analyses of environmental viruses is revealing their role in shaping microbial ecosystems. The challenge that follows large-scale classification and genome sequencing of new organisms is to define the composition, structure and temporal connectivity of microbial ecosystems. The combination of several 'omics' approaches and bioinformatics should help model microbial communities in the future.

Chapter 2 complements Chapter 1, and is an in-depth review of meta'omics approaches that measure active genes in microbial populations at the levels of RNA (metatranscriptomics) and protein (metaproteomics) sequences, and their metabolite (metabolomics) intermediates. Metatranscriptomics is powered by high-throughput pyrosequencing of reverse-transcribed mRNA and rRNA. It generates, respectively, gene expression profiles and taxonomic data of microbial communities in the same sample. Mass spectrometric peptide fingerprint analysis and electrospray ionisation peptide-sequencing technologies enable identification and metaproteomic analyses of protein expression profiles. In combination with genomic and gene transcription data of a given microbial ecosystem, metaproteomics provides functional and structural data, and helps to identify metabolic pathways. Finally, metabolomics improves our understanding of ecosystem adaptation to environmental cues. From the application standpoint, however, metabolic pathways may be harnessed for bioremediation or the production of biomolecules for human and industrial use.

Chapter 3 deals with horizontal gene transfer (HGT) among different bacterial species, and the well recognised role of HGT in bacterial evolution. HGT could influence the interpretation of metagenomic data in a given bacterial community. The authors incorporated computational tools and were able to quantitate HGT sequences in different bacterial ecosystems in both the physical environment and the gut of mice and humans. An interesting finding is that the adherence substrate of bacterial flora influences the rate of HGT. These results have potential for the understanding of host-pathogen interactions, particularly pertaining to bacterial biofilm formation in human disease (for example, in cystic fibrosis, and pathogenic adaptation of the intestinal flora in the development of prominent diseases such as diabetes and food-borne allergies).

Chapters 4 and 5 present detailed sampling and computational methods for acquisition and analysis of meta-genomic data. Chapters 6-9 are comprehensive examples of metagenomic applications in plant-microbe interactions, bioremediation, identification and generation of bioproducts for medicinal and industrial uses. In particular, the archaeal metagenome is a promising source of basic knowledge about microbial communities in extreme environments and a source of novel genes that could be deployed for biotechnological and medical applications. Importantly, Chapter 10 describes how the human microbial flora is now subject to meta-genomic interrogation. Undoubtedly, characterisation of the human microbiome could feed into new basic research into understanding the increased incidence of major human illnesses such as asthma, diabetes, heart disease and allergies that are only partly explained by genetic predisposition.

Lastly, Chapter 11 is a highly readable philosophical journey on how metagenomics shapes long-held arguments about evolutionary processes and the interactions of living organisms at multiple levels. This chapter defends metagenomics as a scientific discipline and its promise in advancing hypothesis-driven research at molecular and organismal levels. Overall, Metagenomics: Theory, Methods, and Applications is a well-written and balanced presentation of an emerging area of research. In terms of nomenclature, it is suggested that the word 'metagenomics' is capitalised as 'Metagenomics' when it is used as an overall subject of inquiry covering several 'omics' approaches, but not capitalised when it specifically refers to the anlaysis of DNA sequences. Each chapter clearly lays out metagenomics as an evolving discipline, its promise and its limitations. This book will easily find an audience in undergraduate and graduate classrooms, and as a tool of independent research.

Diana Marco, editor. Metagenomics: Theory, methods, and applications. Caister Academic Press, Norfolk, UK; 2010. p. x + 212. (plus colour plates); Hardback; £159/US$310. [Google Scholar]

Articles from Human Genomics are provided here courtesy of BioMed Central

Microbiomes are ubiquitous and are found in the ocean, the soil, and in/on other living organisms. Changes in the microbiome can impact the health of the environmental niche in which they reside. In order to learn more about these communities, different approaches based on data from multiple omics have been pursued. Metagenomics produces a taxonomical profile of the sample, metatranscriptomics helps us to obtain a functional profile, and metabolomics completes the picture by determining which byproducts are being released into the environment. Although each approach provides valuable information separately, we show that, when combined, they paint a more comprehensive picture. We conclude with a review of network-based approaches as applied to integrative studies, which we believe holds the key to in-depth understanding of microbiomes.

Keywords

microbiome, metagenomics, metatranscriptomics, metabolomics, networks

Communities of microbes are found in diverse environmental niches, such as the ocean, soil, and inside host organisms, including all animals, plants, and lower eukaryotes.¹ These communities show characteristics, such as complexity, diversity, interaction, cooperation, dynamism, generosity, danger, and competition.² In such communities, microbes may compete for nutrients,³ share functional genes through horizontal gene transfer,⁴ produce toxins that can kill other microbes,⁵ produce various metabolites and signaling molecules for sharing and communication,⁶ and combine forces to fight common enemies, such as the host immune system.⁷ In short, the importance of the microbial community stems from the fact that they are critical to the health of the environmental niche in which they reside,⁸ and an imbalance in the community could be harmful.⁹

Traditionally, a microbiome has been defined as a microbial community occupying a reasonably well-defined habitat.¹⁰ One of the most common approaches to studying a microbiome is analyzing its constituent microbial genomes through metagenomics. More recently, this definition has evolved to include not only the microbes and their genomes but also the aggregate of environmental and host factors. The inclusion of the host environment as part of the microbiome significantly expands its implications, with the interactions between the host and its associated microbial community now relevant to understanding the dynamics of the microbiome. For evolutionary and functional studies of the microbiome, modifications in the host environment (eg, a diet shift in the host organism or a compositional change in the environmental matrix under study) now become critical and must be taken into consideration. Coevolution processes can then be identified, providing valuable information to understand the relationship of the microbial community with its host. This apparent conceptual shift is accompanied by the recognition that, in order to achieve a more comprehensive study of microbiomes, metagenomics must be combined with other omic approaches. Many relevant omic approaches have been proposed for microbiome studies. In this article, we discuss metatranscriptomics and metabolomics, which are rapidly becoming critical to microbiome studies.

Metagenomics is the study of the genomes in a microbial community and constitutes the first step to studying the microbiome. As seen in the “Metagenomics” section, metagenomics comes in different flavors. However, its main purpose is to infer the taxonomic profile of a microbial community. Although whole-metagenome sequencing (WMS) provides a partial glimpse into the functional profile of a microbial community, it is better inferred using metatranscriptomics, which involves sequencing the complete (meta)transcriptome of the microbial community. Metatranscriptomics informs us of the genes that are expressed by the community as a whole. With the use of functional annotations of expressed genes, it is possible to infer the functional profile of a community under specific conditions, which are usually dependent on the status of the host. While metagenomics helps address the question “what is the composition of a microbial community under different conditions?”, and metatrascriptomics helps answer the question “what genes are collectively expressed under different conditions?”, the question considered by metabolomics is “what byproducts are produced under different conditions?”. The metabolites released by the microbial community are largely responsible for the health of the environmental niche that they inhabit.

Regardless of whether microbiome studies are biomedical or environmental in their focus, it is clear that the different omic approaches provide invaluable information. However, the best results are obtained by performing integrative studies that involve all available omic datasets.¹¹ While such efforts hold promise, the integration must be done carefully.¹²

As suggested by a variety of different analyses,^13–16 we believe that network-based approaches can lead to a sophisticated in-depth analysis of microbiomes, particularly when applied to integrative studies, and consequently lead to critical insights into the world of microbiomes.

The National Institute of Health has funded a major initiative that aims to generate resources for a comprehensive characterization of the human microbiome to understand its impact on human health and disease. The first phase, known as the Human Microbiome Project (HMP),¹⁷ focuses on the study of microbial communities that inhabit the human body of healthy individuals,^18,19 with particular emphasis on nasal, oral, skin, gastrointestinal, and urogenital areas.^{17,18,20–23} It is known that the amount of microbial cells present in the human body is notably larger than the amount of human cells. These bacterial communities play critical roles, such as assisting in the digestion of food, synthesizing necessary vitamins, and aiding the immune system in defending our body from pathogenic invaders.²⁴ Human microbiome studies have revealed strong correlations between changes in microbial community profiles and diseases.^22,25–27 These studies have also shown that the structure of the microbial community is significantly different in five areas of the human body (gut, mouth, airways, urogenital, and skin), and that this seems to be independent of gender, age, and ethnicity.^18,19 All the data and protocols associated with this project are available at the HMP Data Analysis and Coordination Center (DACC).²⁸

Skoda yeti firmware update of rns 315. The Integrative HMP (iHMP)²⁷ is the second phase of this initiative, going a step further by gathering multiple omic data from both the microbiome and the host. This is part of a longitudinal study with a broader objective of understanding host-microbiome interactions using integrative analyses. Another related initiative focused on the human microbiome is the Metagenomics of the Human Intestinal Tract (MetaHIT) project.²⁹ This project was funded by the European Seventh Framework Programme until 2012. Its goal was to understand the link between the human intestinal microbiota and human health/disease. For this purpose, they focused on two disorders of increasing incidence in Europe: obesity and inflammatory bowel disease. Similarly, the Human Food Project and the American Gut Project³⁰ focus on the gut microbiome with the aim of determining how to acquire a healthy microbiome through food.

The Earth Microbiome Project (EMP) is a remarkable effort started in 2010 to characterize the diversity, distribution, and structure of microbial ecosystems across the planet and has already gathered over 30,000 samples.³¹ Their focus is on diverse ecosystems, including not only the ones within the bodies of humans, animals, and plants but also terrestrial, marine, freshwater, sediment, air, and constructed environments, as well as every intersection of these ecosystems.

J. Craig Venter Institute's (JCVI) Global Oceanic Sampling (GOS) expeditions and the European Tara Oceans initiatives^32–36 have focused on understanding and cataloging the marine microbiome diversity across the planet. JCVI's vessel, Sorcerer II, has made multiple oceanic expeditions to collect samples from oceans across the globe. Their multistage processing allows them to exploit size differences to separate different groups of microbes, including large microzooplankton and phytoplankton (3–20 μm), picoplankton and large cyanobacteria (0.8–3 μm), prokaryotes and large viruses (0.1–5 μm), and viroplankton (below 0.1 μm).

Metagenomics allows us to investigate the composition of a microbial community. Genomic studies consider the genetic material of a specific organism, while metagenomics (meta meaning beyond) refers to studies of genetic material of entire communities of organisms. This process usually involves next-generation sequencing (NGS) after the DNA is extracted from the samples. NGS produces a large volume of data in the form of short reads, from which a microbial community profile or other information can be pieced together just like gathering information from the pieces of a puzzle.

Recently, some authors have argued in favor of a terminological distinction between metagenomics (used to describe a broad comprehensive genomic approach to microbiome profiling) and metataxonomics (which uses amplicons from a targeted marker gene in order to make taxonomic inferences).³⁷ One popular marker gene used in metataxonomic studies is 16S rDNA.^13,38–42 A large number of databases are available for amplicons targeted in this region^43–45 and to aid in classification of reads and in building taxonomic profiles of a microbiome. With the advancement of technology, studies have shifted toward shotgun approaches,⁴⁶ such as WMS. As a result, a number of specialized databases with complete reference genomes have been developed.⁴⁷ These databases are then used to construct taxonomic profiles^18,48,49 but are also useful for inferring potential functional profiles for the microbial community based on the collection of genes present in the sample.

A variety of tools and analysis pipelines have been developed to analyze metagenomic data.⁵⁰ problem solving environments (PSEs⁵¹) provide user-friendly workbenches to develop flexible scientific analysis pipelines using a menu of available tools. Such workbenches incorporate different ranges of generality. For instance, Galaxy⁵² maximizes generality by providing a framework for genomic analysis while allowing the user to supply tools and file formats for various stages in a pipeline. Galaxy can execute jobs remotely, allows for undoing or repeating of individual steps, and permits inspection of intermediate results but requires considerable computational and storage resources. QIIME⁵³ provides a set of integratable scripts for analyzing raw microbial DNA samples including taxonomic classification using marker genes, such as 16S rRNA, but allows flexible pipelines to be constructed. Mothur⁵⁴ was initially designed to target the microbial ecology community but has since been adopted by the human microbiome community as well. It provides an extensible package with functionality accessible through a domain-specific language. Like QIIME, Mothur is also a metataxonomic tool, focusing on marker genes, such as 16S rRNA. Pathoscope⁵⁵ provides a pipeline that can identify bacterial strains present in a series of raw sequences and generate reports of statistics, such as percentages, gene locations, and protein products. Ideally, a PSE should be open source, infinitely extensible, lightweight, and able to accommodate any tool, user, or developer.

As shown in Figure 1, metagenomic analysis pipelines can be divided into three main steps: (1) preprocessing the reads, (2) processing the reads, and (3) downstream analyses.

Figure 1. Generic microbiome analysis pipeline.

The procedures followed in preprocessing and processing of the reads (steps 1 and 2) have become fairly standardized. Hence, we describe them briefly and focus mostly on downstream analysis (“Downstream analyses of metagenomic data” section).

Preprocessing mainly involves removing adapters from reads, filtering reads by quality and length, removing contaminants, identifying and removing any chimeric sequences that may have been generated during polymerase chain reaction (PCR) amplification, and preparing data for subsequent analysis. A survey of some of the popular tools and techniques currently available for this step can be found in Kim et al.⁵⁰

After preprocessing of the reads, the next step is to classify each read based on the taxa with the highest probability of being the origin of that read. This step often uses a reference database of relevant microbial genomes and produces a microbial profile usually represented as an abundance matrix with microbial taxa as rows, samples as columns, and values representing the abundance of a taxon in the sample.

In the case of metataxonomics, reads are frequently grouped (or clustered) prior to assigning a label. Unlike WMS, which produces a lower coverage and may identify thousands of strains per sample, targeted approaches have reads that come from relatively small regions of the genome, making this extra clustering step valuable in lowering errors in the classification. Groups of reads that result from the clustering process displaying similarity in sequence and/or composition are inferred to have a common origin and referred to as operational taxomonic units (OTUs).

The classification and labeling performed on the reads can be either taxonomy dependent or taxonomy independent. Taxonomy-dependent methods use a database of reference genomes, which has some bias toward data with pathogenic or commercial applications. Methods in this category can be further classified as alignment-based, composition-based, or hybrid. Alignment-based methods usually give the highest accuracy but are limited by the reference database and by the alignment parameters used and are generally computation and memory intensive. Composition-based methods store only compact models instead of the whole genome, requiring fewer computational resources. These methods use features extracted from the genomes (eg, GC percentage and codon or oligonucleotide usage patterns) to build models but have not yet achieved the accuracy of alignment- based approaches. Hybrid approaches offer a compromise between the two. Taxonomy-independent methods, on the other hand, do not require a priori knowledge. Instead, they segregate reads based on properties, such as distance, k-mers, abundance levels, and frequencies. These methods are typically used if the samples are more likely to have microbes that are not documented in the databases. Chen et al.⁵⁶ and Mande et al.⁵⁷ reported an extensive review of popular tools and techniques used for processing 16S reads and for processing WMS reads, respectively.

Accurate classification and labeling are challenging because (a) sequencing technologies produce short reads, (b) for economic reasons the datasets often have low coverage of the genomes in the microbiome, (c) some sequencing technologies have a high percentage of sequencing errors, and (d) the reference genome databases used are not comprehensive, often failing to provide an accurate taxonomic context because of lateral gene transfers between microbial taxa.

Once the reads have been assigned labels or classified as best as possible, downstream analyses attempt to extract useful knowledge from the data. Typical questions addressed in this step include “how diverse are the microbial taxa in the sample?”, “what is the functional profile of the genes present and/or expressed in the microbial community?”, “what microbial taxa are differentially abundant in the samples?”, “what phylogenetic groups, functional and metabolic pathways, orthologous groups of genes, and gene ontology terms are particularly enriched or depleted in the samples?”, and “what microbial groups tend to co-occur or co-avoid in the samples of interest?”. We now review several current tools and techniques for performing downstream analysis.

Richness and diversity are measures that have traditionally been used to characterize a metagenomic sample.^58,59 Richness is a simple count of taxa present in a sample. Diversity refers to a collection of indices and measures (eg, Shannon, Chao, Simpson, and Berger-Parker) that quantify the evenness of the distribution of the abundances of the taxa,⁵⁹ often incorporating distance measures or similarity indices (eg, Jaccard, Sorenson, and Bray-Curtis). Richness and diversity offer measures of complexity of the community but disclose little about interactions within the community, which requires more complex downstream analyses.

Visualizing taxonomic profiles is a task that has been addressed by several initiatives. Krona,⁶⁰ for example, is a simple and intuitive web-based tool to visualize the taxonomic profile as a pie chart with an embedded hierarchy. In contrast, the Visualization and Analysis of Microbial Population Structure (VAMPS) tool⁶¹ can measure and visualize statistically significant similarities and differences between multiple taxonomic profiles of complex microbial communities.

Integrating additional information in metagenomic analyses is extremely valuable in order to provide improved perspectives of the microbial profiles. Based on this premise, a number of approaches have sought the use of phylogenetic information to enhance the labeling and classification of reads, as is the case with Amphora2,⁶² which performs phylogenetic inference using phylum-specific marker databases. This type of inference can be done algorithmically as well, through edge principal component analysis (PCA) and squash clustering.⁶³ Phymm^64,65 is a software package that classifies sequence fragments into phylogenetic groups using interpolated Markov models. Finally, PPlacer⁶⁶ performs phylogenetic placement using a fixed reference tree and maximum-likelihood inference with distance calculations to indicate uncertainty and can be executed in parallel.

A more significant improvement is possible with the help of functional annotations of the genes to which the reads are mapped.^67,68 Although many analytical metagenomic approaches focus on the composition or structure of the samples, functional profiling is also essential, as it provides insight into the underlying biological processes. Other useful resources for annotation include gene ontology (GO),^69,70 Kyoto Encyclopedia of Genes and Genomes (KEGG),^71,72 and Clusters of Orthologous Groups (COG).^73,74 As a part of the HMP initiative to analyze WMS data, a methodology called HUMAnN⁷⁵ was developed for inferring the functional and metabolic potential of a microbial community.

Alternatively, other existing tools, such as IMG/M,⁷⁶ CAMERA,⁷⁷ METAREP,⁷⁸ MEGAN,⁷⁹ and CoMet,⁸⁰ can also be used to obtain functional profiles of microbiomes. IMG/M, METAREP, and CoMet provide a web-based user interface, while CAMERA aims to offer a state-of-the-art computational structure for high-performance network access and grid computing as a part of a distributed architecture. In contrast, MEGAN is a standalone computer program. METAREP and CoMet annotate the data with GO and KEGG, whereas MEGAN uses the NCBI taxonomy to summarize and order the results obtained after performing BLAST. METAREP also offers the option to annotate the data with taxonomic information, and IMG/M uses BLAST to infer phylogenetic information from the sample. However, IMG/M is more oriented toward protein-related information by annotating the results with resources, such as COG, Pfam, TIGRFAMs, ENZYME, and KEGG. IMG/M was developed by the Joint Genome Institute and contains data from the HMP and the Genome Encyclopedia of Bacterial and Archaea Genomes. CAMERA has been designed for environmental and ecological purposes with the aim of providing new ways of visualizing and interacting with data and was applied to data from GOS. METAREP, on the other hand, was developed at JCVI. The office online subtitrat sezonul 2. It performs statistical tests and muti-dimensional scaling (MDS) and can also produce graphical summaries, heatmaps and hierarchical clustering plots. MEGAN uses the lowest common ancestor algorithm to label the reads and has been applied to datasets, such as the Saragaso Sea dataset, and data from mammoth bone. Finally, CoMet combines open reading frame finding and assignment of protein sequences to Pfam domain families with comparative statistical analysis, providing the user with comprehensive tabular data files and visualizations in the form of hierarchical clustering and MDS. It was applied to 454 data.

Obtaining the functional profile is typically not possible with targeted approaches, since it provides no direct evidence of the functional capabilities of the microbial community. However, the tool Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt) shows how to infer a functional profile of a microbial community directly from taxonomic profiles of marker genes, such as the 16S rDNA, and a database of reference genomes.⁸¹ Their results provide useful insights on uncultivated microbial communities, prior to which only marker gene surveys were available.

In summary, metataxonomics helps us to compute the taxonomic profile of a microbial community, while metagenomics helps us to compute the functional profile by focusing on the gene content and using the available functional annotations of the corresponding proteins. While metagenomics is powerful, solely using it to study a microbiome is limited in value. Many experts have confirmed that the percentage of documented bacteria is very low compared to the estimate of bacterial species on our planet.⁸² This may be due partially to the impossibility of culturing complex environments or replicating in the laboratory the real conditions in which the microbiome exists. Either way, the reference databases used to classify and label bacteria are limited to what has been cataloged. Current methods typically either discard reads from undocumented microbes or label them based on the closest documented microbe from the database. Thus, inevitably, results will be based on a biased percentage of bacteria present in the samples, representing the first shortcoming of these methods. Another limitation is that metagenomics cannot reveal dynamic properties, such as the spatiotemporal activity of the community and the impact of the environment on these activities. The only information that can be obtained at a functional level is the potential of the microbiome to display functional properties associated with the presence of genes with no information about their expression levels or lack thereof. The need to monitor gene expression patterns brings us to the topic of our next section, metatranscriptomics.

By focusing on what genes are expressed by the entire microbial community, metatranscriptomics sheds light on the active functional profile of a microbial community.⁸³ The metatranscriptome provides a snapshot of the gene expression in a given sample at a given moment and under specific conditions by capturing the total mRNA. Pioneering studies aiming to identify expressed genes in environmental samples date back to 2005^84,85 and represent the dawn of metatranscriptomics. However, these were limited to a relatively narrow group of genes. As for metagenomics, it is now possible to perform whole metatranscriptomics shotgun sequencing. This (meta)genome-wide expression provides the expression and functional profile of a microbiome.^48,86,87

When processing reads, a typical metatranscriptomics analysis pipeline will either (1) map reads to a reference genome or (2) perform de novo assembly of the reads into transcript contigs and supercontigs. The first strategy, in a manner similar to the alignment-based methods in WMS, maps reads to reference databases, thus gathering information to infer the relative expression of individual genes. The second strategy infers the same but with assembled sequences. The first strategy is limited by the information in the database of reference genomes. The second strategy is limited by the ability of software programs to assemble contigs and supercontigs correctly from short reads data.

The application of metatranscriptomics to the study of the microbiome is far less common relative to other omics reviewed in this article. Most analysis pipelines described in the literature were built ad hoc. The majority of these methods follow the aforementioned first strategy based on read mapping.^88–92 In this case, metatranscriptomic reads are generally mapped to specialized databases (usually downloaded from the NCBI) using alignment tools, such as Bowtie2, BWA, and BLAST. The results are then annotated using resources, such as GO, KEGG, COG, and Swiss-Prot. Finally, different types of downstream analysis are carried out depending on the goal of the study (eg, PCA-based phylogenetic analysis or enrichment analysis). The latest metatranscriptomics techniques include stable isotope probing (SIP), which has been used to retrieve specific targeted transcriptomes of aerobic microbes in lake sediment.⁹³ This not only helps to target specific organisms but also contributes significantly to metabolomics studies.

The second strategy requires assembling metatranscriptomic reads into longer fragments called contigs. For this purpose, numerous software packages are available. Celaj et al.⁹⁴ compared de novo sequence assemblers to reference-based mapping tools. The compared tools included Trinity,⁹⁵ MetaVelvet,⁹⁶ Oases,⁹⁷ AbySS, Trans-Abyss, and SOAPdenovo,^98–100 as well as tools such as Scripture and Cufflinks.^101,102 It was found that compared to other tools Trinity not only outperformed all of them but also appeared to be best tuned for sensitivity across the broadest range of expression levels. This was particularly noticeable in reconstructing transcripts within the highest expression quintiles, in which other de novo strategies failed to perform well.⁹⁵ Li and Dewey¹⁰³ developed RNA-Seq by Expectation Maximization (RSEM), a quantitative pipeline for transcriptomic analysis, currently provided as stand-alone software or a plug-in within Trinity. RSEM takes as input a reference transcriptome or assembly (most likely obtained through Trinity) along with RNA-Seq reads generated from the sample and calculates normalized transcript abundance (ie, the number of RNA-Seq reads corresponding to each reference transcriptome or assembly).^104,105 Although both Trinity and RSEM were designed for transcriptomic datasets (ie, obtained from a single organism), it may be possible to apply them to metatranscriptomic data (ie, obtained from a whole microbial community). MEGAN annotates results with GO to perform enrichment analysis.¹⁰⁶

Although current metatranscriptomic techniques are promising, there are still several obstacles that limit their large-scale application. First, much of the harvested RNA comes from ribosomal RNA, and its dominating abundance can dramatically reduce the coverage of mRNA, which is the main focus of transcriptomic studies. Some efforts have been made to effectively remove rRNA.¹⁰⁷ Second, mRNA is notoriously unstable, compromising the integrity of the sample before sequencing. Third, differentiating between host and microbial RNA can be challenging, although commercial enrichment kits are available. This may also be done in silico if a reference genome is available for the host, as in the work of Perez-Losada et al.¹⁰⁸ who consider the impact of host-pathogen interactions on the human airway microbiome. Finally, transcriptome reference databases are limited in their coverage.

WMS approaches provide information on the taxonomic profile of a microbial community as well as its potential functional profile; in contrast, whole metatranscriptome sequencing describes the active functional profile. This would help in studying the dynamics of functional profiles with varying conditions. We now discuss metabolomics, which studies the consequences of the shifts in the collective gene expression of the microbial community that modifies the very medium where the microbial community must feed, grow, reproduce, and cooperate or compete to survive.

Metabolomics is the comprehensive analysis by which all metabolites of a sample (small molecules released by the organism into the immediate environment) are identified and quantified.¹⁰⁹ The metabolome is considered the most direct indicator of the health of an environment or of the alterations in homeostases (ie, dysbiosis).¹¹⁰ Variation in the production of signature metabolites are related to changes in activity of metabolic routes, and therefore, metabolomics represents an applicable approach to pathway analysis.¹¹¹ Additionally, the application of metabolomics for drug discovery and pharmacogenomics represents a promising avenue for personalized medicine.¹¹²

The metabolomic profile associated with the microbiome may show a strong dependence on environmental factors (eg, diet, exposure to xenobiotics, and environmental stressors), providing valuable information not just about the characteristics of the microbiome but also about the interactions of the microbial community with the host environment.^113–115 Thus, metabolomics aims to improve our understanding of the role of the microbiome in the transformation of nutrients and pollutants as well as other abiotic factors that may affect the homeostasis of the host environment. Microbial communities exert a strong influence on critical biogeochemical cycles, and the study of their metabolome can help to develop predictive biomarkers for environmental stressors.¹¹⁶ The microbiome is regarded as a biological reactor that, based on its genetic pool, can transform resources and hazardous elements into products that are either beneficial or detrimental to the health of its environment. A good example is bioremediation and its application to reduce the consequences of pollution.¹¹⁷

Most interestingly, the metabolome can illustrate signaling processes involved during communication between bacteria, such as quorum sensing, which relates gene expression responses to changes in cell population density.^118–123 A deeper understanding of the communication mechanisms within microbial communities could possibly revolutionize the current strategies in areas such as infections disease control, and optimize agricultural exploitation in environmental conservation. Thus, metabolomics complements the information provided by the other omics (mentioned earlier) by describing not just biological systems themselves, but how they interact internally and externally.

Generating metabolomics data differs significantly from generating metagenomics and metatranscriptomics data, which rely heavily on sequencing. Identifying and quantifying metabolites is typically carried out using a combination of chromatography techniques (ie, liquid chromatography, LC, and gas chromatography, GC) and detection methods, such as mass spectrometry (MS) and nuclear magnetic resonance (NMR). For a more detailed review of these technologies and their many variants, we refer the reader to a recent review by Aldridge and Rhee.¹²⁴ These technologies produce spectra consisting of patterns of peaks that allow both the identification and quantification of metabolites. These patterns (either predicted or experimentally obtained) are stored in spectral databases, allowing automated analysis and generation of metabolomic profiles. With these technological resources, metabolomics fulfills the requirements of a high-throughput analytical method, and thus data analysis represents a critical step in knowledge generation. As a result, we have seen a rise in software development, large data repositories, and initiatives for standardization. This in turn paves the road for data integration.

The analysis pipeline for spectral metabolomic data involves three steps: (1) preprocessing, (2) statistical analysis, and (3) machine learning techniques for pattern recognition.¹²⁵ In the first step, denoising and peak-picking improve the quality of the data to be processed. Once the peak pattern has been established, a comparison against spectral databases identifies the metabolites in the sample and the area below the peaks their respective quantities. To automate this process, spectral databases are maintained and curated by specialized international consortia that emphasize standardization. These include the following: the Human Metabolome Database, a cross-referenced database about the small metabolites found in the human body^126–128; the BioMagResBank, which works as a central repository for experimental NMR data including both small metabolites and macromolecules¹²⁹; the Madison-Qingdao Metabolomics Consortium Database,¹³⁰ which includes both NMR and MS data thoroughly annotated collected from other databases and literature; MassBank,¹³¹ which merges spectral data from different collision-induced dissociation conditions to improve the precision in the identification of compounds; the Golm Metabolome Database,¹³² which stores spectral data with retention indexes, useful for automated identification of compounds analyzed with GC-MS; and the METLIN Metabolite Database,¹³³ which contains curated spectral information of biological metabolites without information of the environmental context from which the samples where obtained. Each of them differs slightly in functionality but pursues similar goals, serving as repositories of spectral data and offering links to their biological interpretation.

By cataloging all metabolites present in a sample, metabolomics offers a powerful way to relate the metabolites to the cellular processes of which they are the byproducts. The combination of metabolomic and pathways information can lead to new hypotheses. One important challenge of this approach is difficulty in determining whether a metabolite was generated by the host or by the microbiome. In addition, if conclusions are to be made about which genes, enzymes, or pathways are associated with a specific metabolite, the results obtained from a metabolomic study must be combined with other omic data. This highlights the need for new approaches that deal with integrated omics, as discussed in the “Integrating multiomic data” section.

Standard analyses of individual omic datasets focus on the community structure and functional roles of individual taxa or groups of taxa. The remaining challenge lies in elucidating the large, dynamic, and complex network of interactions between its constituent entities. With the increasing availability of heterogeneous multiomic datasets,¹¹ the need for integrative analyses has become even more urgent. A reasonable approach (Fig. 2) is to perform separate analysis, adding an extra integrative step within downstream analysis.

Figure 2. Generic multiomic analysis pipeline.

Integrating multiple omic datasets is a problem that researchers are just beginning to tackle.¹² Bringing together different studies will allow researchers to build and test mathematical models of microbial activity and interaction, enabling a better understanding of the interplay between the environment and the microbial community.^134,135 For example, the combination of metagenomics and metatranscriptomics may reveal overexpression or underexpression of particular functions and, in some cases, the activities of specific organisms.^90,136–138 The addition of metabolomics could provide insight into the outcome of those changes in gene expression, which may lead to differential expression of specific metabolites that impact the health of the host environment.^139–144 Understanding the whole ecosystem opens new avenues and exciting approaches for generating new knowledge. By combining multiple (potentially noisy and heterogeneous) data types, we can build support for specific hypotheses; if independent lines of evidence arrive at the same conclusion, then our confidence in that conclusion will grow.

Current studies indicate that integrating metagenomics and metatranscriptomics has the potential of attributing functional changes in gene expression to specific members of the microbial community. Franzosa et al.¹⁴⁵ showed a relationship between genomic abundances and differential regulations of microbial transcripts, discovering up- and downregulated pathways within the human gut microbiome. Shi et al.¹⁴⁶ applied this integrative approach relating the functional and taxonomic profiles of marine environmental samples. Current studies also indicate that integrating the results of metagenomics with metabolomics can provide insight into how members of a microbial community interact with each other and with their environment.¹⁴⁷ For example, Lu et al.¹⁴⁸ observed a simultaneous effect on both microbiome composition and metabolite production upon introducing arsenic into the mouse gut environment. Zhang et al.¹⁴⁹ performed a similar study with the introduction of disinfection byproducts from drinking water. These studies illustrate that the different omics are interdependent and that an integrated approach can lead to more useful discoveries.

Several current studies suggest that integrating all three omic data – metagenomics, metatranscriptomics, and metabolomics – would provide a complete picture from genes to phenotype.^150,151 With the wealth of datasets available but not currently integrated, Abram¹⁵² argues for a system-based approach to multiomics, which would allow predictive modeling. In particular, he points out that studying interrelationships between entities (which he refers to as SIP-omics) would provide some guidance to establishing linkages between various datasets.

Interrelationships also form the basis of the reverse ecology algorithm,¹⁵³ which attempts to connect microbial communities with properties of their environment under the assumption that adaptation to the environment is most fundamental to their structure and topology. The set of metabolites that are acquired by an organism from external sources is called the seed set and represents the metabolic interface with the environment. Borenstein et al.¹⁵⁴ showed how to compute the seed set for individual organisms and how it can be used to characterize the effective biochemical habitat. Ebenhöh et al.¹⁵⁵ offered predictive models of an organism's ability to flourish in specific environments.

In this article, we have discussed how three different omic approaches – metagenomics, metatranscriptomics, and metabolomics – provide useful information toward understanding microbiomes. We also discussed how the value of an integrative approach is greater than the sum of its parts.

Biological networks have long been used to model interactions between biological entities, with applications to areas, such as gene regulation, metabolic and signaling pathways, protein-protein networks, and food webs in ecology.^156–159 With its proven application to analyzing interrelationships and their critical role in multiomics, we believe biological network analysis will be critical to future multiomic approaches to studying the microbiome. In addition, network analyses offer the possibility of exploring both local (eg, relationship with neighbors) as well as global properties (eg, connectivity) of a community. Dutkowski et al.¹⁶⁰ studied the assignment of ontologies using networks and developed tools, such as Cytoscape,¹⁶¹ to perform these analyses.

Metagenomic studies have shown that interactions within a microbiome can be naturally modeled using a network representation,^14,42,162 with properties closely related to social networks.^15,24 Macroscale community structures have been observed in these types of networks, indicating clubs (ie, groups of co-occurring bacteria) as well as rival clubs (ie, groups of bacteria that tend to not co-occur).^15,42

In order to integrate data from various omic sources, microbiomes can also be modeled as heterogeneous networks (Fig. 3), which provides a visual description of what such a network in the context of the microbiome would look like. A heterogeneous network would allow researchers to generate new interesting hypotheses that involve entities from the different omics described in this article (represented in the figure by nodes with different shapes and colors). For instance, we could potentially have a club that includes genes, microbes, and metabolites. Heterogeneous networks have been used in other applications, such as associations between genetic interactions and protein-protein interactions in order to infer cellular function.¹⁶³ Another study couples these same types of networks to infer gene dependencies and new processes, such as DNA damage repair, and also different types of co-expression networks.¹⁶⁴ Many types of omic networks were also integrated to study gene regulation in the bacterium Mycobacterium tuberculosis.¹⁶⁵ Other omic areas not included in this study include metaproteomics, metalipidomics, and metaglycomics. We believe that analyzing heterogeneous networks built across multiple omic datasets is critical to linking the different levels of complexity inherent to biological systems, thus establishing a more comprehensive understanding of the nature and dynamics of microbiomes.

Figure 3. Integrated networks for multiomic data.

Conceived and designed the experiments: VAP, GN. Analyzed the data: VAP, WH, VSU, TC, GN. Wrote the first draft of the manuscript: VAP, WH, VSU, TC. Contributed to the writing of the manuscript: VAP, WH, VSU, TC, GN. Agree with manuscript results and conclusions: VAP, WH, VSU, TC, KM, GN. Jointly developed the structure and arguments for the paper: VAP, GN. Made critical revisions and approved final version: VAP, KM, GN. All authors reviewed and approved of the final manuscript.

References

1.	Ley, R.E. , Peterson, D.A. , Gordon, J.I. Ecological and evolutionary forces shaping microbial diversity in the human intestine. Cell. 2006; 124(4): 837–48. Google Scholar \| Crossref \| ISI
2.	Costello, E.K. , Lauber, C.L. , Hamady, M. Bacterial community variation in human body habitats across space and time. Science. 2009; 326: 1694–7. Google Scholar \| Crossref \| ISI
3.	Hibbing, M.E. , Fuqua, C. , Parsek, M.R. , Peterson, S.B. Bacterial competition: surviving and thriving in the microbial jungle. Nat Rev Microbiol. 2010; 8: 15–25. Google Scholar \| Crossref \| ISI
4.	Liu, L. , Chen, X. , Skogerb, G. The human microbiome: a hot spot of microbial horizontal gene transfer. Genomics. 2012; 100(5): 265–70. Google Scholar \| Crossref \| ISI
5.	Proft, T. Microbial Toxins: Current Research and Future Trends. Caister Academic Press, Norfolk, UK; 2009. Google Scholar
6.	Sharon, G. , Garg, N. , Debelius, J. , Knight, R. , Dorrestein, P.C. , Mazmanian, S.K. Specialized metabolites from the microbiome in health and disease. Cell Metab. 2014; 20(5): 719–30. Google Scholar \| Crossref \| ISI
7.	Kau, A.L. , Ahern, P.P. , Griffin, N.W. , Goodman, A.L. , Gordon, J.I. Human nutrition. Nature. 2011; 474(7351): 327–36. Google Scholar \| Crossref \| ISI
8.	Foxman, B. , Martin, E.T. Use of the microbiome in the practice of epidemiology: a primer on -omic technologies. Am J Epidemiol. 2015; 182(1): 1–8. Google Scholar \| Crossref \| ISI
9.	Betts, K. A study in balance: how microbiomes are changing the shape of environmental health. Environ Health Perspect. 2011; 119(8): 340–6. Google Scholar \| Crossref
10.	Whipps, J.M. , Lewis, K. , Cooke, R.C. Mycoparasitism and Plant Disease Control. Manchester University Press, Manchester, UK; 1988: 161–87. Google Scholar
11.	Segata, N. , Boernigen, D. , Tickle, T.L. , Morgan, X.C. , Garrett, W.S. , Huttenhower, C. Computational metaomics for microbial community studies. Mol Syst Biol. 2013; 9(1): 666–80. Google Scholar \| Crossref
12.	Franzosa, E.A. , Hsu, T. , Sirota-Madi, A. Sequencing and beyond: integrating molecular ‘omics’ for microbial community profiling. Nat Rev Microbiol. 2015; 13(6): 360–72. Google Scholar \| Crossref \| ISI
13.	Barberan, A. , Bates, S.T. , Casamayor, E.O. , Fierer, N. Using network analysis to explore cooccurrence patterns in soil microbial communities. ISME J. 2011; 6: 343–51. Google Scholar \| Crossref \| ISI
14.	Faust, K. , Raes, J. Microbial interactions: from networks to models. Nat Rev Microbiol. 2012; 10: 538–50. Google Scholar \| Crossref \| ISI
15.	Fernandez, M. , Riveros, J.D. , Campos, M. , Mathee, K. , Narasimhan, G. Microbial “Social Networks”. BMC Genomics. 2015; 16(Suppl 11): S6. Google Scholar \| Crossref
16.	Fernandez, M. , Aguiar-Pulido, V. , Huang, W. Microbiome analysis: state-of-the-art and future trends. In: Mandoiu, I. , Zelikovsky, A. , eds. Computational Methods for Next Generation Sequencing Data Analysis. Wiley, Hoboken, NJ; 2015: 333–51. Google Scholar
17.	Peterson, J. , Garges, S. , Giovanni, M. The NIH Human Microbiome Project. Genome Res. 2009; 19: 2317–23. Google Scholar \| Crossref \| ISI
18.	Human Microbiome Project Consortium. Structure, function and diversity of the healthy human microbiome. Nature. 2012; 486: 207–14. Google Scholar \| Crossref \| ISI
19.	Human Microbiome Project Consortium. A framework for human microbiome research. Nature. 2012; 486(7402): 215–21. Google Scholar \| Crossref \| ISI
20.	Turnbaugh, P.J. , Ley, R.E. , Hamady, M. , Fraser-Liggett, C.M. , Knight, R. , Gordon, J.I. The human microbiome project. Nature. 2007; 449: 804–10. Google Scholar \| Crossref \| ISI
21.	Turnbaugh, P.J. , Gordon, J.I. The core gut microbiome, energy balance and obesity. J Physiol (Lond). 2009; 587: 4153–8. Google Scholar \| Crossref
22.	Marrazzo, J.M. , Martin, D.H. , Watts, D.H. Bacterial vaginosis: identifying research gaps proceedings of a workshop sponsored by DHHS/NIH/NIAID. Sex Transm Dis. 2010; 37: 732–44. Google Scholar \| Crossref \| ISI
23.	Qin, J. , Li, R. , Raes, J. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010; 464: 59–65. Google Scholar \| Crossref \| ISI
24.	Ackerman, J. The ultimate social network. Sci Am. 2012; 306(6): 36–43. Google Scholar \| Crossref \| ISI
25.	Brown, K. , DeCoffe, D. , Molcan, E. , Gibson, D.L. Diet-induced dysbiosis of the intestinal micro-biota and the effects on immunity and disease. Nutrients. 2012; 4(8): 1095. Google Scholar \| Crossref \| ISI
26.	Cho, I. , Blaser, M.J. The human microbiome: at the interface of health and disease. Nat Rev Genet. 2012; 13: 260–70. Google Scholar \| Crossref \| ISI
27.	Integrative HMP (iHMP) Research Network Consortium. The integrative human microbiome project: dynamic analysis of microbiome-host omics profiles during periods of human health and disease. Cell Host Microbe. 2014; 16(3): 276–89. Google Scholar \| Crossref \| ISI
28.	Human Microbiome Project Consortium. HMSCP – Shotgun Community Profiling. Available at: http://hmpdacc.org/HMSCP/. Last accessed: Jan. 2016. Google Scholar
29.	Ehrlich, S.D. , MetaHIT Consortium. Metagenomics of the intestinal microbiota: potential applications. Gastroenterol Clin Biol. 2010; 34: S23–8. Google Scholar \| Crossref
30.	Goedert, J.J. , Hua, X. , Yu, G. , Shi, J. Diversity and composition of the adult fecal microbiome associated with history of cesarean birth or appendectomy: analysis of the American gut project. EBioMedicine. 2014; 1(2): 167–72. Google Scholar \| Crossref \| ISI
31.	Gilbert, J.A. , Jansson, J.K. , Knight, R. The earth microbiome project: successes and aspirations. BMC Biol. 2014; 12(1): 69. Google Scholar \| Crossref
32.	Venter, J.C. , Remington, K. , Heidelberg, J.F. Environmental genome shotgun sequencing of the Sargasso sea. Science. 2004; 304(5667): 66–74. Google Scholar \| Crossref \| ISI
33.	Nealson, K.H. , Venter, J.C. Metagenomics and the global ocean survey: what's in it for us, and why should we care? ISME J. 2007; 1(3): 185. Google Scholar \| Crossref \| ISI
34.	Lima-Mendez, G. , Faust, K. , Henry, N. Determinants of community structure in the global plankton interactome. Science. 2015; 348(6237): 1262073. Google Scholar \| Crossref \| ISI
35.	Karsenti, E. , Acinas, S.G. , Bork, P. A holistic approach to marine eco-systems biology. PLoS Biol. 2011; 9(10): e1001177. Google Scholar \| Crossref \| ISI
36.	Sunagawa, S. , Coelho, L.P. , Chaffron, S. Structure and function of the global ocean microbiome. Science. 2015; 348(6237): 1261359. Google Scholar \| Crossref \| ISI
37.	Marchesi, J.R. , Ravel, J. The vocabulary of microbiome research: a proposal. Microbiome. 2015; 3: 31. Google Scholar \| Crossref \| ISI
38.	Chaffron, S. , Rehrauer, H. , Pernthaler, J. , von Mering, C. A global network of coexisting microbes from environmental and whole-genome sequence data. Genome Res. 2010; 20: 947–59. Google Scholar \| Crossref \| ISI
39.	Gonzalez, A. , Knight, R. Advancing analytical algorithms and pipelines for billions of microbial sequences. Curr Opin Biotechnol. 2012; 23: 64–71. Google Scholar \| Crossref \| ISI
40.	Freilich, S. , Kreimer, A. , Meilijson, I. , Gophna, U. , Sharan, R. , Ruppin, E. The large-scale organization of the bacterial network of ecological co-occurrence interactions. Nucleic Acids Res. 2010; 38(12): 3857–68. Google Scholar \| Crossref \| ISI
41.	Kuczynski, J. , Liu, Z. , Lozupone, C. , McDonald, D. , Fierer, N. , and Knight, R. Microbial community resemblance methods differ in their ability to detect biologically relevant patterns. Nat Methods. 2010; 7: 813–9. Google Scholar \| Crossref \| ISI
42.	Faust, K. , Sathirapongsasuti, J.F. , Izard, J. Microbial co-occurrence relationships in the human microbiome. PLoS Comput Biol. 2012; 8(7): e1002606. Google Scholar \| Crossref \| ISI
43.	Cole, J.R. , Wang, Q. , Fish, J.A. Ribosomal Database Project: data and tools for high throughput rRNA analysis. Nucleic Acids Res. 2014; 42: D633–42. Google Scholar \| Crossref \| ISI
44.	Pruesse, E. , Quast, C. , Knittel, K. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res. 2007; 35(21): 7188–96. Google Scholar \| Crossref \| ISI
45.	DeSantis, T.Z. , Hugenholtz, P. , Larsen, N. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006; 72: 5069–72. Google Scholar \| Crossref \| ISI
46.	Sharpton, T.J. An introduction to the analysis of shotgun metagenomic data. Front Plant Sci. 2014; 5: 209. Google Scholar \| Crossref \| ISI
47.	Nelson, K.E. , Weinstock, G.M. , Highlander, S.K. A catalog of reference genomes from the human microbiome. Science. 2010; 328: 994–9. Google Scholar \| Crossref \| ISI
48.	Frias-Lopez, J. , Shi, Y. , Tyson, G.W. Microbial community gene expression in ocean surface waters. Proc Natl Acad Sci (PNAS). 2008; 105(10): 3805–10. Google Scholar \| Crossref \| ISI
49.	Chain, P.S. , Grafham, D.V. , Fulton, R.S. Genome project standards in a new era of sequencing. Science. 2009; 326: 236–7. Google Scholar \| Crossref \| ISI
50.	Kim, M. , Lee, K-H , Yoon, S-W , Kim, B-S , Chun, J. , and Yi, H. Analytical tools and databases for metagenomics in the next-generation sequencing era. Genomics Inform. 2013; 11(3): 102–13. Google Scholar \| Crossref
51.	Gallopoulos, E. , Houstis, E. , Rice, J. Computer as thinker/doer: problem-solving environments for computational science. Comput Sci Eng IEEE. 1994; 1: 11–23. Google Scholar \| Crossref
52.	Goecks, J. , Nekrutenko, A. , Taylor, J. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010; 11: R86. Google Scholar \| Crossref \| ISI
53.	Caporaso, J.G. , Kuczynski, J. , Stombaugh, J. QIIME allows analysis of high-throughput community sequencing data. Nat Methods. 2010; 7: 335–6. Google Scholar \| Crossref \| ISI
54.	Schloss, P.D. , Westcott, S.L. , Ryabin, T. Introducing Mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol. 2009; 75(23): 7537–41. Google Scholar \| Crossref \| ISI
55.	Hong, C. , Manimaran, S. , Shen, Y. PathoScope 2.0: a complete computational framework for strain identification in environmental or clinical sequencing samples. Microbiome. 2014; 2(1): 33. Google Scholar \| Crossref
56.	Chen, W. , Zhang, C.K. , Cheng, Y. , Zhang, S. , Zhao, H. A comparison of methods for clustering 16s rRNA sequences into OTUs. PLoS One. 2013; 8(8): e70837. Google Scholar \| Crossref \| ISI
57.	Mande, S.S. , Mohammed, M.H. , Ghosh, T.S. Classification of metagenomic sequences: methods and challenges. Brief Bioinform. 2012; 13(6): 669–81. Google Scholar \| Crossref \| ISI
58.	Colwell, R. Estimates, Version 7.5: Statistical Estimation of Species Richness and Shared Species from Samples (Software and Users Guide); 2005. Available at: http://viceroy.eeb.uconn.edu/estimates Google Scholar
59.	Colwell, R.K. Biodiversity: concepts, patterns, and measurement. In: Levin, S.A. , ed. The Princeton Guide to Ecology. Princeton University Press, Princeton, NJ; 2009: 257–63. Google Scholar \| Crossref
60.	Ondov, B.D. , Bergman, N.H. , Phillippy, A.M. Interactive metagenomic visualization in a web browser. BMC Bioinformatics. 2011; 12(1): 385. Google Scholar \| Crossref
61.	Huse, S.M. , Welch, D.B.M. , Voorhis, A. VAMPS: a website for visualization and analysis of microbial population structures. BMC Bioinformatics. 2014; 15(1): 41. Google Scholar \| Crossref
62.	Wu, M. , Scott, A.J. Phylogenomic analysis of bacterial and archaeal sequences with amphora2. Bioinformatics. 2012; 28(7): 1033–4. Google Scholar \| Crossref \| ISI
63.	Matsen, F. , Evans, S.N. Edge principal components and squash clustering: using the special structure of phylogenetic placement data for sample comparison. PLoS One. 2013; 8: 3. Google Scholar \| Crossref \| ISI
64.	Brady, A. , Salzberg, S.L. Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models. Nat Methods. 2009; 6: 673–6. Google Scholar \| Crossref \| ISI
65.	Brady, A. , Salzberg, S. Phymmbl expanded: confidence scores, custom databases, parallelization and more. Nat Methods. 2011; 8(5): 367–7. Google Scholar \| Crossref \| ISI
66.	Matsen, F.A. , Kodner, R.B. , Armbrust, E.V. pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics. 2010; 11(1): 538. Google Scholar \| Crossref
67.	Meyer, F. , Paarmann, D. , D'Souza, M. The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics. 2008; 9(1): 386. Google Scholar \| Crossref
68.	Stark, M. , Berger, S.A. , Stamatakis, A. , von Mering, C. Mltreemap-accurate maximum likelihood placement of environmental DNA sequences into taxonomic and functional reference phylogenies. BMC Genomics. 2010; 11(1): 461. Google Scholar \| Crossref
69.	Ashburner, M. , Ball, C.A. , Blake, J.A. Gene ontology: tool for the unification of biology. Nat Genet. 2000; 25(1): 25–9. Google Scholar \| Crossref \| ISI
70.	Gene Ontology Consortium. Gene ontology consortium: going forward. Nucleic Acids Res. 2015; 43(D1): D1049–56. Google Scholar \| Crossref \| ISI
71.	Kanehisa, M. , Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000; 28: 27–30. Google Scholar \| Crossref \| ISI
72.	Kotera, M. , Moriya, Y. , Tokimatsu, T. , Goto, S. Kegg and genomenet, new developments, metagenomic analysis. In: Nelson, K.E. , ed. Encyclopedia of Metagenomics. Springer, New York; 2015: 329–39. Google Scholar
73.	Tatusov, R.L. , Koonin, E.V. , Lipman, D.J. A genomic perspective on protein families. Science. 1997; 278: 631–7. Google Scholar \| Crossref \| ISI
74.	Tatusov, R.L. , Galperin, M.Y. , Natale, D.A. , Koonin, E.V. The cog database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000; 28(1): 33–6. Google Scholar \| Crossref \| ISI
75.	Abubucker, S. , Segata, N. , Goll, J. Metabolic reconstruction for metagenomic data and its application to the human microbiome. PLoS Comput Biol. 2012; 8(6): e1002358. Google Scholar \| Crossref \| ISI
76.	Markowitz, V.M. , Chen, I-MM , Palaniappan, K. IMG: the integrated microbial genomes database and comparative analysis system. Nucleic Acids Res. 2012; 40: D115–22. Google Scholar \| Crossref \| ISI
77.	Seshadri, R. , Kravitz, S.A. , Smarr, L. Camera: a community resource for metagenomics. PLoS Biol. 2007; 5(3): e75. Google Scholar \| Crossref \| ISI
78.	Goll, J. , Rusch, D.B. , Tanenbaum, D.M. METAREP: JCVI metagenomics reports – an open source tool for high-performance comparative metagenomics. Bioinformatics. 2010; 26(20): 2631–2. Google Scholar \| Crossref \| ISI
79.	Huson, D.H. , Mitra, S. , Ruscheweyh, H-J , Schuster, S.C. Integrative analysis of environmental sequences using MEGAN4. Genome Res. 2011; 21(9): 1552–60. Google Scholar \| Crossref \| ISI
80.	Lingner, T. , Aßhauer, K.P. , Schreiber, F. , Meinicke, P. Comet – a web server for comparative functional profiling of metagenomes. Nucleic Acids Res. 2011; 39(Web Server issue): W518–23. Google Scholar \| Crossref \| ISI
81.	Langille, M.G. , Zaneveld, J. , Caporaso, J.G. Predictive functional profiling of microbial communities using 16 s rrna marker gene sequences. Nat Biotechnol. 2013; 31(9): 814–21. Google Scholar \| Crossref \| ISI
82.	Eisen, J. Environmental shotgun sequencing: its potential and challenges for studying the hidden world of microbes. PLoS Biol. 2007; 5(3): e82. Google Scholar \| Crossref \| ISI
83.	Moran, M.A. Metatranscriptomics: eavesdropping on complex microbial communities. Microbiome. 2009; 4(7): 329–34. Google Scholar
84.	Poretsky, R.S. , Bano, N. , Buchan, A. Analysis of microbial gene transcripts in environmental samples. Appl Environ Microbiol. 2005; 71(7): 4121–6. Google Scholar \| Crossref \| ISI
85.	Botero, L.M. , D'imperio, S. , Burr, M. , McDermott, T.R. , Young, M. , Hassett, D.J. Poly (a) polymerase modification and reverse transcriptase PCR amplification of environmental RNA. ApplEnviron Microbiol. 2005; 71(3): 1267–75. Google Scholar
86.	Carvalhais, L.C. , Dennis, P.G. , Tyson, G.W. , Schenk, P.M. Application of metatranscriptomics to soil environments. J Microbiol Methods. 2012; 91(2): 246–51. Google Scholar \| Crossref \| ISI
87.	Gilbert, J.A. , Field, D. , Huang, Y. Detection of large numbers of novel sequences in the metatranscriptomes of complex marine microbial communities. PLoS One. 2008; 3(8): e3042. Google Scholar \| Crossref \| ISI
88.	Leimena, M.M. , Ramiro-Garcia, J. , Davids, M. A comprehensive metatranscriptome analysis pipeline and its validation using human small intestine microbiota datasets. BMC Genomics. 2013; 14(1): 530. Google Scholar \| Crossref
89.	Yost, S. , Duran-Pinedo, A.E. , Teles, R. , Krishnan, K. , Frias-Lopez, J. Functional signatures of oral dysbiosis during periodontitis progression revealed by microbial metatranscriptome analysis. Genome Med. 2015; 7(1): 27. Google Scholar \| Crossref \| ISI
90.	Duran-Pinedo, A.E. , Chen, T. , Teles, R. Community-wide transcriptome of the oral microbiome in subjects with and without periodontitis. ISME J. 2014; 8(8): 1659–72. Google Scholar \| Crossref \| ISI
91.	Jorth, P. , Turner, K.H. , Gumus, P. , Nizam, N. , Buduneli, N. , Whiteley, M. Metatranscriptomics of the human oral microbiome during health and disease. M Bio. 2014; 5(2): e1012–4. Google Scholar
92.	Xiong, X. , Frank, D.N. , Robertson, C.E. Generation and analysis of a mouse intestinal meta-transcriptome through illumina based RNA-sequencing. PLoS One. 2012; 7(4): e36009. Google Scholar \| Crossref \| ISI
93.	Dumont, M.G. , Pommerenke, B. , Casper, P. Using stable isotope probing to obtain a targeted metatranscriptome of aerobic methanotrophs in lake sediment. Environ Microbiol Rep. 2013; 5(5): 757–64. Google Scholar \| ISI
94.	Celaj, A. , Markle, J. , Danska, J. , Parkinson, J. Comparison of assembly algorithms for improving rate of metatranscriptomic functional annotation. Microbiome. 2014; 2(1): 39. Google Scholar \| Crossref
95.	Grabherr, M.G. , Haas, B.J. , Yassour, M. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat Biotechnol. 2011; 29(7): 644–52. Google Scholar \| Crossref \| ISI
96.	Namiki, T. , Hachiya, T. , Tanaka, H. , Sakakibara, Y. Metavelvet: an extension of velvet assembler to de novo metagenome assembly from short sequence reads. Nucleic Acids Res. 2012; 40(20): e155. Google Scholar \| Crossref \| ISI
97.	Schulz, M.H. , Zerbino, D.R. , Vingron, M. , Birney, E. Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels. Bioinformatics. 2012; 28(8): 1086–92. Google Scholar \| Crossref \| ISI
98.	Birol, I. , Jackman, S.D. , Nielsen, C.B. De novo transcriptome assembly with abyss. Bioinformatics. 2009; 25(21): 2872–7. Google Scholar \| Crossref \| ISI
99.	Li, R. , Zhu, H. , Ruan, J. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 2010; 20: 265–72. Google Scholar \| Crossref \| ISI
100.	Robertson, G. , Schein, J. , Chiu, R. De novo assembly and analysis of RNA-seq data. Nat Methods. 2010; 7(11): 909–12. Google Scholar \| Crossref \| ISI
101.	Guttman, M. , Garber, M. , Levin, J.Z. Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincrnas. Nat Biotechnol. 2010; 28(5): 503–10. Google Scholar \| Crossref \| ISI
102.	Trapnell, C. , Williams, B.A. , Pertea, G. Transcript assembly and quantification by RNA-seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010; 28(5): 511–5. Google Scholar \| Crossref \| ISI
103.	Li, B. , Dewey, C.N. Rsem: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinformatics. 2011; 12(1): 323. Google Scholar \| Crossref
104.	Haas, B.J. , Papanicolaou, A. , Yassour, M. De novo transcript sequence reconstruction from RNA-seq using the trinity platform for reference generation and analysis. Nat Protoc. 2013; 8(8): 1494–512. Google Scholar \| Crossref \| ISI
105.	De Bona, F. , Ossowski, S. , Schneeberger, K. , Rätsch, G. Optimal spliced alignments of short sequence reads. BMC Bioinformatics. 2008; 9(Suppl 10): i174–80. Google Scholar \| Crossref
106.	Cao, H.X. , Schmutzer, T. , Scholz, U. , Pecinka, A. , Schubert, I. , Vu, G.T.H. Metatranscriptome analysis reveals host-microbiome interactions in traps of carnivorous genlisea species. Front Microbiol. 2015; 6: 526. Google Scholar \| Crossref \| ISI
107.	Peano, C. , Pietrelli, A. , Consolandi, C. An efficient rRNA removal method for RNA sequencing in GC-rich bacteria. Microb Inform Exp. 2013; 3(1): 1. Google Scholar \| Crossref
108.	Perez-Losada, M. , Castro-Nallar, E. , Bendall, M.L. , Freishtat, R.J. , Crandall, K.A. Dual transcriptomic profiling of host and microbiota during health and disease in pediatric asthma. PLoS One. 2015; 10: e0131819. Google Scholar \| Crossref \| ISI
109.	Fiehn, O. Metabolomics – the link between genotypes and phenotypes. Plant Mol Biol. 2002; 48(1-2): 155–71. Google Scholar \| Crossref \| ISI
110.	Bernini, P. , Bertini, I. , Luchinat, C. Individual human phenotypes in metabolic space and time. J Proteome Res. 2009; 8(9): 4264–71. Google Scholar \| Crossref \| ISI
111.	Krumsiek, J. , Mittelstrass, K. , Do, K.T. Gender-specific pathway differences in the human serum metabolome. Metabolomics. 2015; 11(6): 1815–33. Google Scholar \| Crossref \| ISI
112.	Mastrangelo, A. , Armitage, E.G. , Garcia, A. , Barbas, C. Metabolomics as a tool for drug discovery and personalised medicine. A review. Curr Top Med Chem. 2014; 14(23): 2627–36. Google Scholar \| Crossref \| ISI
113.	Xu, J. , Mahowald, M.A. , Ley, R.E. Evolution of symbiotic bacteria in the distal human intestine. PLoS Biol. 2007; 5(7): e156. Google Scholar \| Crossref \| ISI
114.	Manor, O. , Levy, R. , Borenstein, E. Mapping the inner workings of the microbiome: genomic-and metagenomic-based study of metabolism and metabolic interactions in the human microbiome. Cell Metab. 2014; 20(5): 742–52. Google Scholar \| Crossref \| ISI
115.	Wu, G.D. , Compher, C. , Chen, E.Z. Comparative metabolomics in vegans and omnivores reveal constraints on diet-dependent gut microbiota metabolite production. Gut. 2014; 65(1): 63–72. Google Scholar \| Crossref \| ISI
116.	Lankadurai, B.P. , Nagato, E.G. , Simpson, M.J. Environmental metabolomics: an emerging approach to study organism responses to environmental stressors. Environ Rev. 2013; 21(3): 180–205. Google Scholar \| Crossref
117.	Kimes, N.E. , Callaghan, A.V. , Aktas, D.F. Metagenomic analysis and metabolite profiling of deep-sea sediments from the gulf of Mexico following the deepwater horizon oil spill. Front Microbiol. 2013; 4: 50. Google Scholar \| Crossref \| ISI
118.	Bassler, B.L. , Greenberg, E.P. , Stevens, A.M. Cross-species induction of luminescence in the quorum-sensing bacterium Vibrio harveyi. J Bacteriol. 1997; 179(12): 1943–5. Google Scholar \| Crossref \| ISI
119.	Miller, M.B. , Bassler, B.L. Quorum sensing in bacteria. Ann Rev Microbiol. 2001; 55(1): 165–99. Google Scholar \| Crossref
120.	Bassler, B.L. Small talk: cell-to-cell communication in bacteria. Cell. 2002; 109(4): 421–4. Google Scholar \| Crossref \| ISI
121.	Henke, J.M. , Bassler, B.L. Three parallel quorum-sensing systems regulate gene expression in Vibrio harveyi. J Bacteriol. 2004; 186(20): 6902–14. Google Scholar \| Crossref \| ISI
122.	Waters, C.M. , Bassler, B.L. Quorum sensing: cell-to-cell communication in bacteria. Annu Rev Cell Dev Biol. 2005; 21: 319–46. Google Scholar \| Crossref \| ISI
123.	Camilli, A. , Bassler, B.L. Bacterial small-molecule signaling pathways. Science. 2006; 311(5764): 1113–6. Google Scholar \| Crossref \| ISI
124.	Aldridge, B.B. , Rhee, K.Y. Microbial metabolomics: innovation, application, insight. Curr Opin Microbiol. 2014; 19: 90–6. Google Scholar \| Crossref \| ISI
125.	Smolinska, A. , Blanchet, L. , Buydens, L.M. , Wijmenga, S.S. NMR and pattern recognition methods in metabolomics: from data acquisition to biomarker discovery: a review. Anal Chim Acta. 2012; 750: 82–97. Google Scholar \| Crossref \| ISI
126.	Wishart, D.S. , Tzur, D. , Knox, C. HMDB: the human metabolome database. Nucleic Acids Res. 2007; 35(Suppl 1): D521–6. Google Scholar \| Crossref
127.	Wishart, D.S. , Knox, C. , Guo, A.C. HMDB: a knowledgebase for the human metabolome. Nucleic Acids Res. 2009; 37(Suppl 1): D603–10. Google Scholar \| Crossref
128.	Wishart, D.S. , Jewison, T. , Guo, A.C. HMDB 3.0-the human metabolome database in 2013. Nucleic Acids Res. 2012; 41(Database issue): D801–7. Google Scholar \| Crossref \| ISI
129.	Ulrich, E.L. , Akutsu, H. , Doreleijers, J.F. Biomagresbank. Nucleic Acids Res. 2008; 36(Suppl 1): D402–8. Google Scholar
130.	Cui, Q. , Lewis, I.A. , Hegeman, A.D. Metabolite identification via the Madison metabolomics consortium database. Nat Biotechnol. 2008; 26(2): 162–4. Google Scholar \| Crossref \| ISI
131.	Horai, H. , Arita, M. , Kanaya, S. Massbank: a public repository for sharing mass spectral data for life sciences. J Mass Spectrom. 2010; 45(7): 703–14. Google Scholar \| Crossref \| ISI
132.	Kopka, J. , Schauer, N. , Krueger, S. Gmdcsb.db: the Golm metabolome database. Bioinformatics. 2005; 21(8): 1635–8. Google Scholar \| Crossref \| ISI
133.	Smith, C.A. , O'Maille, G. , Want, E.J. Metlin: a metabolite mass spectral database. Ther Drug Monit. 2005; 27(6): 747–51. Google Scholar \| Crossref \| ISI
134.	Reigstad, C.S. , Kashyap, P.C. Beyond phylotyping: understanding the impact of gut microbiota on host biology. Neurogastroenterol Motil. 2013; 25(5): 358–72. Google Scholar \| Crossref \| ISI
135.	Aw, W. , Fukuda, S. Toward the comprehensive understanding of the gut ecosystem via metabolomics-based integrated omics approach. Semin Immunopathol. 2015; 37(1): 5–16. Google Scholar \| Crossref \| ISI
136.	Mason, O.U. , Hazen, T.C. , Borglin, S. Metagenome, metatranscriptome and single-cell sequencing reveal microbial response to deepwater horizon oil spill. ISME J. 2012; 6(9): 1715–27. Google Scholar \| Crossref \| ISI
137.	McNulty, N.P. , Yatsunenko, T. , Hsiao, A. The impact of a consortium of fermented milk strains on the gut microbiome of gnotobiotic mice and monozygotic twins. Sci Transl Med. 2011; 3(106): 106ra106. Google Scholar \| Crossref \| ISI
138.	Maurice, C.F. , Haiser, H.J. , Turnbaugh, P.J. Xenobiotics shape the physiology and gene expression of the active human gut microbiome. Cell. 2013; 152(1): 39–50. Google Scholar \| Crossref \| ISI
139.	Verberkmoes, N.C. , Russell, A.L. , Shah, M. Shotgun metaproteomics of the human distal gut microbiota. ISME J. 2009; 3(2): 179–89. Google Scholar \| Crossref \| ISI
140.	Weir, T.L. , Manter, D.K. , Sheflin, A.M. , Barnett, B.A. , Heuberger, A.L. , Ryan, E.P. Stool microbiome and metabolome differences between colorectal cancer patients and healthy adults. PLoS One. 2013; 8(8): e70803. Google Scholar \| Crossref \| ISI
141.	Wang, Z. , Klipfell, E. , Bennett, B.J. Gut flora metabolism of phosphatidylcho-line promotes cardiovascular disease. Nature. 2011; 472(7341): 57–63. Google Scholar \| Crossref \| ISI
142.	Koeth, R.A. , Wang, Z. , Levison, B.S. Intestinal microbiota metabolism of l-carnitine, a nutrient in red meat, promotes atherosclerosis. Nat Med. 2013; 19(5): 576–85. Google Scholar \| Crossref \| ISI
143.	Kaddurah-Daouk, R. , Baillie, R.A. , Zhu, H. Enteric microbiome metabolites correlate with response to simvastatin treatment. PLoS One. 2011; 6(10): e25482. Google Scholar \| Crossref \| ISI
144.	Haiser, H.J. , Gootenberg, D.B. , Chatman, K. , Sirasani, G. , Balskus, E.P. , Turnbaugh, P.J. Predicting and manipulating cardiac drug inactivation by the human gut bacterium Eggerthella lenta. Science. 2013; 341(6143): 295–8. Google Scholar \| Crossref \| ISI
145.	Franzosa, E.A. , Morgan, X.C. , Segata, N. Relating the metatranscriptome and metagenome of the human gut. Proc Natl Acad Sci. 2014; 111(22): E2329–38. Google Scholar \| Crossref \| ISI
146.	Shi, Y. , Tyson, G.W. , Eppley, J.M. , DeLong, E.F. Integrated metatranscriptomic and metagenomic analyses of stratified microbial assemblages in the open ocean. ISME J. 2011; 5(6): 999–1013. Google Scholar \| Crossref \| ISI
147.	Turnbaugh, P.J. , Gordon, J.I. An invitation to the marriage of metagenomics and metabolomics. Cell. 2008; 134(5): 708–13. Google Scholar \| Crossref \| ISI
148.	Lu, K. , Abo, R.P. , Schlieper, K.A. Arsenic exposure perturbs the gut microbiome and its metabolic profile in mice: an integrated metagenomics and metabolomics analysis. Environ Health Perspect. 2014; 122(3): 284–91. Google Scholar \| Crossref \| ISI
149.	Zhang, Y. , Zhao, F. , Deng, Y. , Zhao, Y. , Ren, H. Metagenomic and metabolomic analysis of the toxic effects of trichloroacetamide-induced gut microbiome and urine metabolome perturbations in mice. J Proteome Res. 2015; 14(4): 1752–61. Google Scholar \| Crossref \| ISI
150.	Narayanasamy, S. , Muller, E.E. , Sheik, A.R. , Wilmes, P. Integrated omics for the identification of key functionalities in biological wastewater treatment microbial communities. Microb Biotechnol. 2015; 8(3): 363–8. Google Scholar \| Crossref \| ISI
151.	Muller, E.E. , Glaab, E. , May, P. , Vlassis, N. , Wilmes, P. Condensing the omics fog of microbial communities. Trends Microbiol. 2013; 21(7): 325–33. Google Scholar \| Crossref \| ISI
152.	Abram, F. Systems-based approaches to unravel multi-species microbial community functioning. Comput Struct Biotechnol J. 2015; 13: 24–32. Google Scholar \| Crossref \| ISI
153.	Levy, R. , Borenstein, E. Reverse ecology: from systems to environments and back. In: Soyer, O.S. , ed. Evolutionary Systems Biology. Springer, New York; 2012: 329–45. Google Scholar \| Crossref
154.	Borenstein, E. , Kupiec, M. , Feldman, M.W. , Ruppin, E. Large-scale reconstruction and phylogenetic analysis of metabolic environments. Proc Natl Acad Sci. 2008; 105(38): 14482–7. Google Scholar \| Crossref \| ISI
155.	Ebenhöh, O. , Handorf, T. , Heinrich, R. Structural analysis of expanding metabolic networks. Genome Inform. 2004; 15(1): 35–45. Google Scholar
156.	Bachmaier, C. , Brandes, U. , Schreiber, F. Chapter 20: Biological networks. In: Tamassia, R. , ed. Handbook of Graph Drawing and Visualization. CRC Press, Boca Raton, FL; 2013: 621–51. Google Scholar
157.	Wuchty, S. , Ravasz, E. , Barabasi, A-L. The architecture of biological networks. In: Deisboek, T.S. , Kresh, J.Y. , eds. Complex Systems Science in Biomedicine. Springer, New York; 2006: 165–81. Google Scholar \| Crossref
158.	Barabási, A-L , Oltvai, Z.N. , Wuchty, S. Characteristics of biological networks. In: Ben-Naim, E. , Frauenfelder, H. , Tonoczkai, Z. , eds. Complex Networks. SpringerVerlag, Berlin; 2004: 443–57. Google Scholar \| Crossref
159.	Pawson, T. , Nash, P. Protein-protein interactions define specificity in signal transduction. Genes Dev. 2000; 9: 1027–47. Google Scholar
160.	Dutkowski, J. , Kramer, M. , Surma, M.A. A gene ontology inferred from molecular networks. Nat Biotechnol. 2013; 31: 38–45. Google Scholar \| Crossref \| ISI
161.	Demchak, B. , Hull, T. , Reich, M. Cytoscape: the network visualization tool for genomespace workflows. F1000Res. 2014; 2014(3): 151–63. Google Scholar \| Crossref
162.	Friedman, J. , Alm, E.J. Inferring correlation networks from genomic survey data. PLoS Comput Biol. 2012; 8(9): e1002687. Google Scholar \| Crossref \| ISI
163.	Srivas, R. , Hannum, G. , Ruscheinski, J. Assembling global maps of cellular function through integrative analysis of physical and genetic networks. Nat Protoc. 2011; 6(9): 1308–23. Google Scholar \| Crossref \| ISI
164.	Amar, D. , Shamir, R. Constructing module maps for integrated analysis of heterogeneous biological networks. Nucleic Acids Res. 2014; 42(7): 4208–19. Google Scholar \| Crossref \| ISI
165.	van Dam, J.C. , Schaap, P.J. , dos Santos, V.A.M. , Suárez-Diez, M. Integration of heterogeneous molecular networks to unravel gene-regulation in mycobacterium tuberculosis. BMC Syst Biol. 2014; 8(1): 111. Google Scholar \| Crossref

(Redirected from Metagenome)

Metagenomics allows the study of microbial communities like those present in this stream receiving acid drainage from surface coal mining.

Metagenomics is the study of genetic material recovered directly from environmental samples. The broad field may also be referred to as environmental genomics, ecogenomics or community genomics.

While traditional microbiology and microbial genome sequencing and genomics rely upon cultivated clonalcultures, early environmental gene sequencing cloned specific genes (often the 16S rRNA gene) to produce a profile of diversity in a natural sample. Such work revealed that the vast majority of microbial biodiversity had been missed by cultivation-based methods.^[1]

Because of its ability to reveal the previously hidden diversity of microscopic life, metagenomics offers a powerful lens for viewing the microbial world that has the potential to revolutionize understanding of the entire living world.^[2] As the price of DNA sequencing continues to fall, metagenomics now allows microbial ecology to be investigated at a much greater scale and detail than before. Recent studies use either 'shotgun' or PCR directed sequencing to get largely unbiased samples of all genes from all the members of the sampled communities.^[3]

3Sequencing
4Bioinformatics
5Data analysis
6Applications

Etymology[edit]

The term 'metagenomics' was first used by Jo Handelsman, Jon Clardy, Robert M. Goodman, Sean F. Brady, and others, and first appeared in publication in 1998.^[4] The term metagenome referenced the idea that a collection of genes sequenced from the environment could be analyzed in a way analogous to the study of a single genome. In 2005, Kevin Chen and Lior Pachter (researchers at the University of California, Berkeley) defined metagenomics as 'the application of modern genomics technique without the need for isolation and lab cultivation of individual species'.^[5]

History[edit]

Conventional sequencing begins with a culture of identical cells as a source of DNA. However, early metagenomic studies revealed that there are probably large groups of microorganisms in many environments that cannot be cultured and thus cannot be sequenced. These early studies focused on 16S ribosomalRNA sequences which are relatively short, often conserved within a species, and generally different between species. Many 16S rRNA sequences have been found which do not belong to any known cultured species, indicating that there are numerous non-isolated organisms. These surveys of ribosomal RNA (rRNA) genes taken directly from the environment revealed that cultivation based methods find less than 1% of the bacterial and archaeal species in a sample.^[1] Much of the interest in metagenomics comes from these discoveries that showed that the vast majority of microorganisms had previously gone unnoticed.

Early molecular work in the field was conducted by Norman R. Pace and colleagues, who used PCR to explore the diversity of ribosomal RNA sequences.^[6] The insights gained from these breakthrough studies led Pace to propose the idea of cloning DNA directly from environmental samples as early as 1985.^[7] This led to the first report of isolating and cloning bulk DNA from an environmental sample, published by Pace and colleagues in 1991^[8] while Pace was in the Department of Biology at Indiana University. Considerable efforts ensured that these were not PCR false positives and supported the existence of a complex community of unexplored species. Although this methodology was limited to exploring highly conserved, non-protein coding genes, it did support early microbial morphology-based observations that diversity was far more complex than was known by culturing methods. Soon after that, Healy reported the metagenomic isolation of functional genes from 'zoolibraries' constructed from a complex culture of environmental organisms grown in the laboratory on dried grasses in 1995.^[9] After leaving the Pace laboratory, Edward DeLong continued in the field and has published work that has largely laid the groundwork for environmental phylogenies based on signature 16S sequences, beginning with his group's construction of libraries from marine samples.^[10]

In 2002, Mya Breitbart, Forest Rohwer, and colleagues used environmental shotgun sequencing (see below) to show that 200 liters of seawater contains over 5000 different viruses.^[11] Subsequent studies showed that there are more than a thousand viral species in human stool and possibly a million different viruses per kilogram of marine sediment, including many bacteriophages. Essentially all of the viruses in these studies were new species. In 2004, Gene Tyson, Jill Banfield, and colleagues at the University of California, Berkeley and the Joint Genome Institute sequenced DNA extracted from an acid mine drainage system.^[12] This effort resulted in the complete, or nearly complete, genomes for a handful of bacteria and archaea that had previously resisted attempts to culture them.^[13]

Flow diagram of a typical metagenome project^[14]

Beginning in 2003, Craig Venter, leader of the privately funded parallel of the Human Genome Project, has led the Global Ocean Sampling Expedition (GOS), circumnavigating the globe and collecting metagenomic samples throughout the journey. All of these samples are sequenced using shotgun sequencing, in hopes that new genomes (and therefore new organisms) would be identified. The pilot project, conducted in the Sargasso Sea, found DNA from nearly 2000 different species, including 148 types of bacteria never before seen.^[15] Venter has circumnavigated the globe and thoroughly explored the West Coast of the United States, and completed a two-year expedition to explore the Baltic, Mediterranean and Black Seas. Analysis of the metagenomic data collected during this journey revealed two groups of organisms, one composed of taxa adapted to environmental conditions of 'feast or famine', and a second composed of relatively fewer but more abundantly and widely distributed taxa primarily composed of plankton.^[16]

In 2005 Stephan C. Schuster at Penn State University and colleagues published the first sequences of an environmental sample generated with high-throughput sequencing, in this case massively parallel pyrosequencing developed by 454 Life Sciences.^[17] Another early paper in this area appeared in 2006 by Robert Edwards, Forest Rohwer, and colleagues at San Diego State University.^[18]

Sequencing[edit]

Recovery of DNA sequences longer than a few thousand base pairs from environmental samples was very difficult until recent advances in molecular biological techniques allowed the construction of libraries in bacterial artificial chromosomes (BACs), which provided better vectors for molecular cloning.^[19]

Environmental Shotgun Sequencing (ESS). (A) Sampling from habitat; (B) filtering particles, typically by size; (C) Lysis and DNA extraction; (D) cloning and library construction; (E) sequencing the clones; (F) sequence assembly into contigs and scaffolds.

Shotgun metagenomics[edit]

Advances in bioinformatics, refinements of DNA amplification, and the proliferation of computational power have greatly aided the analysis of DNA sequences recovered from environmental samples, allowing the adaptation of shotgun sequencing to metagenomic samples (known also as whole metagenome shotgun or WMGS sequencing). The approach, used to sequence many cultured microorganisms and the human genome, randomly shears DNA, sequences many short sequences, and reconstructs them into a consensus sequence. Shotgun sequencing reveals genes present in environmental samples. Historically, clone libraries were used to facilitate this sequencing. However, with advances in high throughput sequencing technologies, the cloning step is no longer necessary and greater yields of sequencing data can be obtained without this labour-intensive bottleneck step. Shotgun metagenomics provides information both about which organisms are present and what metabolic processes are possible in the community.^[20] Because the collection of DNA from an environment is largely uncontrolled, the most abundant organisms in an environmental sample are most highly represented in the resulting sequence data. To achieve the high coverage needed to fully resolve the genomes of under-represented community members, large samples, often prohibitively so, are needed. On the other hand, the random nature of shotgun sequencing ensures that many of these organisms, which would otherwise go unnoticed using traditional culturing techniques, will be represented by at least some small sequence segments.^[12] An emerging approach combines shotgun sequencing and chromosome conformation capture (Hi-C), which measures the proximity of any two DNA sequences within the same cell, to guide microbial genome assembly.^[21]

High-throughput sequencing[edit]

The first metagenomic studies conducted using high-throughput sequencing used massively parallel 454 pyrosequencing.^[17] Three other technologies commonly applied to environmental sampling are the Ion Torrent Personal Genome Machine, the Illumina MiSeq or HiSeq and the Applied Biosystems SOLiD system.^[22] These techniques for sequencing DNA generate shorter fragments than Sanger sequencing; Ion Torrent PGM System and 454 pyrosequencing typically produces ~400 bp reads, Illumina MiSeq produces 400-700bp reads (depending on whether paired end options are used), and SOLiD produce 25-75 bp reads.^[23] Historically, these read lengths were significantly shorter than the typical Sanger sequencing read length of ~750 bp, however the Illumina technology is quickly coming close to this benchmark. However, this limitation is compensated for by the much larger number of sequence reads. In 2009, pyrosequenced metagenomes generate 200–500 megabases, and Illumina platforms generate around 20–50 gigabases, but these outputs have increased by orders of magnitude in recent years.^[24] An additional advantage to high throughput sequencing is that this technique does not require cloning the DNA before sequencing, removing one of the main biases and bottlenecks in environmental sampling.

Bioinformatics[edit]

The data generated by metagenomics experiments are both enormous and inherently noisy, containing fragmented data representing as many as 10,000 species.^[25] The sequencing of the cow rumen metagenome generated 279 gigabases, or 279 billion base pairs of nucleotide sequence data,^[26] while the human gut microbiome gene catalog identified 3.3 million genes assembled from 567.7 gigabases of sequence data.^[27] Collecting, curating, and extracting useful biological information from datasets of this size represent significant computational challenges for researchers.^[20]^[28]^[29]^[30]

Sequence pre-filtering[edit]

The first step of metagenomic data analysis requires the execution of certain pre-filtering steps, including the removal of redundant, low-quality sequences and sequences of probable eukaryotic origin (especially in metagenomes of human origin).^[31]^[32] The methods available for the removal of contaminating eukaryotic genomic DNA sequences include Eu-Detect and DeConseq.^[33]^[34]

Assembly[edit]

DNA sequence data from genomic and metagenomic projects are essentially the same, but genomic sequence data offers higher coverage while metagenomic data is usually highly non-redundant.^[29] Furthermore, the increased use of second-generation sequencing technologies with short read lengths means that much of future metagenomic data will be error-prone. Taken in combination, these factors make the assembly of metagenomic sequence reads into genomes difficult and unreliable. Misassemblies are caused by the presence of repetitive DNA sequences that make assembly especially difficult because of the difference in the relative abundance of species present in the sample.^[35] Misassemblies can also involve the combination of sequences from more than one species into chimeric contigs.^[35]

There are several assembly programs, most of which can use information from paired-end tags in order to improve the accuracy of assemblies. Some programs, such as Phrap or Celera Assembler, were designed to be used to assemble single genomes but nevertheless produce good results when assembling metagenomic data sets.^[25] Other programs, such as Velvet assembler, have been optimized for the shorter reads produced by second-generation sequencing through the use of de Bruijn graphs. The use of reference genomes allows researchers to improve the assembly of the most abundant microbial species, but this approach is limited by the small subset of microbial phyla for which sequenced genomes are available.^[35] After an assembly is created, an additional challenge is 'metagenomic deconvolution', or determining which sequences come from which species in the sample.^[36]

Gene prediction[edit]

Metagenomic analysis pipelines use two approaches in the annotation of coding regions in the assembled contigs.^[35] The first approach is to identify genes based upon homology with genes that are already publicly available in sequence databases, usually by BLAST searches. This type of approach is implemented in the program MEGAN4.^[37] The second, ab initio, uses intrinsic features of the sequence to predict coding regions based upon gene training sets from related organisms. This is the approach taken by programs such as GeneMark^[38] and GLIMMER. The main advantage of ab initio prediction is that it enables the detection of coding regions that lack homologs in the sequence databases; however, it is most accurate when there are large regions of contiguous genomic DNA available for comparison.^[25]

Species diversity[edit]

A 2016 representation of the tree of life^[39]

Gene annotations provide the 'what', while measurements of species diversity provide the 'who'.^[40] In order to connect community composition and function in metagenomes, sequences must be binned. Binning is the process of associating a particular sequence with an organism.^[35] In similarity-based binning, methods such as BLAST are used to rapidly search for phylogenetic markers or otherwise similar sequences in existing public databases. This approach is implemented in MEGAN.^[41] Another tool, PhymmBL, uses interpolated Markov models to assign reads.^[25]MetaPhlAn and AMPHORA are methods based on unique clade-specific markers for estimating organismal relative abundances with improved computational performances.^[42] Other tools, like mOTUs^[43]^[44] and MetaPhyler^[45], use universal marker genes to profile prokaryotic species. With the mOTUs profiler is possible to profile species without a reference genome, improving the estimation of microbial community diversity.^[44] Recent methods, such as SLIMM, use read coverage landscape of individual reference genomes to minimize false-positive hits and get reliable relative abundances.^[46] In composition based binning, methods use intrinsic features of the sequence, such as oligonucleotide frequencies or codon usage bias.^[25] Once sequences are binned, it is possible to carry out comparative analysis of diversity and richness.

Data integration[edit]

The massive amount of exponentially growing sequence data is a daunting challenge that is complicated by the complexity of the metadata associated with metagenomic projects. Metadata includes detailed information about the three-dimensional (including depth, or height) geography and environmental features of the sample, physical data about the sample site, and the methodology of the sampling.^[29] This information is necessary both to ensure replicability and to enable downstream analysis. Because of its importance, metadata and collaborative data review and curation require standardized data formats located in specialized databases, such as the Genomes OnLine Database (GOLD).^[47]

Several tools have been developed to integrate metadata and sequence data, allowing downstream comparative analyses of different datasets using a number of ecological indices. In 2007, Folker Meyer and Robert Edwards and a team at Argonne National Laboratory and the University of Chicago released the Metagenomics Rapid Annotation using Subsystem Technology server (MG-RAST) a community resource for metagenome data set analysis.^[48] As of June 2012 over 14.8 terabases (14x10¹² bases) of DNA have been analyzed, with more than 10,000 public data sets freely available for comparison within MG-RAST. Over 8,000 users now have submitted a total of 50,000 metagenomes to MG-RAST. The Integrated Microbial Genomes/Metagenomes (IMG/M) system also provides a collection of tools for functional analysis of microbial communities based on their metagenome sequence, based upon reference isolate genomes included from the Integrated Microbial Genomes (IMG) system and the Genomic Encyclopedia of Bacteria and Archaea (GEBA) project.^[49]

One of the first standalone tools for analysing high-throughput metagenome shotgun data was MEGAN (MEta Genome ANalyzer).^[37]^[41] A first version of the program was used in 2005 to analyse the metagenomic context of DNA sequences obtained from a mammoth bone.^[17] Based on a BLAST comparison against a reference database, this tool performs both taxonomic and functional binning, by placing the reads onto the nodes of the NCBI taxonomy using a simple lowest common ancestor (LCA) algorithm or onto the nodes of the SEED or KEGG classifications, respectively.^[50]

With the advent of fast and inexpensive sequencing instruments, the growth of databases of DNA sequences is now exponential (e.g., the NCBI GenBank database ^[51]). Faster and efficient tools are needed to keep pace with the high-throughput sequencing, because the BLAST-based approaches such as MG-RAST or MEGAN run slowly to annotate large samples (e.g., several hours to process a small/medium size dataset/sample ^[52]). Thus, ultra-fast classifiers have recently emerged, thanks to more affordable powerful servers. These tools can perform the taxonomic annotation at extremely high speed, for example CLARK ^[53] (according to CLARK's authors, it can classify accurately '32 million metagenomic short reads per minute'). At such a speed, a very large dataset/sample of a billion short reads can be processed in about 30 minutes.

With the increasing availability of samples containing ancient DNA and due to the uncertainty associated with the nature of those samples (ancient DNA damage), FALCON,^[54] a fast tool capable of producing conservative similarity estimates has been made available. According to FALCON's authors, it can use relaxed thresholds and edit distances without affecting the memory and speed performance.

Comparative metagenomics[edit]

Comparative analyses between metagenomes can provide additional insight into the function of complex microbial communities and their role in host health.^[55] Pairwise or multiple comparisons between metagenomes can be made at the level of sequence composition (comparing GC-content or genome size), taxonomic diversity, or functional complement. Comparisons of population structure and phylogenetic diversity can be made on the basis of 16S and other phylogenetic marker genes, or—in the case of low-diversity communities—by genome reconstruction from the metagenomic dataset.^[56] Functional comparisons between metagenomes may be made by comparing sequences against reference databases such as COG or KEGG, and tabulating the abundance by category and evaluating any differences for statistical significance.^[50] This gene-centric approach emphasizes the functional complement of the community as a whole rather than taxonomic groups, and shows that the functional complements are analogous under similar environmental conditions.^[56] Consequently, metadata on the environmental context of the metagenomic sample is especially important in comparative analyses, as it provides researchers with the ability to study the effect of habitat upon community structure and function.^[25]

Additionally, several studies have also utilized oligonucleotide usage patterns to identify the differences across diverse microbial communities. Examples of such methodologies include the dinucleotide relative abundance approach by Willner et al.^[57] and the HabiSign approach of Ghosh et al.^[58] This latter study also indicated that differences in tetranucleotide usage patterns can be used to identify genes (or metagenomic reads) originating from specific habitats. Additionally some methods as TriageTools^[59] or Compareads^[60] detect similar reads between two read sets. The similarity measure they apply on reads is based on a number of identical words of length k shared by pairs of reads.

A key goal in comparative metagenomics is to identify microbial group(s) which are responsible for conferring specific characteristics to a given environment. However, due to issues in the sequencing technologies artifacts need to be accounted for like in metagenomeSeq.^[28] Others have characterized inter-microbial interactions between the resident microbial groups. A GUI-based comparative metagenomic analysis application called Community-Analyzer has been developed by Kuntal et al. ^[61] which implements a correlation-based graph layout algorithm that not only facilitates a quick visualization of the differences in the analyzed microbial communities (in terms of their taxonomic composition), but also provides insights into the inherent inter-microbial interactions occurring therein. Notably, this layout algorithm also enables grouping of the metagenomes based on the probable inter-microbial interaction patterns rather than simply comparing abundance values of various taxonomic groups. In addition, the tool implements several interactive GUI-based functionalities that enable users to perform standard comparative analyses across microbiomes.

Data analysis[edit]

Community metabolism[edit]

In many bacterial communities, natural or engineered (such as bioreactors), there is significant division of labor in metabolism (Syntrophy), during which the waste products of some organisms are metabolites for others.^[62] In one such system, the methanogenic bioreactor, functional stability requires the presence of several syntrophic species (Syntrophobacterales and Synergistia) working together in order to turn raw resources into fully metabolized waste (methane).^[63] Using comparative gene studies and expression experiments with microarrays or proteomics researchers can piece together a metabolic network that goes beyond species boundaries. Such studies require detailed knowledge about which versions of which proteins are coded by which species and even by which strains of which species. Therefore, community genomic information is another fundamental tool (with metabolomics and proteomics) in the quest to determine how metabolites are transferred and transformed by a community.^[64]

Metatranscriptomics[edit]

Metagenomics allows researchers to access the functional and metabolic diversity of microbial communities, but it cannot show which of these processes are active.^[56] The extraction and analysis of metagenomic mRNA (the metatranscriptome) provides information on the regulation and expression profiles of complex communities. Because of the technical difficulties (the short half-life of mRNA, for example) in the collection of environmental RNA there have been relatively few in situ metatranscriptomic studies of microbial communities to date.^[56] While originally limited to microarray technology, metatranscriptomics studies have made use of transcriptomics technologies to measure whole-genome expression and quantification of a microbial community,^[56] first employed in analysis of ammonia oxidation in soils.^[65]

Viruses[edit]

Metagenomic sequencing is particularly useful in the study of viral communities. As viruses lack a shared universal phylogenetic marker (as 16S RNA for bacteria and archaea, and 18S RNA for eukarya), the only way to access the genetic diversity of the viral community from an environmental sample is through metagenomics. Viral metagenomes (also called viromes) should thus provide more and more information about viral diversity and evolution ^[66]^[67]^[68].^[69]^[70] For example, a metagenomic pipeline called Giant Virus Finder showed the first evidence of existence of giant viruses in a saline desert ^[71] and in Antarctic dry valleys .^[72]

Applications[edit]

Metagenomics has the potential to advance knowledge in a wide variety of fields. It can also be applied to solve practical challenges in medicine, engineering, agriculture, sustainability and ecology.^[29]

Agriculture[edit]

The soils in which plants grow are inhabited by microbial communities, with one gram of soil containing around 10⁹-10¹⁰ microbial cells which comprise about one gigabase of sequence information.^[73]^[74] The microbial communities which inhabit soils are some of the most complex known to science, and remain poorly understood despite their economic importance.^[75] Microbial consortia perform a wide variety of ecosystem services necessary for plant growth, including fixing atmospheric nitrogen, nutrient cycling, disease suppression, and sequesteriron and other metals.^[76] Functional metagenomics strategies are being used to explore the interactions between plants and microbes through cultivation-independent study of these microbial communities.^[77]^[78] By allowing insights into the role of previously uncultivated or rare community members in nutrient cycling and the promotion of plant growth, metagenomic approaches can contribute to improved disease detection in crops and livestock and the adaptation of enhanced farming practices which improve crop health by harnessing the relationship between microbes and plants.^[29]

Biofuel[edit]

Bioreactors allow the observation of microbial communities as they convert biomass into cellulosic ethanol.

Biofuels are fuels derived from biomass conversion, as in the conversion of cellulose contained in corn stalks, switchgrass, and other biomass into cellulosic ethanol.^[29] This process is dependent upon microbial consortia(association) that transform the cellulose into sugars, followed by the fermentation of the sugars into ethanol. Microbes also produce a variety of sources of bioenergy including methane and hydrogen.^[29]

The efficient industrial-scale deconstruction of biomass requires novel enzymes with higher productivity and lower cost.^[26] Metagenomic approaches to the analysis of complex microbial communities allow the targeted screening of enzymes with industrial applications in biofuel production, such as glycoside hydrolases.^[79] Furthermore, knowledge of how these microbial communities function is required to control them, and metagenomics is a key tool in their understanding. Metagenomic approaches allow comparative analyses between convergent microbial systems like biogas fermenters^[80] or insectherbivores such as the fungus garden of the leafcutter ants.^[81]

Biotechnology[edit]

Microbial communities produce a vast array of biologically active chemicals that are used in competition and communication.^[76] Many of the drugs in use today were originally uncovered in microbes; recent progress in mining the rich genetic resource of non-culturable microbes has led to the discovery of new genes, enzymes, and natural products.^[56]^[82] The application of metagenomics has allowed the development of commodity and fine chemicals, agrochemicals and pharmaceuticals where the benefit of enzyme-catalyzedchiral synthesis is increasingly recognized.^[83]

Two types of analysis are used in the bioprospecting of metagenomic data: function-driven screening for an expressed trait, and sequence-driven screening for DNA sequences of interest.^[84] Function-driven analysis seeks to identify clones expressing a desired trait or useful activity, followed by biochemical characterization and sequence analysis. This approach is limited by availability of a suitable screen and the requirement that the desired trait be expressed in the host cell. Moreover, the low rate of discovery (less than one per 1,000 clones screened) and its labor-intensive nature further limit this approach.^[85] In contrast, sequence-driven analysis uses conserved DNA sequences to design PCR primers to screen clones for the sequence of interest.^[84] In comparison to cloning-based approaches, using a sequence-only approach further reduces the amount of bench work required. The application of massively parallel sequencing also greatly increases the amount of sequence data generated, which require high-throughput bioinformatic analysis pipelines.^[85] The sequence-driven approach to screening is limited by the breadth and accuracy of gene functions present in public sequence databases. In practice, experiments make use of a combination of both functional and sequence-based approaches based upon the function of interest, the complexity of the sample to be screened, and other factors.^[85]^[86] An example of success using metagenomics as a biotechnology for drug discovery is illustrated with the malacidin antibiotics.^[87]

Ecology[edit]

Metagenomics can provide valuable insights into the functional ecology of environmental communities.^[88] Metagenomic analysis of the bacterial consortia found in the defecations of Australian sea lions suggests that nutrient-rich sea lion faeces may be an important nutrient source for coastal ecosystems. This is because the bacteria that are expelled simultaneously with the defecations are adept at breaking down the nutrients in the faeces into a bioavailable form that can be taken up into the food chain.^[89]

Metagenomics Methods And Protocols Pdf File

DNA sequencing can also be used more broadly to identify species present in a body of water,^[90] debris filtered from the air, or sample of dirt. This can establish the range of invasive species and endangered species, and track seasonal populations.

Environmental remediation[edit]

Metagenomics can improve strategies for monitoring the impact of pollutants on ecosystems and for cleaning up contaminated environments. Increased understanding of how microbial communities cope with pollutants improves assessments of the potential of contaminated sites to recover from pollution and increases the chances of bioaugmentation or biostimulation trials to succeed.^[91]

Gut Microbe Characterization[edit]

Microbial communities play a key role in preserving human health, but their composition and the mechanism by which they do so remains mysterious.^[92] Metagenomic sequencing is being used to characterize the microbial communities from 15-18 body sites from at least 250 individuals. This is part of the Human Microbiome initiative with primary goals to determine if there is a core human microbiome, to understand the changes in the human microbiome that can be correlated with human health, and to develop new technological and bioinformatics tools to support these goals.^[93]

Another medical study as part of the MetaHit (Metagenomics of the Human Intestinal Tract) project consisted of 124 individuals from Denmark and Spain consisting of healthy, overweight, and irritable bowel disease patients. The study attempted to categorize the depth and phylogenetic diversity of gastrointestinal bacteria. Using Illumina GA sequence data and SOAPdenovo, a de Bruijn graph-based tool specifically designed for assembly short reads, they were able to generate 6.58 million contigs greater than 500 bp for a total contig length of 10.3 Gb and a N50 length of 2.2 kb.

The study demonstrated that two bacterial divisions, Bacteroidetes and Firmicutes, constitute over 90% of the known phylogenetic categories that dominate distal gut bacteria. Using the relative gene frequencies found within the gut these researchers identified 1,244 metagenomic clusters that are critically important for the health of the intestinal tract. There are two types of functions in these range clusters: housekeeping and those specific to the intestine. The housekeeping gene clusters are required in all bacteria and are often major players in the main metabolic pathways including central carbon metabolism and amino acid synthesis. The gut-specific functions include adhesion to host proteins and the harvesting of sugars from globoseries glycolipids. Patients with irritable bowel syndrome were shown to exhibit 25% fewer genes and lower bacterial diversity than individuals not suffering from irritable bowel syndrome indicating that changes in patients’ gut biome diversity may be associated with this condition.

While these studies highlight some potentially valuable medical applications, only 31-48.8% of the reads could be aligned to 194 public human gut bacterial genomes and 7.6-21.2% to bacterial genomes available in GenBank which indicates that there is still far more research necessary to capture novel bacterial genomes.^[94]

Infectious disease diagnosis[edit]

Differentiating between infectious and non-infectious illness, and identifying the underlying etiology of infection, can be quite challenging. For example, more than half of cases of encephalitis remain undiagnosed, despite extensive testing using state-of-the-art clinical laboratory methods. Metagenomic sequencing shows promise as a sensitive and rapid method to diagnose infection by comparing genetic material found in a patient's sample to a database of thousands of bacteria, viruses, and other pathogens

References[edit]

^ ^a^bHugenholz, P; Goebel BM; Pace NR (1 September 1998). 'Impact of Culture-Independent Studies on the Emerging Phylogenetic View of Bacterial Diversity'. J. Bacteriol. 180 (18): 4765–74. PMC107498. PMID9733676.
^Marco, D, ed. (2011). Metagenomics: Current Innovations and Future Trends. Caister Academic Press. ISBN978-1-904455-87-5.
^Eisen, JA (2007). 'Environmental Shotgun Sequencing: Its Potential and Challenges for Studying the Hidden World of Microbes'. PLoS Biology. 5 (3): e82. doi:10.1371/journal.pbio.0050082. PMC1821061. PMID17355177.
^Handelsman, J.; Rondon, M. R.; Brady, S. F.; Clardy, J.; Goodman, R. M. (1998). 'Molecular biological access to the chemistry of unknown soil microbes: A new frontier for natural products'. Chemistry & Biology. 5 (10): R245–R249. doi:10.1016/S1074-5521(98)90108-9. PMID9818143..
^Chen, K.; Pachter, L. (2005). 'Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities'. PLoS Computational Biology. 1 (2): 106–12. Bibcode:2005PLSCB..1..24C. doi:10.1371/journal.pcbi.0010024. PMC1185649. PMID16110337.
^Lane, DJ; Pace B; Olsen GJ; Stahl DA; Sogin ML; Pace NR (1985). 'Rapid determination of 16S ribosomal RNA sequences for phylogenetic analyses'. Proceedings of the National Academy of Sciences. 82 (20): 6955–9. Bibcode:1985PNAS..82.6955L. doi:10.1073/pnas.82.20.6955. PMC391288. PMID2413450.
^Pace, NR; DA Stahl; DJ Lane; GJ Olsen (1985). 'Analyzing natural microbial populations by rRNA sequences'. ASM News. 51: 4–12. Archived from the original on 4 April 2012.
^Pace, NR; Delong, EF; Pace, NR (1991). 'Analysis of a marine picoplankton community by 16S rRNA gene cloning and sequencing'. Journal of Bacteriology. 173 (14): 4371–4378. doi:10.1128/jb.173.14.4371-4378.1991. PMC208098. PMID2066334.
^Healy, FG; RM Ray; HC Aldrich; AC Wilkie; LO Ingram; KT Shanmugam (1995). 'Direct isolation of functional genes encoding cellulases from the microbial consortia in a thermophilic, anaerobic digester maintained on lignocellulose'. Appl. Microbiol. Biotechnol. 43 (4): 667–74. doi:10.1007/BF00164771. PMID7546604.
^Stein, JL; TL Marsh; KY Wu; H Shizuya; EF DeLong (1996). 'Characterization of uncultivated prokaryotes: isolation and analysis of a 40-kilobase-pair genome fragment from a planktonic marine archaeon'. Journal of Bacteriology. 178 (3): 591–599. doi:10.1128/jb.178.3.591-599.1996. PMC177699. PMID8550487.
^Breitbart, M; Salamon P; Andresen B; Mahaffy JM; Segall AM; Mead D; Azam F; Rohwer F (2002). 'Genomic analysis of uncultured marine viral communities'. Proceedings of the National Academy of Sciences of the United States of America. 99 (22): 14250–14255. Bibcode:2002PNAS..9914250B. doi:10.1073/pnas.202488399. PMC137870. PMID12384570.
^ ^a^bTyson, GW; Chapman J; Hugenholtz P; Allen EE; Ram RJ; Richardson PM; Solovyev VV; Rubin EM; Rokhsar DS; Banfield JF (2004). 'Insights into community structure and metabolism by reconstruction of microbial genomes from the environment'. Nature. 428 (6978): 37–43. Bibcode:2004Natur.428..37T. doi:10.1038/nature02340. PMID14961025.(subscription required)
^Hugenholz, P (2002). 'Exploring prokaryotic diversity in the genomic era'. Genome Biology. 3 (2): 1–8. doi:10.1186/gb-2002-3-2-reviews0003. PMC139013. PMID11864374.
^Thomas, T.; Gilbert, J.; Meyer, F. (2012). 'Metagenomics - a guide from sampling to data analysis'. Microbial Informatics and Experimentation. 2 (1): 3. doi:10.1186/2042-5783-2-3. PMC3351745. PMID22587947.
^Venter, JC; Remington K; Heidelberg JF; Halpern AL; Rusch D; Eisen JA; Wu D; Paulsen I; Nelson KE; Nelson W; Fouts DE; Levy S; Knap AH; Lomas MW; Nealson K; White O; Peterson J; Hoffman J; Parsons R; Baden-Tillson H; Pfannkoch C; Rogers Y; Smith HO (2004). 'Environmental Genome Shotgun Sequencing of the Sargasso Sea'. Science. 304 (5667): 66–74. Bibcode:2004Sci..304..66V. CiteSeerX10.1.1.124.1840. doi:10.1126/science.1093857. PMID15001713.
^Yooseph, Shibu; Kenneth H. Nealson; Douglas B. Rusch; John P. McCrow; Christopher L. Dupont; Maria Kim; Justin Johnson; Robert Montgomery; Steve Ferriera; Karen Beeson; Shannon J. Williamson; Andrey Tovchigrechko; Andrew E. Allen; Lisa A. Zeigler; Granger Sutton; Eric Eisenstadt; Yu-Hui Rogers; Robert Friedman; Marvin Frazier; J. Craig Venter (4 November 2010). 'Genomic and functional adaptation in surface ocean planktonic prokaryotes'. Nature. 468 (7320): 60–66. Bibcode:2010Natur.468..60Y. doi:10.1038/nature09530. ISSN0028-0836. PMID21048761.(subscription required)
^ ^a^b^cPoinar, HN; Schwarz, C; Qi, J; Shapiro, B; Macphee, RD; Buigues, B; Tikhonov, A; Huson, D; Tomsho, LP; Auch, A; Rampp, M; Miller, W; Schuster, SC (2006). 'Metagenomics to Paleogenomics: Large-Scale Sequencing of Mammoth DNA'. Science. 311 (5759): 392–394. Bibcode:2006Sci..311.392P. doi:10.1126/science.1123360. PMID16368896.
^Edwards, RA; Rodriguez-Brito B; Wegley L; Haynes M; Breitbart M; Peterson DM; Saar MO; Alexander S; Alexander EC; Rohwer F (2006). 'Using pyrosequencing to shed light on deep mine microbial ecology'. BMC Genomics. 7: 57. doi:10.1186/1471-2164-7-57. PMC1483832. PMID16549033.
^Beja, O.; Suzuki, MT; Koonin, EV; Aravind, L; Hadd, A; Nguyen, LP; Villacorta, R; Amjadi, M; Garrigues, C (2000). 'Construction and analysis of bacterial artificial chromosome libraries from a marine microbial assemblage'. Environmental Microbiology. 2 (5): 516–29. doi:10.1046/j.1462-2920.2000.00133.x. PMID11233160.
^ ^a^bNicola, Segata; Daniela Boernigen; Timothy L Tickle; Xochitl C Morgan; Wendy S Garrett; Curtis Huttenhower (2013). 'Computational meta'omics for microbial community studies'. Molecular Systems Biology. 9 (666): 666. doi:10.1038/msb.2013.22. PMC4039370. PMID23670539.
^Watson, Mick; Roehe, Rainer; Walker, Alan W.; Dewhurst, Richard J.; Snelling, Timothy J.; Ivan Liachko; Langford, Kyle W.; Press, Maximilian O.; Wiser, Andrew H. (28 February 2018). 'Assembly of 913 microbial genomes from metagenomic sequencing of the cow rumen'. Nature Communications. 9 (1): 870. doi:10.1038/s41467-018-03317-6. ISSN2041-1723. PMC5830445. PMID29491419.
^Rodrigue, S. B.; Materna, A. C.; Timberlake, S. C.; Blackburn, M. C.; Malmstrom, R. R.; Alm, E. J.; Chisholm, S. W. (2010). Gilbert, Jack Anthony (ed.). 'Unlocking Short Read Sequencing for Metagenomics'. PLoS ONE. 5 (7): e11840. Bibcode:2010PLoSO..511840R. doi:10.1371/journal.pone.0011840. PMC2911387. PMID20676378.
^Schuster, S. C. (2007). 'Next-generation sequencing transforms today's biology'. Nature Methods. 5 (1): 16–18. doi:10.1038/nmeth1156. PMID18165802.
^'Metagenomics versus Moore's law'. Nature Methods. 6 (9): 623. 2009. doi:10.1038/nmeth0909-623.
^ ^a^b^c^d^e^fWooley, J. C.; Godzik, A.; Friedberg, I. (2010). Bourne, Philip E. (ed.). 'A Primer on Metagenomics'. PLoS Computational Biology. 6 (2): e1000667. Bibcode:2010PLSCB..6E0667W. doi:10.1371/journal.pcbi.1000667. PMC2829047. PMID20195499.
^ ^a^bHess, Matthias; Alexander Sczyrba; Rob Egan; Tae-Wan Kim; Harshal Chokhawala; Gary Schroth; Shujun Luo; Douglas S Clark; Feng Chen; Tao Zhang; Roderick I Mackie; Len A Pennacchio; Susannah G Tringe; Axel Visel; Tanja Woyke; Zhong Wang; Edward M Rubin (28 January 2011). 'Metagenomic discovery of biomass-degrading genes and genomes from cow rumen'. Science. 331 (6016): 463–467. Bibcode:2011Sci..331.463H. doi:10.1126/science.1200387. ISSN1095-9203. PMID21273488.
^Qin, Junjie; Ruiqiang Li; Jeroen Raes; Manimozhiyan Arumugam; Kristoffer Solvsten Burgdorf; Chaysavanh Manichanh; Trine Nielsen; Nicolas Pons; Florence Levenez; Takuji Yamada; Daniel R. Mende; Junhua Li; Junming Xu; Shaochuan Li; Dongfang Li; Jianjun Cao; Bo Wang; Huiqing Liang; Huisong Zheng; Yinlong Xie; Julien Tap; Patricia Lepage; Marcelo Bertalan; Jean-Michel Batto; Torben Hansen; Denis Le Paslier; Allan Linneberg; H. Bjorn Nielsen; Eric Pelletier; Pierre Renault; Thomas Sicheritz-Ponten; Keith Turner; Hongmei Zhu; Chang Yu; Shengting Li; Min Jian; Yan Zhou; Yingrui Li; Xiuqing Zhang; Songgang Li; Nan Qin; Huanming Yang; Jian Wang; Soren Brunak; Joel Dore; Francisco Guarner; Karsten Kristiansen; Oluf Pedersen; Julian Parkhill; Jean Weissenbach; Peer Bork; S. Dusko Ehrlich; Jun Wang (4 March 2010). 'A human gut microbial gene catalogue established by metagenomic sequencing'. Nature. 464 (7285): 59–65. Bibcode:2010Natur.464..59. doi:10.1038/nature08821. ISSN0028-0836. PMC3779803. PMID20203603.(subscription required)
^ ^a^bPaulson, Joseph; O Colin Stine; Hector Corrada Bravo; Mihai Pop (2013). 'Differential abundance analysis for microbial marker-gene surveys'. Nature Methods. 10 (12): 1200–1202. doi:10.1038/nmeth.2658. PMC4010126. PMID24076764.
^ ^a^b^c^d^e^f^gCommittee on Metagenomics: Challenges and Functional Applications, National Research Council (2007). The New Science of Metagenomics: Revealing the Secrets of Our Microbial Planet. Washington, D.C.: The National Academies Press. doi:10.17226/11902. ISBN978-0-309-10676-4. PMID21678629.
^Oulas, A; Pavloudi, C; Polymenakou, P; Pavlopoulos, GA; Papanikolaou, N; Kotoulas, G; Arvanitidis, C; Iliopoulos, I (2015). 'Metagenomics: tools and insights for analyzing next-generation sequencing data derived from biodiversity studies'. Bioinformatics and Biology Insights. 9: 75–88. doi:10.4137/BBI.S12462. PMC4426941. PMID25983555.
^Mende, Daniel R.; Alison S. Waller; Shinichi Sunagawa; Aino I. Järvelin; Michelle M. Chan; Manimozhiyan Arumugam; Jeroen Raes; Peer Bork (23 February 2012). 'Assessment of Metagenomic Assembly Using Simulated Next Generation Sequencing Data'. PLoS ONE. 7 (2): e31386. Bibcode:2012PLoSO..731386M. doi:10.1371/journal.pone.0031386. ISSN1932-6203. PMC3285633. PMID22384016.
^Balzer, S.; Malde, K.; Grohme, M. A.; Jonassen, I. (2013). 'Filtering duplicate reads from 454 pyrosequencing data'. Bioinformatics. 29 (7): 830–836. doi:10.1093/bioinformatics/btt047. PMC3605598. PMID23376350.
^Mohammed, MH; Sudha Chadaram; Dinakar Komanduri; Tarini Shankar Ghosh; Sharmila S Mande (2011). 'Eu-Detect: an algorithm for detecting eukaryotic sequences in metagenomic data sets'. Journal of Biosciences. 36 (4): 709–717. doi:10.1007/s12038-011-9105-2. PMID21857117.
^R, Schmeider; R Edwards (2011). 'Fast identification and removal of sequence contamination from genomic and metagenomic datasets'. PLoS ONE. 6 (3): e17288. Bibcode:2011PLoSO..617288S. doi:10.1371/journal.pone.0017288. PMC3052304. PMID21408061.
^ ^a^b^c^d^eKunin, V.; Copeland, A.; Lapidus, A.; Mavromatis, K.; Hugenholtz, P. (2008). 'A Bioinformatician's Guide to Metagenomics'. Microbiology and Molecular Biology Reviews. 72 (4): 557–578, Table 578 Contents. doi:10.1128/MMBR.00009-08. PMC2593568. PMID19052320.
^Burton, J. N.; Liachko, I.; Dunham, M. J.; Shendure, J. (2014). 'Species-Level Deconvolution of Metagenome Assemblies with Hi-C-Based Contact Probability Maps'. G3: Genes, Genomes, Genetics. 4 (7): 1339–1346. doi:10.1534/g3.114.011825. PMC4455782. PMID24855317.
^ ^a^bHuson, Daniel H; S. Mitra; N. Weber; H. Ruscheweyh; Stephan C. Schuster (June 2011). 'Integrative analysis of environmental sequences using MEGAN4'. Genome Research. 21 (9): 1552–1560. doi:10.1101/gr.120618.111. PMC3166839. PMID21690186.
^Zhu, Wenhan; Lomsadze Alex; Borodovsky Mark (2010). 'Ab initio gene identification in metagenomic sequences'. Nucleic Acids Research. 38 (12): e132. doi:10.1093/nar/gkq275. PMC2896542. PMID20403810.
^Hug, Laura A.; Baker, Brett J.; Anantharaman, Karthik; Brown, Christopher T.; Probst, Alexander J.; Castelle, Cindy J.; Butterfield, Cristina N.; Hernsdorf, Alex W.; Amano, Yuki; Ise, Kotaro; Suzuki, Yohey; Dudek, Natasha; Relman, David A.; Finstad, Kari M.; Amundson, Ronald; Thomas, Brian C.; Banfield, Jillian F. (11 April 2016). 'A new view of the tree of life'. Nature Microbiology. 1 (5): 16048. doi:10.1038/nmicrobiol.2016.48. PMID27572647.
^Konopka, A. (2009). 'What is microbial community ecology?'. The ISME Journal. 3 (11): 1223–1230. doi:10.1038/ismej.2009.88. PMID19657372.
^ ^a^bHuson, Daniel H; A. Auch; Ji Qi; Stephan C Schuster (January 2007). 'MEGAN Analysis of Metagenomic Data'. Genome Research. 17 (3): 377–386. doi:10.1101/gr.5969107. PMC1800929. PMID17255551.
^Nicola, Segata; Levi Waldron; Annalisa Ballarini; Vagheesh Narasimhan; Olivier Jousson; Curtis Huttenhower (2012). 'Metagenomic microbial community profiling using unique clade-specific marker genes'. Nature Methods. 9 (8): 811–814. doi:10.1038/nmeth.2066. PMC3443552. PMID22688413.
^Sunagawa, Shinichi; et al. (2013). 'Metagenomic species profiling using universal phylogenetic marker genes'. Nature Methods. 10 (12): 1196–1199. doi:10.1038/nmeth.2693. PMID24141494.
^ ^a^bMilanese, Alessio; et al. (2019). 'Microbial abundance, activity and population genomic profiling with mOTUs2'. Nature Communications. 10 (1): 1014. doi:10.1038/s41467-019-08844-4. PMC6399450. PMID30833550.
^Liu, Bo; et al. (2011). 'Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences'. BMC Genomics. 12: S4. doi:10.1186/1471-2164-12-S2-S4. PMC3194235. PMID21989143.
^Dadi, Temesgen Hailemariam; Renard, Bernhard Y.; Wieler, Lothar H.; Semmler, Torsten; Reinert, Knut (2017). 'SLIMM: species level identification of microorganisms from metagenomes'. PeerJ. 5: e3138. doi:10.7717/peerj.3138. ISSN2167-8359. PMC5372838. PMID28367376.
^Pagani, Ioanna; Konstantinos Liolios; Jakob Jansson; I-Min A Chen; Tatyana Smirnova; Bahador Nosrat; Victor M Markowitz; Nikos C Kyrpides (1 December 2011). 'The Genomes OnLine Database (GOLD) v.4: status of genomic and metagenomic projects and their associated metadata'. Nucleic Acids Research. 40 (1): D571–9. doi:10.1093/nar/gkr1100. ISSN1362-4962. PMC3245063. PMID22135293.
^Meyer, F; Paarmann D; D'Souza M; Olson R; Glass EM; Kubal M; Paczian T; Rodriguez A; Stevens R; Wilke A; Wilkening J; Edwards RA (2008). 'The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes'. BMC Bioinformatics. 9: 0. doi:10.1186/1471-2105-9-386. PMC2563014. PMID18803844.
^Markowitz, V. M.; Chen, I. -M. A.; Chu, K.; Szeto, E.; Palaniappan, K.; Grechkin, Y.; Ratner, A.; Jacob, B.; Pati, A.; Huntemann, M.; Liolios, K.; Pagani, I.; Anderson, I.; Mavromatis, K.; Ivanova, N. N.; Kyrpides, N. C. (2011). 'IMG/M: The integrated metagenome data management and comparative analysis system'. Nucleic Acids Research. 40 (Database issue): D123–D129. doi:10.1093/nar/gkr975. PMC3245048. PMID22086953.
^ ^a^bMitra, Suparna; Paul Rupek; Daniel C Richter; Tim Urich; Jack A Gilbert; Folker Meyer; Andreas Wilke; Daniel H Huson (2011). 'Functional analysis of metagenomes and metatranscriptomes using SEED and KEGG'. BMC Bioinformatics. 12 Suppl 1: S21. doi:10.1186/1471-2105-12-S1-S21. ISSN1471-2105. PMC3044276. PMID21342551.
^Benson, Dennis; Mark Cavanaugh; Karen Clark; et al. (2013). 'Genbank'. Nucleic Acids Research. 41 (Database issue): D36–D42. doi:10.1093/nar/gks1195. PMC3531190. PMID23193287.
^Bazinet, Adam; Michael Cummings (2012). 'A comparative evaluation of sequence classification programs'. BMC Bioinformatics. 13: 92. doi:10.1186/1471-2105-13-92. PMC3428669. PMID22574964.
^Ounit, Rachid; Steve Wanamaker; Timothy Close; Stefano Lonardi (2015). 'CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers'. BMC Genomics. 16: 236. doi:10.1186/s12864-015-1419-2. PMC4428112. PMID25879410.
^Pratas D; Pinho AJ; Silva RM; Rodrigues JMOS; Hosseini M; Caetano T; Ferreira PJSG (February 2018). 'FALCON: a method to infer metagenomic composition of ancient DNA'. bioRxiv267179.
^Kurokawa, Ken; Takehiko Itoh; Tomomi Kuwahara; Kenshiro Oshima; Hidehiro Toh; Atsushi Toyoda; Hideto Takami; Hidetoshi Morita; Vineet K. Sharma; Tulika P. Srivastava; Todd D. Taylor; Hideki Noguchi; Hiroshi Mori; Yoshitoshi Ogura; Dusko S. Ehrlich; Kikuji Itoh; Toshihisa Takagi; Yoshiyuki Sakaki; Tetsuya Hayashi; Masahira Hattori (1 January 2007). 'Comparative Metagenomics Revealed Commonly Enriched Gene Sets in Human Gut Microbiomes'. DNA Research. 14 (4): 169–181. doi:10.1093/dnares/dsm018. PMC2533590. PMID17916580. Retrieved 18 December 2011.
^ ^a^b^c^d^e^fSimon, C.; Daniel, R. (2010). 'Metagenomic Analyses: Past and Future Trends'. Applied and Environmental Microbiology. 77 (4): 1153–1161. doi:10.1128/AEM.02345-10. PMC3067235. PMID21169428.
^Willner, D; RV Thurber; F Rohwer (2009). 'Metagenomic signatures of 86 microbial and viral metagenomes'. Environmental Microbiology. 11 (7): 1752–66. doi:10.1111/j.1462-2920.2009.01901.x. PMID19302541.
^Ghosh, Tarini Shankar; Monzoorul Haque Mohammed; Hannah Rajasingh; Sudha Chadaram; Sharmila S Mande (2011). 'HabiSign: a novel approach for comparison of metagenomes and rapid identification of habitat-specific sequences'. BMC Bioinformatics. 12 (Supplement 13): S9. doi:10.1186/1471-2105-12-s13-s9. PMC3278849. PMID22373355.
^Fimereli, D.; Detours, V.; Konopka, T. (13 February 2013). 'TriageTools: tools for partitioning and prioritizing analysis of high-throughput sequencing data'. Nucleic Acids Research. 41 (7): e86. doi:10.1093/nar/gkt094. PMC3627586. PMID23408855.
^Maillet, Nicolas; Lemaitre, Claire; Chikhi, Rayan; Lavenier, Dominique; Peterlongo, Pierre (2012). 'Compareads: comparing huge metagenomic experiments'. BMC Bioinformatics. 13 (Suppl 19): S10. doi:10.1186/1471-2105-13-S19-S10. PMC3526429. PMID23282463.
^Bhusan, Kuntal Kumar; Tarini Shankar Ghosh; Sharmila S Mande (2013). 'Community-analyzer: a platform for visualizing and comparing microbial community structure across microbiomes'. Genomics. 102 (4): 409–418. doi:10.1016/j.ygeno.2013.08.004. PMID23978768.
^Werner, Jeffrey J.; Dan Knights; Marcelo L. Garcia; Nicholas B. Scalfone; Samual Smith; Kevin Yarasheski; Theresa A. Cummings; Allen R. Beers; Rob Knight; Largus T. Angenent (8 March 2011). 'Bacterial community structures are unique and resilient in full-scale bioenergy systems'. Proceedings of the National Academy of Sciences of the United States of America. 108 (10): 4158–4163. Bibcode:2011PNAS.108.4158W. doi:10.1073/pnas.1015676108. ISSN0027-8424. PMC3053989. PMID21368115.
^McInerney, Michael J.; Jessica R. Sieber; Robert P. Gunsalus (December 2009). 'Syntrophy in Anaerobic Global Carbon Cycles'. Current Opinion in Biotechnology. 20 (6): 623–632. doi:10.1016/j.copbio.2009.10.001. ISSN0958-1669. PMC2790021. PMID19897353.
^Klitgord, N.; Segrè, D. (2011). 'Ecosystems biology of microbial metabolism'. Current Opinion in Biotechnology. 22 (4): 541–546. doi:10.1016/j.copbio.2011.04.018. PMID21592777.
^Leininger, S.; Urich, T.; Schloter, M.; Schwark, L.; Qi, J.; Nicol, G. W.; Prosser, J. I.; Schuster, S. C.; Schleper, C. (2006). 'Archaea predominate among ammonia-oxidizing prokaryotes in soils'. Nature. 442 (7104): 806–809. Bibcode:2006Natur.442.806L. doi:10.1038/nature04983. PMID16915287.
^Paez-Espino, D; Eloe-Fadrosh, EA; Pavlopoulos, GA; Thomas, AD; Huntemann, M; Mikhailova, N; Rubin, E; Ivanova, NN; Kyrpides, NC (25 August 2016). 'Uncovering Earth's virome'. Nature. 536 (7617): 425–30. Bibcode:2016Natur.536.425P. doi:10.1038/nature19094. PMID27533034.
^Paez-Espino, D; Chen, IA; Palaniappan, K; Ratner, A; Chu, K; Szeto, E; Pillay, M; Huang, J; Markowitz, VM; Nielsen, T; Huntemann, M; K Reddy, TB; Pavlopoulos, GA; Sullivan, MB; Campbell, BJ; Chen, F; McMahon, K; Hallam, SJ; Denef, V; Cavicchioli, R; Caffrey, SM; Streit, WR; Webster, J; Handley, KM; Salekdeh, GH; Tsesmetzis, N; Setubal, JC; Pope, PB; Liu, WT; Rivers, AR; Ivanova, NN; Kyrpides, NC (4 January 2017). 'IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses'. Nucleic Acids Research. 45 (D1): D457–D465. doi:10.1093/nar/gkw1030. PMC5210529. PMID27799466.
^Paez-Espino D, Roux S, Chen IA, Palaniappan K, Ratner A, Chu K, et al. (2018). 'IMG/VR v.2.0: an integrated data management and analysis system for cultivated and environmental viral genomes'. Nucleic Acids Res. 47 (D1): D678–D686. doi:10.1093/nar/gky1127. PMC6323928. PMID30407573.
^Paez-Espino, D; Pavlopoulos, GA; Ivanova, NN; Kyrpides, NC (August 2017). 'Nontargeted virus sequence discovery pipeline and virus clustering for metagenomic data'. Nature Protocols. 12 (8): 1673–1682. doi:10.1038/nprot.2017.063. PMID28749930.
^Kristensen, DM; Mushegian AR; Dolja VV; Koonin EV (2009). 'New dimensions of the virus world discovered through metagenomics'. Trends in Microbiology. 18 (1): 11–19. doi:10.1016/j.tim.2009.11.003. PMC3293453. PMID19942437.
^Kerepesi, Csaba; Grolmusz, Vince (2016). 'Giant Viruses of the Kutch Desert'. Archives of Virology. 161 (3): 721–724. arXiv:1410.1278. doi:10.1007/s00705-015-2720-8. PMID26666442.
^Kerepesi, Csaba; Grolmusz, Vince (2017). 'The 'Giant Virus Finder' Discovers an Abundance of Giant Viruses in the Antarctic Dry Valleys'. Archives of Virology. 162 (6): 1671–1676. arXiv:1503.05575. doi:10.1007/s00705-017-3286-4. PMID28247094.
^Jansson, Janet (2011). 'Towards 'Tera-Terra': Terabase Sequencing of Terrestrial Metagenomes Print E-mail'. Microbe. 6 (7). p. 309. Archived from the original on 31 March 2012.
^Vogel, T. M.; Simonet, P.; Jansson, J. K.; Hirsch, P. R.; Tiedje, J. M.; Van Elsas, J. D.; Bailey, M. J.; Nalin, R.; Philippot, L. (2009). 'TerraGenome: A consortium for the sequencing of a soil metagenome'. Nature Reviews Microbiology. 7 (4): 252. doi:10.1038/nrmicro2119.
^'TerraGenome Homepage'. TerraGenome international sequencing consortium. Retrieved 30 December 2011.
^ ^a^bCommittee on Metagenomics: Challenges and Functional Applications, National Research Council (2007). Understanding Our Microbial Planet: The New Science of Metagenomics(PDF). The National Academies Press.
^Charles T (2010). 'The Potential for Investigation of Plant-microbe Interactions Using Metagenomics Methods'. Metagenomics: Theory, Methods and Applications. Caister Academic Press. ISBN978-1-904455-54-7.
^Bringel, Françoise; Couée, Ivan (22 May 2015). 'Pivotal roles of phyllosphere microorganisms at the interface between plant functioning and atmospheric trace gas dynamics'. Frontiers in Microbiology. 6: 486. doi:10.3389/fmicb.2015.00486. PMC4440916. PMID26052316.
^Li, Luen-Luen; Sean R McCorkle; Sebastien Monchy; Safiyh Taghavi; Daniel van der Lelie (18 May 2009). 'Bioprospecting metagenomes: glycosyl hydrolases for converting biomass'. Biotechnology for Biofuels. 2: 10. doi:10.1186/1754-6834-2-10. ISSN1754-6834. PMC2694162. PMID19450243.
^Jaenicke, Sebastian; Christina Ander; Thomas Bekel; Regina Bisdorf; Marcus Dröge; Karl-Heinz Gartemann; Sebastian Jünemann; Olaf Kaiser; Lutz Krause; Felix Tille; Martha Zakrzewski; Alfred Pühler; Andreas Schlüter; Alexander Goesmann (26 January 2011). Aziz, Ramy K (ed.). 'Comparative and Joint Analysis of Two Metagenomic Datasets from a Biogas Fermenter Obtained by 454-Pyrosequencing'. PLoS ONE. 6 (1): e14519. Bibcode:2011PLoSO..614519J. doi:10.1371/journal.pone.0014519. PMC3027613. PMID21297863.
^Suen, Garret; Jarrod J Scott; Frank O Aylward; Sandra M Adams; Susannah G Tringe; Adrián A Pinto-Tomás; Clifton E Foster; Markus Pauly; Paul J Weimer; Kerrie W Barry; Lynne A Goodwin; Pascal Bouffard; Lewyn Li; Jolene Osterberger; Timothy T Harkins; Steven C Slater; Timothy J Donohue; Cameron R Currie (September 2010). Sonnenburg, Justin (ed.). 'An insect herbivore microbiome with high plant biomass-degrading capacity'. PLoS Genetics. 6 (9): e1001129. doi:10.1371/journal.pgen.1001129. ISSN1553-7404. PMC2944797. PMID20885794.
^Simon, C.; Daniel, R. (2009). 'Achievements and new knowledge unraveled by metagenomic approaches'. Applied Microbiology and Biotechnology. 85 (2): 265–276. doi:10.1007/s00253-009-2233-z. PMC2773367. PMID19760178.
^Wong D (2010). 'Applications of Metagenomics for Industrial Bioproducts'. Metagenomics: Theory, Methods and Applications. Caister Academic Press. ISBN978-1-904455-54-7.
^ ^a^bSchloss, Patrick D; Jo Handelsman (June 2003). 'Biotechnological prospects from metagenomics'(PDF). Current Opinion in Biotechnology. 14 (3): 303–310. doi:10.1016/S0958-1669(03)00067-3. ISSN0958-1669. PMID12849784. Retrieved 3 January 2012.
^ ^a^b^cKakirde, Kavita S.; Larissa C. Parsley; Mark R. Liles (1 November 2010). 'Size Does Matter: Application-driven Approaches for Soil Metagenomics'. Soil Biology & Biochemistry. 42 (11): 1911–1923. doi:10.1016/j.soilbio.2010.07.021. ISSN0038-0717. PMC2976544. PMID21076656.
^Parachin, Nádia Skorupa; Marie F Gorwa-Grauslund (2011). 'Isolation of xylose isomerases by sequence- and function-based screening from a soil metagenomic library'. Biotechnology for Biofuels. 4 (1): 9. doi:10.1186/1754-6834-4-9. ISSN1754-6834. PMC3113934. PMID21545702.
^Hover BM, Kim S, Katz M, Charlop-Powers Z, Owen JG, Ternei MA, et al. (12 February 2018). 'Culture-independent discovery of the malacidins as calcium-dependent antibiotics with activity against multidrug-resistant Gram-positive pathogens'. Nature Microbiology. 3 (4): 415–422. doi:10.1038/s41564-018-0110-1. PMC5874163. PMID29434326.
^Raes, J.; Letunic, I.; Yamada, T.; Jensen, L. J.; Bork, P. (2011). 'Toward molecular trait-based ecology through integration of biogeochemical, geographical and metagenomic data'. Molecular Systems Biology. 7: 473. doi:10.1038/msb.2011.6. PMC3094067. PMID21407210.
^Lavery, T. J.; Roudnew, B.; Seymour, J.; Mitchell, J. G.; Jeffries, T. (2012). Steinke, Dirk (ed.). 'High Nutrient Transport and Cycling Potential Revealed in the Microbial Metagenome of Australian Sea Lion (Neophoca cinerea) Faeces'. PLoS ONE. 7 (5): e36478. Bibcode:2012PLoSO..736478L. doi:10.1371/journal.pone.0036478. PMC3350522. PMID22606263.
^'What's Swimming In The River? Just Look For DNA'. NPR.org. 24 July 2013. Retrieved 10 October 2014.
^George I; et al. (2010). 'Application of Metagenomics to Bioremediation'. Metagenomics: Theory, Methods and Applications. Caister Academic Press. ISBN978-1-904455-54-7.
^Zimmer, Carl (13 July 2010). 'How Microbes Defend and Define Us'. New York Times. Retrieved 29 December 2011.
^Nelson KE and White BA (2010). 'Metagenomics and Its Applications to the Study of the Human Microbiome'. Metagenomics: Theory, Methods and Applications. Caister Academic Press. ISBN978-1-904455-54-7.
^Qin, Junjie; Ruiqiang Li; Jeroen Raes; Manimozhiyan Arumugam; Kristoffer Solvesten Burgdorf (March 2010). 'A human gut microbial gene catalogue established by metagenomic sequencing'. Nature. 464 (7285): 59–65. Bibcode:2010Natur.464..59. doi:10.1038/nature08821. PMC3779803. PMID20203603.

External links[edit]

Focus on Metagenomics at Nature Reviews Microbiology journal website
The “Critical Assessment of Metagenome Interpretation” (CAMI) initiative to evaluate methods in metagenomics

Retrieved from 'https://en.wikipedia.org/w/index.php?title=Metagenomics&oldid=899005851'