DOE Joint Genome Institute

  • COVID-19
  • About Us
  • Contact Us
  • Our Science
    • DOE Mission Areas
    • Bioenergy Research Centers
    • Science Programs
    • Science Highlights
    • Scientists
    Data yielded from RIViT-seq increased the number of sigma factor-gene pairs confirmed in Streptomyces coelicolor from 209 to 399. Here, grey arrows denote previously known regulation and red arrows are regulation identified by RIViT-seq; orange nodes mark sigma factors while gray nodes mark other genes. (Otani, H., Mouncey, N.J. Nat Commun 13, 3502 (2022). https://doi.org/10.1038/s41467-022-31191-w)
    Streamlining Regulon Identification in Bacteria
    Regulons are a group of genes that can be turned on or off by the same regulatory protein. RIViT-seq technology could speed up associating transcription factors with their target genes.

    More

    (PXFuel)
    Designer DNA: JGI Helps Users Blaze New Biosynthetic Pathways
    In a special issue of the journal Synthetic Biology, JGI scientific users share how they’ve worked with the JGI DNA Synthesis Science Program and what they’ve discovered through their collaborations.

    More

    A genetic element that generates targeted mutations, called diversity-generating retroelements (DGRs), are found in viruses, as well as bacteria and archaea. Most DGRs found in viruses appear to be in their tail fibers. These tail fibers – signified in the cartoon by the blue virus’ downward pointing ‘arms’— allow the virus to attach to one cell type (red), but not the other (purple). DGRs mutate these ‘arms,’ giving the virus opportunities to switch to different prey, like the purple cell. (Courtesy of Blair Paul)
    A Natural Mechanism Can Turbocharge Viral Evolution
    A team has discovered that diversity generating retroelements (DGRs) are not only widespread, but also surprisingly active. In viruses, DGRs appear to generate diversity quickly, allowing these viruses to target new microbial prey.

    More

  • Our Projects
    • Search JGI Projects
    • DOE Metrics/Statistics
    • Approved User Proposals
    • Legacy Projects
    Photograph of a stream of diatoms beneath Arctic sea ice.
    Polar Phytoplankton Need Zinc to Cope with the Cold
    As part of a long-term collaboration with the JGI Algal Program, researchers studying function and activity of phytoplankton genes in polar waters have found that these algae rely on dissolved zinc to photosynthesize.

    More

    This data image shows the monthly average sea surface temperature for May 2015. Between 2013 and 2016, a large mass of unusually warm ocean water--nicknamed the blob--dominated the North Pacific, indicated here by red, pink, and yellow colors signifying temperatures as much as three degrees Celsius (five degrees Fahrenheit) higher than average. Data are from the NASA Multi-scale Ultra-high Resolution Sea Surface Temperature (MUR SST) Analysis product. (Courtesy NASA Physical Oceanography Distributed Active Archive Center)
    When “The Blob” Made It Hotter Under the Water
    Researchers tracked the impact of a large-scale heatwave event in the ocean known as “The Blob” as part of an approved proposal through the Community Science Program.

    More

    A plantation of poplar trees. (David Gilbert)
    Genome Insider podcast: THE Bioenergy Tree
    The US Department of Energy’s favorite tree is poplar. In this episode, hear from ORNL scientists who have uncovered remarkable genetic secrets that bring us closer to making poplar an economical and sustainable source of energy and materials.

    More

  • Data & Tools
    • IMG
    • Data Portal
    • MycoCosm
    • PhycoCosm
    • Phytozome
    • GOLD
    HPCwire Editor's Choice Award (logo crop) for Best Use of HPC in the Life Sciences
    JGI Part of Berkeley Lab Team Awarded Best Use of HPC in Life Sciences
    The HPCwire Editors Choice Award for Best Use of HPC in Life Sciences went to the Berkeley Lab team comprised of JGI and ExaBiome Project team, supported by the DOE Exascale Computing Project for MetaHipMer, an end-to-end genome assembler that supports “an unprecedented assembly of environmental microbiomes.”

    More

    With a common set of "baseline metadata," JGI users can more easily access public data sets. (Steve Wilson)
    A User-Centered Approach to Accessing JGI Data
    Reflecting a structural shift in data access, the JGI Data Portal offers a way for users to more easily access public data sets through a common set of metadata.

    More

    Phytozome portal collage
    A More Intuitive Phytozome Interface
    Phytozome v13 now hosts upwards of 250 plant genomes and provides users with the genome browsers, gene pages, search, BLAST and BioMart data warehouse interfaces they have come to rely on, with a more intuitive interface.

    More

  • User Programs
    • Calls for Proposals
    • Special Initiatives & Programs
    • Product Offerings
    • User Support
    • Policies
    • Submit a Proposal
    screencap from Amundson and Wilkins subsurface microbiome video
    Digging into Microbial Ecosystems Deep Underground
    JGI users and microbiome researchers at Colorado State University have many questions about the microbial communities deep underground, including the role viral infection may play in other natural ecosystems.

    Read more

    Yeast strains engineered for the biochemical conversion of glucose to value-added products are limited in chemical output due to growth and viability constraints. Cell extracts provide an alternative format for chemical synthesis in the absence of cell growth by isolating the soluble components of lysed cells. By separating the production of enzymes (during growth) and the biochemical production process (in cell-free reactions), this framework enables biosynthesis of diverse chemical products at volumetric productivities greater than the source strains. (Blake Rasor)
    Boosting Small Molecule Production in Super “Soup”
    Researchers supported through the Emerging Technologies Opportunity Program describe a two-pronged approach that starts with engineered yeast cells but then moves out of the cell structure into a cell-free system.

    More

    These bright green spots are fluorescently labelled bacteria from soil collected from the surface of plant roots. For reference, the scale bar at bottom right is 10 micrometers long. (Rhona Stuart)
    A Powerful Technique to Study Microbes, Now Easier
    In JGI's Genome Insider podcast: LLNL biologist Jennifer Pett-Ridge collaborated with JGI scientists through the Emerging Technologies Opportunity Program to semi-automate experiments that measure microbial activity in soil.

    More

  • News & Publications
    • News
    • Blog
    • Podcasts
    • Webinars
    • Publications
    • Newsletter
    • Logos and Templates
    • Photos
    A view of the mangroves from which the giant bacteria were sampled in Guadeloupe. (Hugo Bret)
    Giant Bacteria Found in Guadeloupe Mangroves Challenge Traditional Concepts
    Harnessing JGI and Berkeley Lab resources, researchers characterized a giant - 5,000 times bigger than most bacteria - filamentous bacterium discovered in the Caribbean mangroves.

    More

    In their approved proposal, Frederick Colwell of Oregon State University and colleagues are interested in the microbial communities that live on Alaska’s glacially dominated Copper River Delta. They’re looking at how the microbes in these high latitude wetlands, such as the Copper River Delta wetland pond shown here, cycle carbon. (Courtesy of Rick Colwell)
    Monitoring Inter-Organism Interactions Within Ecosystems
    Many of the proposals approved through JGI's annual Community Science Program call focus on harnessing genomics to developing sustainable resources for biofuels and bioproducts.

    More

    Coloring the water, the algae Phaeocystis blooms off the side of the sampling vessel, Polarstern, in the temperate region of the North Atlantic. (Katrin Schmidt)
    Climate Change Threatens Base of Polar Oceans’ Bountiful Food Webs
    As warm-adapted microbes edge polewards, they’d oust resident tiny algae. It's a trend that threatens to destabilize the delicate marine food web and change the oceans as we know them.

    More

News & Publications
Home › News Releases › Uncovering Novel Genomes from Earth’s Microbiomes

November 9, 2020

Uncovering Novel Genomes from Earth’s Microbiomes

Genome resource expands known diversity of bacteria and archaea by 44 percent.

Artistic interpretation of how microbial genome sequences from the GEM catalog can help fill in gaps of knowledge about the microbes that play key roles in the Earth's microbiomes. (Rendered by Zosia Rostomian​, Berkeley Lab)

Artistic interpretation of how microbial genome sequences from the GEM catalog can help fill in gaps of knowledge about the microbes that play key roles in the Earth’s microbiomes. (Rendered by Zosia Rostomian​, Berkeley Lab)

Despite advances in sequencing technologies and computational methods in the past decade, researchers have uncovered genomes for just a small fraction of Earth’s microbial diversity. Because most microbes cannot be cultivated under laboratory conditions, their genomes can’t be sequenced using traditional approaches. Identifying and characterizing the planet’s microbial diversity is key to understanding the roles of microorganisms in regulating nutrient cycles, as well as gaining insights into potential applications they may have in a wide range of research fields.

A public repository of 52,515 microbial draft genomes generated from environmental samples around the world, expanding the known diversity of bacteria and archaea by 44%, is now available and described November 9, 2020 in Nature Biotechnology. Known as the GEM (Genomes from Earth’s Microbiomes) catalog, this work results from a collaboration involving more than 200 scientists, researchers at the U.S. Department of Energy (DOE) Joint Genome Institute (JGI), a DOE Office of Science User Facility located at Lawrence Berkeley National Laboratory (Berkeley Lab), and the DOE Systems Biology Knowledgebase (KBase).

Metagenomics is the study of the microbial communities in the environmental samples without needing to isolate individual organisms, using various methods for processing, sequencing and analysis. “Using a technique called metagenome binning, we were able to reconstruct thousands of metagenome-assembled genomes (MAGs) directly from sequenced environmental samples without needing to cultivate the microbes in the lab,” noted Stephen Nayfach, the study’s first author and research scientist in Nikos Kyrpides’ Microbiome Data Science group. “What makes this study really stand out from previous efforts is the remarkable environmental diversity of the samples we analyzed.”

Emiley Eloe-Fadrosh, head of the JGI Metagenome Program and senior author on the study elaborated on Nayfach’s comments. “This study was designed to encompass the broadest and most diverse range of samples and environments, including natural and agricultural soils, human- and animal-host associated, and ocean and other aquatic environments – that’s pretty remarkable.”

The GEM catalog expands the bacterial and archaeal orders as seen on the phylogenetic tree, with new lineages of uncultivated genomes from the GEM catalog (in green) and previously existing reference genomes (in gray). Around the phylogenetic tree, the strip charts indicate if an order is uncultured (blue; represented only by metagenome-assembled genomes or MAGs) or cultured (gray; represented by an isolate genome). The next four strip charts indicate the environmental distribution, while the bar plot indicates the number of genomes from the GEM catalog recovered from each order. (Stephen Nayfach)

The GEM catalog expands the bacterial and archaeal orders as seen on the phylogenetic tree, with new lineages of uncultivated genomes from the GEM catalog (in green) and previously existing reference genomes (in gray). Around the phylogenetic tree, the strip charts indicate if an order is uncultured (blue; represented only by metagenome-assembled genomes or MAGs) or cultured (gray; represented by an isolate genome). The next four strip charts indicate the environmental distribution, while the bar plot indicates the number of genomes from the GEM catalog recovered from each order. (Stephen Nayfach)

Adding Value Beyond Genome Sequences

Much of the data had been generated from environmental samples sequenced by the JGI through the Community Science Program and was already available on the JGI’s Integrated Microbial Genomes & Microbiomes (IMG/M) platform. Eloe-Fadrosh noted that it was a nice example of “big-data” mining to gain a deeper understanding of the data and enhancing the value by making data publicly available.

To acknowledge the efforts of the investigators who had done the sampling, Eloe-Fadrosh reached out to more than 200 researchers around the world in accordance with the JGI data use policy. “I felt it is important to acknowledge the significant efforts to collect and extract DNA from these samples, many of which come from unique, difficult to access environments, and invited these researchers to be co-authors as part of IMG data consortium,” she said.

Listen to Dan Udwary talk about mining the catalog of Earth's microbiomes in the JGI Natural Prodcast.

Listen to study co-author Dan Udwary talk about genome mining using the GEM catalog in this episode of the JGI Natural Prodcast.

Using this massive dataset, Nayfach clustered the MAGs into 18,000 candidate species groups, 70% of which were novel compared over 500,000 existing genomes available at that time. “Looking across the tree of life, it’s striking how many uncultivated lineages are only represented by MAGs,” he said. “While these draft genomes are imperfect, they can still reveal a lot about the biology and diversity of uncultured microbes.”

Teams of researchers worked on multiple analyses harnessing the genome repository, and the IMG/M team developed several updates and features to mine the GEM catalog. (Watch this IMG webinar on Metagenome Bins to learn more.) One group mined the dataset for novel secondary metabolites of secondary metabolite biosynthetic gene clusters (BGCs), increasing these BGCs in IMG/ABC (Atlas of Biosynthetic Gene Clusters) by 31%. (Listen to this JGI Natural Prodcast episode on genome mining.) Nayfach also worked with another team on predicting host-virus connections between all viruses in IMG/VR (Virus) and the GEM catalog, associating 81,000 viruses – 70% of which had not already been associated with a host – with 23,000 MAGs.

Modeling A New Path for Metagenomics Researchers

Data from environmental samples collected at Artarctica's Dry Valleys were part of the massive dataset used to generate the genomic catalog of Earth's microbiomes. (Craig Cary, International Centre for Terrestrial Antarctic Research, University of Waikato)

Data in IMG  from environmental samples collected at Artarctica’s Dry Valleys were used for the study. (Craig Cary, International Centre for Terrestrial Antarctic Research, University of Waikato)

Building upon these resources, KBase, a multi-institutional collaborative knowledge creation and discovery environment designed for biologists and bioinformaticians, developed metabolic models for thousands of MAGs. The models are now available in a public Narrative, which provides shareable, reproducible workflows. “Metabolic modeling is a routine analysis for isolate genomes, but has not been done at scale for uncultivated microbes,” said Eloe-Fadrosh, “and we felt that the collaboration with KBase would add value beyond clustering and analysis of these MAGs.

“Just bringing this dataset into KBase has immediate value because people can find the high-quality MAGs and use them to inform future analyses,” said José P. Faria, a KBase computational biologist at Argonne National Laboratory. “The process of building a metabolic model is simple: you just select a genome or MAG and press a button to build a model from our database of mappings between biochemical reactions and annotations. We look at what was annotated in the genome and at the resulting model to assess the metabolic capabilities of the organism.” (Watch this KBase webinar on metabolic modeling.)

KBase User Engagement lead Elisha Wood-Charlson added that by demonstrating the ease with which metabolic models were generated from the GEM dataset, metagenomics researchers might consider branching into this space. “Most metagenomics researchers might not be willing to dive into an entirely new research field [metabolic modeling], but they might be interested in how biochemistry impacts what they work on. The genomics community can now explore metabolism using KBase’s easy path from genomes or MAGs to modeling that may not have been considered,” she said.

A Community Resource for Facilitating Research

Data in IMG from algal samples were used for the study. (Erica Young)

Kostas Konstantinidis of Georgia Institute of Technology, one of the co-authors whose data were part of the catalog, “I don’t think there are many institutions that can do this kind of large-scale metagenomics and that have the capacity for large scale analyses. The beauty of this study is that it’s done at this scale that individual labs cannot do, and it gives us new insights into microbial diversity and function.”

He is already finding ways to utilize the catalog in his own research on how microbes respond to climate change. “With this dataset I can see where every microbe is found, and how abundant it is. That’s very useful for my work and for others doing similar research.” Additionally, he’s interested in expanding the diversity of the reference database he’s developing called the Microbial Genomes Atlas to allow for more robust analyses by adding the MAGs.

“This is a great resource for the community,” Konstantinidis added. “It’s a dataset that is going to facilitate many more studies subsequently. And I hope JGI and other institutions continue to do this kind of projects.”

The work also used resources of the National Energy Research Scientific Computing Center (NERSC), another DOE Office of Science User Facility located at Berkeley Lab.

 

Publication: Nayfach S et al. A Genomic Catalog of Earth’s Microbiomes. Nature Biotechnology. 2020 Nov 9. doi: 10.1038/s41587-020-0718-6.

Behind the Paper: Eloe-Fadrosh E et al. Building a genomic resource across Earth’s biomes for the community.

 

Byline: Massie S. Ballon

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)
  • Click to share on Pinterest (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to print (Opens in new window)

The U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility at Lawrence Berkeley National Laboratory, is committed to advancing genomics in support of DOE missions related to clean energy generation and environmental characterization and cleanup. JGI provides integrated high-throughput sequencing and computational analysis that enable systems-based scientific approaches to these challenges. Follow @jgi on Twitter.

DOE’s Office of Science is the largest supporter of basic research in the physical sciences in the United States, and is working to address some of the most pressing challenges of our time. For more information, please visit science.energy.gov.

Filed Under: News Releases

More topics:

  • COVID-19 Status
  • News
  • Science Highlights
  • Blog
  • Webinars
  • CSP Plans
  • Featured Profiles

Related Content:

Introducing New Members of the JGI User Executive Committee

incoming 2023 UEC members

JGI Contributes Nine to 2022 Highly Cited Researchers List

Nine headshots, one for each researcher, laid out beside a purple ribbon reading, "Home to Highly Cited Researchers 2022 Clarivate"

JGI announces first round of 2023 New Investigator awardees

Digital ID card with 10 headshots reads: Congratulations to our 2023 New Investigator recipients!

JGI at 25: Following Fungi that Pry Apart Plant Polymers

A brown goat with white horns looks at green hay

Exploring Possibilities: 2022 JGI-UC Merced Interns

2022 JGI-UC Merced interns (Thor Swift/Berkeley Lab)

JGI at 25: Using team science to build communities around data

  • Careers
  • Contact Us
  • Events
  • User Meeting
  • MGM Workshops
  • Internal
  • Disclaimer
  • Credits
  • Policies
  • Emergency Info
  • Accessibility / Section 508 Statement
  • Flickr
  • LinkedIn
  • RSS
  • Twitter
  • YouTube
Lawrence Berkeley National Lab Biosciences Area
A project of the US Department of Energy, Office of Science

JGI is a DOE Office of Science User Facility managed by Lawrence Berkeley National Laboratory

© 1997-2023 The Regents of the University of California