DOE Joint Genome Institute

  • COVID-19
  • About Us
  • Contact Us
  • Our Science
    • DOE Mission Areas
    • Science Programs
    • Science Highlights
    • Scientists
    A vertical tree stump outdoors with about a dozen shiitake mushrooms sprouting from its surface.
    Tracing the Evolution of Shiitake Mushrooms
    Understanding Lentinula genomes and their evolution could provide strategies for converting plant waste into sugars for biofuel production. Additionally, these fungi play a role in the global carbon cycle.

    More

    Soil Virus Offers Insight into Maintaining Microorganisms
    Through a collaborative effort, researchers have identified a protein in soil viruses that may promote soil health.

    More

    Data yielded from RIViT-seq increased the number of sigma factor-gene pairs confirmed in Streptomyces coelicolor from 209 to 399. Here, grey arrows denote previously known regulation and red arrows are regulation identified by RIViT-seq; orange nodes mark sigma factors while gray nodes mark other genes. (Otani, H., Mouncey, N.J. Nat Commun 13, 3502 (2022). https://doi.org/10.1038/s41467-022-31191-w)
    Streamlining Regulon Identification in Bacteria
    Regulons are a group of genes that can be turned on or off by the same regulatory protein. RIViT-seq technology could speed up associating transcription factors with their target genes.

    More

  • Our Projects
    • Search JGI Projects
    • DOE Metrics/Statistics
    • Approved User Proposals
    • Legacy Projects
    A panoramic view of a lake reflecting a granite mountain.
    Genome Insider: Methane Makers in Yosemite’s Lakes
    Meet researchers who sampled the microbial communities living in the mountaintop lakes of the Sierra Nevada mountains to see how climate change affects freshwater ecosystems, and how those ecosystems work.

    Listen

    A light green shrub with spiny leaves, up close.
    Genome Insider: A Shrubbier Version of Rubber
    Hear from the consortium working on understanding the guayule plant's genome, which could lead to an improved natural rubber plant.

    Listen

    The switchgrass diversity panel growing at the Kellogg Biological Station in Michigan. (David Lowry)
    Mapping Switchgrass Traits with Common Gardens
    The combination of field data and genetic information has allowed researchers to associate climate adaptations with switchgrass biology.

    More

  • Data & Tools
    • IMG
    • Data Portal
    • MycoCosm
    • PhycoCosm
    • Phytozome
    • GOLD
    iPHoP image (Simon Roux)
    iPHoP: A Matchmaker for Phages and their Hosts
    Building on existing virus-host prediction approaches, a new tool combines and evaluates multiple predictions to reliably match viruses with their archaea and bacteria hosts.

    More

    Abstract image of gold lights and squares against a black backdrop
    Silver Age of GOLD Introduces New Features
    The Genomes OnLine Database makes curated microbiome metadata that follows community standards freely available and enables large-scale comparative genomics analysis initiatives.

    More

    Graphical overview of the RNA Virus MetaTranscriptomes Project. (Courtesy of Simon Roux)
    A Better Way to Find RNA Virus Needles in the Proverbial Database Haystacks
    Researchers combed through more than 5,000 data sets of RNA sequences generated from diverse environmental samples around the world, resulting in a five-fold increase of RNA virus diversity.

    More

  • User Programs
    • Calls for Proposals
    • Special Initiatives & Programs
    • Product Offerings
    • User Support
    • Policies
    • Submit a Proposal
    Green plant matter grows from the top, with the area just beneath the surface also visible as soil, root systems and a fuzzy white substance surrounding them.
    Supercharging SIP in the Fungal Hyphosphere
    Applying high-throughput stable isotope probing to the study of a particular fungi, researchers identified novel interactions between bacteria and the fungi.

    More

    Digital ID card with six headshots reads: Congratulations to our 2022 Function Genomics recipients!
    Final Round of 2022 CSP Functional Genomics Awardees
    Meet the final six researchers whose proposals were selected for the 2022 Community Science Program Functional Genomics call.

    More

    croppe image of the JGI helix sculpture
    Tips for a Winning Community Science Program Proposal
    In the Genome Insider podcast, tips to successfully avail of the JGI's proposal calls, many through the Community Science Program.

    Listen

  • News & Publications
    • News
    • Blog
    • Podcasts
    • Webinars
    • Publications
    • Newsletter
    • Logos and Templates
    • Photos
    2022 JGI-UC Merced interns (Thor Swift/Berkeley Lab)
    Exploring Possibilities: 2022 JGI-UC Merced Interns
    The 2022 UC Merced intern cohort share how their summer internship experiences have influenced their careers in science.

    More

    image from gif that shows where in the globe JGI fungal collaborators are located.
    Using Team Science to Build Communities Around Data
    As the data portals grow and evolve, the research communities further expand around them. But with two projects, communities are forming to generate high quality genomes to benefit researchers.

    More

    Cow Rumen and the Early Days of Metagenomics
    Tracing a cow rumen dataset from the lab to material for a hands-on undergraduate research course at CSU-San Marcos that has since expanded into three other universities.

    More

News & Publications
Home › News Releases › Uncovered: 1000 New Microbial Genomes

June 12, 2017

Uncovered: 1000 New Microbial Genomes

Potential biotech applications seen with release of 1,003 reference bacterial and archaeal genomes.  

The release of 1,003 phylogenetically diverse bacterial and archaeal reference genomes, the single largest release to date, is part of the DOE JGI’s Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative. (Zosia Rostomian, Berkeley Lab Creative Services.)

The release of 1,003 phylogenetically diverse bacterial and archaeal reference genomes, the single largest release to date, is part of the DOE JGI’s Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative. (Zosia Rostomian, Berkeley Lab Creative Services.)

The number of microbes in a handful of soil exceeds the number of stars in the Milky Way galaxy, but researchers know less about what’s on Earth because they have only recently had the tools to deeply explore what is just underfoot. Now scientists at the U.S. Department of Energy Joint Genome Institute (DOE JGI), a DOE Office of Science User Facility, have taken a decisive step forward in uncovering the planet’s microbial diversity. In a paper published June 12, 2017 in Nature Biotechnology, DOE JGI’s Prokaryotic Super Program head Nikos Kyrpides and his team of researchers report the release of 1,003 phylogenetically diverse bacterial and archaeal reference genomes—the single largest release to date.

“Bacteria and archaea comprise the largest amount of biodiversity of free-living organisms on Earth,” said Kyrpides, senior author of the paper. “They have already conquered every environment on the planet, so they have found ways to survive under the harshest of conditions with different enzymes and with different biochemistry.”

The U.S. Department of Energy is interested in learning more about this biodiversity because microbes play important roles in regulating Earth’s biogeochemical cycles—processes that govern nutrient circulation in terrestrial and marine environments, for example. Uncovering the functions of genes, enzymes and metabolic pathways through genome sequencing and analysis has wide applications in the fields of bioenergy, biomedicine, agriculture and environmental sciences.

New Functions, New Applications

The effort is part of the DOE JGI’s Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative that aims to sequence thousands of bacterial and archaeal genomes to fill in unexplored branches of the tree of life. “In addition to identifying over half a million new protein families, this effort has more than doubled the coverage of phylogenetic diversity of all type strains with genome sequences”, said Supratim Mukherjee, a DOE JGI computational biologist and co-first author of the paper.

Since a great portion of research in microbial genomics has been focused on human pathogens or biotechnological work horses, GEBA is the main effort worldwide attempting to address the phylogenetic coverage knowledge gap by sequencing a diverse set of cultured but poorly characterized microbial type strains. “It was recognized that we weren’t sampling many parts of the tree of life,” said Rekha Seshadri, a DOE JGI computational biologist and co-first author of the paper. “And if we sampled some of those parts of the tree, we’d discover new functions, which could be an important resource for new applications.”

The release of these genomes is the culmination of almost a decade’s worth of work, with the first 56 GEBA genomes published in 2009. The microorganisms were isolated from environments ranging from sea water and soil, to plants, and to cow rumen and termite guts. Genome sequencing and analysis was done at the DOE JGI through the Community Science Program, and the 1,003 genomes are publicly available through the Integrated Microbial Genomes with Microbiomes (IMG/M) system, with all associated metadata in compliance with the Genomics Standards Consortium available through the Genomes OnLine Database. In fact, all these genomes were publicly released immediately after sequencing to maximize their use by the larger scientific community, in accordance with the DOE JGI’s practice of immediate data release, said co-author Tanja Woyke, head of the DOE JGI Microbial Genomics Program, who overviewed the sequencing of the project.

With the release of high quality genomic information from the 1,003 reference genomes, DOE JGI is providing a wealth of new sequences that will be invaluable to scientists interested in experiments such as characterizing biotechnologically relevant secondary metabolites or studying enzymes that work under specific conditions, Seshadri said. And because Kyrpides’ research team sequenced type strains that are readily available from culture collections, scientists can perform follow-up experiments with them in the lab, she added.

“The partnership with culture collection centers such as the Leibniz Institute DSMZ in Germany and the ATCC Global Bioresource Center in the U.S., was critical in accomplishing this endeavor,” said Kyrpides.

Though it’s evident that bacteria can jumpstart innovations in biotechnology—such as the species Streptococcus pyogenes, which produces the Cas9 protein that functions as the “scissors” in the breakthrough CRISPR-Cas9 gene editing tool—scientists have only just begun to uncover the hidden potential that exists within the wide genetic diversity of bacterial and archaeal phyla.

A Reference Framework to Anchor Data

Jonathan Eisen, a microbiologist at the University of California, Davis who initiated the GEBA project at the DOE JGI in 2007 with Kyrpides and Phil Hugenholtz, and Hans-Peter Klenk at the Leibniz Institute DSMZ, believes that the paper reinforces that having a goal to achieve phylogenetic diversity is a more useful approach than random selection when choosing microbial organisms for sequencing.

Some of the DOE JGI authors on the Nature Biotechnology paper. Front Row: Neha Varghese, co-first author Rekha Seshadri, Emiley Eloe-Fadrosh. Back Row: Tanja Woyke, George Pavropoulos, David Paez-Espino, senior author Nikos Kyrpides, Natalia Ivanova. Top Left Inset: co-first author Supratim Mukherjee

Some of the DOE JGI authors on the Nature Biotechnology paper. Front Row: Neha Varghese; co-first author Rekha Seshadri; Emiley Eloe-Fadrosh. Back Row: Tanja Woyke; George Pavropoulos; David Paez-Espino; senior author Nikos Kyrpides; former DOE JGI microbial ecologist Phil Hugenholtz, now at the University of Queensland; Natalia Ivanova. Top Left Inset: co-first author Supratim Mukherjee

He said filling out the tree of life will provide researchers with a reference framework with which to understand their own results. “It’s incredibly helpful for interpreting environmental data. For example, if you go and find a fossil bed somewhere and find tons of bones, but if no one had ever assembled skeletons before, it’d be useless,” Eisen said. “But with an assembled skeleton to use as a reference, “you can say ‘this looks like a mammal’. The same is true with metagenomic data—if you have reference genomes from across the tree [of life], you can anchor environmental data much more accurately.”

“At a time when we are witnessing the public databases being flooded by an infusion of low or questionable quality, highly fragmented and chimeric or contaminated genomes, the significance of genomes from the type strains as invaluable taxonomic signposts cannot be overstated,” Kyrpides said.

Collaborators on this work included researchers a the Leibniz Institute DSMZ in Germany, the University of Georgia, Michigan State University, the University of Queensland in Australia and Newcastle University in the United Kingdom.

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)
  • Click to share on Pinterest (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to print (Opens in new window)

The U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility at Lawrence Berkeley National Laboratory, is committed to advancing genomics in support of DOE missions related to clean energy generation and environmental characterization and cleanup. JGI provides integrated high-throughput sequencing and computational analysis that enable systems-based scientific approaches to these challenges. Follow @jgi on Twitter.

DOE’s Office of Science is the largest supporter of basic research in the physical sciences in the United States, and is working to address some of the most pressing challenges of our time. For more information, please visit science.energy.gov.

Filed Under: News Releases

More topics:

  • COVID-19 Status
  • News
  • Science Highlights
  • Blog
  • Webinars
  • CSP Plans
  • Featured Profiles

Related Content:

JGI announces 2023 Functional Genomics awardees

Digital index card with JGI logo reads: Community Science Program (FY23) Congratulations to our CSP Functional Genomics recipients! Picture from left to right: (top) Thom Booth, Gabriel Castrillo, Han Li; (bottom) Jorge A. Marchand, Emre Özdemir, Fong Tian Wong

Researching and Solving Real-World Problems with the 2023 JGI-UC Merced Interns

2023 JGI-UC Merced interns (Zhong Wang/Berkeley Lab)

RECAP: Multi-Omic Journeys with 2023 JGI Annual Meeting Keynotes

Bruce Hungate stands at a podium and gesticulates as he discusses microbes.

For the Tiniest Archaea, A Genomic Switch of Friend or Foe

A grey microscopy photo taken at micron-scale. Microbes shown are small, round and slightly spiky in shape.

Doubling Down on Known Protein Families

An illustration of a microscope emitting a beam of light that hits a small, nondescript item.

The JGI announces 2024 awardees for our Community Science Program annual call

A series of headshots: From left to right: [above] Olivia Ahern, Adriana Corales, Hugh Cross, Megan DeMarche, Joanne Emerson, Matthew Hudson, Megan Keller and Julia Kelliher; [below] Vassili Kouvelis, Seppe Kuehn, Tesfaye Mengiste, Egbert Schwartz, Hannah Schulman, Bram Stone and Jana Voriskova
  • Careers
  • Contact Us
  • Events
  • User Meeting
  • MGM Workshops
  • Internal
  • Disclaimer
  • Credits
  • Policies
  • Emergency Info
  • Accessibility / Section 508 Statement
  • Flickr
  • LinkedIn
  • RSS
  • Twitter
  • YouTube
Lawrence Berkeley National Lab Biosciences Area
A project of the US Department of Energy, Office of Science

JGI is a DOE Office of Science User Facility managed by Lawrence Berkeley National Laboratory

© 1997-2023 The Regents of the University of California