DOE Joint Genome Institute

  • About Us
  • Phone Book
  • Contact Us
  • Our Science
    • Our DOE Mission Areas
    • DOE Bioenergy Research Centers
    • Science Programs
    • Science Highlights
    • Scientists at JGI
    TEM image of O. tauri strain RCC4221. Credit: Herve Moreau
    Tiny Green Algae Reveal Large Genomic Variation
    A decade after the complete representative genomes of three Ostreococcus picoplankton groups were sequenced, researchers have sequenced and analyzed the genomes of 13 members of a natural Ostreococcus population.

    Read more

    Genome-wide distribution of fast neutron-induced mutations in the Kitaake rice mutant population. (Guotian Li and Rashmi Jain)
    Mutant Rice Database for Bioenergy Research
    Boosting yields of bioenergy feedstock crops such as grasses requires a functional genomics resource for grass models involved in plant cell wall biosynthesis studies. Using fast-neutron irradiation, rice researchers were able to create the first major, large-scale collection of mutations for grass models.

    Read more

    The oil slick in the Gulf of Mexico from the Deepwater Horizon oil spill as seen on April 29, 2010. (NASA Earth Observatory image created by Jesse Allen, using data provided courtesy of the University of Wisconsin’s Space Science and Engineering Center MODIS Direct Broadcast system)
    Lessons from Simulating A Deep Ocean Oil Spill
    Researchers present the first complete picture of how successive waves of microbial populations degraded the released oil. They were also able to recover high-quality genomes of the key microbial players, and determine the metabolic factors driving the shifts between microbial communities.

    Read more

  • Our Projects
    • Search JGI Project List
    • DOE Metrics/Statistics
    • Approved User Proposals
    • Legacy Projects
    Developing an Ice Plant Gene Atlas
    The ice plant was the first reported halophytic, facultative crassulacean acid metabolism (CAM) species, meaning that the plant can be induced to switch from C3 photosynthesis to CAM following water-deficit or salinity stress treatment.

    Read more

    Mark Dopson of Sweden’s Linnaeus University has a project that deploys DOE JGI’s single cell genomics resources on samples sourced from the deep subsurface, including the Pyhäsalmi mine. (Courtesy of Mark Dopson)
    Exploring Deep Biosphere Microbial Communities
    The data from one of the least understood habitats on Earth will have implications for our understanding of global energy and nutrient cycles, the potential for deep terrestrial disposal of nuclear waste and geo-engineering for CO2 storage, while also providing insights about how life could be sustained on other planets.

    Read more

    Suillus ampliporus, shown here, is part of the genus-wide molecular phylogeny of Suillus project from Nhu Nguyen of the University of California, Berkeley. Suillus fungi tolerate heavy metals, but the protection varies among hosts. (Image by Nhu Nguyen)
    A Suillus Genus Genome Atlas
    As symbiotic ectomycorrhizal fungi (EMF), Suillus spp. play critical roles in the function of forest ecosystems by providing their plant hosts with mineral resources in exchange for photosynthetically fixed carbon.

    Read more

  • Data & Tools
    • IMG
    • Genome Portal
    • MycoCosm
    • Phytozome
    • GOLD
    This graphic depicts the geographic distribution of GOLD biosamples and organisms. Organism location of isolation is marked in pink while Biosample location of collection is denoted with blue dots. Updates to the Genomes OnLine Database (GOLD) are reported in the upcoming Database issue of Nucleic Acids Research. (Image from Supratim Mukherjee et al. Nucl. Acids Res. 2016;nar.gkw992)
    DOE JGI Database of DNA viruses and retroviruses debuts on IMG platform
    In a series of four articles published in the Database issue of the Nucleic Acids Research journal, DOE JGI researchers report on the latest updates to several publicly accessible databases and computational tools that benefit the global community of microbial researchers.

    Read more

    Sakinaw Lake UBC
    Benchmark Study for Improving Microbial Community Profiling
    Researchers used synthetic and natural microbial lake communities to compare the microbial community profiles generated from high throughput short-read sequencing and high throughput long-read sequencing approaches.

    Read more

    Better Microbial Genome Binning with MetaBAT
    An automated tool called MetaBAT that automatically groups large genomic fragments assembled from metagenome sequences to reconstruct single microbial genomes.

    Read more about MetaBAT

  • User Program Info
    • Community Science Program (CSP)
    • Other Programs
    • User Support
    • Product Offerings
    • Submit a Proposal
    JGI-NERSC Microbiome Data Science Call
    The Microbiome Data Science call will enable users to perform state-of-the-art computational genomics and metagenomics research and help them translate sequence information, generated by the DOE JGI or elsewhere, into biological discovery.

    Read more

    Large-scale CSP Call
    The CSP Annual Call is focused on large-scale sequence-based genomic science projects that address questions of relevance to DOE missions in sustainable biofuel production, global carbon cycling, and biogeochemistry.

    Read more

    JGI-EMSL Collaborative Science Call
    The FICUS call between DOE JGI and Environmental Molecular Science Laboratory (EMSL) represents a unique opportunity for researchers to combine the power of genomics and molecular characterization in one proposed research project.

    Read more

  • News & Publications
    • News Releases
    • Blog
    • Publications
    • Scientific Posters
    • Newsletter
    • Logos
    • Photos
    The release of 1,003 phylogenetically diverse bacterial and archaeal reference genomes, the single largest release to date, is part of the DOE JGI’s Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative. (Zosia Rostomian, Berkeley Lab Creative Services.)
    Uncovered: 1000 New Microbial Genomes
    DOE JGI scientists have taken a decisive step forward in uncovering the planet’s microbial diversity, reporting the release of 1,003 phylogenetically diverse bacterial and archaeal reference genomes—the single largest release to date—in Nature Biotechnology.

    Read more

    Scanning electron micrograph (SEM) of Neocallimastix californiae, a representative of the Neocallimastigomycetes, a clade of the early-diverging fungal lineages that are not well-studied. It's one of three Neocallimastigomycetes sequenced and annotated by the DOE JGI for this study. (Chuck Smallwood, PNNL)
    Fungal Enzymes Team Up to More Efficiently Break Down Cellulose
    In Nature Microbiology, a team led by researchers at UC Santa Barbara has found for the first time that early lineages of fungi can form complexes of enzymes capable of degrading plant biomass. The work was enabled by harnessing the capabilities of two U.S. Department of Energy (DOE) Office of Science User Facilities: the DOE JGI and EMSL.

    Read more

    Linderina pennispora ZyGoLife Research Consortium Flickr CC BY-SA 2.0
    Finding A New Major Gene Expression Regulator in Fungi
    DOE JGI scientists report the prevalence of a particular DNA base modification (6mA) in the earliest branches of the fungal kingdom.

    Read more

Our Science
Home › Our Science › Science Programs › Microbial Program › Phylogenetic Diversity

Phylogenetic Diversity

Phylogenetic Tree

Image: Phylogenetic Tree, Chris Rinke

Genome sequencing has revolutionized our understanding of microorganisms and the role they play in important processes, including pathogenesis, energy production, bioremediation, global nutrient cycles; and the origins, evolution, and diversity of life. Currently, there are approximately 15,000 complete or draft genome sequences of Bacteria and Archaea available. These genome sequences show a highly biased phylogenetic distribution when compared to the extent of microbial diversity known today. This bias has resulted in a major gap in our knowledge of microbial genome complexity and our understanding of the evolution, physiology, and metabolic capacity of microbes.

Under the umbrella of the Genomic Encyclopedia of Bacteria and Archaea (GEBA), the DOE JGI is sequencing thousands of bacterial and archaeal genomes from diverse branches of the Tree of Life. This phylogenomic approach is of great value in multiple areas of public and general scientific interest. The potential benefits include: (a) improved identification of protein families and ortholog groups across species, which will improve annotation of other microbial genomes; (b) improved phylogenetic anchoring of metagenomic data; (c) gene discovery (which tends to be maximized by selecting phylogenetically novel organisms); (d) a better understanding of the processes underlying the evolutionary diversification of microbes (e.g., lateral gene transfer and gene duplication); (e) a better understanding of the classification and evolutionary history of microbial species; and, (f) improved correlations of phenotype and genotype in microbes.

GEBA-family Projects

The original GEBA project was completed in collaboration with the DSMZ.  In a pilot study, the  DOE JGI sequenced 53 bacterial and 3 archaeal novel and highly diverse genomes, representing a first step towards a phylogenetically balanced sequence space in the microbial tree of life (Wu et al, 2009, Nature). An additional approximately 200 GEBA genomes have since been sequenced; the bulk of which are of finished quality. A phylum-level GEBA study, the CyanoGEBA effort, sequenced the genomes of approximately 50 phylogenetically diverse cyanobacterial strains from all five morphological sections, including several from the never-before-sequenced Pleurocapsales and Stigonematales (Shih et al, 2013, PNAS). In an effort to sequence type strains and provides links between sequence data and phenotypic information, the GEBA type strain project is currently sequencing types strains of high phylogenetic diversity and isolated from soil habitats as well as associated with plant hosts. To study Root Nodulating Bacteria (RNB) and their symbioses with legumes after infection of root hairs, 100 RNB strains isolated from various locations around the world have been sequenced under the GEBA-RNB initiative. Lastly, the GEBA-MDM project makes use of the high throughput single cell genomics capabilities at the DOE JGI to explored the genomes of candidate phyla representatives. In a pilot study, approximately 200 single-cell genomes from 29 mostly underexplored branches of the tree of life were sequenced shedding light on the metabolic potential and phylogeny of this microbial dark matter (Rinke et al, 2013, Nature).

GEBA type strain

Type strains map from http://microbial-earth.namesforlife.com/v2/

Image: Type strains map from http://microbial-earth.namesforlife.com/v2/

The type strain commonly refers to the nomenclatural type or the element of a taxon with which the name is permanently associated. In practice, this is usually a living culture that was chosen to represent a prokaryotic species when the species name was proposed. Because of their importance, type strains for species are usually carefully maintained in a number of culture collections throughout the world. Because of the rules of nomenclature, type strains of species should not be identical or highly similar with the type strain of any other species.

The goal of the GEBA-type strain project is generating a comprehensive genomic encyclopedia of the validly named bacterial and archaeal species in order to (i) catalog bacterial and archaeal diversity, (ii) unravel novel functions derived from novel protein families, and (iii) improve the binning and annotation of metagenomes. Type strains play a crucial role in defining the phylogenomic and taxonomic space of Bacteria and Archaea. They constitute the living cultures that serve as a fixed reference point for the assignment of bacterial and archaeal names and exhibit all the relevant phenotypic and genotypic properties cited in the original published taxonomic circumscriptions.

During the first phase of GEBA-type strain study we have identified and sequenced 1,000 new phylogenetic diverse type strains. Our ongoing activities include the scrutiny of our data set to search novel functions, protein families, and undiscovered biosynthetic gene clusters –a key aspect for detection of novel natural products. Finally, we will be able to study the effect of our findings on metagenomic analyses.

PIs: Nikos Kyrpidis, David Paez Espino, JGI; Hans-Peter-Klenk, DSMZ Germany; Barny Whitman, University of Georgia.

GEBA-RNB

Image: Nodules induced by burkholderia on Lebeckia ambigua, John Howieson and Wayne Reeve.

Nodules induced by burkholderia on Lebeckia ambigua. (John Howieson and Wayne Reeve)

Root Nodulating Bacteria (RNB) are soil inhabiting bacteria that form symbioses with legumes after infection of root hairs. The GEBA-RNB initiative at the JGI is sequencing 100 RNB strains isolated from various locations around the world. This project will support the systematic sequence-based studies and understanding of the biogeographical effects on species evolution as well as the mechanisms of symbiotic nitrogen fixation (SNF) by RNB. The latter is a significant asset for world agricultural productivity, farming economy and environmental sustainability.

Image: Nitrogen fixation by burkholderia on lebeckia ambigua (centre pot), John Howieson and Wayne Reeve.

Nitrogen fixation by burkholderia on Lebeckia ambigua (centre pot). (John Howieson and Wayne Reeve)

SNF reduces energy consumption required to produce nitrogenous fertilizer, saving $US 6.8 billion per year. SNF significantly reduces greenhouse gas emissions compared to intensive agriculture practice using artificial N-input. SNF benefits the environment (reduces dry-land salinity, increases soil fertility, and prevents waterway eutrophication).

Root endosymbioses promote carbon sequestration. Shared genetic mechanisms between fungal and bacterial root endosymbioses exist and a detailed understanding of endosymbionts will be beneficial to drive bioenergy development from trees. The GEBA-RNB project is based on collaboration between JGI and an international consortium from 15 countries coordinated by Wayne Reeve from Murdoch University, Australia.

PIs: Nikos Kyrpidis, JGI; Wayne Reeve, University of Perth, Australia.

GEBA-MDM

Image: Great Boiling Springs (Nevada), Robert Dodsworth

Great Boiling Springs in Nevada. (Robert Dodsworth)

While the bulk of the microbial genomes sequenced to date are derived from cultivated bacterial and archaeal representatives, the vast majority of microorganisms elude current culturing attempts, severely limiting the ability to recover complete or even partial genomes of these largely mysterious species referred to as the microbial dark matter.

Image: Main Great Boiling Spring in Nevada, Robert Dodsworth

Main Great Boiling Spring in Nevada. (Jeremy Dodsworth)

The GEBA-Microbial Dark Matter (GEBA-MDM) project represents a natural extension of the original GEBA project and aims to use single-cell genomics to explore the uncultivated majority of Bacteria and Archaea. The first phase of GEBA-MDM targeted 201 single cell representatives of 29 mostly underexplored branches of the tree of life, with a special focus on candidate phyla (phyla proposed on the basis of environmental sequences that have no cultivated representatives). We uncovered unexpected metabolic features including a novel amino acid use for the opal stop codon, an archaeal-type purine synthesis in Bacteria and complete sigma factors in Archaea similar to those in Bacteria. Single-cell genomes also served to phylogenetically anchor up to 20% of metagenomic reads in some habitats improving the organism-level interpretation of ecosystem function. Overall this study greatly expanded the genomic representation of the tree of life and provided a systematic step towards a better understanding of microbial evolution. The phase II GEBA-MDM project is currently underway, which is seeking out new environments with high levels of candidate phyla representatives.

PIs: Tanja Woyke, Christian Rinke, Natalia N. Ivanova, Nikos C. Kyrpides, JGI; Phil Hugenholtz, Center of Ecogenomics, Australia; Alexander Sczyrba, Bielefeld University, Germany; Aaron Darling, University of Technology Sydney; Brian P. Hedlund, University of Nevada; Jonathan Eisen, University of California, Davis; George Tsiamis, University of Patras, Greece; Stefan M. Sievert, Woods Hole Oceanographic Institution; Wen-Tso Liu, University of Illinois at Urbana-Champaign; Steven J. Hallam, University of British Columbia; Ramunas Stepanauskas, Bigelow Laboratory for Ocean Sciences.

References:

Wu D et al. A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea. Nature. 2009 Dec 24;462(7276):1056-60. doi: 10.1038/nature08656.

Shih et al. Improving the coverage of the cyanobacterial phylum using diversity-driven genome sequencing. PNAS. 2013 Jan 15;110(3):1053-8. doi:10.1073/pnas.1217107110.

Rinke C et al. Insights into the phylogeny and coding potential of microbial dark matter. Nature. 2013 Jul 25;499(7459):431-7. doi: 10.1038/nature12352.

  • Plant Program
  • Fungal Program
  • Metagenome Program
  • Microbial Program
    • Phylogenetic Diversity
    • Function-driven Genomics
    • Microbial Publications
  • DNA Synthesis Science Program
  • Metabolomics Technology
  • Careers
  • Contact Us
  • Events
  • User Meeting
  • MGM Workshops
  • Internal
  • Disclaimer
  • Credits
  • Emergency Info
  • Accessibility / Section 508 Statement
  • Facebook
  • Flickr
  • Google+
  • Instagram
  • Linkedin
  • RSS
  • Twitter
  • YouTube
Lawrence Berkeley National Lab Biosciences Area
A project of the US Department of Energy, Office of Science

JGI is a DOE Office of Science User Facility managed by Lawrence Berkeley National Laboratory© 1997-2017 The Regents of the University of California