DOE Joint Genome Institute

  • COVID-19
  • About Us
  • Contact Us
  • Our Science
    • DOE Mission Areas
    • Bioenergy Research Centers
    • Science Programs
    • Science Highlights
    • Scientists
    Data yielded from RIViT-seq increased the number of sigma factor-gene pairs confirmed in Streptomyces coelicolor from 209 to 399. Here, grey arrows denote previously known regulation and red arrows are regulation identified by RIViT-seq; orange nodes mark sigma factors while gray nodes mark other genes. (Otani, H., Mouncey, N.J. Nat Commun 13, 3502 (2022). https://doi.org/10.1038/s41467-022-31191-w)
    Streamlining Regulon Identification in Bacteria
    Regulons are a group of genes that can be turned on or off by the same regulatory protein. RIViT-seq technology could speed up associating transcription factors with their target genes.

    More

    (PXFuel)
    Designer DNA: JGI Helps Users Blaze New Biosynthetic Pathways
    In a special issue of the journal Synthetic Biology, JGI scientific users share how they’ve worked with the JGI DNA Synthesis Science Program and what they’ve discovered through their collaborations.

    More

    A genetic element that generates targeted mutations, called diversity-generating retroelements (DGRs), are found in viruses, as well as bacteria and archaea. Most DGRs found in viruses appear to be in their tail fibers. These tail fibers – signified in the cartoon by the blue virus’ downward pointing ‘arms’— allow the virus to attach to one cell type (red), but not the other (purple). DGRs mutate these ‘arms,’ giving the virus opportunities to switch to different prey, like the purple cell. (Courtesy of Blair Paul)
    A Natural Mechanism Can Turbocharge Viral Evolution
    A team has discovered that diversity generating retroelements (DGRs) are not only widespread, but also surprisingly active. In viruses, DGRs appear to generate diversity quickly, allowing these viruses to target new microbial prey.

    More

  • Our Projects
    • Search JGI Projects
    • DOE Metrics/Statistics
    • Approved User Proposals
    • Legacy Projects
    Photograph of a stream of diatoms beneath Arctic sea ice.
    Polar Phytoplankton Need Zinc to Cope with the Cold
    As part of a long-term collaboration with the JGI Algal Program, researchers studying function and activity of phytoplankton genes in polar waters have found that these algae rely on dissolved zinc to photosynthesize.

    More

    This data image shows the monthly average sea surface temperature for May 2015. Between 2013 and 2016, a large mass of unusually warm ocean water--nicknamed the blob--dominated the North Pacific, indicated here by red, pink, and yellow colors signifying temperatures as much as three degrees Celsius (five degrees Fahrenheit) higher than average. Data are from the NASA Multi-scale Ultra-high Resolution Sea Surface Temperature (MUR SST) Analysis product. (Courtesy NASA Physical Oceanography Distributed Active Archive Center)
    When “The Blob” Made It Hotter Under the Water
    Researchers tracked the impact of a large-scale heatwave event in the ocean known as “The Blob” as part of an approved proposal through the Community Science Program.

    More

    A plantation of poplar trees. (David Gilbert)
    Genome Insider podcast: THE Bioenergy Tree
    The US Department of Energy’s favorite tree is poplar. In this episode, hear from ORNL scientists who have uncovered remarkable genetic secrets that bring us closer to making poplar an economical and sustainable source of energy and materials.

    More

  • Data & Tools
    • IMG
    • Data Portal
    • MycoCosm
    • PhycoCosm
    • Phytozome
    • GOLD
    HPCwire Editor's Choice Award (logo crop) for Best Use of HPC in the Life Sciences
    JGI Part of Berkeley Lab Team Awarded Best Use of HPC in Life Sciences
    The HPCwire Editors Choice Award for Best Use of HPC in Life Sciences went to the Berkeley Lab team comprised of JGI and ExaBiome Project team, supported by the DOE Exascale Computing Project for MetaHipMer, an end-to-end genome assembler that supports “an unprecedented assembly of environmental microbiomes.”

    More

    With a common set of "baseline metadata," JGI users can more easily access public data sets. (Steve Wilson)
    A User-Centered Approach to Accessing JGI Data
    Reflecting a structural shift in data access, the JGI Data Portal offers a way for users to more easily access public data sets through a common set of metadata.

    More

    Phytozome portal collage
    A More Intuitive Phytozome Interface
    Phytozome v13 now hosts upwards of 250 plant genomes and provides users with the genome browsers, gene pages, search, BLAST and BioMart data warehouse interfaces they have come to rely on, with a more intuitive interface.

    More

  • User Programs
    • Calls for Proposals
    • Special Initiatives & Programs
    • Product Offerings
    • User Support
    • Policies
    • Submit a Proposal
    screencap from Amundson and Wilkins subsurface microbiome video
    Digging into Microbial Ecosystems Deep Underground
    JGI users and microbiome researchers at Colorado State University have many questions about the microbial communities deep underground, including the role viral infection may play in other natural ecosystems.

    Read more

    Yeast strains engineered for the biochemical conversion of glucose to value-added products are limited in chemical output due to growth and viability constraints. Cell extracts provide an alternative format for chemical synthesis in the absence of cell growth by isolating the soluble components of lysed cells. By separating the production of enzymes (during growth) and the biochemical production process (in cell-free reactions), this framework enables biosynthesis of diverse chemical products at volumetric productivities greater than the source strains. (Blake Rasor)
    Boosting Small Molecule Production in Super “Soup”
    Researchers supported through the Emerging Technologies Opportunity Program describe a two-pronged approach that starts with engineered yeast cells but then moves out of the cell structure into a cell-free system.

    More

    These bright green spots are fluorescently labelled bacteria from soil collected from the surface of plant roots. For reference, the scale bar at bottom right is 10 micrometers long. (Rhona Stuart)
    A Powerful Technique to Study Microbes, Now Easier
    In JGI's Genome Insider podcast: LLNL biologist Jennifer Pett-Ridge collaborated with JGI scientists through the Emerging Technologies Opportunity Program to semi-automate experiments that measure microbial activity in soil.

    More

  • News & Publications
    • News
    • Blog
    • Podcasts
    • Webinars
    • Publications
    • Newsletter
    • Logos and Templates
    • Photos
    A view of the mangroves from which the giant bacteria were sampled in Guadeloupe. (Hugo Bret)
    Giant Bacteria Found in Guadeloupe Mangroves Challenge Traditional Concepts
    Harnessing JGI and Berkeley Lab resources, researchers characterized a giant - 5,000 times bigger than most bacteria - filamentous bacterium discovered in the Caribbean mangroves.

    More

    In their approved proposal, Frederick Colwell of Oregon State University and colleagues are interested in the microbial communities that live on Alaska’s glacially dominated Copper River Delta. They’re looking at how the microbes in these high latitude wetlands, such as the Copper River Delta wetland pond shown here, cycle carbon. (Courtesy of Rick Colwell)
    Monitoring Inter-Organism Interactions Within Ecosystems
    Many of the proposals approved through JGI's annual Community Science Program call focus on harnessing genomics to developing sustainable resources for biofuels and bioproducts.

    More

    Coloring the water, the algae Phaeocystis blooms off the side of the sampling vessel, Polarstern, in the temperate region of the North Atlantic. (Katrin Schmidt)
    Climate Change Threatens Base of Polar Oceans’ Bountiful Food Webs
    As warm-adapted microbes edge polewards, they’d oust resident tiny algae. It's a trend that threatens to destabilize the delicate marine food web and change the oceans as we know them.

    More

News & Publications
Home › Blog › IMG/VR Database Triples in Size

January 26, 2018

IMG/VR Database Triples in Size

Got 12 Gb of free storage to download it all?

To better understand how Earth’s vast and diverse microbial population helps regulate global nutrient cycles, it helps to understand how viruses infect microbes, and affect their functions and metabolic processes. Two years ago, even though the number of viruses is estimated to be at least two orders of magnitude more than the microbial cells on the planet, sequence databases held less than 2,200 sequenced DNA virus genomes, compared to the approximately 50,000 bacterial genomes on file. That ratio changed dramatically when JGI researchers unveiled more than 125,000 partial and complete viral genomes, described in an August 2016 Nature paper, and boosted the number of known viral genes 16-fold.

Growth over time in the total and unique number of viral sequences in IMG/VR. (Image composite by David Paez-Espino).

Growth rate of virus identification and microbial host prediction for the new release of IMG/VR database. Growth over time in the total and unique number of viral sequences in IMG/VR. (Image composite by David Paez-Espino).

On the heels of that release, the JGI launched IMG/VR, describing it then in the January 2017 “Database” issue of Nucleic Acids Research as the largest publicly available database with isolate reference DNA viruses and computationally identified contiguous viral sequences (“contigs”) from thousands of ecologically diverse metagenomics samples. By then, the contents of the IMG/VR database had doubled from the figure referenced in August 2016.

This month, IMG/VR unveiled its largest release of viral datasets yet, with a year’s worth of data mined from metagenome sequences both by the JGI and externally submitted, and isolates sourced from the National Center for Biotechnology Information (NCBI). There are now over 715,000 viral datasets, with nearly 8,500 isolate viruses, and over 700,000 viral contigs.

“The viral diversity in IMG/VR has tripled since January 2017,” said JGI bioinformaticist David Paez-Espino. “We have now identified over 34,000 viral sequences targeting several microbial taxa for the very first time, we’ve associated new viruses to known microbial genomes, and the vast majority of the gene content (over 15 million genes in total) remains hypothetical or unknown, meaning that there is tremendous potential for new discoveries out of that gene pool.”

“The idea is to release all the viral data we could find along with key information, such as samples in which a virus was identified, and what its expected host might be, so that the community can efficiently analyze and mine these data through this integrated viral genomics resource,” said JGI scientist Simon Roux.

As detailed in an August 2017 Nature Protocols report, the data generation was made possible by a semi-automated discovery pipeline. Paez-Espino adds that in addition to the data release, IMG/VR has a new feature that allows researchers to download the entire database. “You can search IMG/VR for homologous viruses if you provide your sequences, or search for a host using our spacer BLAST database, but now you can also bulk download the whole database with metadata associated.” Despite the massive data release, he notes, downloading more than 715,000 viral contigs and the rest of the database would take up just 12 GB of storage on a computer. That’s up from the database size of 4GB had it been downloadable a year ago.

“We anticipate that the release of such massive viral sequence data to the research community will drive novel discoveries and understanding of the viral world,” said Prokaryote Super Program head Nikos Kyrpides, “We invite the community to explore these data and provide feedback on how we can improve IMG/VR.”

Hear how researchers are already mining the IMG/VR database at the upcoming Viral EcoGenomics & Applications (VEGA) Symposium to be held March 14-15, 2018 during the JGI’s 13th Annual Genomics of Energy and Environment Meeting at the Hilton San Francisco Union Square. Additionally, on March 13, the Prokaryote Super Program hosts a workshop detailing the microbial and metagenomics resources and capabilities available to users through partnering with the JGI – and how current collaborators are developing technologies to further innovate – and harnessing the Integrated Microbial Genomes suite of tools. Learn more at usermeeting.jgi.doe.gov/vega.

Through the Community Science Program (CSP), the JGI has launched a New Investigator call for proposals, with the emphasis on providing pilot data to form the foundation of a large-scale CSP proposal submission. Proposals are due by March 1, 2018, and must be independent of ongoing accepted proposals. Additionally, the lead PIs on these New Investigator proposals cannot have been lead PI on any previously accepted JGI CSP or FICUS proposal. The New Investigator calls for proposal will go out four times a year, and replace the CSP Small Scale Calls – click here to learn more.

 

References:

  • IMG/VR portal: https://img.jgi.doe.gov/vr
  • Bulk Download the IMG/VR database: https://genome.jgi.doe.gov/portal/IMG_VR/IMG_VR.home.html
  • JGI News Release: Unveiled: Earth’s Viral Diversity
  • JGI Science Highlight: DOE JGI Database of DNA viruses and retroviruses debuts on IMG platform
  • JGI Publication: “Uncovering Earth’s virome” http://rdcu.be/FDVo
  • JGI Publication: “IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses” https://doi.org/10.1093/nar/gkw1030
  • JGI Publication: “Nontargeted virus sequence discovery pipeline and virus clustering for metagenomic data” https://www.nature.com/articles/nprot.2017.063

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)
  • Click to share on Pinterest (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to print (Opens in new window)

Filed Under: Blog

More topics:

  • COVID-19 Status
  • News
  • Science Highlights
  • Blog
  • Webinars
  • CSP Plans
  • Featured Profiles

Related Content:

JGIota: Sequencing Shiitakes with David Hibbett

A Genome Insider Logo Image

JGI at 25: Mapping Switchgrass Traits with Common Gardens

Aerial photo of the switchgrass diversity panel late in the 2020 season at the Kellogg Biological Station in Michigan. (Robert Goodwin)

JGI at 25: Following Fungi that Pry Apart Plant Polymers

A brown goat with white horns looks at green hay

Exploring Possibilities: 2022 JGI-UC Merced Interns

2022 JGI-UC Merced interns (Thor Swift/Berkeley Lab)

JGI at 25: Using team science to build communities around data

JGI at 25: Expanding Metagenomics to Capture Viral Diversity

Artist rendering of genome standards being applied to deciphering the extensive diversity of viruses. (Illustration by Leah Pantea)
  • Careers
  • Contact Us
  • Events
  • User Meeting
  • MGM Workshops
  • Internal
  • Disclaimer
  • Credits
  • Policies
  • Emergency Info
  • Accessibility / Section 508 Statement
  • Flickr
  • LinkedIn
  • RSS
  • Twitter
  • YouTube
Lawrence Berkeley National Lab Biosciences Area
A project of the US Department of Energy, Office of Science

JGI is a DOE Office of Science User Facility managed by Lawrence Berkeley National Laboratory

© 1997-2023 The Regents of the University of California