DOE Joint Genome Institute

  • COVID-19
  • About Us
  • Contact Us
  • Our Science
    • DOE Mission Areas
    • Science Programs
    • Science Highlights
    • Scientists
    A vertical tree stump outdoors with about a dozen shiitake mushrooms sprouting from its surface.
    Tracing the Evolution of Shiitake Mushrooms
    Understanding Lentinula genomes and their evolution could provide strategies for converting plant waste into sugars for biofuel production. Additionally, these fungi play a role in the global carbon cycle.

    More

    Soil Virus Offers Insight into Maintaining Microorganisms
    Through a collaborative effort, researchers have identified a protein in soil viruses that may promote soil health.

    More

    Data yielded from RIViT-seq increased the number of sigma factor-gene pairs confirmed in Streptomyces coelicolor from 209 to 399. Here, grey arrows denote previously known regulation and red arrows are regulation identified by RIViT-seq; orange nodes mark sigma factors while gray nodes mark other genes. (Otani, H., Mouncey, N.J. Nat Commun 13, 3502 (2022). https://doi.org/10.1038/s41467-022-31191-w)
    Streamlining Regulon Identification in Bacteria
    Regulons are a group of genes that can be turned on or off by the same regulatory protein. RIViT-seq technology could speed up associating transcription factors with their target genes.

    More

  • Our Projects
    • Search JGI Projects
    • DOE Metrics/Statistics
    • Approved User Proposals
    • Legacy Projects
    A panoramic view of a lake reflecting a granite mountain.
    Genome Insider: Methane Makers in Yosemite’s Lakes
    Meet researchers who sampled the microbial communities living in the mountaintop lakes of the Sierra Nevada mountains to see how climate change affects freshwater ecosystems, and how those ecosystems work.

    Listen

    A light green shrub with spiny leaves, up close.
    Genome Insider: A Shrubbier Version of Rubber
    Hear from the consortium working on understanding the guayule plant's genome, which could lead to an improved natural rubber plant.

    Listen

    The switchgrass diversity panel growing at the Kellogg Biological Station in Michigan. (David Lowry)
    Mapping Switchgrass Traits with Common Gardens
    The combination of field data and genetic information has allowed researchers to associate climate adaptations with switchgrass biology.

    More

  • Data & Tools
    • IMG
    • Data Portal
    • MycoCosm
    • PhycoCosm
    • Phytozome
    • GOLD
    iPHoP image (Simon Roux)
    iPHoP: A Matchmaker for Phages and their Hosts
    Building on existing virus-host prediction approaches, a new tool combines and evaluates multiple predictions to reliably match viruses with their archaea and bacteria hosts.

    More

    Abstract image of gold lights and squares against a black backdrop
    Silver Age of GOLD Introduces New Features
    The Genomes OnLine Database makes curated microbiome metadata that follows community standards freely available and enables large-scale comparative genomics analysis initiatives.

    More

    Graphical overview of the RNA Virus MetaTranscriptomes Project. (Courtesy of Simon Roux)
    A Better Way to Find RNA Virus Needles in the Proverbial Database Haystacks
    Researchers combed through more than 5,000 data sets of RNA sequences generated from diverse environmental samples around the world, resulting in a five-fold increase of RNA virus diversity.

    More

  • User Programs
    • Calls for Proposals
    • Special Initiatives & Programs
    • Product Offerings
    • User Support
    • Policies
    • Submit a Proposal
    Green plant matter grows from the top, with the area just beneath the surface also visible as soil, root systems and a fuzzy white substance surrounding them.
    Supercharging SIP in the Fungal Hyphosphere
    Applying high-throughput stable isotope probing to the study of a particular fungi, researchers identified novel interactions between bacteria and the fungi.

    More

    Digital ID card with six headshots reads: Congratulations to our 2022 Function Genomics recipients!
    Final Round of 2022 CSP Functional Genomics Awardees
    Meet the final six researchers whose proposals were selected for the 2022 Community Science Program Functional Genomics call.

    More

    croppe image of the JGI helix sculpture
    Tips for a Winning Community Science Program Proposal
    In the Genome Insider podcast, tips to successfully avail of the JGI's proposal calls, many through the Community Science Program.

    Listen

  • News & Publications
    • News
    • Blog
    • Podcasts
    • Webinars
    • Publications
    • Newsletter
    • Logos and Templates
    • Photos
    2022 JGI-UC Merced interns (Thor Swift/Berkeley Lab)
    Exploring Possibilities: 2022 JGI-UC Merced Interns
    The 2022 UC Merced intern cohort share how their summer internship experiences have influenced their careers in science.

    More

    image from gif that shows where in the globe JGI fungal collaborators are located.
    Using Team Science to Build Communities Around Data
    As the data portals grow and evolve, the research communities further expand around them. But with two projects, communities are forming to generate high quality genomes to benefit researchers.

    More

    Cow Rumen and the Early Days of Metagenomics
    Tracing a cow rumen dataset from the lab to material for a hands-on undergraduate research course at CSU-San Marcos that has since expanded into three other universities.

    More

Archived Educator Resources
Home › Archived Educator Resources › Assembly Exercise › Teacher Instructions

Teacher Instructions

Creation Instructions

  1. Print the AssemblyPoster.xls file onto a poster. It is preferable for the paper to be of good quality with a glossy finish. Optimum paper size is ANSI E, usually found in the advanced properties of your print options.
  2. Spray-mount the poster onto posterboard.
  3. Cut out each of the of the fragments so that it includes the numeric label on the 5′ end. Example: The top left piece has the label 1-371164C. This will be used later.
  4. Cut a 14-foot piece of string. Note: the length represents all Ns in the 40-kb fragment.
  5. Make holes in the 3′ end of the B1-153495H fragment and the 5′ end of the B2-153495H fragment. These holes should be about half an inch from the ends and only large enough to pass the string in step 4 through.
  6. Tie together B1-153495H and B2-153495H with the string.

Step 1

Separate the students into two groups (or four groups if running two assemblies). Generally three to seven students per group works best.

Step 2

Separate the pieces into two groups. The pieces have a barcode on the 5′ end. The barcodes have a format that consists of a single character followed by a dash followed by six numbers and then finally a letter. The initial character before the dash designates the group that the piece belongs to. Thus the barcode 2-664789P belongs to group 2. Barcode 1-371164C belongs to group 1. The barcode for the 40-kb piece has an initial character of B, which represents bridge. B1 is the part of the 40-kb piece that goes with group one. B2 goes with group 2. This 40-kb piece is introduced later.

Step 3

Go through the Student Instructions with the students.

Step 4

Give the students the pieces.

Notes

  1. If the students need help during the assembly, you can assist them. The barcodes also contain information on the placement of the pieces. The third number after the dash in the barcode indicates which position (reading from left to right) the piece has within the group. Thus the barcode 2-664789P is the fourth piece from the left piece in group 2. Barcode 1-371164C is the left-most piece in group 1. The third positions in the 40-kb fragment pieces do not have any significance.
  2. The two pieces that contain mismatches are 1-653687F and 2-664789P.

Step 5

  • Once they have completed the assembly, make the following points:
  • Show the idea of depth of coverage (the number of times a particular base is sequenced on different reads) by pointing out areas where the assembly is 3 to 4 times in depth.
  • A true assembly is much more difficult than this due to the following reasons:
    • The clones and reads are 50 times longer.
    • The clones are double stranded and their orientation is unknown.
    • Genomes contain repeat areas.
    • There is potential for contamination.
    • There may be polymorphisms.
    • The number of reads in an assembly is in the thousands to millions. Ideally, there are enough reads so that every base is represented 10 times, thus giving us 10x depth of coverage.
  • These difficulties make it necessary to use computers to perform the assemblies.
  • Note that the 40kb piece bridged the two contigs thus resulting in one contig. This is the main reason why the JGI does 40kb libraries. If the portion of the assembly where a 40kb fragment does not have enough depth of coverage, then often times that fragment is broken up into a 3kb library to fill in the area.
  • Note that the base-pair mismatch was in the last base of the read. This is there to illustrate that the quality scores typically drop towards the end of the read.
  • To illustrate other points relating to sequence quality, work through the quality activity.

Step 6

Go through the Computer Demo with the students.

  • Student Instructions
  • Teacher Instructions
  • Sequence Quality
  • Computer Demo

More topics:

  • COVID-19 Status
  • News
  • Science Highlights
  • Blog
  • Webinars
  • CSP Plans
  • Featured Profiles
  • Careers
  • Contact Us
  • Events
  • User Meeting
  • MGM Workshops
  • Internal
  • Disclaimer
  • Credits
  • Policies
  • Emergency Info
  • Accessibility / Section 508 Statement
  • Flickr
  • LinkedIn
  • RSS
  • Twitter
  • YouTube
Lawrence Berkeley National Lab Biosciences Area
A project of the US Department of Energy, Office of Science

JGI is a DOE Office of Science User Facility managed by Lawrence Berkeley National Laboratory

© 1997-2023 The Regents of the University of California