Published in:
Omics-a Journal of Integrative Biology 12(2) , 123-127 (Jun 2008)
Author(s):
DOI:
DOI 10.1089/omi.2008.0020
Abstract:
Given the growing wealth of downstream information, the integration of molecular and non-molecular data on a given organism has become a major challenge. For micro-organisms, this information now includes a growing collection of sequenced genes and complete genomes, and for communities of organisms it includes metagenomes. Integration of the data is facilitated by the existence of authoritative, community-recognized, consensus identifiers that may form the heart of so-called information knuckles. The Genomic Standards Consortium (GSC) is building a mapping of identifiers across a group of federated databases with the aim to improve navigation across these resources and to enable the integration of this information in the near future. In particular, this is possible because of the existence of INSDC Genome Project Identifiers (GPIDs) and accession numbers, and the ability of the community to define new consensus identifiers such as the culture identifiers used in the StrainInfo. net bioportal. Here we outline ( 1) the general design of the Genomic Rosetta Stone project, ( 2) introduce example linkages between key databases ( that cover information about genomes, 16S rRNA gene sequences, and microbial biological resource centers), and ( 3) make an open call for participation in this project by providing a vision for its future use.