Generic Model Organism Database

The Generic Model Organism Database (GMOD) Project began as an effort to create reusable software tools for developing Model Organism Databases (MODs). MODs describe genome and other information about important experimental organisms in the life sciences. Also called organism-specific databases, these databases capture the large volumes of data and information being generated by modern biology.

Behind every MOD is a software system that is designed to help manage the data within the MOD, and to help users query and access those data. In the past, every MOD project developed its own software tools. GMOD is a loose federation of software applications (components) aimed at providing functionality that is needed by many or all MODs. Some of these software components are linked together by their use of a common database schema known as Chado. This project is funded by the United States National Institutes of Health, National Science Foundation and the USDA Agricultural Research Service.

Contents

Chado database schema

Chado makes extensive use of controlled vocabularies to type all entities in the database, so there is a feature table where gene, transcripts, exons, transposable elements, etc. are stored and their type is provided by the Sequence Ontology. When a new datatype comes along, the feature table requires no modification, only an update of the data in the database. The same is largely true of analysis data that can be stored in Chado as well.

The existing core modules of Chado are:

Software

The full list of GMOD software components is found on the GMOD Components page. These components include:

  • GMOD Core (Chado database and tools)
    • Chado : the Chado schema and tools to install it.
    • XORT  : a tool for loading and dumping chado-xml
    • GMODTools : extracts data from a Chado database into common genome bulk formats (GFF, Fasta, etc)
  • MOD website
  • Genome Editing and Visualization
    • Apollo  : a Java application for viewing and editing genome annotations
    • GBrowse  : a CGI application for displaying genome annotations
    • JBrowse : a JavaScript application for displaying genome annotations
    • Pathway Tools  : a genome browser with a comparative mode
  • Comparative Genomics
    • GBrowse_syn : a GBrowse based synteny viewer
    • CMap  : a CGI application for displaying comparative maps
  • Literature curation
    • Textpresso  : a text mining system for scientific literature
  • Database querying tools
    • BioMart  : a query-oriented data management system
    • InterMine : open source data warehouse system
  • Biological Pathways
    • Pathway Tools : tools for metabolic pathway information, and analysis of high-throughput functional genomics data
  • Regulatory Networks
    • Pathway Tools : supports definition of regulatory interactions and browsing of regulatory networks
  • Analysis


References

Participating databases

The following organism databases are contributing to and/or adopting GMOD components for model organism databases.

ANISEED AntonosporaDB ATIDB Arabidopsis
BeeBase BeetleBase Bovine Genome Database (BGD)
BioHealthBase Bovine QTL Viewer Cattle EST Gene Family Database
CGD CGL ChromDB
Chromosome 7 Annotation Project CSHLmpd Database of Genomic Variants
DictyBase DroSpeGe EcoCyc
FlyBase Fungal Comparative Genomics Fungal Telomere Browser
Gallus Genome Browser GeneDB GrainGenes
Gramene HapMap Human 2q33
Human Genome Segmental Duplication Database IVDB MAGI
Marine Biological Lab Organism Databases MGI Non-Human Segmental Duplication Database
OMAP OryGenesDB Oryza Chromosome 8
Pathway Tools ParameciumDB PeanutMap
PlantsDB PlasmoDB PseudoCAP
PossumBase PUMAdb RGD
SGD SGD Lite SmedDB
Sol Genomics Network Soybase Soybean Gbrowse Database
T1DBase TAIR TGD
TGI TIGR TIGR Rice Genome Browser
ToxoDB TriAnnot BAC Viewer VectorBase
wFleaBase WormBase
XanthusBase Xenbase

Related projects

See also