Identifiers.org
Identifiers.org is a project providing stable and perennial identifiers for data records used in the Life Sciences. The identifiers are provided in the form of Uniform Resource Identifiers (URIs). Identifiers.org is also a resolving system, that relies on collections listed in the MIRIAM Registry to provide direct access to different instances of the identified records.
Identifiers.org URIs and resolving system
The Identifiers.org URIs[1][2] are perennial identifiers, that specify at once the data collection, using the namespaces of the Registry, and the record identifier within the collection in the form of a unique resolvable URI. The Identifiers.org resolving system is built upon the information stored in the MIRIAM Registry, which is a database that stores namespaces assigned to commonly used data collections (databases and ontologies) for the Life Sciences. It transforms an Identifiers.org URI into the various URLs leading to the various instances of the record identified by the URI.
Identifier structure
An Identifiers.org URI is formed of several parts:
- Protocol. Identifiers.org URIs are HTTP URIs and start with "http:/"
- Data collection. These are namespaces listed in the MIRIAM Registry. For instance "pubmed" for the publication resource pubmed, "ec-code" for the enzyme nomenclature and "go" for gene ontology
- Record in the collection. For instance "9606" is "3-fluorotoluene" in the collection PubChem, it is "Homo sapiens" in the collection "taxonomy" and it is a social science publication in the collection "pubmed".
- Optional: Identifiers.org URIs can be suffixed with parameters, for instance imposing which resource to use for resolving, "profiles" that control the resolver's behaviour etc.
Usage
The systems allows a consistent and uniform annotation of datasets. This is turn facilitates data alignment and integration. Identifiers.org URIs are used to encode the metadata in the standard formats of the COMBINE initiative,[3] such as SBML. In particular, databases such as BioModels Database and Reactome export their data in SBML with cross-references encoded using Identifiers.org URIs. These URIs are also used in various semantic web projects such as Bio2RDF, Open PHACTS and the EBI RDF platform[4]
Comparison with other URI systems
Identifiers.org URIs have been developed since 2011 as a resolvable version of the MIRIAM identifiers, developed since 2005, which were of a URN form, and not directly resolvable. Identifiers.org URIs are similar to PURLs, albeit providing alternative resolutions for collections with several instances. They are also similar to DOIs, but provide human readable collection names, and re-use the record identifier assigned by the data provider.
See also
- MIRIAM Registry
- BioModels Database
- SBML
- CellML
- LSID
- Digital object identifier
- Persistent uniform resource locator
References
- ↑ Juty, N; Le Novère, N; Laibe, C (2012). "Identifiers.org and MIRIAM Registry: Community resources to provide persistent identification". Nucleic Acids Research. 40 (Database issue): D580–6. PMC 3245029 . PMID 22140103. doi:10.1093/nar/gkr1097.
- ↑ http://identifiers.org/ Identifiers.org Website
- ↑ http://co.mbine.org/ COmputational Modeling in BIology NEtwork Web site
- ↑ S Jupp, J Malone, J Bolleman, M Brandizi, M Davies, L Garcia, A Gaulton, C Laibe, N Redaschi, S Wimalaratne, Helen P, Ewan B, AM Jenkinson (2014) The EBI RDF Platform: Linked Open Data for the Life Sciences. Bioinformatics doi:10.1093/bioinformatics/btt765
External links
- identifiers.org website
- standards of the COMBINE initiative
- Open PHACTS, the Open Pharmacological Space