Warning: mysql_fetch_assoc(): supplied argument is not a valid MySQL result resource in /home/www/web203/html/t3lib/class.t3lib_db.php on line 723
neuroscientific.net: bio-zen Ontology Framework

The bio-zen Ontology Framework


The development of the bio-zen  ontology framework is an attempt to represent data, information and knowledge from research in all facets of the life sciences on the  Semantic Web. The goal of this project is the unification of information that is now scattered through a multitude of different data structures, exchange formats and databases. Through the use of Semantic Web technologies, the decentralised and barrier-free development and exchange of experimental data, hypotheses and biological models becomes possible.

Conventional databases (e.g. relational databases or XML databases) do a poor job of representing biological reality. Researchers that want to publish or search for information do not only have to know about the biological structures they are investigating, they also have to deal with the structures of the database tables, file formats or XML documents – all of which are in most cases only remotely similar to the mental representation we have in mind when thinking about biological facts.

Furthermore, most databases in use nowadays were designed with only a small and limited field of investigation in mind, and so we are now confronted with a convolute of small databases that can only be made to work together through a lot of additional work.

We also see that systems biology with its focus on the simulation of complex biological systems has an ever growing impact on classical molecular biology. However, the realm of qualitative information that is represented in texts and databases like Uniprot or BIND is completely disconnected from the world of simulation and modelling, which is currently represented through languages like SBML, CellML or NEURON models. To realize the promises of systems biology, the division between these two worlds has to be bridged.

The bio-zen framework uses Semantic Web technologies to overcome the limitations of current information systems in the life sciences. The descriptions of biological reality in the bio-zen framework are very similar to the cognitive models researchers have about their subjects of investigation, making the work with information systems more intuitive for the individual scientist. bio-zen is exceptionally flexible and extensible, making it easy to represent information from a wide variety of fields in a common framework. It also allows for a seamless integration of mathematical descriptions and simulation parameters into qualitative information, enabling a quick transition from data and information to model simulations and back.

If successful, the Semantic Web and the bio-zen framework could mark the beginning of a whole new paradigm in the organisation and dissemination of information in the life sciences.

The following Semantic Web ontologies are incorporated in bio-zen (all ontologies are mapped to DOLCE):

DOLCE

Descriptive Ontology for Linguistic and Cognitive Engineering

DOLCE is an established foundational ontology available in the OWL format. It defines basic ontological classes and properties and acts as a foundation for other ontologies.

bio-zen is based on DOLCE, which gives bio-zen a sound ontological base and makes it highly interoperable with other ontologies based on DOLCE.

 http://www.loa-cnr.it/DOLCE.html 

SKOS

Simple Knowledge Organisation System

The SKOS ontology can be used to represent thesauri, classification schemes, subject heading lists, taxonomies and terminologies in a Semantic Web format.

In bio-zen, SKOS is used for the flexible representation of existing biomedical ontologies and taxonomies (e.g. the Gene Ontology [6]).

 http://www.w3.org/2004/02/skos/ 

FOAF

FOAF is a popular Semantic Web ontology for the description of personal data, personal relationships and organisations.

 http://www.foaf-project.org/ 

SIOC

Semantically Interlinked Open Communities

“SIOC provides methods for interconnecting discussion methods such as blogs, forums and mailing lists to each other. It consists of the SIOC ontology, an open-standard machine readable format for expressing the information contained both explicitly and implicitly in internet discussion methods, of SIOC metadata producers for a number of popular blogging platforms and content management systems, and of storage and browsing / searching systems for leveraging this SIOC data.”

In bio-zen, SIOC is adapted for the representation of basic scientific discourse on the web.

 http://sioc-project.org/ 

Dublin Core

Dublin core is a widely accepted metadata standard for the description of resources (e.g. title, creator, copyright etc.)

 http://dublincore.org/ 

Creative Commons

Creative Commons metadata is a widely used standard for the description of copyright information. In bio-zen, it is used to make machine-readable statements about the copyright restrictions of scientific publications and databases.

 http://sciencecommons.org/ 

 

Basic design principles of the bio-zen framework:

• The ontology is built upon existing foundational ontologies ( DOLCE and  SKOS). This means that it is rooted in a sound ontological framework, easing the interoperability with ontologies from other domains.

• Over-specialisation is avoided where possible. Generic properties like ‘part of’, ‘constituent of’, ‘broader concept than’. ‘caused by’ etc. are the backbone of the whole ontology. The class hierarchy is kept simple and lean.

• Dublin Core and other existing metadata standards are used.

• The ontology integrates two different approaches of information representation in a common framework: ‘realist’ ontological descriptions and ‘conceptualist’ taxonomies and concept hierarchies.

- Making a clear distinction between both approaches reduces the susceptibility to inconsistencies.

- Unifying both approaches in one common framework makes it possible to combine the specific advantages of each approach in the best way possible. The consistency of the realist approach is complemented with the flexibility of the conceptualist approach.

• Users of the ontology only need to make OWL individuals to represent information. The definition of new classes is not necessary, which helps to avoid many of the associated problems.

• The ontology is focused on the description of spatio-temporal particulars (concrete biological things and models). This focus limits the problems that arise when users of the ontology can define universals.

• The ontology allows the seamless integration of mathematical models (similar to and compatible with SBML 2) into qualitative information.

• Molecular interactions are modelled as stochastic processes involving populations of molecules, not as singular events that involve singular molecules. This approach is much closer to biological reality in most occasions, and avoids some grave consistency problems associated with the other approach.

• Strict use of SI units for physical measurements.

• The ontology includes simple means to 'fuzzify' information (i.e. basic support for fuzzy logic - like constructs).

 

The first version of the bio-zen manual is ready for download (PDF, 500 kb)

Go to the download section / view class structure

Go to the bio-zen discussion forum at Google Groups

You are not logged in.
LOGIN
Register as new user
Breaking News:
01-10-09 11:23
Making the web work for pharma data

The Linking Open Drug Data project is gaining traction

[more]

go to Archive ->