IT703: Semantic Digitization of Experimental Data in Biological Sciences

TitleIT703: Semantic Digitization of Experimental Data in Biological Sciences
Publication TypeConference Paper
Year of Publication2016
AuthorsRaghuvanshi S
Conference NameInternational Conference on Biomedical Ontology and BioCreative (ICBO BioCreative 2016)
Date Published11/30/16 Volume 1747
Other NumbersVol-1747|urn:nbn:de:0074-1747-1

A major bulk of published experimental data, referred to as ÔGold StandardÕ data, is available in a format that cannot be easily accessed by computers unless effectively curated. Most curation techniques bank on mining the text for information. Here we propose and demonstrate the efficacy of curating the experimental data itself. The data models facilitate digitization of the every aspect of the information associated with the experimental data. The models utilize several universally accepted ontologies as well as in-house developed alphanumeric notations for digitizing different aspect of the data. The data models have sufficient flexibility to address the extensive variability in experimental data. They have a very generic nature and can be used to curate and digitize experimental data from any organism. The digitized data is easily stored in a relational database management system and can thus be rapidly searched and integrated. These models have been successfully used to digitize data from over 20,000 experiments spanning over 500 research articles on rice biology. The entire dataset is available as a database entitled ÔManually Curated Database of Rice ProteinsÕ at