BioExtract server-An integrated workflow-enabling system to access and analyze heterogeneous, distributed biomolecular data

Carol Lushbough, Michael K. Bergman, Carolyn J. Lawrence, Doug Jennewein, Volker Brendel

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Many in silico investigations in bioinformatics require access to multiple, distributed data sources and analytic tools. The requisite data sources may include large public data repositories, community databases, and project databases for use in domain-specific research. Different data sources frequently utilize distinct query languages and return results in unique formats, and therefore researchers must either rely upon a small number of primary data sources or become familiar with multiple query languages and formats. Similarly, the associated analytic tools often require specific input formats and produce unique outputs which make it difficult to utilize the output from one tool as input to another. The BioExtract Server (http://bioextract.org) is a Web-based data integration application designed to consolidate, analyze, and serve data from heterogeneous biomolecular databases in the form of a mash-up. The basic operations of the BioExtract Server allow researchers, via their Web browsers, to specify data sources, flexibly query data sources, apply analytic tools, download result sets, and store query results for later reuse. As a researcher works with the system, their steps are saved in the background. At any time, these steps can be preserved long-term as a workflow simply by providing a workflow name and description.

Original languageEnglish (US)
Article number4626945
Pages (from-to)12-24
Number of pages13
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume7
Issue number1
DOIs
StatePublished - Jan 2010
Externally publishedYes

Keywords

  • Bioinformatics (genome or protein) databases
  • Data integration
  • Database integration
  • Distributed architectures
  • Heterogeneous Databases.
  • Heterogeneous databases
  • Mash-up
  • Scientific workflow automation.

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'BioExtract server-An integrated workflow-enabling system to access and analyze heterogeneous, distributed biomolecular data'. Together they form a unique fingerprint.

Cite this