Using Workflow to Build an Information Management System for a Geographically Distributed Genome Sequence Initiative

TitleUsing Workflow to Build an Information Management System for a Geographically Distributed Genome Sequence Initiative
Publication TypeBook Chapter
Year of Publication2003
AuthorsMichael Weise, John Miller, Krzysztof Kochut, Jonathan Arnold, R. David Hall, Amit Sheth
Abstract

Genome projects are very demanding undertakings and are often carried out collaboratively by multiple research centers. There are many different types of tasks that must be performed both by researchers and automated tools. These include such activities as shotgun sequencing, sequence finishing, sequence similarity searches, data annotation, oligonucleotide synthesis, library construction, and data submission. These individual tasks are organized into workflows that carry out a particular function, such as providing an annotated sequence of a cosmid clone. The individual tasks may be carried out at one or more of the participating institutions. A single workflow may be spread across multiple research centers. Creating software systems to support distributed workflows presents developers with a number of challenges, such as coordinating the execution of applications running on different systems, transporting data between systems, integrating legacy applications, providing recovery mechanisms, and creating user interfaces. Additionally, there will likely be frequent change to the organizational procedures, especially in the early stages of a genome project. This paper discusses using a general purpose workflow management system (WfMS) to address these challenges in the implementation of an information system that manages a geographically distributed genome project. A prototype application built with the METEOR WfMS is described which is running on several systems at the University of Georgia.

Full Text

R. David Hall, John A. Miller,Jonathan Arnold, Krys J. Kochut, Amit P. Sheth, and Michael J. Weise, 'Using Workflow to Build an Information Management System for a Geographically Distributed Genome Sequence Initiative,'in Genomics of Plants and Fungi, R.A. Prade and J. Bohnert (Eds.), New York: Marcel Dekker, 2003, ch. 12, pp. 359-372.
pages: ch. 12, pp. 359-372
publisher: New York: Marcel Dekker, 2003
year: 2003
hasEditor: .J. Bohnert
hasURL: http://knoesis.wright.edu/library/download/HMA+01.pdf
hasBookTitle: Genomics of Plants and Fungi

Related Files: