Galapagos: Model-driven discovery of end-to-end application-storage relationships in distributed systems

Kostas Magoutis, Murthy Devarakonda, Nikolai Joukov, Norbert G. Vogl

Research output: Contribution to journalArticlepeer-review

29 Scopus citations


Modern business information systems are typically multi-tiered distributed systems comprising Web services, application services, databases, enterprise information systems, file systems, storage controllers, and other storage systems. In such environments, data is stored in different forms at multiple tiers, with each tier associated with some level of data abstraction. An information entity owned by an application generally maps to several data entities, logically associated across tiers and related to the application. Discovery of such relationships in a distributed system is a challenging problem, complicated by the widespread adoption of virtualization technologies and by the traditional tendency to manage each tier as an independent domain. In this paper, we present a system and methodology for model-driven discovery of end-to-end application-data relationships spanning multiple tiers, from the applications to the lowest levels of the storage hierarchy. The key to our methodology involves modeling how data is used and transformed by distributed software components. An important benefit of our system, which we call Galapagos, is the ability to reflect business decisions expressed at the application level to the level of storage.

Original languageEnglish (US)
Pages (from-to)367-377
Number of pages11
JournalIBM Journal of Research and Development
Issue number4-5
StatePublished - 2008

ASJC Scopus subject areas

  • General Computer Science


Dive into the research topics of 'Galapagos: Model-driven discovery of end-to-end application-storage relationships in distributed systems'. Together they form a unique fingerprint.

Cite this