Difference between revisions of "Prima"

From Schema Evolution
Jump to: navigation, search
(Performance Issues)
Line 1: Line 1:
PRIMA is the name of a system that support management and querying of transaction-time database with evolving schema over history. PRIMA is an acronym for Panta Rhei Information Management and Archival.
+
PRIMA is a system that support management and querying of transaction-time database with evolving schema over history. PRIMA is an acronym for Panta Rhei Information Management and Archival.
  
 
== Overview ==
 
== Overview ==

Revision as of 15:49, 13 June 2008

PRIMA is a system that support management and querying of transaction-time database with evolving schema over history. PRIMA is an acronym for Panta Rhei Information Management and Archival.

Contents

Overview

The old problem of managing the history of database information is now made more urgent and complex by fast-spreading web information systems. Indeed, systems such as Wikipedia are faced with the challenge of managing the history of their databases in the face of intense database schema evolution. Our PRIMA system addresses this difficult problem by introducing two key pieces of new technology. The first is a method for publishing the history of a relational database in XML, whereby the evolution of the schema and its underlying database are given a unified representation. This temporally grouped representation makes it easy to formulate sophisticated historical queries on any given schema version using standard XQuery. The second key piece of technology provided by PRIMA is that schema evolution is transparent to the user: she writes queries against the current schema while retrieving the data from one or more schema versions. The system then performs the labor-intensive and error-prone task of rewriting such queries into equivalent ones for the appropriate versions of the schema. This feature is particularly relevant for historical queries spanning over potentially hundreds of different schema versions. The latter one is realized by (i) introducing Schema Modification Operators (SMO)s to represent the mappings between successive schema versions and (ii) an XML integrity constraint language (XIC) to efficiently rewrite the queries using the constraints established by the SMOs. The scalability of the approach has been tested against both synthetic data and real-world data from the Wikipedia DB schema evolution history.

Data set

Employee Database Schema Evolution: Synthetic data

Wikipedia Database Schema Evolution: Real-world data

H-PRIMA

PRIMA was initially based on XML DB that execute XQuery queries. In order to improve the efficiency of the system, we are pursuing RDBMS-based system, which we call H-PRIMA.

Publications

"Managing and querying transaction-time databases under schema evolution" Hyun J. Moon, Carlo A. Curino, Alin Deutsch, Chien-Yi Hou, and Carlo Zaniolo. Accepted for publication at Very Large Data Base VLDB, 2008. (PDF will be available soon)

Personal tools