Difference between revisions of "Prism"

From Schema Evolution
Jump to: navigation, search
(Related Works)
Line 97: Line 97:
  
 
Some of the most relevant approaches to the general problem of schema evolution are the impact-minimizing methodology of  
 
Some of the most relevant approaches to the general problem of schema evolution are the impact-minimizing methodology of  
  \cite{Ra:2005vj}, the unified approach to application and database evolution of  
+
 
  \cite{1228375}, the application-code generation of  
+
 
  \cite{cleve2006} and the framework for metadata model management  of  
+
  <bibref f="defbib.bib">Ra:2005vj</bibref>, the unified approach to application and database evolution of  
  \cite{melnik03rondo} and the further contributions \cite{bernstein03applying, bernstein03data, velegrakis03mapping, yu05semantic}.
+
  <bibref f="defbib.bib">1228375</bibref>, the application-code generation of  
While these and other interesting attempts provide solid theoretical foundations and interesting methodological approaches, the lack of operational tools for graceful schema evolution observed by Roddick in  \cite{roddick95schema} remains largely unsolved twelve years later.
+
  <bibref f="defbib.bib">cleve2006</bibref> and the framework for metadata model management  of  
\PRISM\ represents, at the best of our knowledge, the most advanced attempt in this direction available to date.
+
  <bibref f="defbib.bib">melnik03rondo</bibref> and the further contributions <bibref f="defbib.bib">bernstein03applying, bernstein03data, velegrakis03mapping, yu05semantic</bibref>.
 +
While these and other interesting attempts provide solid theoretical foundations and interesting methodological approaches, the lack of operational tools for graceful schema evolution observed by Roddick in  <bibref f="defbib.bib">roddick95schema</bibref> remains largely unsolved twelve years later.
 +
'''PRISM''' represents, at the best of our knowledge, the most advanced attempt in this direction available to date.
  
 
The operational answer to the issue of schema evolution used by '''PRISM''' exploits some of the most recent results on mapping composition
 
The operational answer to the issue of schema evolution used by '''PRISM''' exploits some of the most recent results on mapping composition
  \cite{nash05composition}, mapping invertibility
+
  <bibref f="defbib.bib">nash05composition</bibref>, mapping invertibility
  \cite{fagin2007c}, and query rewriting   
+
  <bibref f="defbib.bib">fagin2007c</bibref>, and query rewriting   
  \cite{deutsch03mars}.
+
  <bibref f="defbib.bib">deutsch03mars</bibref>.
 
The SMO language used here captures the essence of existing works  
 
The SMO language used here captures the essence of existing works  
  \cite{BernsteinGMN08},  but extends them with functions,
+
  <bibref f="defbib.bib">BernsteinGMN08</bibref>,  but extends them with functions,
 
for expressing data type and semantic conversions.  
 
for expressing data type and semantic conversions.  
 
The translation between SMOs and Disjunctive Embedded Dependencies (DED) exploited here is similar to the incremental adaptation approach of  
 
The translation between SMOs and Disjunctive Embedded Dependencies (DED) exploited here is similar to the incremental adaptation approach of  
  \cite{velegrakis03mapping}, but achieves different goals.
+
  <bibref f="defbib.bib">velegrakis03mapping</bibref>, but achieves different goals.
 
The query rewriting portion of '''PRISM''' exploits theories and tools developed in the context of the MARS project  
 
The query rewriting portion of '''PRISM''' exploits theories and tools developed in the context of the MARS project  
  \cite{deutsch01ded,deutsch03mars}.  
+
  <bibref f="defbib.bib">deutsch01ded,deutsch03mars</bibref>.  
The theories of mapping composition studied in \cite{madhavan03composing, FaginPKT04,nash05composition, BernsteinGMN08}, and the concept of invertibility recently investigated by Fagin et al. in  
+
The theories of mapping composition studied in <bibref f="defbib.bib">madhavan03composing, FaginPKT04,nash05composition, BernsteinGMN08</bibref>, and the concept of invertibility recently investigated by Fagin et al. in  
  \cite{fagin2007c,fagin2007a} support the notion of SMO composition and inversion.
+
  <bibref f="defbib.bib">fagin2007c,fagin2007a</bibref> support the notion of SMO composition and inversion.
  
 
  The big players in the world of commercial DBMSs have been mainly focusing on reducing the downtime when the schema is updated  
 
  The big players in the world of commercial DBMSs have been mainly focusing on reducing the downtime when the schema is updated  
  \cite{oraclewhitepaper} and on assistive design tools  \cite{db2changemanagementexpert}, and lack the automatic query rewriting features provided in \PRISM.  
+
  <bibref f="defbib.bib">oraclewhitepaper</bibref> and on assistive design tools  \cite{db2changemanagementexpert</bibref>, and lack the automatic query rewriting features provided in '''PRISM'''.  
  Other tools of interest are \cite{chimera} and LiquiBase\footnote{Available on-line: \url{http://www.liquibase.org/}}.
+
  Other tools of interest are <bibref f="defbib.bib">chimera</bibref> and LiquiBase (Available on-line: http://www.liquibase.org/).
  
 
Further related works include the results on mapping information preservation by Barbosa et al.  
 
Further related works include the results on mapping information preservation by Barbosa et al.  
  \cite{BarbosaFM05}, the ontology-based repository of  
+
  <bibref f="defbib.bib">BarbosaFM05</bibref>, the ontology-based repository of  
  \cite{bounif2006}, the schema versioning approaches of  
+
  <bibref f="defbib.bib">bounif2006</bibref>, the schema versioning approaches of  
  \cite{jagadish97}. XML schema evolution has been addressed in  \cite{MoroML07} by means of a guideline-driven approach. Object-oriented schema evolution has been investigated in  
+
  <bibref f="defbib.bib">jagadish97</bibref>. XML schema evolution has been addressed in  <bibref f="defbib.bib">MoroML07</bibref> by means of a guideline-driven approach. Object-oriented schema evolution has been investigated in  
  \cite{Galante:2005eq}.  
+
  <bibref f="defbib.bib">Galante:2005eq</bibref>.  
 
In the context of data warehouse X-TIME represents an interesting step toward schema versioning by means of the notion of augmenting schema  
 
In the context of data warehouse X-TIME represents an interesting step toward schema versioning by means of the notion of augmenting schema  
  \cite{GolfarelliLRV04,Rizzi07}.  
+
  <bibref f="defbib.bib">GolfarelliLRV04,Rizzi07</bibref>.  
 
'''PRISM''' differs form all the above in terms of both goals and techniques.
 
'''PRISM''' differs form all the above in terms of both goals and techniques.

Revision as of 17:13, 29 March 2009

PRISM-LOGO.png

PRISM is a joint project of Politecnico di Milano and University of California, Los Angeles. The main investigators are:

Carlo A. Curino (contact author): [1]

Hyun J. Moon: [2]

Carlo Zaniolo: [3]

Contents

Abstract

Supporting graceful schema evolution represents an unsolved problem for traditional information systems that is further exacerbated in web information systems, such as Wikipedia and public scientific databases: in these projects based on multiparty cooperation the frequency of database schema changes has increased while tolerance for downtimes has nearly disappeared. As of today, schema evolution remains an error-prone and time-consuming undertaking, because the DB Administrator (DBA) lacks the methods and tools needed to manage and automate this endeavor by (i) predicting and evaluating the effects of the proposed schema changes, (ii) rewriting queries and applications to operate on the new schema, and (iii) migrating the database.

Our PRISM system takes a big first step toward addressing this pressing need by providing: (i) a language of Schema Modification Operators (SMO) to express concisely complex schema changes, (ii) tools that allow the DBA to evaluate the effects of such changes, (iii) optimized translation of old queries to work on the new schema version, (iv) automatic data migration, and (v) full documentation of intervened changes as needed to support data provenance, database flash back, and historical queries. PRISM solves these problems by integrating recent theoretical advances on mapping composition and invertibility, into a design that also achieves usability and scalability. Wikipedia and its 170+ schema versions provided an invaluable testbed for validating PRISM tools and their ability to support legacy queries.

On-line Demo

Prism: a tool for schema evolution support

While the actual demo is available online at: Prism a tool for schema evolution support. The Demo is still under development and has limited functionalities w.r.t. the internal prototype, but we are working on it... stay tuned!

Screencasts

There is a Video of the Demo available at: http://yellowstone.cs.ucla.edu/schema-evolution/documents/Prism-Demo.mov

There is a Video of the VLDB presentation at: http://yellowstone.cs.ucla.edu/schema-evolution/documents/prism-vldb2008.mov

Publications

"Graceful database schema evolution: the prism workbench", Carlo A. Curino, Hyun J. Moon, Carlo Zaniolo, to appear in VLDB 2008 PDF

"Information Systems Integration and Evolution: Ontologies at Rescue", Carlo A. Curino, Letizia Tanca, Carlo Zaniolo, STSM 2008 PDF

"The PRISM Workwench: Database Schema Evolution Without Tears", Carlo A. Curino, Hyun J. Moon, MyungWon Ham, Carlo Zaniolo, DEMO paper at ICDE 2009.

Bibtex

 @INPROCEEDINGS{curino-vldb2008a,
   author = {Carlo A. Curino and Hyun J. Moon and Carlo Zaniolo},
   title = {Graceful database schema evolution: the prism workbench},
   booktitle = {Very Large Data Base (VLDB)},
   year = {2008}
 }
 @INPROCEEDINGS{curino-stsm2008,
   author = {Carlo A. Curino and Letizia Tanca and Carlo Zaniolo},
   title = {Information Systems Integration and Evolution: Ontologies at Rescue},
   booktitle = {International Workshop on Semantic Technologies in System Maintenance (STSM)},
   year = {2008}
 }

Commercial Competitors

Here we list a series of commercial tools which partially tackle the problem of Schema Evolution. None of this support automatic query rewriting and data migration impact analysis as done by prototype PRISM. Most of them focus on comparing schema versions and creating report of the differences. Another common features is analyzing the impact of the change w.r.t. other DB objects such as store procedures, views, constraints, but none of this provide analysis of the impact on query and applications.

  • DB2 Change Management Expert [4]
  • Oracle Change Management Pack [5]
  • MySQL Workbench for Schema Change [6]
  • SwisSQL DBChangeManager (MSSQL) (DB compare and synchronization) [7]
  • Idera SQL Change [8]
  • Embarcadero Change Manager [9]
  • Red-Gate SQL Compare [10]
  • Best Soft Tool [11]
  • Toad® DBA Suite [12] [13]
  • Aldon lifecycle management solution [14]

Useful Links

  • Management-hub.com [15]


Related Works

Some of the most relevant approaches to the general problem of schema evolution are the impact-minimizing methodology of


<bibref f="defbib.bib">Ra:2005vj</bibref>, the unified approach to application and database evolution of 
<bibref f="defbib.bib">1228375</bibref>, the application-code generation of 
<bibref f="defbib.bib">cleve2006</bibref> and the framework for metadata model management  of 
<bibref f="defbib.bib">melnik03rondo</bibref> and the further contributions <bibref f="defbib.bib">bernstein03applying, bernstein03data, velegrakis03mapping, yu05semantic</bibref>.

While these and other interesting attempts provide solid theoretical foundations and interesting methodological approaches, the lack of operational tools for graceful schema evolution observed by Roddick in <bibref f="defbib.bib">roddick95schema</bibref> remains largely unsolved twelve years later. PRISM represents, at the best of our knowledge, the most advanced attempt in this direction available to date.

The operational answer to the issue of schema evolution used by PRISM exploits some of the most recent results on mapping composition

<bibref f="defbib.bib">nash05composition</bibref>, mapping invertibility
<bibref f="defbib.bib">fagin2007c</bibref>, and query rewriting  
<bibref f="defbib.bib">deutsch03mars</bibref>.

The SMO language used here captures the essence of existing works

<bibref f="defbib.bib">BernsteinGMN08</bibref>,  but extends them with functions,

for expressing data type and semantic conversions. The translation between SMOs and Disjunctive Embedded Dependencies (DED) exploited here is similar to the incremental adaptation approach of

<bibref f="defbib.bib">velegrakis03mapping</bibref>, but achieves different goals.

The query rewriting portion of PRISM exploits theories and tools developed in the context of the MARS project

<bibref f="defbib.bib">deutsch01ded,deutsch03mars</bibref>. 

The theories of mapping composition studied in <bibref f="defbib.bib">madhavan03composing, FaginPKT04,nash05composition, BernsteinGMN08</bibref>, and the concept of invertibility recently investigated by Fagin et al. in

<bibref f="defbib.bib">fagin2007c,fagin2007a</bibref> support the notion of SMO composition and inversion.
The big players in the world of commercial DBMSs have been mainly focusing on reducing the downtime when the schema is updated 
<bibref f="defbib.bib">oraclewhitepaper</bibref> and on assistive design tools   \cite{db2changemanagementexpert</bibref>, and lack the automatic query rewriting features provided in PRISM. 
Other tools of interest are <bibref f="defbib.bib">chimera</bibref> and LiquiBase (Available on-line: http://www.liquibase.org/).

Further related works include the results on mapping information preservation by Barbosa et al.

<bibref f="defbib.bib">BarbosaFM05</bibref>, the ontology-based repository of 
<bibref f="defbib.bib">bounif2006</bibref>, the schema versioning approaches of 
<bibref f="defbib.bib">jagadish97</bibref>. XML schema evolution has been addressed in  <bibref f="defbib.bib">MoroML07</bibref> by means of a guideline-driven approach. Object-oriented schema evolution has been investigated in 
<bibref f="defbib.bib">Galante:2005eq</bibref>. 

In the context of data warehouse X-TIME represents an interesting step toward schema versioning by means of the notion of augmenting schema

<bibref f="defbib.bib">GolfarelliLRV04,Rizzi07</bibref>. 

PRISM differs form all the above in terms of both goals and techniques.

Personal tools