Difference between revisions of "Benchmark Extension"

From Schema Evolution
Jump to: navigation, search
(GridCC)
(GridCC)
Line 182: Line 182:
 
The [http://www.gridcc.org GRIDCC] is a three-year project funded by the European Commission. Its goal is integrating instruments and sensors with the traditional Grid resources. The GRIDCC middleware is being designed bearing in mind use cases from a very diverse set of applications, and as the result, the GRIDCC architecture provides access to the instruments in as generic a way as possible. GRIDCC is also developing an adaptable user interface and a mechanism for executing complex workflows in order to increase both the usability and the usefulness of the system. The new middleware is incorporated into significant applications that will allow the software validation in terms both of functionality and quality of service. The pilot application this paper focuses on is applying GRIDCC to support Remote Operations of the ELETTRA synchrotron radiation facility. We describe the results of implementing via GRIDCC complex workflows involved in the both routine operations and troubleshooting scenarios. In particular, the implementation of an orbit correction feedback shows the level of integration of instruments and traditional Grid resources which can be reached using the GRIDCC middleware.  
 
The [http://www.gridcc.org GRIDCC] is a three-year project funded by the European Commission. Its goal is integrating instruments and sensors with the traditional Grid resources. The GRIDCC middleware is being designed bearing in mind use cases from a very diverse set of applications, and as the result, the GRIDCC architecture provides access to the instruments in as generic a way as possible. GRIDCC is also developing an adaptable user interface and a mechanism for executing complex workflows in order to increase both the usability and the usefulness of the system. The new middleware is incorporated into significant applications that will allow the software validation in terms both of functionality and quality of service. The pilot application this paper focuses on is applying GRIDCC to support Remote Operations of the ELETTRA synchrotron radiation facility. We describe the results of implementing via GRIDCC complex workflows involved in the both routine operations and troubleshooting scenarios. In particular, the implementation of an orbit correction feedback shows the level of integration of instruments and traditional Grid resources which can be reached using the GRIDCC middleware.  
  
 
+
Number of Schema Versions: '''7'''
  
 
SVN for the MySQL DB Schema: http://sadgw.lnl.infn.it:8000/cgi-cvs/gridCC/framework/installation/configuration/databases/mysql/mysqlRunNumber.sql?sortby=date&only_with_tag=MAIN
 
SVN for the MySQL DB Schema: http://sadgw.lnl.infn.it:8000/cgi-cvs/gridCC/framework/installation/configuration/databases/mysql/mysqlRunNumber.sql?sortby=date&only_with_tag=MAIN

Revision as of 07:19, 27 October 2008

This section report the temporary results of an ongoing effort aimed at extending the existing dataset. The data available must be considered raw material to be used "as is".

Contents

CMS and Wiki

MediaWiki Schema Evolution

This is an update of the schema history to the current 05/23/2008

The SVN revision of the SQL script of the MediaWiki schema is available at:

The following .tar.gz file:

contains a dump of all the revisions (194) of the schema and few simple scripts that can be used for:

  • re-download an updated set of schemas from the SVN repository
  • batch install all the schema versions in a MySQL system
  • batch remove all the schema versions from a MySQL system
  • compute a simple set of statistics


Joomla! 1.5 Schema Evolution

Joomla! is an award-winning Content Management System (CMS) that will help you build websites and other powerful online applications. Best of all, Joomla! is an open source solution that is freely available to everybody.

The SVN revision of the SQL script of the Joomla! 1.5 schema is available at:

The following .tar.gz file:

contains a dump of all the revisions (46) of the schema and few simple scripts that can be used for:

  • re-download an updated set of schemas from the SVN repository
  • batch install all the schema versions in a MySQL system
  • batch remove all the schema versions from a MySQL system
  • (todo) compute a simple set of statistics

TikiWiki Schema Evolution

TikiWiki (Tiki) is your Groupware/CMS (Content Management System) solution. Tiki has the features you need:

  • Wikis (like Wikipedia)
  • Forums (like phpBB)
  • Blogs (like WordPress)
  • Articles (like Digg)
  • Image Gallery (like Flickr)
  • Map Server (like Google Maps)
  • Link Directory (like DMOZ)
  • Multilingual (like Babel Fish)
  • Bug tracker (like Bugzilla)
  • Free source software (LGPL)

The SVN revision of the SQL script of the TikiWiki schema is available at:

The following .tar.gz file:

contains a dump of all the revisions (152) of the schema and few simple scripts that can be used for:

  • re-download an updated set of schemas from the SVN repository
  • batch install all the schema versions in a MySQL system
  • batch remove all the schema versions from a MySQL system
  • (todo) compute a simple set of statistics

XOOPS Dynamic Web CMS

XOOPS is a dynamic web content management system written in PHP for the MySQL database. Its object orientation makes it an ideal tool for developing small or large community websites, intra company and corporate portals, weblogs and much more.

Popularity: 6,559,127 download from sourceforge at 05/22/2008

The SVN revision of the SQL script of the TikiWiki schema is available at:

The following .tar.gz file:

contains a dump of all the revisions (14) of the schema and few simple scripts that can be used for:

  • re-download an updated set of schemas from the SVN repository
  • batch install all the schema versions in a MySQL system
  • batch remove all the schema versions from a MySQL system
  • (todo) compute a simple set of statistics

Coppermine Photo Gallery:

Coppermine is an easily set-up, fast, feature-rich photo gallery script with mySQL database, user management, private galleries, automatic thumbnail creation, ecard feature and a template system for easy customization to match the rest of a site.

Popularity: 4,681,872 download from sourceforge at 05/22/2008

The SVN revision of the SQL script of the TikiWiki schema is available at:

https://coppermine.svn.sourceforge.net/svnroot/coppermine/trunk/cpg1.5.x/sql/schema.sql

The following .tar.gz file:

contains a dump of all the revisions (69) of the schema and few simple scripts that can be used for:

  • re-download an updated set of schemas from the SVN repository
  • batch install all the schema versions in a MySQL system
  • batch remove all the schema versions from a MySQL system
  • (todo) compute a simple set of statistics

TYPO3 Content Management Framework

TYPO3 is an enterprise class Web CMS written in PHP/MySQL. It's designed to be extended with custom written backend modules and frontend libraries for special functionality. It has very powerful integration of image manipulation.

Popularity: 3,277,323 download from sourceforge at 05/22/2008

The SVN revision of the SQL script of the TikiWiki schema is available at:

https://typo3.svn.sourceforge.net/svnroot/typo3/TYPO3core/trunk/t3lib/stddb/tables.sql

The following .tar.gz file:

contains a dump of all the revisions (39) of the schema and few simple scripts that can be used for:

  • re-download an updated set of schemas from the SVN repository
  • batch install all the schema versions in a MySQL system
  • batch remove all the schema versions from a MySQL system
  • (todo) compute a simple set of statistics

Scientific Databases

GrainGene

The GrainGenes 2.0 is a DB for Triticeae and Avena, releasing the current schema of the DB Backend available at: http://wheat.pw.usda.gov/ggmigration/gg_schema_mysql/

An on-line interface to formulate SQL queries to the DB: http://wheat.pw.usda.gov/cgi-bin/graingenes/sql.cgi?pre=0

UCSC Genome Bioinformatics

The UCSC database is a MySQL based project. http://genome.ucsc.edu/

BioSQL

BioSQL is a joint effort between the OBF projects (BioPerl, BioJava etc) to support a shared database schema for storing sequence data. In theory, you could load a GenBank file into the database with BioPerl, then using Biopython extract this from the database as a record object with featues - and get more or less the same thing as if you had loaded the GenBank file directly as a SeqRecord using SeqIO.

This is a promising source of data for our benchmark!

SVN: http://code.open-bio.org/svnweb/index.cgi/biosql/view/biosql-schema/trunk/sql/biosqldb-mysql.sql

The schema we collect at 05 Sep. 2008 are 46 and are available here: [1]

GUS

The Genomics Unified Schema (GUS) is an extensive relational database schema and associated application framework designed to store, integrate, analyze and present functional genomics data. The GUS schema supports a wide range of data types including genomics, gene expression, transcript assemblies, proteomics and others. It emphasizes standards-based ontologies and strong-typing.

The GUS Application Framework offers an object-relational layer and a Plugin API used to rapidly create robust data loading programs for diverse data sources. The GUS distribution includes plugins for standard data sources. The GUS Web Development Kit (WDK) is a rich environment for efficiently designing sophisticated query-based websites with little programming required.

Their about page: http://www.gusdb.org/about.php The SVN: https://www.cbil.upenn.edu/svn/gus/

NCBO

The National Center for Biomedical Ontology is a consortium of leading biologists, clinicians, informaticians, and ontologists who develop innovative technology and methods allowing scientists to create, disseminate, and manage biomedical information and knowledge in machine-processable form.

In this Context they use relational DB backend.

The SVN: http://smi-protege.stanford.edu/repos/cbio/ncbo/trunk/conf/ncbo_tables.sql


Open EMR

OpenEMR is a free medical practice management, electronic medical records, prescription writing, and medical billing application. These programs are also referred to as electronic health records. OpenEMR is licensed under the General Gnu Public License (General GPL). It is a free open source replacement for medical applications such as Medical Manager, Health Pro, and Misys. It features support for EDI billing to clearing houses such as MedAvant and ZirMED using ANSI X12. Medical claim and accounts receivable are accomplished through SQL-Ledger, which has been customized. Calendar features include categories for appointment types, colors associated with appointment types, repeating appointments, and the ability to restrict appointments based on type. There are customizable medical encounter forms, support for voice recognition software, and electronic or scanned digital document management for records.

The homepage: http://www.oemr.org/ The SourceForge project URL: http://sourceforge.net/projects/openemr/

Genomic DB Survey

Another relevant source is [2] where Erika De Francesco and Simona Rombo provide a survey of almost 80 genomic databases.


CERN Scientific DB

In this subsection we collect datasets coming from the CERN research center.


GridCC

The GRIDCC is a three-year project funded by the European Commission. Its goal is integrating instruments and sensors with the traditional Grid resources. The GRIDCC middleware is being designed bearing in mind use cases from a very diverse set of applications, and as the result, the GRIDCC architecture provides access to the instruments in as generic a way as possible. GRIDCC is also developing an adaptable user interface and a mechanism for executing complex workflows in order to increase both the usability and the usefulness of the system. The new middleware is incorporated into significant applications that will allow the software validation in terms both of functionality and quality of service. The pilot application this paper focuses on is applying GRIDCC to support Remote Operations of the ELETTRA synchrotron radiation facility. We describe the results of implementing via GRIDCC complex workflows involved in the both routine operations and troubleshooting scenarios. In particular, the implementation of an orbit correction feedback shows the level of integration of instruments and traditional Grid resources which can be reached using the GRIDCC middleware.

Number of Schema Versions: 7

SVN for the MySQL DB Schema: http://sadgw.lnl.infn.it:8000/cgi-cvs/gridCC/framework/installation/configuration/databases/mysql/mysqlRunNumber.sql?sortby=date&only_with_tag=MAIN

Personal tools