CINXE.COM
Data Conversion Area |
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en"> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <title>Data Conversion Area | </title> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <link rel="shortcut icon" href="/sites/all/themes/acquia_marina/favicon.ico" type="image/x-icon" /> <link type="text/css" rel="stylesheet" media="all" href="/modules/node/node.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/modules/system/defaults.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/modules/system/system.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/modules/system/system-menus.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/modules/user/user.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/modules/cck/theme/content-module.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/modules/fckeditor/fckeditor.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/themes/acquia_marina/style.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/themes/acquia_marina/icons.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/themes/acquia_marina/dblog.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/themes/acquia_marina/theme_settings/fixed.css?Y" /> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/themes/acquia_marina/theme_settings/fonts_1.css?Y" /> <!--[if IE 7]> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/themes/acquia_marina/ie7-fixes.css" /> <![endif]--> <!--[if lte IE 6]> <link type="text/css" rel="stylesheet" media="all" href="/sites/all/themes/acquia_marina/ie6-fixes.css" /> <![endif]--> <script type="text/javascript" src="/misc/jquery.js?Y"></script> <script type="text/javascript" src="/misc/drupal.js?Y"></script> <script type="text/javascript" src="/sites/all/themes/acquia_marina/script.js?Y"></script> <script type="text/javascript"> <!--//--><![CDATA[//><!-- jQuery.extend(Drupal.settings, { "basePath": "/" }); //--><!]]> </script> </head> <body class="not-logged-in not-front full-node node-type-page layout-first-main preface-first-middle-last"> <div id="page" class="clearfix"> <div id="skip"> <a href="#main-content">Skip to Main Content</a> </div> <div id="header-wrapper"> <div id="header" class="clearfix"> <div id="header-first"> <h1><a href="/" title="Home">D-NET LAB</a></h1> </div><!-- /header-first --> <div id="header-middle"> </div><!-- /header-middle --> <div id="header-last"> </div><!-- /header-last --> </div><!-- /header --> </div><!-- /header-wrapper --> <div id="primary-menu-wrapper" class="clearfix"> <div id="primary-menu"> <ul class="menu"><li class="expanded first"><a href="/node/1" title="Overview">Overview</a><ul class="menu"><li class="leaf first"><a href="/node/4" title="What is D-Net for?">What is D-NET for?</a></li> <li class="leaf"><a href="/node/3" title="General Concepts">General Concepts</a></li> <li class="leaf"><a href="/node/5" title="System Architecture">System Architecture</a></li> <li class="leaf last"><a href="/node/34" title="Distributed Applications Vs Service Oriented Architectures">Why SOA?</a></li> </ul></li> <li class="expanded active-trail"><a href="/node/8" title="Software">D-NET Software</a><ul class="menu"><li class="leaf first"><a href="/node/11" title="Enabling Area">Enabling Service Area</a></li> <li class="leaf"><a href="/node/10" title="Data Area">Data Mediation Area</a></li> <li class="leaf active-trail"><a href="/node/43" title="Data Conversion Area" class="active">Data Conversion Area</a></li> <li class="leaf"><a href="/node/44" title="Data Storage and Indexing Area">Data Storage and Indexing Area</a></li> <li class="leaf"><a href="/node/42" title="Data Curation and Enrichment Area">Data Curation and Enrichment Area</a></li> <li class="leaf"><a href="/node/9" title="End User Functionality Area">Data Provision Area</a></li> <li class="leaf last"><a href="/node/32" title="Service typologies">Service patterns</a></li> </ul></li> <li class="expanded"><a href="/node/24" title="D-Net Framework">D-NET Resource Framework</a><ul class="menu"><li class="leaf first"><a href="/node/29" title="Resource registration and resource profiles">Resource registration</a></li> <li class="leaf"><a href="/node/30" title="Application patterns">Service interaction patterns</a></li> <li class="leaf"><a href="/node/31" title="Resource Model">Resource Model</a></li> <li class="leaf last"><a href="/node/33" title="Service Orchestration and blackboard protocol">Blackboard protocol</a></li> </ul></li> <li class="expanded"><a href="/node/12" title="Community">Community</a><ul class="menu"><li class="leaf first"><a href="/node/15" title="Sponsors">Technical Partners</a></li> <li class="leaf"><a href="/node/22" title="D-Net Installations">D-NET Installations</a></li> <li class="leaf last"><a href="/node/7" title="Publications">Publications</a></li> </ul></li> <li class="expanded"><a href="/node/16" title="Getting Started">Getting Started</a><ul class="menu"><li class="leaf first"><a href="/node/41" title="Downlaod">Download</a></li> <li class="leaf"><a href="/node/18" title="Installation">Installation</a></li> <li class="leaf"><a href="/node/20" title="License">License</a></li> <li class="leaf"><a href="/node/40" title="Documentation">Documentation</a></li> <li class="collapsed last"><a href="/node/37" title="FAQ">FAQ</a></li> </ul></li> <li class="leaf last"><a href="/node/46" title="Credits">Credits</a></li> </ul> </div><!-- /primary_menu --> </div><!-- /primary-menu-wrapper --> <div id="preface"> <div id="preface-wrapper" class="prefaces-3 clearfix"> <div id="preface-first" class="column"> <!-- start block.tpl.php --> <div class="block-wrapper odd"> <div id="block-block-4" class="block block-block"> <div class="content"> <p><img width="240" height="210" alt="" src="/sites/default/files/image/d-netAgain.png" /></p> </div> </div> </div> <!-- /end block.tpl.php --> </div><!-- /preface-first --> <div id="preface-middle" class="column"> <!-- start block.tpl.php --> <div class="block-wrapper odd"> <div id="block-block-5" class="block block-block"> <h2 class="title block-title pngfix">Getting Started</h2> <div class="content"> <p><a href="http://www.d-net.research-infrastructures.eu/node/41">Download</a><br /> <a href="http://www.d-net.research-infrastructures.eu/?q=node/18">Installation</a><br /> <a href="http://www.d-net.research-infrastructures.eu/?q=node/20">License</a></p> </div> </div> </div> <!-- /end block.tpl.php --> </div><!-- /preface-middle --> <div id="preface-last" class="column"> <!-- start block.tpl.php --> <div class="block-wrapper odd"> <div id="block-block-3" class="block block-block"> <h2 class="title block-title pngfix">Overview</h2> <div class="content"> <p><a href="http://www.d-net.research-infrastructures.eu/?q=node/3">General Concepts</a><br /> <a href="http://www.d-net.research-infrastructures.eu/?q=node/5">System Architecture</a><br /> <a href="http://www.d-net.research-infrastructures.eu/?q=node/6">Resource Model</a><br /> <a href="http://www.d-net.research-infrastructures.eu/?q=node/7">Publications</a></p> </div> </div> </div> <!-- /end block.tpl.php --> </div><!-- /preface-last --> </div><!-- /preface-wrapper --> </div><!-- /preface --> <div id="main-wrapper"> <div id="main" class="clearfix"> <div id="breadcrumb"> <div class="breadcrumb"><a href="/">Home</a></div> </div><!-- /breadcrumb --> <div id="sidebar-first"> <!-- start block.tpl.php --> <div class="block-wrapper odd"> <!-- see preprocess_block() --> <div class="rounded-block"> <div class="rounded-block-top-left"></div> <div class="rounded-block-top-right"></div> <div class="rounded-outside"> <div class="rounded-inside"> <p class="rounded-topspace"></p> <div id="block-menu-primary-links" class="block block-menu"> <div class="block-icon pngfix"></div> <h2 class="title block-title pngfix">Main Menu</h2> <div class="content"> <ul class="menu"><li class="expanded first"><a href="/node/1" title="Overview">Overview</a><ul class="menu"><li class="leaf first"><a href="/node/4" title="What is D-Net for?">What is D-NET for?</a></li> <li class="leaf"><a href="/node/3" title="General Concepts">General Concepts</a></li> <li class="leaf"><a href="/node/5" title="System Architecture">System Architecture</a></li> <li class="leaf last"><a href="/node/34" title="Distributed Applications Vs Service Oriented Architectures">Why SOA?</a></li> </ul></li> <li class="expanded active-trail"><a href="/node/8" title="Software">D-NET Software</a><ul class="menu"><li class="leaf first"><a href="/node/11" title="Enabling Area">Enabling Service Area</a></li> <li class="leaf"><a href="/node/10" title="Data Area">Data Mediation Area</a></li> <li class="leaf active-trail"><a href="/node/43" title="Data Conversion Area" class="active">Data Conversion Area</a></li> <li class="leaf"><a href="/node/44" title="Data Storage and Indexing Area">Data Storage and Indexing Area</a></li> <li class="leaf"><a href="/node/42" title="Data Curation and Enrichment Area">Data Curation and Enrichment Area</a></li> <li class="leaf"><a href="/node/9" title="End User Functionality Area">Data Provision Area</a></li> <li class="leaf last"><a href="/node/32" title="Service typologies">Service patterns</a></li> </ul></li> <li class="expanded"><a href="/node/24" title="D-Net Framework">D-NET Resource Framework</a><ul class="menu"><li class="leaf first"><a href="/node/29" title="Resource registration and resource profiles">Resource registration</a></li> <li class="leaf"><a href="/node/30" title="Application patterns">Service interaction patterns</a></li> <li class="leaf"><a href="/node/31" title="Resource Model">Resource Model</a></li> <li class="leaf last"><a href="/node/33" title="Service Orchestration and blackboard protocol">Blackboard protocol</a></li> </ul></li> <li class="expanded"><a href="/node/12" title="Community">Community</a><ul class="menu"><li class="leaf first"><a href="/node/15" title="Sponsors">Technical Partners</a></li> <li class="leaf"><a href="/node/22" title="D-Net Installations">D-NET Installations</a></li> <li class="leaf last"><a href="/node/7" title="Publications">Publications</a></li> </ul></li> <li class="expanded"><a href="/node/16" title="Getting Started">Getting Started</a><ul class="menu"><li class="leaf first"><a href="/node/41" title="Downlaod">Download</a></li> <li class="leaf"><a href="/node/18" title="Installation">Installation</a></li> <li class="leaf"><a href="/node/20" title="License">License</a></li> <li class="leaf"><a href="/node/40" title="Documentation">Documentation</a></li> <li class="collapsed last"><a href="/node/37" title="FAQ">FAQ</a></li> </ul></li> <li class="leaf last"><a href="/node/46" title="Credits">Credits</a></li> </ul> </div> </div> <p class="rounded-bottomspace"></p> </div><!-- /rounded-inside --> </div> <div class="rounded-block-bottom-left"></div> <div class="rounded-block-bottom-right"></div> </div><!-- /rounded-block --> </div> <!-- /end block.tpl.php --> <!-- start block.tpl.php --> <div class="block-wrapper even"> <!-- see preprocess_block() --> <div class="rounded-block"> <div class="rounded-block-top-left"></div> <div class="rounded-block-top-right"></div> <div class="rounded-outside"> <div class="rounded-inside"> <p class="rounded-topspace"></p> <div id="block-user-1" class="block block-user"> <div class="block-icon pngfix"></div> <h2 class="title block-title pngfix">Navigation</h2> <div class="content"> <ul class="menu"><li class="leaf first last"><a href="/login" title="">Login</a></li> </ul> </div> </div> <p class="rounded-bottomspace"></p> </div><!-- /rounded-inside --> </div> <div class="rounded-block-bottom-left"></div> <div class="rounded-block-bottom-right"></div> </div><!-- /rounded-block --> </div> <!-- /end block.tpl.php --> </div><!-- /sidebar-first --> <div id="content-wrapper"> <div id="content"> <a name="main-content" id="main-content"></a> <div id="content-inner"> <h1 class="title"> Data Conversion Area </h1> <div id="content-content"> <!-- start node.tpl.php --> <div id="node-43" class="node odd full-node node-type-page"> <div class="meta"> </div> <div class="content"> <p>The <strong>Data Conversion Area</strong> contains services used to convert data from one domain into data of another domain. The services can deal with metadata content (in XML format), to be converted from one format onto another format (i.e., XML schema to XML schema), or woth generic file content, e.g., images to thumbnails.<br /> Metadata records can be applied structural mappings or semantics mappings (e.g., from input vocabulary to output vocabulary). Such transformations can be one-to-one, i.e., an XML record is transformed into another XML record, or one-to-many, an XML record is transformed into a number of XML records, and vice versa.<br /> More specifically, the services are:</p> <ul> <li><strong>Metadata Transformation service</strong>. It is used to convert one XML file onto another XML file by following a number of transformation rules (which can be specified through a user interface)</li> <li><strong>Metadata Cleaner service</strong>. It is used to convert the values of XML elements from one format to another (e.g., data format converson) or from one vocabulary to another.</li> <li><strong>Metadata Unpackaging service</strong>. It is used to apply XSLT scripts to input XML records in order to obtain a set of (possibly interlinked) XML records. The service is also capable of packaging XML records to become one XML record.</li> <li><strong>Feature Extraction Service</strong>. It is used to process a list of incoming files (provided as a list of URLs) with a given algorithm, to produce an output file for each processed file. The Service is designed to be extended with new algorithms when required. Examples of processing algorithms are: extraction of text from PDF file, language detection from text file, extraction of thumbnails from images, extraction of text from images (OCR).</li> </ul> </div> </div> <!-- /#node-43 --> </div> </div><!-- /content-inner --> </div><!-- /content --> </div><!-- /content-wrapper --> <div id="footer" class="clearfix"> <!-- start block.tpl.php --> <div class="block-wrapper odd"> <div id="block-block-8" class="block block-block"> <div class="content"> <script type="text/javascript"> var _gaq = _gaq || []; _gaq.push(['_setAccount', 'UA-24837644-1']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script> </div> </div> </div> <!-- /end block.tpl.php --> </div><!-- /footer --> </div><!-- /main --> </div><!-- /main-wrapper --> </div><!-- /page --> </body> </html>