OAI-PMH Validator & data extractor
OAI-PMH Validator & data extractor Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)
Validator & data extractor Tool
Download and evaluate XML metadata from OAI-PMH enabled digital libraries. fa-check-circle-o"></i> Validate URL</a></li> <li><a href="#file" data-toggle="tab"><i class="fa fa-check"></i> Validate By Direct Input</a></li> <li><a href="#batch_download" data-toggle="tab"><i class="fa fa-download"></i> Download XML</a></li> <li><a href="#api" data-toggle="tab" title="Application Programming Interface"><i class="fa fa-code"></i> REST API</a></li> <li><a href="#about" data-toggle="tab" title=""><i class="fa fa-info-circle"></i> About</a></li> <li><a href="#ref" data-toggle="tab" title="References in scientific publications"><i class="fa fa-bookmark"></i> References</a></li> </ul> <div class="tab-content"> <div class="tab-pane active" id="uri"> <div class="row"> <div class="col-sm-12 form-horizontal" role="form" id="urlForm"> <div class="form-group"> <label for="url" class="col-sm-2 control-label">OAI-PMH URL:</label> <div class="col-sm-7"> <div class="input-group"> <input type="url" id="url" name="url" autofocus required placeholder="OAI-PMH URL" class="form-control" title="Please enter an OAI-PMH URL and click Go" /> <span class="input-group-btn"> <button id="goButton" class="btn" type="button">Check now <i class="fa fa-angle-double-right"></i></button> </span> </div> <small class="text-muted">Example OAI-PMH URL:</small> </div> </div> </div> <div class="col-sm-9 col-sm-offset-2"></div> </div> <div class="row"> <div class="col-md-2" id="vertical-tabs"> <ul class="nav nav-pills nav-stacked" id="availableCommands"> </ul> </div> <div id="oaipmhResult" class="col-md-10" style ="display: none;"> <a href="" target="_blank" id="help-button" class="pull-right"><i class="fa fa-question-circle"></i> Help</a> <ul id="subtabs" class="nav nav-tabs"> <li class="active"><a href="#validationTab" class="validation" data-toggle="tab"><i class="fa fa-check"></i> <span id="validationTabLabel">Validation</span> <span class="badge" id="validationCount"></span></a></li> <li><a href="#resultTab" data-toggle="tab"><i class="fa fa-list"></i> XML Result <span class="badge" id="resultTabSize"></span></a></li> </ul> <div class="tab-content"> <div id="validationTab" class="tab-pane active validation_tab"> </div> <div id="resultTab" class="tab-pane"> <div class="actions"> <form name="downloadXMLForm" id="downloadXMLForm" method="post" action="/analysers/downloadXml" target="_blank"> <input type="hidden" name="data[xmlUrl]" id="xmlUrl" /> </form> <form name="printXMLForm" id="printXMLForm" method="post" action="/analysers/oaipmhRequest" target="_blank"> <input type="hidden" name="data[printUrl]" id="printUrl" /> </form> <a href="javascript:;" title="Download results in XML" onclick="javascript:$('#downloadXMLForm').submit()"><i class="fa fa-download"></i> Download XML</a> <a href="javascript:;" title="Print results" onclick="javascript:$('#printXMLForm').submit()"><i class="fa fa-print"></i> Print XML</a> </div> <pre class="source-code"></pre> </div> </div> </div> </div> </div> <div id="file" class="tab-pane validation_tab"> <div class="form-horizontal"> <div class="form-group"> <label class="col-sm-2 control-label">Please enter XML markup in order to validate:</label> <div class="col-sm-8"> <textarea rows="12" cols="50" name="data[directInput]" id="directInput" class="form-control"></textarea> </div> </div> <div class="form-group"> <div class="col-sm-offset-2 col-sm-10"> <button id="goDirectInputButton" class="btn" type="button"><i class="fa fa-check"></i> Validate</button> </div> </div> </div> <p id="directInputResults"></p> </div> <div id="batch_download" class="tab-pane"> <p>Download all the records from a digital library. Please insert an OAI-PMH URL, select the metadata prefix and check out the results.</p> <div class="form-horizontal"> <div class="form-group"> <label class="col-sm-2 control-label">OAI-PMH URL:</label> <div class="col-sm-7"> <div class="input-group"> <input type="url" id="harvest_url" name="harvest_url" class="form-control" placeholder="OAI-PMH URL" /> <span class="input-group-btn"> <button id="goBatchButton" class="btn" type="button"><i class="fa fa-download"></i> Download records</button> </span> </div> </div> </div> <div class="form-group"> <label class="col-sm-2 control-label">Metadata prefix:</label> <div class="col-sm-5"> <select name="batchDownloadMetadataPrefix" id="batchDownloadMetadataPrefix" class="form-control"> <option value="datacite">DATACITE</option> <option value="ese">ESE</option> <option value="edm">EDM</option> <option value="heal">HEAL</option> <option value="lido">LIDO</option> <option value="marcxml">MARCXML</option> <option value="mets">METS</option> <option value="oai_agris">OAI_AGRIS</option> <option value="oai_dc" selected>OAI_DC</option> <option value="oai_datacite">OAI_DATACITE</option> <option value="oai_scolomfr">OAI_SCOLOMFR</option> <option value="rdf">RDF</option> <option value="unimarc">UNIMARC</option> </select> </div> </div> <p>Optional parameters:</p> <div class="form-group form-group-sm"> <label class="col-sm-2 control-label control-label">Set:</label> <div class="col-sm-2"> <input type="text" id="set" name="set" class="form-control input-sm" /> </div> <label class="col-sm-1 control-label">From:</small></label> <div class="col-sm-2"> <input type="text" id="from" name="from" class="form-control input-sm" /> </div> <label class="col-sm-1 control-label">Until:</label> <div class="col-sm-2"> <input type="text" id="until" name="until" class="form-control input-sm" /> </div> </div> </div> <div class="row"> <div id="harvesting_results" class="col-sm-7 col-sm-offset-2 col-md-offset-2"></div> </div> </div> <div id="api" class="tab-pane"> <h2>REST API</h2> <p> is <strong>completely FREE</strong> for personal or academic use.</p> <p>Large scale use of the service via its <strong>REST API</strong> as well as technical and/or scientific support is available on a fee.</p> <p>Please contact the author, <a href="" style="text-decoration: underline">Vangelis Banos</a>, to learn more.</p> </div> <div id="about" class="tab-pane"> <h2>About</h2> <p>The OAI-PMH validator and data extractor tool is a free service created by <a href="">Vangelis Banos</a>. The aim of this project it to support digital repository operators and developers by automating the harvesting and validation of OAI-PMH services. The author has accumulated 10+ years of relevant experience working on projects like: <ul> <li><a href=""></a> - The Hellenic aggregator for Europeana (2010 -),</li> <li><a href=""></a> - Greek digital libraries search engine (2006 - 2012),</li> <li><a href=""></a> - Open Access Digital Libraries search engine (2008 - 2014 now closed),</li> <li><a href=""></a> - National Documentation Centre of Greece Digital Library Interoperability Testing Service (2011 - 2015).</li> </ul> <h2>About OAI-PMH</h2> <p>OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) is a protocol developed by the <a href="">Open Archives Initiative</a>. It is used to harvest (or collect) the metadata descriptions of the records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representing metadata in Dublin Core, but may also support additional representations.</p> <p>The protocol is usually just referred to as the OAI Protocol. <a href="">Check full article in wikipedia »</a></p> <h2>What is my OAI-PMH URL?</h2> <p>The OAI-PMH URL address of a digital library varies depending on the software it is based on. Please check out some widely used software and their Sample URLs.</p> <table class="table table-bordered"> <tr> <th>Software</th> <th>Sample URL</th> <th>Sample OAI-PMH URL</th> </tr> <tr> <td>Dspace version 1.5 and newer</td> <td><a href=""></a></td> <td><a href=""></a></td> </tr> <tr> <td>Open Journal System (OJS)</td> <td><a href=""></a> <td><a href=""></a> </tr> </table> <p>These URLs are just the defaults and can be changed by the library's administrator.</p> <h2>How does works?</h2> <p> is a web application. It works in the background on our servers and presents the final output to the users. There is no need to install software in your computer, everything is done using a web browser</p> <h2>What kind of validation tests are performed?</h2> <b>Generic Checks</b> <ol> <li>Check HTTP Response code.</li> <li>Check HTTP Response document content type.</li> <li>Check XML document file size.</li> <li>Check HTTP Request & Response Time.</li> <li>Check XML document compliance against the OAI-PMH XML Schema <a href=""></a>.</li> </ol> <b>Content-specific Checks</b> <ol start="6"> <li>Check OAI-PMH Protocol version.</li> <li>Check Administrator email address.</li> <li>Check ListSets command for sets.</li> <li>Check available Metadata formats.</li> <li>Check Total Records number.</li> <li>Check Dublin Core metadata in Records.</li> <li>Check ESE metadata in Records.</li> <li>Check XML document compliance against the ESE XML Schema <a href=""></a></li> <li>Also check various ESE XML elements for common mistakes such as <ul> <li>Check for invalid europeana:isShownAt URL</li> <li>Check for invalid europeana:isShownBy URL</li> </ul> </li> <li>Check various metadata elements in Records: <ul> <li>Empty dc:title</li> <li>Empty or invalid dc:identifier</li> <li>Empty setSpec</li> </ul> </li> </ol> <h3>I want an extra feature of some kind, what can I do?</h3> <p>Please contact the author, <a href="">Vangelis Banos</a>. Features include:
View, print or download the output of all OAI-PMH supported commands.
Detect problems with metadata records (e.g. invalid URLs, empty titles, invalid date formats etc.)
Download all records from one or more digital libraries in parallel.
Check compliance with OAI-PMH, Dublin Core (DC), Europeana Semantic Elements (ESE) and other standards.