CINXE.COM
OAI-PMH Validator & data extractor - OAIPMH.com
<!doctype html> <html lang="en"> <head> <meta charset="utf-8"/> <title>OAI-PMH Validator & data extractor - OAIPMH.com</title> <meta name="keywords" content="OAIPMH, OAI-PMH, OAI-PMH validator, OAI-PMH checker, OAI-PMH harvester, OAI tool, OAI PMH testing, Europeana Semantic Elements validator, ESE validation" /> <meta name="description" content="OAI-PMH validator and data extractor web app" /> <meta http-equiv="X-UA-Compatible" content="IE=edge" /> <meta name="viewport" content="initial-scale=1.0, width=device-width, maximum-scale=1.0"/> <link rel="shortcut icon" type="image/x-icon" href="/favicon.ico"> <!-- Google tag (gtag.js) --> <script async src="https://www.googletagmanager.com/gtag/js?id=G-V2GDXLPKPQ"></script> <script> window.dataLayer = window.dataLayer || []; function gtag(){dataLayer.push(arguments);} gtag('js', new Date()); gtag('config', 'G-V2GDXLPKPQ'); </script> <script src="https://code.jquery.com/jquery-3.7.1.min.js" integrity="sha256-/JqT3SQfawRcv/BIHPThkBvs0OEvtFFmqPF/lYI/Cxo=" crossorigin="anonymous"></script> <link rel="stylesheet" type="text/css" href="//fonts.googleapis.com/css?family=Roboto&subset=latin,greek"/> <link rel="stylesheet" type="text/css" href="//netdna.bootstrapcdn.com/bootstrap/3.4.1/css/bootstrap.min.css"/> <link rel="stylesheet" type="text/css" href="//netdna.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.css"/> <link rel="stylesheet" type="text/css" href="/css/style.css?1676156044"/> <script type="text/javascript" src="/js/main.js?1733488052" type="module"></script></head> <body> <div class="headerContainer"> <div class="container"> <div class="row"> <div class="col-sm-3"> <a href="/" title="Home" id="logo"><img src="/img/oaipmh-logo230x75.png?1676156044" class="img-responsive" alt=""/></a> </div> <div class="col-sm-9"> <p class="lead">Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)<br /><strong>Validator & data extractor</strong> Tool</p> </div> </div> </div> </div> <div class="container"> <div class="row"> <div class="col-md-12"> <p class="lead">Download and evaluate XML metadata from <strong>OAI-PMH enabled</strong> digital libraries.</p> <ul id="main-tabs" class="nav nav-tabs"> <li class="active"><a href="#uri" data-toggle="tab" class="active"><i class="fa fa-check-circle-o"></i> Validate URL</a></li> <li><a href="#file" data-toggle="tab"><i class="fa fa-check"></i> Validate By Direct Input</a></li> <li><a href="#batch_download" data-toggle="tab"><i class="fa fa-download"></i> Download XML</a></li> <li><a href="#api" data-toggle="tab" title="Application Programming Interface"><i class="fa fa-code"></i> REST API</a></li> <li><a href="#about" data-toggle="tab" title=""><i class="fa fa-info-circle"></i> About</a></li> <li><a href="#ref" data-toggle="tab" title="References in scientific publications"><i class="fa fa-bookmark"></i> References</a></li> </ul> <div class="tab-content"> <div class="tab-pane active" id="uri"> <div class="row"> <div class="col-sm-12 form-horizontal" role="form" id="urlForm"> <div class="form-group"> <label for="url" class="col-sm-2 control-label">OAI-PMH URL:</label> <div class="col-sm-7"> <div class="input-group"> <input type="url" id="url" name="url" autofocus required placeholder="OAI-PMH URL" class="form-control" title="Please enter an OAI-PMH URL and click Go" /> <span class="input-group-btn"> <button id="goButton" class="btn" type="button">Check now <i class="fa fa-angle-double-right"></i></button> </span> </div> <small class="text-muted">Example OAI-PMH URL: https://oai.datacite.org/oai</small> </div> </div> </div> <div class="col-sm-9 col-sm-offset-2"></div> </div> <div class="row"> <div class="col-md-2" id="vertical-tabs"> <ul class="nav nav-pills nav-stacked" id="availableCommands"> </ul> </div> <div id="oaipmhResult" class="col-md-10" style ="display: none;"> <a href="http://www.openarchives.org/OAI/openarchivesprotocol.html" target="_blank" id="help-button" class="pull-right"><i class="fa fa-question-circle"></i> Help</a> <ul id="subtabs" class="nav nav-tabs"> <li class="active"><a href="#validationTab" class="validation" data-toggle="tab"><i class="fa fa-check"></i> <span id="validationTabLabel">Validation</span> <span class="badge" id="validationCount"></span></a></li> <li><a href="#resultTab" data-toggle="tab"><i class="fa fa-list"></i> XML Result <span class="badge" id="resultTabSize"></span></a></li> </ul> <div class="tab-content"> <div id="validationTab" class="tab-pane active validation_tab"> </div> <div id="resultTab" class="tab-pane"> <div class="actions"> <form name="downloadXMLForm" id="downloadXMLForm" method="post" action="/analysers/downloadXml" target="_blank"> <input type="hidden" name="data[xmlUrl]" id="xmlUrl" /> </form> <form name="printXMLForm" id="printXMLForm" method="post" action="/analysers/oaipmhRequest" target="_blank"> <input type="hidden" name="data[printUrl]" id="printUrl" /> </form> <a href="javascript:;" title="Download results in XML" onclick="javascript:$('#downloadXMLForm').submit()"><i class="fa fa-download"></i> Download XML</a> <a href="javascript:;" title="Print results" onclick="javascript:$('#printXMLForm').submit()"><i class="fa fa-print"></i> Print XML</a> </div> <pre class="source-code"></pre> </div> </div> </div> </div> </div> <div id="file" class="tab-pane validation_tab"> <div class="form-horizontal"> <div class="form-group"> <label class="col-sm-2 control-label">Please enter XML markup in order to validate:</label> <div class="col-sm-8"> <textarea rows="12" cols="50" name="data[directInput]" id="directInput" class="form-control"></textarea> </div> </div> <div class="form-group"> <div class="col-sm-offset-2 col-sm-10"> <button id="goDirectInputButton" class="btn" type="button"><i class="fa fa-check"></i> Validate</button> </div> </div> </div> <p id="directInputResults"></p> </div> <div id="batch_download" class="tab-pane"> <p>Download all the records from a digital library. Please insert an OAI-PMH URL, select the metadata prefix and check out the results.</p> <div class="form-horizontal"> <div class="form-group"> <label class="col-sm-2 control-label">OAI-PMH URL:</label> <div class="col-sm-7"> <div class="input-group"> <input type="url" id="harvest_url" name="harvest_url" class="form-control" placeholder="OAI-PMH URL" /> <span class="input-group-btn"> <button id="goBatchButton" class="btn" type="button"><i class="fa fa-download"></i> Download records</button> </span> </div> </div> </div> <div class="form-group"> <label class="col-sm-2 control-label">Metadata prefix:</label> <div class="col-sm-5"> <select name="batchDownloadMetadataPrefix" id="batchDownloadMetadataPrefix" class="form-control"> <option value="datacite">DATACITE</option> <option value="ese">ESE</option> <option value="edm">EDM</option> <option value="heal">HEAL</option> <option value="lido">LIDO</option> <option value="marcxml">MARCXML</option> <option value="mets">METS</option> <option value="oai_agris">OAI_AGRIS</option> <option value="oai_dc" selected>OAI_DC</option> <option value="oai_datacite">OAI_DATACITE</option> <option value="oai_scolomfr">OAI_SCOLOMFR</option> <option value="rdf">RDF</option> <option value="unimarc">UNIMARC</option> </select> </div> </div> <p>Optional parameters:</p> <div class="form-group form-group-sm"> <label class="col-sm-2 control-label control-label">Set:</label> <div class="col-sm-2"> <input type="text" id="set" name="set" class="form-control input-sm" /> </div> <label class="col-sm-1 control-label">From:</small></label> <div class="col-sm-2"> <input type="text" id="from" name="from" class="form-control input-sm" /> </div> <label class="col-sm-1 control-label">Until:</label> <div class="col-sm-2"> <input type="text" id="until" name="until" class="form-control input-sm" /> </div> </div> </div> <div class="row"> <div id="harvesting_results" class="col-sm-7 col-sm-offset-2 col-md-offset-2"></div> </div> </div> <div id="api" class="tab-pane"> <h2>REST API</h2> <p>OAIPMH.com is <strong>completely FREE</strong> for personal or academic use.</p> <p>Large scale use of the OAIPMH.com service via its <strong>REST API</strong> as well as technical and/or scientific support is available on a fee.</p> <p>Please contact the author, <a href="http://vbanos.gr/" style="text-decoration: underline">Vangelis Banos</a>, to learn more.</p> </div> <div id="about" class="tab-pane"> <h2>About</h2> <p>The OAI-PMH validator and data extractor tool is a free service created by <a href="http://vbanos.gr">Vangelis Banos</a>. The aim of this project it to support digital repository operators and developers by automating the harvesting and validation of OAI-PMH services. The author has accumulated 10+ years of relevant experience working on projects like: <ul> <li><a href="http://aggregator.libver.gr">aggregator.libver.gr</a> - The Hellenic aggregator for Europeana (2010 -),</li> <li><a href="http://openarchives.gr">openarchives.gr</a> - Greek digital libraries search engine (2006 - 2012),</li> <li><a href="http://www.libsearch.com">www.libsearch.com</a> - Open Access Digital Libraries search engine (2008 - 2014 now closed),</li> <li><a href="http://saas.ekt.gr/content/Interoperability-Testing">http://saas.ekt.gr/content/Interoperability-Testing</a> - National Documentation Centre of Greece Digital Library Interoperability Testing Service (2011 - 2015).</li> </ul> <h2>About OAI-PMH</h2> <p>OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) is a protocol developed by the <a href="http://openarchives.org">Open Archives Initiative</a>. It is used to harvest (or collect) the metadata descriptions of the records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representing metadata in Dublin Core, but may also support additional representations.</p> <p>The protocol is usually just referred to as the OAI Protocol. <a href="http://en.wikipedia.org/wiki/OAI-PMH">Check full article in wikipedia »</a></p> <h2>What is my OAI-PMH URL?</h2> <p>The OAI-PMH URL address of a digital library varies depending on the software it is based on. Please check out some widely used software and their Sample URLs.</p> <table class="table table-bordered"> <tr> <th>Software</th> <th>Sample URL</th> <th>Sample OAI-PMH URL</th> </tr> <tr> <td>Dspace version 1.5 and newer</td> <td><a href="http://medusa.libver.gr/">http://medusa.libver.gr/</a></td> <td><a href="http://medusa.libver.gr/dspace-oai/request">http://medusa.libver.gr/oai/request</a></td> </tr> <tr> <td>Open Journal System (OJS)</td> <td><a href="http://dl.acs.org.au/">http://dl.acs.org.au/</a> <td><a href="http://dl.acs.org.au/index.php/index/oai">http://dl.acs.org.au/index.php/index/oai</a> </tr> </table> <p>These URLs are just the defaults and can be changed by the library's administrator.</p> <h2>How does OAIPMH.com works?</h2> <p>OAIPMH.com is a web application. It works in the background on our servers and presents the final output to the users. There is no need to install software in your computer, everything is done using a web browser</p> <h2>What kind of validation tests are performed?</h2> <b>Generic Checks</b> <ol> <li>Check HTTP Response code.</li> <li>Check HTTP Response document content type.</li> <li>Check XML document file size.</li> <li>Check HTTP Request & Response Time.</li> <li>Check XML document compliance against the OAI-PMH XML Schema <a href="http://oaipmh.com/files/OAI-PMH.xsd">http://oaipmh.com/files/OAI-PMH.xsd</a>.</li> </ol> <b>Content-specific Checks</b> <ol start="6"> <li>Check OAI-PMH Protocol version.</li> <li>Check Administrator email address.</li> <li>Check ListSets command for sets.</li> <li>Check available Metadata formats.</li> <li>Check Total Records number.</li> <li>Check Dublin Core metadata in Records.</li> <li>Check ESE metadata in Records.</li> <li>Check XML document compliance against the ESE XML Schema <a href="http://www.europeana.eu/schemas/ese/ESE-V3.3.xsd">http://www.europeana.eu/schemas/ese/ESE-V3.3.xsd</a></li> <li>Also check various ESE XML elements for common mistakes such as <ul> <li>Check for invalid europeana:isShownAt URL</li> <li>Check for invalid europeana:isShownBy URL</li> </ul> </li> <li>Check various metadata elements in Records: <ul> <li>Empty dc:title</li> <li>Empty or invalid dc:identifier</li> <li>Empty setSpec</li> </ul> </li> </ol> <h3>I want an extra feature of some kind, what can I do?</h3> <p>Please contact the author, <a href="http://vbanos.gr">Vangelis Banos</a>. Any suggestions for improvements and new features are always welcome.</p> </div> <div id="ref" class="tab-pane"> <h2>References in scientific publications</h2> <ol> <li>Miranda, Jesús María Aramberri. "ONDAREA, DE ALMACÉN A FÁBRICA." Nuevos Extractos de la RSBAP 26-G (2022).</li> <li>Sosa, Jared David Tadeo Guerrero, et al. "Un generador de metadatos openaire conforme con el repositorio nacional de México." Journal of Science and Research 5.CININGEC (2020): 808-830.</li> <li>Sebastián Blanco-Olea, Fernando. "Comparative Quality Evaluation of Universities' Institutional Repositories of Peru." JLIS. it, Italian Journal of Library, Archives & Information Science 12.2 (2021).</li> <li>Mesquita, Giovana Cavalcanti de. "Análise do repositório do Ministério da Economia no contexto do acesso aberto." (2021).</li> <li>Gottardo, Taís Virgínia, and Ivanildo Barbosa. "INDE METADATA CONFORMITY INDICATOR." Boletim de Ciências Geodésicas 25 (2019): e2019S002.</li> <li>Costa, Michelli Pereira da, and Fernando César Lima Leite. "Open access institutional repositories in Latin America." (2019).</li> <li>Pereira da Costa, Michelli, and Fernando César Lima Leite. "Repositorios institucionales de acceso abierto en América Latina." Biblios 74 (2019): 1-14.</li> <li><a href="http://hdl.handle.net/2268/221340">Myriam Bastin, François Renaville. ULiège experience with aggregators and discovery tools providers. Be proactive and apply best practices (if you can...), 2018</a></li> <li>Deinzer, Gernot. "6b Repositoriensoftware." Praxishandbuch Open Access (2017): 290.</li> <li>COSTA, Michelli, and Fernando LEITE. "Repositórios institucionais de acesso aberto à informação científica da América Latina." (2017).</li> <li>林信成, and 周庭郁. "圖書資訊學開放取用期刊聯合目錄系統之設計與實作." Journal of Educational Media & Library Sciences 54.1 (2017).</li> <li>Bettoni, E. M., M. B. de Carvalho, and Patricia Marchiori. Geração de indicadores para periódicos científicos: um estudo na AtoZ. Federal University of Paraná, 2016.</li> <li>Smith, Ina. Open access infrastructure. Vol. 2. UNESCO Publishing, 2015.</li> <li>Rousidis, Dimitris, et al. "Evaluation of Metadata in Research Data Repositories: The Case of the DC. Subject Element." Metadata and Semantics Research. Springer International Publishing, 2015. 203-213.</li> <li>Koulouris, Alexandros, Vangelis Banos, and Emmanouel Garoufallou. "Aggregating metadata for Europeana: the Greek paradigm." Proceedings of the International Conference on Integrated Information (IC-ININFO). 2011.</li> <li>Garoufallou, Emmanouel, Vangelis Banos, and Alexandros Koulouris. "Solving aggregation problems of Greek cultural and educational repositories in the framework of Europeana." International Journal of Metadata, Semantics and Ontologies 8.2 (2013): 134-144.</li> <li>Rousidis, Dimitris, et al. "Metadata for Big Data: A preliminary investigation of metadata quality issues in research data repositories." Information Services and Use 34.3 (2014): 279-286.</li> <li>Ivanović, Lidija, Dragan Ivanović, and Bojana Dimić Surla. "IMPROVING DISSEMINATION OF HUMAN KNOWLEDGE BY EXPORTING DATA FROM RESEARCH INFORMATION SYSTEMS."</li> <li>Georgiadis, Haris, et al. "Ensuring the quality and interoperability of open cultural digital content: System architecture and scalability." Information, Intelligence, Systems and Applications, IISA 2014, The 5th International Conference on. IEEE, 2014.</li> <li>Rousidis, Dimitris, et al. "Data Quality Issues and Content Analysis for Research Data Repositories: The Case of Dryad." Let’s Put Data to Use: Digital Scholarship for the Next Generation, 18th International Conference on Electronic Publishing, Thessaloniki, Greece. 2014.</li> <li>Stathopoulou, Ioanna-Ourania, et al. "An Open Cultural Digital Content Infrastructure." Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on. IEEE, 2014.</li> <li>Hirschmann, Barbara. "DOI Registration Manual." (2014).</li> <li>Calzolari, Nicoletta, Monica Monachini, and Valeria Quochi. "Interoperability framework: The FLaReNet action plan proposal." Language Resources, Technology and Services in the Sharing Paradigm (2011): 41.</li> <li>Birello, Giancarlo, et al. "Step by step installation guide of a digital preservation infrastructure." (2012).</li> <li>Antonius Rachmat, C. "Analisis Rancang Bangun Sistem Repositori Institusi Berbasis Metadata Dublin Core di UKDW Yogyakarta."</li> <li>Houssos, Nikos, et al. "Enhanced OAI-PMH services for metadata sharing in heterogeneous environments." Library Review 63.6/7 (2014): 465-489.</li> <li>Kapidakis, Sarantos. "Comparing metadata quality in the Europeana context." Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments. ACM, 2012.</li> <li>Calzolari, Nicoletta, et al. "Final FLaReNet Deliverable Language Resources for the Future–The Future of Language Resources." The Strategic Language Resource Agenda. FLaReNet project (2011).</li> </ol> </div> </div> </div> </div> <div class="row"> <div class="col-md-6"> <p>Features include:</p> <ul> <li>View, print or download the output of all OAI-PMH supported commands.</li> <li>Detect problems with metadata records (e.g. invalid URLs, empty titles, invalid date formats etc.)</li> </ul> </div> <div class="col-md-6"> <ul> <li>Download all records from one or more digital libraries in parallel.</li> <li>Check compliance with OAI-PMH, Dublin Core (DC), Europeana Semantic Elements (ESE) and other standards.</li> </ul> </div> </div> </div> <div class="footerContainer"> <div class="container"> <div class="row"> <div class="col-md-6"> <p><b>60900</b> digital libraries, repositories and e-journals have been already tested with oaipmh.com.</p> </div> <div class="col-md-6"> Created by <a href="http://vbanos.gr" title="Vangelis Banos, Information and Communication Systems Engineer, Msc">Vangelis Banos</a>, © 2011 - 2025 <a href="http://archiveready.com/check?url=http://validator.oaipmh.com/" title="Website Archivability Testing"><img src="http://archiveready.com/static/img/archiveready-badge.png" alt="" /></a> </div> </div> </div> </div> </body> </html>