CINXE.COM
OpenCitations - Download
<!DOCTYPE html> <html xmlns="http://www.w3.org/1999/xhtml" lang="en"> <head> <meta charset="utf-8" /> <meta http-equiv="X-UA-Compatible" content="IE=edge" /> <meta name="viewport" content="width=device-width, initial-scale=1" /> <link rel="icon" href="/static/favicon.ico" /> <title>OpenCitations - Download</title> <!-- Bootstrap core CSS --> <link href="/static/css/bootstrap.min.css" rel="stylesheet" /> <link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/bootstrap-icons@1.10.0/font/bootstrap-icons.css" /> <!-- Font Awesome --> <link href="/static/css/font-awesome.min.css" rel="stylesheet" /> <!-- IE10 viewport hack for Surface/desktop Windows 8 bug --> <link href="/static/css/ie10-viewport-bug-workaround.css" rel="stylesheet" /> <!-- Custom styles for this template --> <link href="/static/css/cover.css" rel="stylesheet" /> <!-- HTML5 shim and Respond.js for IE8 support of HTML5 elements and media queries --> <!--[if lt IE 9]> <script src="/static/js/html5shiv.min.js"><![CDATA[ ]]></script> <script src="/static/js/respond.min.js"><![CDATA[ ]]></script> <![endif]--> </head> <body> <div class="cover-container"> <div class="masthead clearfix"> <nav class="navbar"> <div class="navbar-header"> <button type="button" class="navbar-toggle" data-toggle="collapse" data-target=".navbar-collapse"> <span class="icon-bar"></span> <span class="icon-bar"></span> <span class="icon-bar"></span> </button> <h3 class="masthead-brand"> <a href="/"><span class="oc-purple">Open</span><span class="oc-blue">Citations</span></a> </h3> </div> <div class="navbar-collapse collapse"> <div class="masthead-search"> <form class="input-group" action="/index/search" method="get"> <input type="text" class="form-control oc-purple" placeholder="Search for a DOI, e.g. 10.1162/qss_a_00023" name="text"> <input type="hidden" name="rule" value="citeddoi"> <div class="input-group-btn"> <button class="btn btn-default oc-purple" type="submit"><i class="glyphicon glyphicon-search"></i></button> </div> </form> </div> <div class="masthead-pages"> <ul class="nav masthead-nav navbar-nav"> <li> <a href="/"> Home </a> <li> <a href="/about"> About </a> <li> <a href="/membership"> Help us </a> <li> <a href="/model"> Data Model </a> <li> <a href="/datasets"> Datasets </a> <li> <a href="/querying"> Querying Data </a> <li> <a href="/tools"> Tools </a> <li class="active"> <a href="/download"> Download </a> <li> <a href="/publications"> Publications </a> </li> </ul> </div> </div> </nav> </div> <div class="cover left"> <h3>Download</h3> <p>This page contains details of and links to all the data dumps of the <a href="#meta"><span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Meta</a> and <a href="#index"><span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Index</a>. They are made available online by means of the support of <a href="http://figshare.com">Figshare</a> and of the <a href="https://archive.org/">Internet Archive</a>.</p> <div class="toc"> <h4>Table of content</h4> <ul> <li><p><a href="#meta"><span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Meta</a></p></li> <li><p><a href="#index"><span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Index</a></p></li> </ul> </div> <a class="anchor" id="meta"></a> <h4><span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Meta</h4> <p>The <span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Meta database stores and delivers bibliographic metadata for all publications involved in the <span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Index.</p> <h5><strong>Most recent OpenCitations Meta data dump - June 2024 Dump</strong></h5> <p>This dataset's dump, released on 2024-06-20, enhances its previous version by incorporating new data from the Crossref dump available at <a href="https://crossref.org/snapshots/monthly/2024/03" target="_blank">Crossref March 2024 Dump</a>. This dump includes information on:</p> <ul> <li><p>116,705,111 bibliographic entities</p></li> <li><p>348,815,548 authors and 2,561,336 editors (counted by their roles, without disambiguating individuals)</p></li> <li><p>715,711 publication venues</p></li> <li><p>245,783 publishers</p></li> </ul> <div class="table-responsive"> <table class="table"> <tr><th>Type and format</th><th>Archive</th><th>Size</th></tr> <tr><td>Metadata (CSV)</td><td><a href="https://doi.org/10.6084/m9.figshare.21747461.v9">tar.gz</a></td><td>11G (47G zipped) on ext4</td></tr> <tr><td>Metadata and provenance (RDF)</td><td><a href="https://doi.org/10.6084/m9.figshare.21747536.v7">tar.gz</a></td><td>44G (145G compressed) on ext4</td></tr> </table> </div> <p>In addition, a CSV dump containing a mapping between all the bibliographic resources identified by an OMID (e.g., br/12345) and their corresponding PID(s) (e.g., DOI, PMID)</p> <div class="table-responsive"> <table class="table"> <tr><td>BR OMID map (CSV)</td><td><a href="https://doi.org/10.6084/m9.figshare.24427156.v2">ZIP</a></td><td>4.4 GB (1.7 GB zipped)</td></tr> </table> </div> <h5>Previous dumps</h5> <ul> <li><p>Dump created on 2024-04-06. Compared to the previous dump, this one incorporates OpenAlex IDs, leveraging data from the OpenAlex dump. Dump available in <a href="https://doi.org/10.6084/m9.figshare.21747461.v8">CSV (metadata)</a> and <a href="https://doi.org/10.6084/m9.figshare.21747536.v6">RDF (metadata and provenance)</a> formats.</p></li> <li><p>Dump created on 2023-11-29. Compared to the previous dump, this one adds the metadata contained in the <a href="https://api.japanlinkcenter.org/">Japan Link Center (JaLC)</a>. Dump available in <a href="https://doi.org/10.6084/m9.figshare.21747461.v7">CSV (metadata)</a> format.</p></li> <li><p>Dump created on 2023-10-24. Compared to the previous dump, this one adds the metadata contained in <a href="https://doi.org/10.5281/zenodo.7845968">OpenAIRE</a> and in the <a href="https://api.crossref.org/snapshots/monthly/2023/09">Crossref dump dated September 2023</a>.Dump available in <a href="https://doi.org/10.6084/m9.figshare.21747461.v5">CSV (metadata)</a> and <a href="https://doi.org/10.6084/m9.figshare.21747536.v5">JSON-LD (metadata and provenance)</a> formats.</p></li> <li><p>Dump created on 2023-06-28. Compared to the previous dump, this one adds the metadata contained in the <a href="https://nih.figshare.com/collections/iCite_Database_Snapshots_NIH_Open_Citation_Collection_/4586573/36">dump of NIH Open Citation Collection dated November 2022</a>. Dump available in <a href="https://doi.org/10.6084/m9.figshare.21747461.v4">CSV (metadata)</a> and <a href="https://doi.org/10.6084/m9.figshare.21747536.v4">JSON-LD (metadata and provenance)</a> formats.</p></li> <li><p>Dump created on 2023-02-24. Compared to the previous dump, this one adds the metadata contained in the <a href="https://archive.org/details/datacite_dump_20211022">last dump of DataCite dated 22 October 2021</a>. Dump available in <a href="https://doi.org/10.6084/m9.figshare.21747461.v3">CSV (metadata)</a> and <a href="https://doi.org/10.6084/m9.figshare.21747536.v3">JSON-LD (metadata and provenance)</a> formats.</p></li> <li><p>Dump created on 2022-12-20, based on open references to works with DOIs within the Crossref dump dated December 2022. Dump available in <a href="https://doi.org/10.6084/m9.figshare.21747461.v2">CSV (metadata)</a> and <a href="https://doi.org/10.6084/m9.figshare.21747536.v2">JSON-LD (metadata and provenance)</a> formats.</p></li> </ul> <a class="anchor" id="index"></a> <h4><span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Index</h4> <p>The <span class="oc-purple">Open</span><span class="oc-blue">Citations</span> Index stores OMID-to-OMID references representing all the references gathered from several sources.</p> <h5><strong>Most recent OpenCitations Index data dump - July 2024 Dump</strong></h5> <p>Dump created on 2024-07-01. Compared to the previous dump, this one adds the citation data contained in the <a href="https://api.crossref.org/snapshots/monthly/2024/03">Crossref dump dated March 2024</a>. This dump includes information on:</p> <ul> <!--<li><p>__ bibliographic resources;</p></li>--> <li><p>2,012,939,079 citations</p></li> </ul> <div class="table-responsive"> <table class="table"> <tr><th>Type and format</th><th>Archive</th><th>Size</th></tr> <tr><td>Citation data (CSV)</td><td><a href="https://doi.org/10.6084/m9.figshare.24356626.v3">ZIP</a></td><td>179 GB (28.1 GB zipped)</td></tr> <tr><td>Citation data (N-Triple)</td><td><a href="https://doi.org/10.6084/m9.figshare.24369136.v3">ZIP</a></td><td>1.5 TB (65.6 GB zipped)</td></tr> <tr><td>Citation data (Scholix)</td><td><a href="https://doi.org/10.6084/m9.figshare.24416749.v3">ZIP</a></td><td>1.8 TB (39 GB zipped)</td></tr> <tr><td>Provenance data (CSV)</td><td><a href="https://doi.org/10.6084/m9.figshare.24417733.v3">ZIP</a></td><td>329 GB (15 GB zipped)</td></tr> <tr><td>Provenance data (N-Triple)</td><td><a href="https://doi.org/10.6084/m9.figshare.24417736.v3">ZIP</a></td><td>2.6 TB (83 GB zipped)</td></tr> </table> </div> <p>In addition:</p> <div class="table-responsive"> <table class="table"> <tr><th style="width: 65%">Type and format</th><th>Archive</th><th>Size</th></tr> <tr><td>Citation data sources' info (N-Triple): information regarding the data source collection (e.g., COCI, DOCI, POCI, etc) of all the citation data</td><td><a href="https://doi.org/10.6084/m9.figshare.24427051.v3">ZIP</a></td><td>364 GB (20.4 GB zipped)</td></tr> <tr><td>Citation count data (CSV): the number of incoming citations to each bibliographic entity (identified by an OMID) in OpenCitations Index</td><td><a href="https://doi.org/10.6084/m9.figshare.24747375.v3">ZIP</a></td><td>1.1 GB (0.4 GB zipped)</td></tr> <!--<tr><td>Reference count data (CSV): the number of references of each bibliographic entity (identified by an OMID) in OpenCitations Index</td><td><a href="https://doi.org/10.6084/m9.figshare.24747498.v1">ZIP</a></td><td>1.7 GB (0.35 GB zipped)</td></tr>--> </table> </div> <h5>Previous dumps</h5> <ul> <li><p>Dump created on 2023-11-29. Dump available in <a href="https://doi.org/10.6084/m9.figshare.24356626.v2">CSV (citation data)</a>, <a href="https://doi.org/10.6084/m9.figshare.24369136.v2">N-Triple (citation data)</a>, <a href="https://doi.org/10.6084/m9.figshare.24416749.v2">SCHOLIX (citation data)</a>, <a href="https://doi.org/10.6084/m9.figshare.24417733.v2">CSV (provenance data)</a>, and <a href="https://doi.org/10.6084/m9.figshare.24417736.v2">N-Triple (provenance data)</a>. In addition, a <a href="https://doi.org/10.6084/m9.figshare.24427051.v2">N-Triple dump</a> containing information regarding the data source collection, and a <a href="https://doi.org/10.6084/m9.figshare.24747375.v1"> citation count dump </a> with the number of incoming citations to each bibliographic entity (identified by an OMID) </p></li> <li><p>Dump created on 2023-10-25. Dump available in <a href="https://doi.org/10.6084/m9.figshare.24356626.v1">CSV (citation data)</a>, <a href="https://doi.org/10.6084/m9.figshare.24369136.v1">N-Triple (citation data)</a>, <a href="https://doi.org/10.6084/m9.figshare.24416749.v1">SCHOLIX (citation data)</a>, <a href="https://doi.org/10.6084/m9.figshare.24417733.v1">CSV (provenance data)</a>, and <a href="https://doi.org/10.6084/m9.figshare.24417736.v1">N-Triple (provenance data)</a>. In addition, a <a href="https://doi.org/10.6084/m9.figshare.24427051.v1">N-Triple dump</a> containing information regarding the data source collection.</p></li> </ul> </div> </div> <footer> <div class="container"> <div class="row"> <div id="disi" class="col-xs-12 col-sm-2"> <p> <a href="http://www.unibo.it"><img src="/static/img/unibo.png" /></a> </div> <div id="ficlit" class="col-xs-12 col-sm-2"> <p> <a href="http://www.ficlit.unibo.it"><img src="/static/img/ficlit.png" /></a></p> </div> <div id="openscholarlymetadata" class="col-xs-12 col-sm-2"> <p> <a href="https://openscholarlymetadata.org"><img src="/static/img/openscholarlymetadata.png" /></a></p> </div> <!-- <div id="oerc" class="col-xs-12 col-sm-2"> <p> <a href="http://www.oerc.ox.ac.uk/"><img src="/static/img/oerc.png" /></a> </p> </div> <div id="ox" class="col-xs-12 col-sm-2"> <p> <a href="http://www.ox.ac.uk"><img src="/static/img/uniox.png" /></a> </p> </div> --> </div> </div> <p> <a class="policy oc-purple" href="/policy">Privacy Policies</a> <span class="separator"></span> <a class="social oc-purple" href="mailto:contact@opencitations.net"><i class="fa fa-envelope" aria-hidden="true"></i></a> <a class="social oc-purple" href="http://www.twitter.com/opencitations"><i class="fa fa-twitter" aria-hidden="true"></i></a> <a class="social oc-blue" href="http://github.com/opencitations"><i class="fa fa-github" aria-hidden="true"></i></a> <a class="social oc-blue" href="https://www.linkedin.com/company/opencitations/"><i class="fa fa-linkedin" aria-hidden="true"></i></a> <a class="social oc-blue" href="https://opencitations.hypotheses.org/"><i class="fa fa-newspaper-o" aria-hidden="true"></i></a> <a class="social oc-blue" href="https://scicomm.xyz/@opencitations"><i class="bi bi-mastodon" aria-hidden="true"></i></a> </p> </footer> <!-- Bootstrap core JavaScript ================================================== --> <!-- Placed at the end of the document so the pages load faster --> <script src="/static/js/jquery.min.js"><![CDATA[ ]]></script> <script>window.jQuery || document.write('<script src="/static/js/jquery.min.js"><\/script>')</script> <script src="/static/js/bootstrap.min.js"><![CDATA[ ]]></script> <!-- IE10 viewport hack for Surface/desktop Windows 8 bug --> <script src="/static/js/ie10-viewport-bug-workaround.js"><![CDATA[ ]]></script> <!-- Cookies handler ================================================== --> <script src="/static/js/cookies.js"></script> </body> </html>