CINXE.COM
Identifiers
<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8" /> <meta property="og:title" content="Identifiers" /> <meta property="og:url" content="https://www.ddbj.nig.ac.jp/ddbj/identifiers-e.html" /> <meta property="og:description" content="The organism names (values of /organism qualifier) are required to submit your s..." /> <meta property6="og:image" content="/images/thumbnail/logo_ddbj_fb.png" /> <meta name="viewport" content="width=device-width, initial-scale=1.0" /> <title>Identifiers</title> <script async src="https://www.google-analytics.com/analytics.js"></script> <script src="https://code.jquery.com/jquery-3.5.0.js" integrity="sha256-r/AaFHrszJtwpe+tHyNi/XCfMxYpbsRg2Uqn0x3s2zc=" crossorigin="anonymous"></script> <script src="https://cdnjs.cloudflare.com/ajax/libs/jquery.hoverintent/1.10.1/jquery.hoverIntent.min.js" integrity="sha512-gx3WTM6qxahpOC/hBNUvkdZARQ2ObXSp/m+jmsEN8ZNJPymj8/Jamf8+/3kJQY1RZA2DR+KQfT+b3JEB0r9YRg==" crossorigin="anonymous"></script> <script src="https://cdnjs.cloudflare.com/ajax/libs/spin.js/4.1.0/spin.min.js" integrity="sha512-CbohqWjAgarTqRHcX1MbwkF2pujwbsCee1PABpnBWC+VqSldvlNEEI5+4OSsR/HbFQOFFpwY2YvZZNjBMxNnXg==" crossorigin="anonymous"></script> <script type="text/javascript" src="https://cdnjs.cloudflare.com/ajax/libs/jquery.colorbox/1.6.4/jquery.colorbox-min.js"></script> <script type="text/javascript" src="https://cdnjs.cloudflare.com/ajax/libs/jquery-deparam/0.5.3/jquery-deparam.min.js"></script> <script type="text/javascript" src="https://www.ddbj.nig.ac.jp/assets/js/jquery.trace.js"></script> <script type="text/javascript" src="https://www.ddbj.nig.ac.jp/assets/js/jquery.json_search.js"></script> <link rel="icon" href="https://www.ddbj.nig.ac.jp/assets/images/favicon_ddbj.ico"> <link rel="stylesheet" href="https://www.ddbj.nig.ac.jp/assets/css/colorbox.css" /> <link rel="stylesheet" href="https://www.ddbj.nig.ac.jp/assets/css/main.css" /> <link rel="alternate" type="application/rss+xml" title="My Site RSS" href="/feed.xml" /> <script src="https://www.ddbj.nig.ac.jp/assets/js/main.js"></script> </head> <body data-category="ddbj"> <script src="https://www.ddbj.nig.ac.jp/assets/js/ddbj_common_framework.js" id="DDBJ_common_framework" style="display: block; height: 40px;" data-bottom-menu="true" data-ddbj-home-page="true" data-search="true" ></script> <section class="top-news-view"> <div class="inner"> <ul> <li class="item"> <a href="https://www.ddbj.nig.ac.jp/news/en/2024-10-22-e">On Cyber Threats against DDBJ, a node of the International Nucleotide Sequence Database Collaboration</a> </li> <li class="item"> <a href="https://www.ddbj.nig.ac.jp/news/en/2024-11-22-e">(27th November 9:00-November 28th 12:00)Announcement of D-way/MSS suspension</a> </li> <li class="item"> <a href="https://www.ddbj.nig.ac.jp/news/en/">(12th December 8:00-20th December 12:00(JST)) Suspension of DDBJ services due to NIG supercomputer maintenance</a> </li> </ul> </div> </section> <div id="primary"> <header id="PageHeader"> <div class="inner"> <div class="page-title"> <p class="title -normal">DDBJ Annotated/Assembled Sequences</p> </div> <nav class="tab-menu-view"> <ul class="tabmenucontainer"> <li class=""> <a href="/ddbj/index-e.html">Home</a> </li> <li class=" -haschild"> <a href="/ddbj/submission-e.html">Submission</a> <ul> <li> <a href="/ddbj/submission-e.html">Before Submission</a> </li> <li> <a href="/ddbj/web-submission-e.html">Web submission</a> </li> <li> <a href="/ddbj/mss-e.html">Mass Submission</a> </li> <li> <a href="/ddbj/update-e.html">Data Update</a> </li> </ul> </li> <li class=" -haschild"> <a href="http://ddbj.nig.ac.jp/arsa/?lang=en">Search</a> <ul> <li> <a href="http://getentry.ddbj.nig.ac.jp/top-e.html">getentry</a> </li> <li> <a href="http://ddbj.nig.ac.jp/arsa/?lang=en">ARSA</a> </li> </ul> </li> <li class=" -haschild -current"> <a href="/ddbj/flat-file-e.html">Flat file</a> <ul> <li> <a href="/ddbj/feature-table-e.html">Feature Table</a> </li> <li> <a href="/ddbj/features-e.html">Feature key</a> </li> <li> <a href="/ddbj/qualifiers-e.html">Qualifier key</a> </li> <li> <a href="/ddbj/sequence-e.html">Nucleotide Sequences</a> </li> <li> <a href="/ddbj/organism-e.html">Organism qualifier</a> </li> <li> <a href="/ddbj/identifiers-e.html">Identifiers</a> </li> <li> <a href="/ddbj/location-e.html">Description of Location</a> </li> <li> <a href="/ddbj/cds-e.html">Protein Coding Sequence</a> </li> <li> <a href="/ddbj/geneticcode-e.html">The Genetic Codes</a> </li> <li> <a href="/ddbj/code-e.html">Codes Used in Sequence Description</a> </li> <li> <a href="/ddbj/example-e.html">Description Examples of Sequence Data</a> </li> </ul> </li> <li class=" -haschild"> <a href="/ddbj/data-categories-e.html">Data categories</a> <ul> <li> <a href="/ddbj/genome-e.html">Data Submission from Genome Project</a> </li> <li> <a href="/ddbj/pseudohaplotype-e.html">Pseudohaplotype</a> </li> <li> <a href="/ddbj/wgs-e.html">WGS</a> </li> <li> <a href="/ddbj/finished_level_genome-e.html">Finished level genomic sequences</a> </li> <li> <a href="/ddbj/metagenome-assembly-e.html">Metagenome Assembly</a> </li> <li> <a href="/ddbj/single-amplified-genome-e.html">Single amplified genome</a> </li> <li> <a href="/ddbj/htg-e.html">HTG</a> </li> <li> <a href="/ddbj/environmental-e.html">Environmental sample</a> </li> <li> <a href="/ddbj/env-e.html">ENV</a> </li> <li> <a href="/ddbj/tls-e.html">TLS</a> </li> <li> <a href="/ddbj/transcriptome-e.html">Data Submission from Transcriptome Project</a> </li> <li> <a href="/ddbj/tsa-e.html">TSA</a> </li> <li> <a href="/ddbj/est-e.html">EST</a> </li> <li> <a href="/ddbj/htc-e.html">HTC</a> </li> <li> <a href="/ddbj/tpa-e.html">Third Party Data (TPA)</a> </li> </ul> </li> <li class=""> <a href="/faq/en/index-e.html?tag=ddbj">FAQ</a> </li> <li class=" -haschild"> <a href="/ddbj/index-e.html">Other</a> <ul> <li> <a href="/ddbj/patent-data-e.html">Patent</a> </li> <li> <a href="/ddbj/mga-e.html">MGA</a> </li> </ul> </li> </ul> </nav> </div> </header> <section id="NavigationAndMainView"> <div class="inner"> <div class="subview"> <nav id="TableOfContents" class="internal-link"> </nav> </div> <section id="MainContentView" class="mainview"> <header class="header"> <nav class="breadcrumb-view"> <ul> <li> <a href="https://www.ddbj.nig.ac.jp/index-e.html">Home</a> </li> <li> <a href="https://www.ddbj.nig.ac.jp/ddbj/index-e.html">ddbj</a> </li> <li><a>Identifiers</a></li> </ul> </nav> <h1 class="title">Identifiers</h1> </header> <main class="md-content"> <p><a href="/ddbj/organism-e.html">The organism names (values of /organism qualifier)</a> are required to submit your sequence data.<br /> If you do not identify the species of the organisms, or if you need to submit a large number of sequences from the same species, it may be required to use some identifiers of the sample organisms for your sequence data.</p> <p>For many cases of bacteria, yeasts, and other microorganisms, especially <span class="red"><strong>to submit whole genome sequences of the microorganisms</strong></span>, you may need to add identifiers of strains and/or culture collections for the organisms. <br /> Further, when you propose a new species name, you have to deposit the organism to two or more culture collections.</p> <h2 id="type">Type of Idetifiers</h2> <p>There are two types of identifiers that are acceptable for us.</p> <h3 id="sample">Assign to the Sample Organisms</h3> <p>In general, we expect you to use this type of identifier. <br /> A typical case is to use the <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifier. The other qualifiers would be <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a>, <a href="/ddbj/qualifiers-e.html#culture_collection">/culture_collection</a>, <a href="/ddbj/qualifiers-e.html#ecotype">/ecotype</a>, <a href="/ddbj/qualifiers-e.html#specimen_voucher">/specimen_voucher</a>, <a href="/ddbj/qualifiers-e.html#cultivar">/cultivar</a>, <a href="/ddbj/qualifiers-e.html#bio_material">/bio_material</a>, <a href="/ddbj/qualifiers-e.html#cell_line">/cell_line</a>, and so on.</p> <h3 id="seq">Assign to the Sequences</h3> <p>This type of identifier is operatively described. In cases of your research purpose, you may need to use this type of identifier.</p> <p>The typical ones are <a href="/ddbj/qualifiers-e.html#clone">/clone</a> and <a href="/ddbj/qualifiers-e.html#submitter_seqid">/submitter_seqid</a> qualifiers. <br /> Also, <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifiers for OTU of environmental samples, <a href="/ddbj/qualifiers-e.html#bio_material">/bio_material</a> qualifiers in some conditions would be assigned to the sequences.</p> <p>For using every identical sequence, <a href="/ddbj/qualifiers-e.html#haplotype">/haplotype</a> qualifier is also corresponding to the case. For submitting data related to population genetics, see <a href="/ddbj/representative-sequence-e.html">Representative submissions of identical sequences for variation studies</a>.</p> <h2 id="format">Format of Identifiers</h2> <ul> <li>All sample identifiers must be distinct from each other.</li> <li>Identical samples should be indicated with the same identifier.</li> <li>In cases of commercial samples, model strains and any other official names, you should use them ‘as is’. <ul> <li>For <a href="/ddbj/qualifiers-e.html#bio_material">/bio_material</a>, <a href="/ddbj/qualifiers-e.html#culture_collection">/culture_collection</a> and <a href="/ddbj/qualifiers-e.html#specimen_voucher">/specimen_voucher</a> qualifiers, we have the formats to enhance their traceability, so, please follow the formats.</li> </ul> </li> <li>When you isolate the sample organisms, please name them by yourself. <ul> <li>Please note that “1”, “2”, “A”, “B” , or something like them are too simple to keep uniqueness among the data submitted from many researchers, in advance.</li> <li>For example, to make it easier keeping uniqueness by adding acronyms of geographical names of collection sites or the initials of the person who collected samples and the collected years to serial numbers, <br /> i.e. [serial number]-[acronym of geographical name]-[collected year], “1-MS-2021” etc.</li> <li>We strongly recommend that you do not include the name of an organism or the abbreviation of an organism name in the sample identifier.</li> </ul> </li> </ul> <p>Please note we do not systematically manage the identifiers of samples. So, at least, it is required to assign unique names within your submissions for your study.</p> <h2 id="detail">In Detail about Identifiers</h2> <h3 id="not_required">Situations where the identifiers not required</h3> <p>When your samples meet all of the following conditions, it is not required to use identifiers for your samples. <br /> Please note we do not intend to prohibit using identifiers for your samples, so, we will accept the identifiers of your samples if you need.</p> <ul> <li>Sequences derived from multicellular organisms</li> <li>Aimed to study neither of identifying species, biogeography, epidemiology, population genetics or the like.</li> <li>Aimed to study identifying and/or functional analysis genes/proteins and so on to describe general features of the subject organisms.</li> </ul> <h3 id="bact">Bacteria or Archaea</h3> <p>Some identifiers are required regardless of the taxonomic identification level of the organisms or the number of sequences.</p> <ul> <li>For the pure cultured lines that are expected to be genetically uniform, use <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifier</li> <li>For cultured strains deposited to or divided from culture collections, use <a href="/ddbj/qualifiers-e.html#culture_collection">/culture_collection</a> qualifier</li> <li>In cases difficult to call the pure cultured lines, use <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifier</li> </ul> <h3 id="fungi">Fungi</h3> <p>In cases of unicellular organisms like yeasts, some identifiers are required regardless of the taxonomic identification level of the organisms or the number of sequences.</p> <ul> <li>For the pure cultured lines that are expected to be genetically uniform, use <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifier</li> <li>For cultured strains deposited to or divided from culture collections, use <a href="/ddbj/qualifiers-e.html#culture_collection">/culture_collection</a> qualifier</li> <li>In cases difficult to call the pure cultured lines, use <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifier</li> </ul> <p>In cases of multicellular organisms like mushrooms, some identifiers may be required when the species is not identified or when you submit many sequences from multiple individuals of the same species.</p> <ul> <li>For the identifiers of individuals, use <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifier</li> <li>For specimens deposited to or divided from museum/collections, use <a href="/ddbj/qualifiers-e.html#specimen_voucher">/specimen_voucher</a> qualifier</li> <li>For the pure lines that are expected to be genetically uniform, <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifier</li> </ul> <h3 id="plant">Plants</h3> <p>In cases of unicellular organisms like algae, some identifiers are required regardless of the taxonomic identification level of the organisms or the number of sequences.</p> <ul> <li>For the pure cultured lines that are expected to be genetically uniform, use <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifier</li> <li>For cultured strains deposited to or divided from culture collections, use <a href="/ddbj/qualifiers-e.html#culture_collection">/culture_collection</a> qualifier</li> <li>In cases difficult to call the pure cultured lines, use <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifier</li> </ul> <p>In cases of multicellular organisms, some identifiers may be required when the species is not identified or when you submit many sequences from multiple individuals of the same species.</p> <ul> <li>For the identifiers of individuals, use <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifier</li> <li>For specimens deposited to or divided from museum/collections, use <a href="/ddbj/qualifiers-e.html#specimen_voucher">/specimen_voucher</a> qualifier</li> <li>For variants phenotypically/geographically distinguished, but not yet subspecies, use <a href="/ddbj/qualifiers-e.html#ecotype">/ecotype</a> qualifier</li> <li>For cultivated varieties, use <a href="/ddbj/qualifiers-e.html#cultivar">/cultivar</a> qualifier</li> <li>For the pure lines that are expected to be genetically uniform, use <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifier</li> </ul> <h3 id="animal">Animals</h3> <p>In cases of unicellular organisms like protists, some identifiers are required regardless of the taxonomic identification level of the organisms or the number of sequences.</p> <ul> <li>For the pure cultured lines that are expected to be genetically uniform, use <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifier</li> <li>For cultured strains deposited to or divided from culture collections, use <a href="/ddbj/qualifiers-e.html#culture_collection">/culture_collection</a> qualifier</li> <li>In cases difficult to call the pure cultured lines, use <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifier</li> </ul> <p>In cases of multicellular organisms, some identifiers may be required when the species is not identified or when you submit many sequences from multiple individuals of the same species.</p> <ul> <li>For the identifiers of individuals, use <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifier</li> <li>For specimens deposited to or divided from museum/collections, use <a href="/ddbj/qualifiers-e.html#specimen_voucher">/specimen_voucher</a> qualifier</li> <li>For breeded varieties, use <a href="/ddbj/qualifiers-e.html#note">/note</a> qualifier with “breed:” prefix.</li> <li>For pure or congenic lines of model organisms, use <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifier</li> <li>For mutated strains of model organisms, use <a href="/ddbj/qualifiers-e.html#bio_material">/bio_material</a> qualifier</li> <li>For immortalised cell lines deposited to or divided from collections, <a href="/ddbj/qualifiers-e.html#cell_line">/cell_line</a> qualifier</li> </ul> <h3 id="human">Human Subjects</h3> <p>For the data derived from human subjects, description of individual identifiers may be needed to report polymorphisms or so. <br /> Even in these cases, the identifiers must be anonymized. We prohibit using any personal names or any identifiers that enable inferring personal names.</p> <ul> <li>For the identifiers of individuals, use <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifier</li> <li>For immortalised cell lines deposited to or divided from collections, use <a href="/ddbj/qualifiers-e.html#cell_line">/cell_line</a> qualifier</li> </ul> <h3 id="virus">Viruses</h3> <p>Usually, we recommend using <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifiers for viral data. <br /> For frequently submitted pathogenic viruses like SARS-CoV-2 and Influenza virus, we strongly recommend you using <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifiers with the identifiers in the following format recommended by <a href="https://ictv.global/">ICTV</a>.</p> <ul> <li>Format: <code class="language-plaintext highlighter-rouge">[virus_type]/[host_common_name]/[locality_name]/[sample_identifier]/[year]</code></li> <li>Example: <code class="language-plaintext highlighter-rouge">SARS-CoV-2/human/Japan/A12/2021</code></li> </ul> <table> <tbody> <tr> <td><code class="language-plaintext highlighter-rouge">virus_type</code></td> <td>the abbreviated name, genotype or type of the virus</td> </tr> <tr> <td><code class="language-plaintext highlighter-rouge">host_common_name</code></td> <td>the common name of the viral host organism indicated in <a href="/ddbj/qualifiers-e.html#host">/host</a> qualifier</td> </tr> <tr> <td><code class="language-plaintext highlighter-rouge">locality_name</code></td> <td>the name of country or region of the collection site indicated in <a href="/ddbj/qualifiers-e.html#geo_loc_name">/geo_loc_name</a> qualifier</td> </tr> <tr> <td><code class="language-plaintext highlighter-rouge">sample_identifier</code></td> <td>any identifiers/numbers assigned by the submitters or the collectors</td> </tr> <tr> <td><code class="language-plaintext highlighter-rouge">year</code></td> <td>the year of the sampling date indicated in <a href="/ddbj/qualifiers-e.html#collection_date">/collection_date</a> qualifier</td> </tr> </tbody> </table> <p>Though it is difficult for viruses to confirm the pure lines without culture by infected cells, we conventionally accepted <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifiers for viral sequences. <br /> However, we recommend using /isolate qualifiers. <br /> Please note that a genotype (or genogroup etc.) is similar to species/lineage of virus, so, it cannot be sample identifiers.</p> <h3 id="env">Environmental Samples</h3> <p>In general, please describe <a href="/ddbj/qualifiers-e.html#isolate">/isolate</a> qualifiers as names of individuals or OTUs.<br /> You can not use <a href="/ddbj/qualifiers-e.html#strain">/strain</a> qualifiers for environmental samples. <br /> When you have cloned the corresponding DNA samples, describe the clone names in <a href="/ddbj/qualifiers-e.html#clone">/clone</a> qualifiers.</p> <h2 id="organism">Organism Names (/organism qualifier) and Identifiers</h2> <p>When you describe the organism names in species, subspecies, varieties or lower levels, it is not required to include any identifiers into the organism names. <br /> However, please pay attention to the following cases.</p> <h3 id="org_uni">Bacteria, Archaea, or Unicellular Fungi</h3> <p>In cases of unicellular organisms, it is not required to include any identifiers in organism names regardless of the taxonomic identification level.</p> <ul> <li>When you propose a new species, it is recommended to include some identifier such as strain name in organism name.</li> <li>In cases of cyanobacteria, it is required to include some identifiers such as strain names in organism names.</li> <li>To submit whole-genome-scale sequences of microorganisms, etc., currently, if the species has not yet been identified, it is a general rule to include strain, etc. in the organism name. <br /> Previously, regardless of species identification, it is required for genome-wide sequence submissions to include strain names, etc. in the organism names.</li> </ul> <h3 id="org_multi">Animals, Plants, or Multicellular Fungi</h3> <p>In cases of multicellular organisms, it is required to include some identifiers in organism names when the species is not identified or when you propose new species. <br /> The identifiers included in organism names do not have to be assigned to every individual, so you should assign an identifier in each unit that is considered to be the same species.</p> <h3 id="org_virus">Viruses</h3> <ul> <li>Different from other domains, descriptions like scientific names can be accepted during species proposing. <br />In that case, we recommend to include some identifiers into the virus names.</li> <li>By 2017, we used informal names for frequently submitted pathogenic viruses including their strain names and serotypes in the description of organism names. However, the rule has been discontinued for future submissions.</li> </ul> <h3 id="org_env">Environmental Samples</h3> <p>For most cases, the species names of the organisms can not be identified, however, it is not required to include any identifiers into the organism names.</p> </main> <aside class="related-pages"> <h2 class="caption">Related pages</h2> <div class="navigation"> <nav> <ul> <li> <a href="/ddbj/flat-file-e.html">DDBJ flat file format</a> </li> <li> <a href="/ddbj/features-e.html">Feature Key</a> </li> <li> <a href="/ddbj/qualifiers-e.html">Qualifier key</a> </li> <li> <a href="/ddbj/organism-e.html">Organism qualifier</a> </li> <li> <a href="/ddbj/example-e.html">Description Examples of Sequence Data</a> </li> <li> <a href="/ddbj/representative-sequence-e.html">Representative submissions of identical sequences for variation studies</a> </li> </ul> </nav> </div> </aside> </section> </div> </section> </div> <footer></footer> <div id="back-top"></div> </body> </html>