CINXE.COM
MSS - Mass Submission System
<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8" /> <meta property="og:title" content="MSS - Mass Submission System" /> <meta property="og:url" content="https://www.ddbj.nig.ac.jp/ddbj/mss-e.html" /> <meta property="og:description" content=" Submission of research data from human subjects For all data from human subje..." /> <meta property6="og:image" content="/images/thumbnail/logo_ddbj_fb.png" /> <meta name="viewport" content="width=device-width, initial-scale=1.0" /> <title>MSS - Mass Submission System</title> <script async src="https://www.google-analytics.com/analytics.js"></script> <script src="https://code.jquery.com/jquery-3.5.0.js" integrity="sha256-r/AaFHrszJtwpe+tHyNi/XCfMxYpbsRg2Uqn0x3s2zc=" crossorigin="anonymous"></script> <script src="https://cdnjs.cloudflare.com/ajax/libs/jquery.hoverintent/1.10.1/jquery.hoverIntent.min.js" integrity="sha512-gx3WTM6qxahpOC/hBNUvkdZARQ2ObXSp/m+jmsEN8ZNJPymj8/Jamf8+/3kJQY1RZA2DR+KQfT+b3JEB0r9YRg==" crossorigin="anonymous"></script> <script src="https://cdnjs.cloudflare.com/ajax/libs/spin.js/4.1.0/spin.min.js" integrity="sha512-CbohqWjAgarTqRHcX1MbwkF2pujwbsCee1PABpnBWC+VqSldvlNEEI5+4OSsR/HbFQOFFpwY2YvZZNjBMxNnXg==" crossorigin="anonymous"></script> <script type="text/javascript" src="https://cdnjs.cloudflare.com/ajax/libs/jquery.colorbox/1.6.4/jquery.colorbox-min.js"></script> <script type="text/javascript" src="https://cdnjs.cloudflare.com/ajax/libs/jquery-deparam/0.5.3/jquery-deparam.min.js"></script> <script type="text/javascript" src="https://www.ddbj.nig.ac.jp/assets/js/jquery.trace.js"></script> <script type="text/javascript" src="https://www.ddbj.nig.ac.jp/assets/js/jquery.json_search.js"></script> <link rel="icon" href="https://www.ddbj.nig.ac.jp/assets/images/favicon_ddbj.ico"> <link rel="stylesheet" href="https://www.ddbj.nig.ac.jp/assets/css/colorbox.css" /> <link rel="stylesheet" href="https://www.ddbj.nig.ac.jp/assets/css/main.css" /> <link rel="alternate" type="application/rss+xml" title="My Site RSS" href="/feed.xml" /> <script src="https://www.ddbj.nig.ac.jp/assets/js/main.js"></script> </head> <body data-category="ddbj"> <script src="https://www.ddbj.nig.ac.jp/assets/js/ddbj_common_framework.js" id="DDBJ_common_framework" style="display: block; height: 40px;" data-bottom-menu="true" data-ddbj-home-page="true" data-search="true" ></script> <section class="top-news-view"> <div class="inner"> <ul> <li class="item"> <a href="https://www.ddbj.nig.ac.jp/news/en/2024-10-22-e">On Cyber Threats against DDBJ, a node of the International Nucleotide Sequence Database Collaboration</a> </li> <li class="item"> <a href="https://www.ddbj.nig.ac.jp/news/en/2024-11-22-e">(27st November 9:00-November 28th 12:00)Announcement of D-way/MSS suspension</a> </li> </ul> </div> </section> <div id="primary"> <header id="PageHeader"> <div class="inner"> <div class="page-title"> <p class="title -normal">DDBJ Annotated/Assembled Sequences</p> </div> <nav class="tab-menu-view"> <ul class="tabmenucontainer"> <li class=""> <a href="/ddbj/index-e.html">Home</a> </li> <li class=" -haschild -current"> <a href="/ddbj/submission-e.html">Submission</a> <ul> <li> <a href="/ddbj/submission-e.html">Before Submission</a> </li> <li> <a href="/ddbj/web-submission-e.html">Web submission</a> </li> <li> <a href="/ddbj/mss-e.html">Mass Submission</a> </li> <li> <a href="/ddbj/update-e.html">Data Update</a> </li> </ul> </li> <li class=" -haschild"> <a href="http://ddbj.nig.ac.jp/arsa/?lang=en">Search</a> <ul> <li> <a href="http://getentry.ddbj.nig.ac.jp/top-e.html">getentry</a> </li> <li> <a href="http://ddbj.nig.ac.jp/arsa/?lang=en">ARSA</a> </li> </ul> </li> <li class=" -haschild"> <a href="/ddbj/flat-file-e.html">Flat file</a> <ul> <li> <a href="/ddbj/feature-table-e.html">Feature Table</a> </li> <li> <a href="/ddbj/features-e.html">Feature key</a> </li> <li> <a href="/ddbj/qualifiers-e.html">Qualifier key</a> </li> <li> <a href="/ddbj/sequence-e.html">Nucleotide Sequences</a> </li> <li> <a href="/ddbj/organism-e.html">Organism qualifier</a> </li> <li> <a href="/ddbj/identifiers-e.html">Identifiers</a> </li> <li> <a href="/ddbj/location-e.html">Description of Location</a> </li> <li> <a href="/ddbj/cds-e.html">Protein Coding Sequence</a> </li> <li> <a href="/ddbj/geneticcode-e.html">The Genetic Codes</a> </li> <li> <a href="/ddbj/code-e.html">Codes Used in Sequence Description</a> </li> <li> <a href="/ddbj/example-e.html">Description Examples of Sequence Data</a> </li> </ul> </li> <li class=" -haschild"> <a href="/ddbj/data-categories-e.html">Data categories</a> <ul> <li> <a href="/ddbj/genome-e.html">Data Submission from Genome Project</a> </li> <li> <a href="/ddbj/pseudohaplotype-e.html">Pseudohaplotype</a> </li> <li> <a href="/ddbj/wgs-e.html">WGS</a> </li> <li> <a href="/ddbj/finished_level_genome-e.html">Finished level genomic sequences</a> </li> <li> <a href="/ddbj/metagenome-assembly-e.html">Metagenome Assembly</a> </li> <li> <a href="/ddbj/single-amplified-genome-e.html">Single amplified genome</a> </li> <li> <a href="/ddbj/htg-e.html">HTG</a> </li> <li> <a href="/ddbj/environmental-e.html">Environmental sample</a> </li> <li> <a href="/ddbj/env-e.html">ENV</a> </li> <li> <a href="/ddbj/tls-e.html">TLS</a> </li> <li> <a href="/ddbj/transcriptome-e.html">Data Submission from Transcriptome Project</a> </li> <li> <a href="/ddbj/tsa-e.html">TSA</a> </li> <li> <a href="/ddbj/est-e.html">EST</a> </li> <li> <a href="/ddbj/htc-e.html">HTC</a> </li> <li> <a href="/ddbj/tpa-e.html">Third Party Data (TPA)</a> </li> </ul> </li> <li class=""> <a href="/faq/en/index-e.html?tag=ddbj">FAQ</a> </li> <li class=" -haschild"> <a href="/ddbj/index-e.html">Other</a> <ul> <li> <a href="/ddbj/patent-data-e.html">Patent</a> </li> <li> <a href="/ddbj/mga-e.html">MGA</a> </li> </ul> </li> </ul> </nav> </div> </header> <section id="NavigationAndMainView"> <div class="inner"> <div class="subview"> <nav id="TableOfContents" class="internal-link"> </nav> </div> <section id="MainContentView" class="mainview"> <header class="header"> <nav class="breadcrumb-view"> <ul> <li> <a href="https://www.ddbj.nig.ac.jp/index-e.html">Home</a> </li> <li> <a href="https://www.ddbj.nig.ac.jp/ddbj/index-e.html">ddbj</a> </li> <li><a>MSS - Mass Submission System</a></li> </ul> </nav> <h1 class="title">MSS - Mass Submission System</h1> </header> <main class="md-content"> <div class="attention"> <p><span class="red">Submission of research data from human subjects</span></p> <p>For all data from human subjects researches submitted to DDBJ, it is submitter’s responsibility to ensure that the dignity and the right of participant (human subject) is protected in accordance with all applicable laws, regulations and policies of submitter’s institute. In principle, make sure to remove any direct personal identifiers of human subjects from your submissions. Before submission, read “<a href="/policies-e.html#submission-of-human-data">Submission of research data from human subjects</a>”.</p> </div> <h2 id="outline">Overview</h2> <p>Mass Submission System (MSS) is the service to accept relatively large scale nucleotide sequence data (not reads) through sending text files. The nucleotide sequence data belonging to <span class="red">either of the following cases</span> should be submitted via MSS, because they are not acceptable via the <a href="/ddbj/web-submission-e.html">NSSS: DDBJ Nucleotide Sequence Submission System</a>. Please note the points other than number or length of your data.</p> <h4 id="a-either-of-the-following-categories-or-amounts-of-sequence-data">a) Either of the following categories or amounts of sequence data</h4> <div class="attention"> <ul> <li><a href="/ddbj/est-e.html">EST</a>, <a href="/ddbj/tsa-e.html">TSA</a>, <a href="/ddbj/htc-e.html">HTC</a>, <a href="/ddbj/gss-e.html">GSS</a>, <a href="/ddbj/htg-e.html">HTG</a>, <a href="/ddbj/wgs-e.html">WGS</a>, <a href="/ddbj/tls-e.html">TLS</a>, <a href="/ddbj/tpa-e.html">TPA</a> <ul> <li>See <a href="/documents/data-categories-e.html#division">Categories for Sequence Data</a> in detail.</li> </ul> </li> <li>Submission with long sequences, greater than 500 kb in its length</li> <li>Complex submission containing many features for one sequence, more than 30 features</li> <li>Submission consists of large number of sequences, greater than 100 in total</li> </ul> </div> <h4 id="b-regardless-finished-or-draft-level-sequence-data-of-whole-length-scale-replicons">b) Regardless finished or draft level, sequence data of whole-length scale replicons</h4> <div class="attention"> <ul> <li>(Nuclear) genome</li> <li>Organelle genome</li> <li>Chromosome</li> <li>Virus/Phage genome/segments</li> <li>Plasmid</li> </ul> </div> <h4 id="c-sequence-data-to-be-described-bioproject-or-biosample-in-dblink">c) Sequence data to be described BioProject or BioSample in DBLINK</h4> <p>When you need to use <a href="/ddbj/flat-file-e.html#DBLINK">DBLINK</a> to link BioProject or BioSample, the following cases are included, but not limited to them.</p> <div class="attention"> <ul> <li>Sequence data from metagenome analyses, environmental profilings, and so on</li> <li>Sequence data of targeted genes to be linked each other</li> <li>When you are planning to submit or have submitted whole genome scale data obtained from the same samples. <ul> <li>Required to submit prokaryotic 16S rRNA gene for phylogenic report</li> <li>Advanced paper submission of any other targeted gene(s)/cluster region(s)</li> </ul> </li> </ul> </div> <ul> <li>Basically, if none of the above applies to your data, DDBJ recommends using the <a href="/ddbj/web-submission-e.html">NSSS: DDBJ Nucleotide Sequence Submission System</a>.</li> <li>If you are to submit reads from sequencers, see <a href="/dra/index-e.html">DRA, DDBJ Sequence Read Archive</a>.</li> </ul> <h2 id="flow">Registration process in MSS</h2> <style type="text/css"> .mssflow, .mssflowend {position: relative; display: inline-block; margin: 5px 5px; padding: 0.2em 0.5em; border: solid 2px #FFC091; border-radius: 8px; box-shadow: 0 3px 4px rgba(0, 0, 0, 0.32);} .mssflow::after { display: inline-block; margin: 0 0 0 -15px; border: 8px solid transparent; border-left-color: #FFC300; content: ""; position: relative; top: 4px; left: 28px; } table tr:hover { background-color: #FEC091; } </style> <!-- **Recomend:** A. Apply for the submission after you have prepared the submission files --> <div class="mssflow">Create the submission files</div> <div class="mssflow"><a href="https://mss.ddbj.nig.ac.jp/">Login to https://mss.ddbj.nig.ac.jp/ with D-way account</a></div> <div class="mssflow">Fill MSS form & upload the submission files</div> <div class="mssflow">DDBJ creats a new Mass-ID, sends autoreply</div> <div class="mssflow">Review, inquiring and answering by email</div> <div class="mssflow">You can obtain accession numbers</div> <div class="mssflowend">Release the data, or hold until specified date</div> <p></p> <!-- B. Apply for the submission, and then you prepare the submission files <div class="mssflow"><a href="https://mss.ddbj.nig.ac.jp/">Login to https://mss.ddbj.nig.ac.jp/ with D-way account</a></div> <div class="mssflow">Fill MSS form</div> <div class="mssflow">DDBJ creats a new Mass-ID, sends autoreply</div> <div class="mssflow">Create the submission files</div> <div class="mssflow">Upload the submission files</div> <div class="mssflow">Review, inquiring and answering by email</div> <div class="mssflow">You can obtain accession numbers</div> <div class="mssflowend">Release the data, or hold until specified date</div> --> <!-- <img src="/assets/images/ddbj/MSS-flow_20200916-en.png" alt="The Flow of MSS" title="" class=""> --> <h2 id="prep">Preparation of the submission files</h2> <h3 id="prep-1">Required files for the registration</h3> <ul> <li>Sequence file <ul> <li>The text file that contains all nucleotide sequences in FASTA-like format. see <a href="/ddbj/file-format-e.html#sequence">Submission file format:Sequence file</a>.</li> </ul> </li> <li>Annotation file <ul> <li>The tab delimited text file that contains metadata (submitters, reference) and annotation (Feature/Qualifier) see <a href="/ddbj/file-format-e.html#annotation">Submission file format:Annotation file</a></li> <li>For the prokaryote genome, you <a href="/ddbj/mss-e.html#mssdfast">can create the files by using DFAST(DDBJ Fast Annotation and Submission Tool)</a>.</li> </ul> </li> <li>AGP file(only in case of <a href="/ddbj/con-e.html">CON</a> entries) <ul> <li><span class="red"><strong>[Caution] DDBJ currently terminated accepting new submissions.</strong> </span></li> <li>The tab delimited text file to construct CON sequence that contains the order, orientation, and type of each piece entry. If nucleotide sequence can be assembled from an AGP file, you do not need to send a sequence file. see <a href="/ddbj/file-format-e.html#agp">Submission file format:AGP file</a></li> </ul> </li> </ul> <h3 id="prep-2">Getting BioProject & BioSample ID</h3> <ul> <li>According to the type of the data, you must obtain <a href="https://ddbj.nig.ac.jp/D-way/">BioProject ID and BioSample ID (also reserve locus_tag prefix)</a> to prepare the submission files.</li> <li>In principle, it is not acceptable to change the taken locus_tag prefix, so please be careful when registering the prefix.</li> <li>See the <a href="#mssprjtable">table</a> to know what data type needs BioProject and BioSample.</li> </ul> <h3 id="prep-3">Sample & Documentation</h3> <ul> <li><a href="/ddbj/file-format-e.html#sample">Sample annotations</a></li> <li><a href="https://www.ddbj.nig.ac.jp/ddbj/file-format-e.html">User guide of annotation file</a></li> <li>For prokaryote genome, we strongly recommend that you should use <a href="https://dfast.ddbj.nig.ac.jp/"><strong>DFAST (DDBJ Fast Annotation and Submission Tool)</strong></a>. <ul> <li>See <a href="/ddbj/mss-e.html#mssdfast">DFAST: creating the submission files</a> and obtain the submission files.</li> </ul> </li> <li>For <a href="/ddbj/genome-e.html">whole genome-scale sequence</a>, it is optional to describe biological features except source and assembly_gap. However, in case of a novel species that have not been reported so far, it is required to describe feature annotation against at least one genome as a representative.</li> <li>When you submit a genome with annotation, it is required to reserve <a href="/ddbj/qualifiers-e.html#locus_tag">locus_tag prefix</a> at the registration of <a href="/biosample/index-e.html">BioSample</a>.</li> <li>For <a href="/ddbj/tsa-e.html">TSA</a> data, it is optional (basically unnecessary) to describe biological features except source and assembly_gap.</li> <li>In case of <a href="/ddbj/est-e.html">EST</a>, you can not describe any biological features except source.</li> <li>See the <a href="#mssprjtable">table</a> to know what data type needs annotation.</li> </ul> <h3 id="prep-4">Tool for checking the submission files</h3> <p>Before submitting to DDBJ, <strong>the files must be checked with software tools</strong> provided from DDBJ.</p> <ul> <li>UME (Utilities for MSS file Error check) The tool checks the syntax, format and amino acid translation of CDS features in <a href="/ddbj/file-format-e.html#sequence">sequence file</a> and <a href="/ddbj/file-format-e.html#annotation">annotation file</a>. It includes both Parser and transChecker. OS: Windows, Linux/macOS Details:<a href="/ddbj/ume-e.html">UME User’s Manual</a></li> <li>Parser The tool checks the syntax and format of <a href="/ddbj/file-format-e.html#sequence">sequence file</a> and <a href="/ddbj/file-format.html#annotation">annotation file</a>. OS: Linux Details:<a href="/ddbj/parser-e.html">Parser User’s Manual</a></li> <li>transChecker The tool validates the amino acid translation of CDS features (protein-coding sequence) in <a href="/ddbj/file-format.html#annotation">annotation file</a> and <a href="/ddbj/file-format-e.html#sequence">sequence file</a>. OS: Linux Details:<a href="/ddbj/transchecker-e.html">transChecker User’s Manual</a></li> </ul> <div class="attention"> <p>Download: <a href="/ddbj/mss-tool-e.html">Validation tools for MSS data files</a></p> </div> <ul> <li>Validation tools for data files do not have any function to make files for your submission. So, please make your submission files by using text editor, spreadsheet software, or some application in your PC, appropriately.</li> <li>Syntax errors due to using undefined characters, contamination of control codes, and so on would cause a major obstacle during processing submitted data, which may result in significant delay of issuing accession numbers.</li> <li>When you have to describe CDS (protein coding sequences) as one of <a href="/ddbj/file-format-e.html#biological_feature">Biological feature</a> for the annotation of your sequence, you must check the amino acid translation of CDS features by using UME or transChecker tool before submitting to DDBJ.</li> <li>Before installing the validation tools, see <a href="/ddbj/mss-tool-e.html#license">End-user license agreement</a>.</li> </ul> <h2 id="account">Creating account</h2> <ul> <li>If you have not obtained D-way account, create your account by clicking “Register for a new account” on <a href="https://ddbj.nig.ac.jp/D-way/">D-way</a> (see <a href="/account-e.html">HELP</a>).</li> <li>If you would like to use SCP/SFTP for transferring the files to DDBJ, you also need to <a href="/account-e.html#register-public-key">register a public key</a> on the account setting page on D-way. See “<a href="/upload-e.html">Data upload</a>” for detailed descriptions how to transfer the files.</li> </ul> <h2 id="submission">Applying for the registration</h2> <p>Please apply for your submission through <a href="https://mss.ddbj.nig.ac.jp/">“Application form for MSS”</a>. You can include multiple submission to the same application only if the submission data meet the requirements shown below. You need to separate the submission if any one of the files does not meet the requirements.</p> <p>You can upload the files in the application form if you have prepared the submission files. If you have not created the submission files and could not upload the files during the MSS application, please upload the submission file later. You will see the description for uploading the files in the reply email.</p> <p>A Mass-ID is issued just after you complete the application. The Mass-ID (e.g. [DDBJ:NSUB000001]) is included within the subject name of the email. Please keep the subject when you reply to the email sent from DDBJ.</p> <dl> <dt>Requirements that you can include the same application</dt> <dd>· All submission files have the same contact person</dd> <dd>· All submission files have the same data type</dd> <dd>· All submission files have the same hold date</dd> </dl> <ul> <li>Example cases that are allowed for the same application <ul> <li>Draft genomes consist of twenty Bacterial strains ➞ WGS: Whole Genome Shotgun</li> <li>Finished level genome sequences from three isolates of eukaryote genomes ➞ GNM: Finished Level Genome sequence, non-WGS</li> <li>Assembled transcribed sequences among the same species. It comprises multiple sets from different organisms. ➞ TSA: Transcriptome Shotgun Assembly</li> </ul> </li> </ul> <h3 id="submission-1">Example case where you must apply for the registration more than two times</h3> <dl> <dt>a. Draft genome of chromosome, and complete plasmid sequences from a Bacterial strain</dt> <dd>You must separate into two submissions. <ul> <li>Draft genome ➞ WGS: Whole Genome Shotgun</li> <li>Complete plasmid sequence(s) ➞ MISC: Sequences that are not included in above types</li> </ul> </dd> <dt>b. Draft genome sequences of chromosomes from a eukaryotic isolate, and complete organelle genome</dt> <dd>You must separate into two submissions. <ul> <li>Draft genome ➞ WGS: Whole Genome Shotgun</li> <li>Complete organelle sequence(s) ➞ MISC: Sequences that are not included in above types</li> </ul> </dd> <dt>c. Draft genome sequences of chromosomes from a eukaryotic isolate, and assembled transcript sequences in large-scale</dt> <dd>You must separate into two submissions. <ul> <li>Draft genome ➞ WGS: Whole Genome Shotgun</li> <li>Assembled transcripts ➞ TSA: Transcriptome Shotgun Assembly</li> </ul> </dd> </dl> <h2 id="sendfiles">How to upload the submission files</h2> <p>Submitters can transfer the submission files from MSS form by either one of the methods indicated below.</p> <ul> <li>Uploading from browser</li> <li>Specifying the DFAST job ID</li> <li>Loading the files which have been transferred to SFTP server <ul> <li>Select this method when the total submission files exceed more than 10 Gbytes in uncompressed size. You need <a href="/account-e.html#generate-key-pair">public/private key pair</a> to use SFTP. First, <a href="/account-e.html#register-public-key">register a public key</a> to your account, and then upload the files according to <a href="/upload-e.html">“Data upload”</a>.</li> <li>Read the description below</li> </ul> </li> </ul> <div class="attention"> <p><NOTE> Do not send submission files as email attachment unless there is some particular reason.</p> </div> <h3 id="sftpmss">File format for uploading to SFTP server</h3> <ul> <li>The destination directory is /mass</li> <li>The mass directory is a target to import the files when MSS Application Form is used. Therefore only the the submission files should be placed here.</li> <li>MSS form reads the files recursively from the subdirectories under mass/.</li> <li>There are some rules for the submission file name. As to the compressed files, the files in the compressed archive should be subject to the rule. <ul> <li>File extension of the annotation file should be either one of <span style="background-color: #e8e8e8;">.ann</span>, <span style="background-color: #e8e8e8;">.annt</span>, <span style="background-color: #e8e8e8;">.tsv</span>, or <span style="background-color: #e8e8e8;">.ann.txt</span></li> <li>File extension of the nucleotide sequence file should be either one of <span style="background-color: #e8e8e8;">.fasta</span>, <span style="background-color: #e8e8e8;">.seq.fa</span>, <span style="background-color: #e8e8e8;">.fa</span>, <span style="background-color: #e8e8e8;">.fna</span>, or <span style="background-color: #e8e8e8;">.seq</span></li> <li>An annotation file and a nucleotide sequence file must comprise a pair. The system determines as a pair whose filenames without the extension has the same name. <ul> <li>Excluding the case of re-submitting the submission files as the request from DDBJ curator.</li> </ul> </li> <li>Use alphanumeric and part of symbolic (excluding space, backtick, angle brackets “<>”, and parentheses “()”) characters for the file name. Do not include multibyte character such as Japanese font.</li> </ul> </li> <li>MSS Application Form can import the files from compressed archive. The following compression types are available. <ul> <li>gzip, bzip2, xz, lzip, lzma, lzop, zstd, compress e.g. 20230322-1.tar.gz 20230322-2.tar.bz2 20230322-3.tar.xz 20230322-4.zip 20230322-5.tar.lzma 20230322-6.tar.lzo 20230322-7.tar.zst 20230322-8.tar.Z</li> </ul> </li> </ul> <h2 id="review">Review of the submission files</h2> <p>Upload the submission files to DDBJ after you have checked the submission files by using <a href="/ddbj/mss-e.html#prep-4">the file checking tool</a>.</p> <p>DDBJ reviews submission files and then informs the submitter of some correction requests and/or inquiries. If there is no problem with the submitted the files, DDBJ will register the submitted data to our database and send the accession number(s) to the contact person and/or submitters by an email.</p> <p><em>Optional: Before preparing the entire sequence and annotation files, you can send a part of your data as a test submission, and then ask DDBJ whether the submission files are correctly created or not.</em></p> <h2 id="publication">Publication of the data</h2> <p>If you specified immediate release in the submission process, the submitted data are open to public as soon as possible. Whereas, if you specified <a href="/ddbj/submission-e.html#hold-date">the hold date</a>, the data will be released based on the principle of <a href="https://www.ddbj.nig.ac.jp/documents/data-release-policy-e.html">“Hold-Until-Published” data release</a>. The registered data are distributed in a flat file format converted according to a DDBJ defined rule, depending on the contents of sequence and annotation. Please refer to <a href="https://docs.google.com/spreadsheets/d/15gLGL5FMV8gRt46ezc2Gmb-R1NbYsIGMssB0MyHkcwE/edit#gid=961825804">the relationships between annotation files and flat files</a>.</p> <h2 id="mssprjtable">Requirement of BioProject, BioSample ID</h2> <ul> <li>Genome</li> </ul> <table> <thead> <tr> <th>Your submission</th> <th>BioProject</th> <th>BioSample</th> <th>Annotation with<br /> biological feature</th> <th>locus_tag</th> <th>Need DRA</th> <th>You should select</th> </tr> </thead> <tbody> <tr> <td>Draft genome w/ annotation</td> <td>M</td> <td>M</td> <td>M</td> <td>M</td> <td>OPT</td> <td>WGS</td> </tr> <tr> <td>Draft genome w/o annotation</td> <td>M</td> <td>M</td> <td>NR</td> <td>NR</td> <td>OPT</td> <td>WGS</td> </tr> <tr> <td>Finished level genome sequence, non-WGS</td> <td>M</td> <td>M</td> <td>M</td> <td>M</td> <td>OPT</td> <td>GNM</td> </tr> <tr> <td>Metagenome-Assembled Genome w/ annotation</td> <td>M</td> <td>M</td> <td>M</td> <td>M</td> <td>M</td> <td>MAG</td> </tr> <tr> <td>Metagenome-Assembled Genome w/o annotation</td> <td>M</td> <td>M</td> <td>NR</td> <td>NR</td> <td>M</td> <td>MAG</td> </tr> <tr> <td>Single Amplified Genome w/ annotation</td> <td>M</td> <td>M</td> <td>M</td> <td>M</td> <td>OPT</td> <td>SAG</td> </tr> <tr> <td>Single Amplified Genome w/o annotation</td> <td>M</td> <td>M</td> <td>NR</td> <td>NR</td> <td>OPT</td> <td>SAG</td> </tr> <tr> <td>High Throughput Genomic Sequences</td> <td>M</td> <td>M</td> <td>OPT</td> <td>NR</td> <td>OPT</td> <td>HTG</td> </tr> <tr> <td>Transcriptome Shotgun Assembly</td> <td>M</td> <td>M</td> <td>OPT</td> <td>NR</td> <td>M</td> <td>TSA</td> </tr> <tr> <td>High Throughput cDNA Sequences</td> <td>M</td> <td>M</td> <td>OPT</td> <td>NR</td> <td>OPT</td> <td>HTC</td> </tr> <tr> <td>Expressed Sequence Tags</td> <td>M</td> <td>M</td> <td>NR</td> <td>NR</td> <td>OPT</td> <td>EST</td> </tr> <tr> <td>Virus/Phage genome</td> <td>NR</td> <td>NR</td> <td>OPT</td> <td>NR</td> <td>OPT</td> <td>MISC</td> </tr> <tr> <td>Plasmid genome only</td> <td>NR</td> <td>NR</td> <td>OPT</td> <td>NR</td> <td>OPT</td> <td>MISC</td> </tr> <tr> <td>Organelle genome only</td> <td>NR</td> <td>NR</td> <td>OPT</td> <td>NR</td> <td>OPT</td> <td>MISC</td> </tr> <tr> <td>Finished Level Genome + Plasmid</td> <td>M</td> <td>M</td> <td>M</td> <td>M</td> <td>OPT</td> <td>GNM, MISC for each submission</td> </tr> <tr> <td>Finished Level Genome + Organelle</td> <td>M</td> <td>M</td> <td>M</td> <td>M</td> <td>OPT</td> <td>GNM, MISC for each submission</td> </tr> </tbody> </table> <p>M, Mandatory; NR, Not required; OPT, Optional</p> <ul> <li>Transcriptome</li> </ul> <table> <thead> <tr> <th>Your submission</th> <th>BioProject</th> <th>BioSample</th> <th>Annotation with<br /> biological feature</th> <th>locus_tag</th> <th>Need DRA</th> <th>You should select</th> </tr> </thead> <tbody> <tr> <td>Transcriptome Shotgun Assembly</td> <td>M</td> <td>M</td> <td>OPT</td> <td>NR</td> <td>M</td> <td>TSA</td> </tr> <tr> <td>High Throughput cDNA Sequences</td> <td>M</td> <td>M</td> <td>OPT</td> <td>NR</td> <td>OPT</td> <td>HTC</td> </tr> <tr> <td>Expressed Sequence Tags</td> <td>M</td> <td>M</td> <td>NR</td> <td>NR</td> <td>OPT</td> <td>EST</td> </tr> </tbody> </table> <p>M, Mandatory; NR, Not required; OPT, Optional</p> <ul> <li>Targeted Locus Study</li> </ul> <table> <thead> <tr> <th>Your submission</th> <th>BioProject</th> <th>BioSample</th> <th>Annotation with<br /> biological feature</th> <th>locus_tag</th> <th>Need DRA</th> <th>You should select</th> </tr> </thead> <tbody> <tr> <td>Targeted Locus Study</td> <td>M</td> <td>M</td> <td>M</td> <td>NR</td> <td>OPT</td> <td>TLS</td> </tr> </tbody> </table> <p>M, Mandatory; NR, Not required; OPT, Optional</p> <h2 id="mssdfast">DFAST for the submission of prokaryote genomes</h2> <h3 id="mssdfast-1">DFAST(DDBJ Fast Annotation and Submission Tool)</h3> <p><a href="https://dfast.ddbj.nig.ac.jp/">DFAST</a> is a rapid annotation pipeline service for prokaryote genomes, which also generates the annotation files that can be directly submitted to DDBJ. We strongly recommend that the submitters use <a href="https://dfast.ddbj.nig.ac.jp/">DFAST</a> for the registration of the prokaryote genomes to Annotated/Assembled Sequences database.</p> <h3 id="mssdfast-2">Registration procedure for the prokaryote genome</h3> <ol> <li>You need <a href="/account-e.html">D-way account</a> which has been obtained through <a href="https://dfast.ddbj.nig.ac.jp/">DFAST</a> in order to register the prokaryote genome and the annotation into DDBJ. Registration of <a href="https://www.ddbj.nig.ac.jp/bioproject/index-e.html">BioProject</a>, <a href="https://www.ddbj.nig.ac.jp/biosample/index-e.html">BioSample</a> and <a href="https://www.ddbj.nig.ac.jp/ddbj/locus_tag-e.html">locus_tag prefix</a> when biological feature are descriebed are required in advance.</li> <li>If you login to <a href="https://dfast.ddbj.nig.ac.jp/">DFAST</a> with <a href="/account-e.html">D-way account</a>, you can manage the <a href="https://dfast.ddbj.nig.ac.jp/help_login">jobs analyzed</a> in <a href="https://dfast.ddbj.nig.ac.jp/">DFAST</a>. If you have not obtained the login account, see <a href="/account-e.html#account-for-bioproject-biosample-submissions">“Create a D-way account in the website”</a> to get a new account.</li> </ol> <h3 id="mssdfast-3">How to submit the data obtained in DFAST</h3> <ol> <li>Login to <a href="https://dfast.ddbj.nig.ac.jp/">DFAST</a> with your account. First, upload the fasta file in “job submission page”, and start the job to analyze the genome. At this stage, you can obtain a job ID. When the job is finished, click “DDBJ submission” tab on the page. The annotation and sequence files, which are needed for MSS submission, are created after you fill necessary data (e.g. BioProject ID, BioSample ID, locus_tag prefix, and other metadata) into the form in metadata section.(*1) Finally, click “Format Check” to do the syntax check of the files.</li> <li>Submitting by DFAST job ID <ol> <li>Copy the target job ID (format: <span style="font-family:Arial;">########-####-####-####-############</span>)</li> </ol> </li> <li>Submitting the files downloaded from DFAST <ol> <li>In the <a href="https://dfast.ddbj.nig.ac.jp/help_login">job management page</a>, add checkmark to the job number that you would like to submit to DDBJ.</li> <li>Select “MSS” for the file format type, and click “DOWNLOAD” to download the submission files. Please check the meta information carefully. If you encounter a warning, check again and correct the metadata that you have filled (*2). If you would like to edit the annotation and meta data on a text file, download the files and open them by text editor.</li> </ol> </li> <li>Apply for the submission through <a href="https://mss.ddbj.nig.ac.jp/">“Application form for MSS”</a>. According to the process shown in <a href="/ddbj/mss-e.html#flow">“The Flow of MSS”</a>, send the submission files to DDBJ.</li> </ol> <p>*1 You can use <a href="https://dfast.ddbj.nig.ac.jp/">DFAST</a> and obtain the result of genome annotation without logging in. After you login to DFAST, you can import the job into your account by the function of “Job History” on the menu bar if you remember the job ID.</p> <p>*2 The function for checking the metadata in <a href="https://dfast.ddbj.nig.ac.jp/">DFAST</a> is simple. You may be asked to correct the files by DDBJ curators after you submit the data.</p> </main> <aside class="related-pages"> <h2 class="caption">Related pages</h2> <div class="navigation"> <nav> <ul> <li> <a href="/ddbj/file-format-e.html">Submission File Format</a> </li> <li> <a href="/ddbj/mss-tool-e.html">Validation tools for MSS data files</a> </li> <li> <a href="/ddbj/ume-e.html">UME User’s manual</a> </li> <li> <a href="/ddbj/parser-e.html">Parser User’s Manual</a> </li> <li> <a href="/ddbj/transchecker-e.html">transChecker User’s Manual</a> </li> <li> <a href="/ddbj/validator-e.html">Validator error message</a> </li> <li> <a href="/ddbj/mss-form-e.html">Application form for MSS</a> </li> </ul> </nav> </div> </aside> </section> </div> </section> </div> <footer></footer> <div id="back-top"></div> </body> </html>