CINXE.COM

Examples and use cases

<!DOCTYPE html> <html lang="en-gb"> <head> <title>Examples and use cases</title> <!--[if lte IE 7]><link rel="stylesheet" type="text/css" media="all" href="/minified/98adc89a8b4418c03a4c81d5bf76272d.ie7.css"/><![endif]--><!--[if gt IE 7]><link rel="stylesheet" type="text/css" media="all" href="/minified/98adc89a8b4418c03a4c81d5bf76272d.css"/><![endif]--><!--[if !IE]><!--><link rel="stylesheet" type="text/css" media="all" href="/minified/98adc89a8b4418c03a4c81d5bf76272d.css"/><!--<![endif]--><!--[if lte IE 7]><link rel="stylesheet" type="text/css" media="all" href="/minified/9bafd8bb591fb39bcb766b0160434c45.image.ie7.css"/><![endif]--><!--[if gt IE 7]><link rel="stylesheet" type="text/css" media="all" href="/minified/9bafd8bb591fb39bcb766b0160434c45.image.css"/><![endif]--><!--[if !IE]><!--><link rel="stylesheet" type="text/css" media="all" href="/minified/9bafd8bb591fb39bcb766b0160434c45.image.css"/><!--<![endif]--> <link rel="icon" type="image/png" href="/i/ensembl-favicon.png" /> <link href="/apple-touch-icon.png" rel="apple-touch-icon" type="image/png" /> <link type="application/opensearchdescription+xml" rel="search" title="Ensembl (All)" href="//useast.ensembl.org/opensearch/all.xml" /> <meta name="google-site-verification" content="tG3WW0bg9gNaIHH002gJl-fV2k0bzf-ZxC8pBqXTaNo" /> <meta name="viewport" content="target-densitydpi=device-dpi, width=device-width, initial-scale=1.0, maximum-scale=2.0, user-scalable=yes" /> <script>var prefetch = ['/minified/8b49a46e34da8ff984905ba1470c2f54.png','/minified/9cf07063d9febd1e9344ca4a6cc02cb1.png','/minified/ff8270b50aa958c83f1fe0c29e61a75a.png','/minified/95cdc75b8720f22b080d415a4915e7a1.png','/minified/f37021b24ed3c44d7252707733a6c8e3.png','/minified/79a5fb49f716554891451d08438ee679.png','/minified/d2faa6d6cba3f9fe2b841c9ca08bb867.jpg'];</script> </head> <body class="ie ie7 ie67 static" data-pace="2" id="ensembl-webpage"> <div id="min_width_container"> <div id="min_width_holder"> <!-- Announcement Banner --> <!-- /Announcement Banner --> <div id="masthead" class="js_panel"> <input type="hidden" class="panel_type" value="Masthead" /> <div class="logo_holder"><a href="/"><div class="logo-header print_hide" title="Ensembl mirror">&nbsp;</div></a><img src="/i/e-ensembl_print.gif" alt="Ensembl mirror" title="Ensembl mirror" class="screen_hide_inline" style="width:170px;height:45px" /></div> <div class="mh print_hide"> <div class="account_holder"><div class="_account_holder"><div class="account-loading">Loading&hellip;</div><form action="/Ajax/accounts_dropdown"></form></div></div> <div class="tools_holder"><ul class="tools"><li><a class="constant" href="/Multi/Tools/Blast">BLAST/BLAT</a></li><li><a class="constant" href="/info/docs/tools/vep/">VEP</a></li><li><a class="constant" href="/info/docs/tools/index.html">Tools</a></li><li><a class="constant" href="/biomart/martview">BioMart</a></li><li><a class="constant" rel="nofollow" href="/info/data/">Downloads</a></li><li><a class="constant" rel="nofollow" href="/info/">Help &amp; Docs</a></li><li class="last"><a class="constant" target="_blank" href="https://www.ensembl.info">Blog</a></li></ul><div class="more"><a href="#">More <span class="arrow">&#9660;</span></a></div></div> <div class="search_holder print_hide"> <div id="searchPanel" class="js_panel"> <input type="hidden" class="panel_type" value="SearchBox" /> <form action="/Multi/Psychic"> <div class="search print_hide"> <div class="sites button"> <img class="search_image no-sprite badge-16" src="/i/search/ensembl.gif" alt="" /> <img src="/i/search/down.gif" style="width:7px" alt="" /> <input type="hidden" name="site" value="ensembl_all" /> </div> <div> <label class="hidden" for="se_q">Search terms</label> <input class="query inactive" id="se_q" type="text" name="q" value="Search all species&hellip;" data-role="none" onkeydown="if (event.keyCode == 13) { $(this).closest('form').submit(); return false; }" /> </div> <div class="button"><img src="/i/16/search.png" alt="Search&nbsp;&raquo;" onClick="$(this).closest('form').submit()" /></div> </div> <div class="site_menu hidden"> <div class="ensembl_all"><img class="no-sprite" src="/i/search/ensembl.gif" class="badge-16" alt="Search all species"/>Search all species<input type="hidden" value="Search all species&hellip;" /></div> <div class="ensembl_genomes"><img class="no-sprite" src="/i/search/ensembl_genomes.gif" class="badge-16" alt="Search Ensembl genomes"/>Search Ensembl genomes<input type="hidden" value="Search Ensembl genomes&hellip;" /></div> <div class="vega"><img class="no-sprite" src="/i/search/vega.gif" class="badge-16" alt="Search Vega"/>Search Vega<input type="hidden" value="Search Vega&hellip;" /></div> <div class="ebi"><img class="no-sprite" src="/i/search/ebi.gif" class="badge-16" alt="Search EMBL-EBI"/>Search EMBL-EBI<input type="hidden" value="Search EMBL-EBI&hellip;" /></div> <div class="sanger"><img class="no-sprite" src="/i/search/sanger.gif" class="badge-16" alt="Search Sanger"/>Search Sanger<input type="hidden" value="Search Sanger&hellip;" /></div> </div> </form> </div> </div> </div> <div class="tabs_holder print_hide"><ul class="tabs"><li class=" short_tab" style="display:none"><a href="/info/website/" title="Using this website">Using</a></li><li class=" short_tab" style="display:none"><a href="/info/genome/" title="Annotation and prediction">Annotation</a></li><li class=" short_tab" style="display:none"><a href="/info/data/" title="Data access">Data</a></li><li class=" active short_tab" style="display:none"><a href="/info/docs/" title="API &amp; software">API</a></li><li class=" short_tab" style="display:none"><a href="/info/about/" title="About us">About</a></li><li class=" long_tab" style="display:block"><a href="/info/website/">Using this website</a></li><li class=" long_tab" style="display:block"><a href="/info/genome/">Annotation and prediction</a></li><li class=" long_tab" style="display:block"><a href="/info/data/">Data access</a></li><li class=" active long_tab" style="display:block"><a href="/info/docs/">API &amp; software</a></li><li class=" long_tab" style="display:block"><a href="/info/about/">About us</a></li></ul></div> </div> <div id="main_holder"> <div id="page_nav_wrapper"> <div id="page_nav" class="nav print_hide js_panel floating"> <input type="hidden" class="panel_type" value="LocalContext" /> <div class="header">In this section</div> <ul class="local_context"> <li class="top_level"><img src="/i/leaf.gif" class="toggle vep_tutorial.html" alt=""><a href="vep_tutorial.html"><b>Tutorial</b></a></li><li class="top_level"><img src="/i/leaf.gif" class="toggle vep_download.html" alt=""><a href="vep_download.html"><b>Download and install</b></a></li><li class="top_level"><img src="/i/leaf.gif" class="toggle vep_options.html" alt=""><a href="vep_options.html"><b>Running VEP</b></a></li><li class="top_level"><img src="/i/leaf.gif" class="toggle vep_cache.html" alt=""><a href="vep_cache.html"><b>Annotation sources</b></a></li><li class="top_level"><img src="/i/leaf.gif" class="toggle vep_filter.html" alt=""><a href="vep_filter.html"><b>Filtering results</b></a></li><li class="top_level"><img src="/i/leaf.gif" class="toggle vep_custom.html" alt=""><a href="vep_custom.html"><b>Custom annotations</b></a></li><li class="top_level"><img src="/i/leaf.gif" class="toggle vep_plugins.html" alt=""><a href="vep_plugins.html"><b>Plugins</b></a></li><li class="top_level"><img src="/i/leaf.gif" class="toggle vep_example.html" alt=""><a href="vep_example.html"><b>Examples and use cases</b></a></li><li class="last"><img src="/i/leaf.gif" class="toggle vep_other.html" alt=""><a href="vep_other.html"><b>Other information</b></a></li> </ul> <div class="subheader">On this page</div> <ul class="local_context" style="border-width:0"> <li class="top_level"><img src="/i/leaf.gif"><a href="#examples">Example commands</a></li><li class="top_level"><img src="/i/leaf.gif"><a href="#gnomad">gnomAD</a></li><li class="top_level"><img src="/i/leaf.gif"><a href="#gerp">Conservation scores</a></li><li class="top_level"><img src="/i/leaf.gif"><a href="#dbNSFP">dbNSFP</a></li><li class="top_level"><img src="/i/leaf.gif"><a href="#StructVar">Structural variants</a></li><li class="top_level"><img src="/i/leaf.gif"><a href="#pangenomes">Pangenome assemblies</a></li><li class="last"><img src="/i/leaf.gif"><a href="#citations">Citations and VEP users</a></li> </ul> <div class="js_panel" style="margin:16px 0 0 8px"> <input type="hidden" class="panel_type" value="SearchBox" /> <form method="get" action="/Multi/Psychic" class="clear search-form"><fieldset><input type="hidden" name="site" value="ensembl" /><input type="hidden" name="facet_feature_type" value="Documentation" /><div class="form-field ff-multi"><div class="ff-right ff-inline"><input value="Search documentation&#8230;" name="q" type="text" id="q" size="20" class="ftext inactive input optional _string query" /><input class="fbutton" id="6XGj7mtF_1" type="submit" value="Go" /></div></div></fieldset></form> </div> <p class="invisible">.</p> </div> </div> <div id="main"> <ul class="breadcrumbs print_hide"><li><a class="home" href="//useast.ensembl.org">Home</a></li><li><a href="/info/">Help &amp; Documentation</a></li><li><a href="/info/docs/" title="API & Software">API & Software</a></li><li><a href="/info/docs/tools/" title="Ensembl Tools">Ensembl Tools</a></li><li><a href="/info/docs/tools/vep/" title="Ensembl Variant Effect Predictor (VEP)">Ensembl Variant Effect Predictor (VEP)</a></li><li><a href="/info/docs/tools/vep/script/" title="VEP command line">VEP command line</a></li><li class="last">Examples and use cases</li></ul> <div id="content"><div id="static"> <div> <div style="float:right"><img src="/img/vep_logo.png"/></div> <h1 id="top"><span style="color:#006;padding-right:15px">Variant Effect Predictor</span><span style="color:#666"><img src="/i/16/user.png"/> Examples and use cases</span></h1> <hr/> <h2 id="examples">Example commands</h2> <ul> <li> <p>Read input from <b>STDIN</b>, output to <b>STDOUT</b></p> <pre class="code sh_sh">./vep --cache -o stdout</pre> </li> <li> <p>Add <b>regulatory</b> region <b>consequences</b></p> <pre class="code sh_sh">./vep --cache -i variants.txt --regulatory</pre> </li> <li> <p>Input file <kbd>variants.vcf.txt</kbd>, input file <b>format VCF</b>, add <b>gene symbol</b> identifiers</p> <pre class="code sh_sh">./vep --cache -i variants.vcf.txt --format vcf --symbol</pre> </li> <li> <p><b>Filter</b> out <b>common variants</b> based on 1000 Genomes data</p> <pre class="code sh_sh">./vep --cache -i variants.txt --filter_common</pre> </li> <li> <p><b>Force overwrite</b> of output file <kbd>variants_output.txt</kbd>, check for existing <b>co-located variants</b>, output only <b>coding sequence</b> consequences, output <b>HGVS names</b></p> <pre class="code sh_sh">./vep --cache -i variants.txt -o variants_output.txt --force --check_existing --coding_only --hgvs</pre> </li> <li> <p>Specify <b>DB connection parameters</b> in registry file <kbd>ensembl.registry</kbd>, add <b>SIFT</b> score and prediction, <b>PolyPhen</b> prediction</p> <pre class="code sh_sh">./vep --database -i variants.txt --registry ensembl.registry --sift b --polyphen p</pre> </li> <li> <p>Connect to <b>Ensembl Genomes</b> db server for <i>Arabidopsis thaliana</i></p> <pre class="code sh_sh">./vep --database -i variants.txt --genomes --species arabidopsis_thaliana</pre> </li> <li> <p>Load config from <b>ini file</b>, run in <b>quiet mode</b></p> <pre class="code sh_sh">./vep --config vep.ini -i variants.txt -q</pre> </li> <li> <p>Use <b>cache</b> in <kbd>/home/vep/mycache/</kbd>, use <b>gzcat</b> instead of zcat</p> <pre class="code sh_sh">./vep --cache --dir /home/vep/mycache/ -i variants.txt --compress gzcat</pre> </li> <li> <p>Add custom position-based <b>phenotype</b> annotation from remote <b>BED file</b></p> <pre class="code sh_sh">./vep --cache -i variants.vcf --custom file=ftp://ftp.myhost.org/data/phenotypes.bed.gz,short_name=phenotype</pre> </li> <li> <p>Use the <b>plugin</b> named MyPlugin, output only the variation name, feature, consequence type and MyPluginOutput <b>fields</b></p> <pre class="code sh_sh">./vep --cache -i variants.vcf --plugin MyPlugin --fields Uploaded_variation,Feature,Consequence,MyPluginOutput</pre> </li> <li> <p>Right align variants before consequence calculation. For more information, see <a href="/info/docs/tools/vep/script/vep_other.html#shifting" target="_blank">here</a>.</p> <pre class="code sh_sh">./vep --cache -i variants.vcf --shift_3prime 1</pre> </li> <li> <p>Report uploaded allele before minimisation. For more information, see <a href="/info/docs/tools/vep/script/vep_options.html#opt_uploaded_allele" target="_blank">here</a>.</p> <pre class="code sh_sh">./vep --cache -i variants.vcf --uploaded_allele</pre> </li> </ul> <hr/> <h2 id="gnomad">gnomAD</h2> <p> <a href="http://gnomad.broadinstitute.org/" rel="external">gnomAD</a> exome frequency data is included in VEP's cache files from release 90, replacing ExAC; use <a href="vep_options.html#opt_af_gnomade">--af_gnomade</a> to enable using this data. VEP can also retrieve frequency data from the gnomAD genomes set or ExAC via VEP's custom annotation functionality.<p> <p>For the latest gnomAD data, please visit <a href="https://gnomad.broadinstitute.org/downloads" rel="external">gnomAD downloads</a>.<p> <ol> <li>VEP requires Bio::DB::HTS to read data from tabix-indexed VCFs - see <a href="vep_download.html#installer"><img style="vertical-align:bottom" src="/i/16/download.png"/> installation instructions</a></li> <li>Ensembl's FTP site hosts abridged VCF files for gnomAD and ExAC, additionally remapped to GRCh38 using <a href="http://crossmap.sourceforge.net/" rel="external">CrossMap</a>. It is possible for VEP to read these files directly from their remote location, though for optimal performance the VCF and index should be downloaded to a local file system. <ul> <li> <b>GRCh38</b> <ul> <li>gnomAD genomes (r2.1, remapped with CrossMap): <a href="https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh38/variation_genotype/gnomad/r2.1/genomes/">[VCFs and tabix indexes]</a> </li> <li>gnomAD exomes (r2.1, remapped with CrossMap): <a href="https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh38/variation_genotype/gnomad/r2.1/exomes/">[VCFs and tabix indexes]</a> </li> <li>ExAC (v0.3, remapped using CrossMap): <a href="https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh38/variation_genotype/ExAC.0.3.GRCh38.vcf.gz">[VCF]</a> <a href="https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh38/variation_genotype/ExAC.0.3.GRCh38.vcf.gz.tbi">[tabix index]</a> </li> </ul> </li> <li> <b>GRCh37</b> <ul> <li>gnomAD genomes (r2.1): <a href="https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh37/variation_genotype/">[VCF and tabix indexes]</a> </li> <li>gnomAD exomes (r2.1): <a href="https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh37/variation_genotype/">[VCF and tabix indexes]</a> </li> <li>ExAC (v0.3): <a href="https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh37/variation_genotype/ExAC.0.3.GRCh37.vcf.gz">[VCF]</a> <a href="https://ftp.ensembl.org/pub/data_files/homo_sapiens/GRCh37/variation_genotype/ExAC.0.3.GRCh37.vcf.gz.tbi">[tabix index]</a> </li> </ul> </li> </ul> </li> <li> <p>Run VEP with the following command (using the GRCh38 input example) to get locations and continental-level allele frequencies:</p> <pre class="code sh_sh">./vep -i examples/homo_sapiens_GRCh38.vcf --cache \ --custom file=gnomad.genomes.r2.0.1.sites.GRCh38.noVEP.vcf.gz,short_name=gnomADg,format=vcf,type=exact,coords=0,fields=AF_AFR%AF_AMR%AF_ASJ%AF_EAS%AF_FIN%AF_NFE%AF_OTH</pre> <p>You will then see data under field names as described in the VEP output header:</p> <pre class="code sh_sh">## gnomADg : gnomad.genomes.r2.0.1.sites.GRCh38.noVEP.vcf.gz (exact) ## gnomADg_AFR_AF : AFR_AF field from gnomad.genomes.r2.0.1.sites.GRCh38.noVEP.vcf.gz ## gnomADg_AMR_AF : AMR_AF field from gnomad.genomes.r2.0.1.sites.GRCh38.noVEP.vcf.gz ...</pre> where the gnomADg field contains the ID (or coordinates if no ID found) of the variant in the VCF file. Any of the fields in the gnomAD file INFO field can be added by appending them to the list in your VEP command. </li> </ol> <hr/> <h2 id="gerp">Conservation scores</h2> <p> You can use VEP's <a href="vep_custom.html">custom annotation</a> feature to add conservation scores to your output. For example, to add GERP scores, download the bigWig file from the list below, and run VEP with the following flag:</p> <pre class="code sh_sh">./vep --cache -i example.vcf --custom file=All_hg19_RS.bw,short_name=GERP,format=bigwig</pre> <p> Example conservation score files: </p> <div> <div style="float:left"> <b>Human (GRCh38)</b> <ul> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg38/phastCons7way/" rel="external">phastCons 7-way</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg38/phastCons20way/" rel="external">phastCons 20-way</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg38/phastCons100way/" rel="external">phastCons 100-way</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg38/phyloP7way/" rel="external">phyloP 7-way</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg38/phyloP20way/" rel="external">phyloP 20-way</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg38/phyloP100way/" rel="external">phyloP 100-way</a> </li> </ul> </div> <div style="float:left;margin-left:40px"> <b>Human (GRCh37)</b> <ul> <li><a href="http://hgdownload.soe.ucsc.edu/gbdb/hg19/bbi/All_hg19_RS.bw" rel="external">GERP</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg19/phastCons46way/" rel="external">phastCons 46-way</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg19/phastCons100way/" rel="external">phastCons 100-way</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg19/phyloP46way/" rel="external">phyloP 46-way</a> </li> <li><a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg19/phyloP100way/" rel="external">phyloP 100-way</a> </li> </ul> </div> <div class="clear"></div> </div> <p> All files provided by the UCSC genome browser - files for other species are available from their <a href="http://hgdownload.soe.ucsc.edu/goldenPath/" rel="external">FTP site</a>, though be sure to use the file corresponding to the <a href="/Homo_sapiens/Info/Annotation#assembly">correct assembly</a>. </p> <hr/> <h2 id="dbNSFP">dbNSFP</h2> <p> dbNSFP - <a href="http://www.ncbi.nlm.nih.gov/pubmed/21520341" rel="external">&quot;a lightweight database of human nonsynonymous SNPs and their functional predictions&quot;</a> - provides pathogenicity predictions from many tools (including SIFT, LRT, MutationTaster, FATHMM) across every possible missense substitution in the human proteome. </p> <p> Plugins in VEP sometimes require data processed in specific ways as arguments. Any requirements and usage instructions for each plugin can be found in the <a href="vep_plugins.html">plugin documentation</a>. </p> <p> In the case of the dbNSFP.pm plugin, the data needs to be <a href="https://sites.google.com/site/jpopgen/dbNSFP" rel="external">downloaded</a> and then processed into a format that the plugin can use. Note that there are two distinct branches of the files provided for academic and commercial usage; please use the appropriate files for your use case. </p> <p> After downloading the file, you will need to process it so that tabix can index it correctly. This will take a while as the file is very large! Note that you will need the <a href="http://samtools.sourceforge.net/tabix.shtml" rel="external">tabix</a> utility in your path to use dbNSFP. </p> <pre class="code sh_sh">version=4.5c unzip dbNSFP${version}.zip zcat dbNSFP${version}_variant.chr1.gz | head -n1 > h # GRCh38/hg38 data zgrep -h -v "^#chr" dbNSFP${version}_variant.chr* | sort -k1,1 -k2,2n - | cat h - | bgzip -c > dbNSFP${version}_grch38.gz tabix -s 1 -b 2 -e 2 dbNSFP${version}_grch38.gz # GRCh37/hg19 data zgrep -h -v "^#chr" dbNSFP${version}_variant.chr* | awk '$8 != "." ' | sort -k8,8 -k9,9n - | cat h - | bgzip -c > dbNSFP${version}_grch37.gz tabix -s 8 -b 9 -e 9 dbNSFP${version}_grch37.gz</pre> <p> Then simply download the <a href="https://github.com/Ensembl/VEP_plugins/blob/release/113/dbNSFP.pm" rel="external">dbNSFP.pm plugin</a> and place it either in <b>$HOME/.vep/Plugins/</b> or a path in your <b>$PERL5LIB</b>. When you run VEP with the plugin, you will need to select some of the columns that you wish to retrieve; to list them run VEP with the plugin and the path to the dbNSFP file and no further parameters: </p> <pre class="code sh_sh">./vep --cache --force --plugin dbNSFP,dbNSFP4.5c_grch38.txt.gz 2014-04-04 11:27:05 - Read existing cache info 2014-04-04 11:27:05 - Auto-detected FASTA file in cache directory 2014-04-04 11:27:05 - Checking/creating FASTA index 2014-04-04 11:27:05 - Failed to instantiate plugin dbNSFP: ERROR: No columns selected to fetch. Available columns are: #chr,pos(1-coor),ref,alt,aaref,aaalt,hg18_pos(1-coor),genename,Uniprot_acc, Uniprot_id,Uniprot_aapos,Interpro_domain,cds_strand,refcodon,SLR_test_statistic, codonpos,fold-degenerate,Ancestral_allele,Ensembl_geneid,Ensembl_transcriptid, ...</pre> <p> Note that some of these fields are replicates of those produced by the core VEP code (e.g. <a href="vep_options.html#opt_sift">SIFT</a>, the <a href="vep_options.html#opt_af_1kg">1000 Genomes</a> and <a href="vep_options.html#opt_af_esp">ESP</a> frequencies) - you should use the options to enable these from the VEP code in place of the annotations from dbNSFP as the dbNSFP file covers <b>only</b> missense substitutions. Other fields, such as the conservation scores, may be better served by using genome-wide files as described <a href="#gerp">above</a>. </p> <p> To select fields, just add them as a comma-separated list to your command line:</p> <pre class="code sh_sh">./vep --cache --force --plugin dbNSFP,dbNSFP4.5c_grch38.txt.gz,LRT_score,FATHM_score,MutationTaster_score</pre> <p> One final point to note is that the dbNSFP scores are frozen on a particular Ensembl release's transcript set; check the readme file on their download site to find out exactly which. While in the majority of cases protein sequences don't change between releases, in some circumstances the protein sequence used by VEP in the latest release may differ from the sequence used to calculate the scores in dbNSFP. </p> <hr/> <h2 id="StructVar">Structural variants</h2> <p>VEP can be used to annotate structural variants (SV) with their predicted effect on other genomic features. For more information on SV input format, see <a href="/info/docs/tools/vep/vep_formats.html#sv">here</a>.</p> <h4>Prediction process</h4> <ul> <li> If the INFO keys <code>END</code> or <code>SVLEN</code> are present, the proportion of any overlapping feature covered by the variant is calculated</li> <li> The alternative allele (or <code>SVTYPE</code> in older VCF files) defines the type of structural variant; some types of structural variants are tested for specific consequences:</li> <table class="ss"> <tr> <th>Structural variant type</th> <th>Abbreviation</th> <th>Specific consequences</th> </tr> <tr class="bg1"> <td>Insertion</td> <td>INS</td> <td>Feature elongation</td> </tr> <tr class="bg2"> <td>Deletion</td> <td>DEL</td> <td>Feature truncation</td> </tr> <tr class="bg1"> <td>Duplication</td> <td>DUP</td> <td>Feature amplification/elongation</td> </tr> <tr class="bg2"> <td>Inversion</td> <td>INV</td> <td><i>Not tested for any specific consequence</i></td> </tr> <tr class="bg1"> <td>Copy number variation</td> <td>CNV</td> <td>Feature amplification/elongation (if copy number is 2) or truncation (if copy number is 0)</td> </tr> <tr class="bg2"> <td>Breakpoint variant</td> <td>BND</td> <td>Feature truncation</td> </tr> </table> </ul> <h5> Insertions and deletions</h5> <ul> <li> Supports <a href="/info/docs/tools/vep/vep_formats.html#sv">mobile element insertions/deletions</a>, including ALU, HERV, LINE1 and SVA elements <ul> <li> Currently, mobile element variants are treated as any insertion/deletion </ul> </ul> <h5> Breakpoint variants</h5> <ul> <li> Supports chromosome synonyms in breakends (such as <code>chr4</code> and <code>NC_000004.12</code>) <li> Processes <a href="/info/docs/tools/vep/vep_formats.html#sv">single breakends and multiple, comma-separated alternative breakends</a> <li> Consequences are reported for each breakend; for instance, for a VCF input like <code>1 7936271 . N N[12:58877476[,N[X:10932343[</code>, it will report the consequences for each of the 3 breakends: <ul> <li> <code>N[12:58877476[</code>: consequences for the first alternative breakend near chr12:58877476 <li> <code>N[X:10932343[</code>: consequences for the second alternative breakend near chrX:10932343 <li> <code>N.</code>: consequences for the reference breakend near chr1:7936271 (represented as detailed in the <a href="https://samtools.github.io/hts-specs/VCFv4.4.pdf" rel="external">VCF 4.4 specification, section 5.4.9: Single breakends</a>) </ul> <li> In case of specific breakends not overlaping any reported Ensembl features (such as transcripts and regulatory regions), that specific breakend will <b>NOT</b> be presented in VEP output. </ul> <h4> Reported overlaps</h4> <ul> <li> VEP calculates the length and proportion of each genomic feature overlapped by a structural variant <li> Use the <a href="vep_options.html#opt_overlaps">--overlaps</a> option to enable this when using VCF or tab format. (This is reported by default in standard VEP and JSON format.) <li> The keys <code>bp_overlap</code> and <code>percentage_overlap</code> are used in JSON format and <code>OverlapBP</code> and <code>OverlapPC</code> in other formats. </ul> <h4> Plugin support</h4> <ul> <li> <a href="vep_plugins.html#CADD">CADD plugin</a> <li> <a href="vep_plugins.html#Conservation">Conservation plugin</a> <li> <a href="vep_plugins.html#NearestGene">NearestGene plugin</a> <li> <a href="vep_plugins.html#Phenotypes">Phenotypes plugin</a> <li> <a href="vep_plugins.html#StructuralVariantOverlap">StructuralVariantOverlap plugin</a>: please note that all features of this plugin have been ported to <a href="vep_custom.html">--custom annotation</a>, with additional improvements <li> <a href="vep_plugins.html#TSSDistance">TSSDistance plugin</a> </ul> <h4> Changing memory requirements</h4> <ul> <li> By default, VEP does not annotate variants larger than 10M. If you are using the command line tool, you can use the <a href="vep_options.html#opt_max_sv_size">--max_sv_size</a> option to modify this. <ul> <li> This limit is not associated with breakpoint variants: each breakend in a breakpoint variant is analysed by VEP as a single base (the alternative sequence is currently ignored). </ul> <li>By default, variants are analysed in batches of 5000. Using the <a href="vep_options.html#opt_buffer_size">--buffer_size</a> option to reduce this can reduce memory requirements, especially if your data is sparse. A smaller buffer size is essential when annotating structural variants with regulatory data. </ul> <hr/> <h2 id="pangenomes">Pangenome assemblies</h2> <p> VEP is able to analyse variants in <b>any species or assembly</b> (even if not part of <a href="/info/about/species.html">Ensembl data</a>) by providing your own <a href="vep_cache.html#fasta">FASTA file</a> and <a href="vep_cache.html#gff" target="_blank">GFF/GTF annotation</a>: </p> <pre class="code sh_sh">./vep -i variants.txt -o variants_output.txt --gff data.gff.gz --fasta genome.fa.gz</pre> <p> We also provide data for other assemblies besides those supported in the current Ensembl and Ensembl Genomes sites. </p> <h3 id="HPRC">HPRC assemblies</h3> <p>The <a href="https://humanpangenome.org/" rel="external">Human Pangenome Reference Consortium (HPRC)</a> aims to sequence 350 individuals of diverse ancestries, producing a pangenome of 700 haplotypes by the end of 2024. The first publication (<i><a href="https://www.nature.com/articles/s41586-023-05896-x" rel="external">A draft human pangenome reference</a></i>) describes 47 phased, diploid assemblies from a cohort of genetically diverse individuals.</p> <p>The VEP command-line tool (CLI) can annotate and filter variants called against the latest human assemblies, including the telomere-to-telomere assembly of the CHM13 cell line (T2T-CHM13). We have annotated genes on these human assemblies, based on Ensembl/<a href="https://www.gencodegenes.org/human/release_38.html" rel="external">GENCODE 38</a> genes and transcripts, via a new mapping pipeline as detailed in the Methods section of <i><a href="https://www.nature.com/articles/s41586-023-05896-x" rel="external">A draft human pangenome reference</a></i>. The links to download and visualise the human annotations for HPRC assemblies are summarised in the <a href="https://projects.ensembl.org/hprc/" rel="external">Ensembl HPRC data page</a>.</p> <h4 id="HPRC_run">Running VEP with HPRC assemblies</h4> <p>Currently, VEP can only be run with HPRC assemblies in offline mode, one assembly at a time. There are two ways to use VEP with HPRC assemblies:</p> <ul> <li> Using <b>VEP cache</b> with (recommended) <b>FASTA sequence</b> (the most efficient way) </li> <li> Using <b>GTF annotation</b> with (mandatory) <b>FASTA sequence</b> </li> </ul> <p>In the examples below, we demonstrate annotating variants on <b>T2T-CHM13v2.0</b> (<a href="https://ftp.ensembl.org/pub/rapid-release/species/Homo_sapiens/GCA_009914755.4/" rel="external">GCA_009914755.4</a> assembly). To create a sample VCF to use in the examples below, you can take the first 100 lines from the ClinVar VCF file mapped to T2T-CHM13:</p> <pre class="code sh_sh">clinvar=ftp://ftp.ensembl.org/pub/rapid-release/species/Homo_sapiens/GCA_009914755.4/ensembl/variation/2022_10/vcf/2024_07/clinvar_20240624_GCA_009914755.4.vcf.gz tabix -h $clinvar 1 | head -n 100 > test.vcf</pre> <h5 id="cache">VEP cache</h5> <p><a href="vep_cache.html#cache">VEP cache</a> is a downloadable archive containing all transcript models for an assembly; it may also contain regulatory features and variant data.</p> <p>Let's start by downloading and extracting the VEP cache to the default VEP directory (available for each annotation by clicking in <b>VEP cache</b> in the <a href="https://projects.ensembl.org/hprc/" rel="external">Ensembl HPRC data page</a>). In the case of T2T-CHM13:</p> <pre class="code sh_sh"> cd $HOME/.vep curl -O https://ftp.ensembl.org/pub/rapid-release/species/Homo_sapiens/GCA_009914755.4/ensembl/variation/2022_10/indexed_vep_cache/Homo_sapiens-GCA_009914755.4-2022_10.tar.gz tar xzf Homo_sapiens-GCA_009914755.4-2022_10.tar.gz</pre> <p>This will create the folder <kbd>homo_sapiens_gca009914755v4/107_T2T-CHM13v2.0</kbd> with the gene data required to run VEP. The name of this folder contains relevant information when running VEP:</p> <ul> <li> Species: <kbd>homo_sapiens_gca009914755v4</kbd> </li> <li> Cache version: <kbd>107</kbd> </li> <li> Assembly: <kbd>T2T-CHM13v2.0</kbd> </li> </ul> <p> As well as molecular consequence predictions, many gene/transcript-based <a href="vep_options.html">VEP options</a> are supported for HPRC assemblies: </p> <pre class="code sh_sh">vep -i test.vcf --offline \ --species homo_sapiens_gca009914755v4 \ --cache_version 107 \ --fasta Homo_sapiens-GCA_009914755.4-softmasked.fa.gz \ --domains --symbol --canonical --protein --biotype --uniprot --variant_class</pre> <p> We don't have other annotations, such as RefSeq transcripts or variant information in the cache. </p> <p> To run VEP with the downloaded cache in offline mode, please specify the species (which here includes assembly name) and cache version:</p> <pre class="code sh_sh">vep -i test.vcf --offline --species homo_sapiens_gca009914755v4 --cache_version 107</pre> <h5 id="fasta">FASTA sequence</h5> <p>When using VEP cache, supplying the reference genomic sequence in a FASTA file is optional, but is required to enable the following options:</p> <ul> <li> Create HGVS notations (<a href="vep_options.html#opt_hgvs">--hgvs</a> and <a href="vep_options.html#opt_hgvsg">--hgvsg</a>) </li> <li> Check the reference sequence given in input data (<a href="vep_options.html#opt_check_ref">--check_ref</a>) </li> </ul> <p> Genomic FASTA files can be found in <a href="https://projects.ensembl.org/hprc/" rel="external">Ensembl HPRC data page</a> <b>> FTP dumps > ensembl > genome</b>. FASTA files need to be either uncompressed or compressed with <b>bgzip</b> (recommended) to be compatible with VEP. For instance, to download a compressed FASTA file, uncompress it and then re-compress it with bgzip: </p> <pre class="code sh_sh">curl -O https://ftp.ensembl.org/pub/rapid-release/species/Homo_sapiens/GCA_009914755.4/ensembl/genome/Homo_sapiens-GCA_009914755.4-softmasked.fa.gz gzip -d Homo_sapiens-GCA_009914755.4-softmasked.fa.gz bgzip Homo_sapiens-GCA_009914755.4-softmasked.fa.gz</pre> <p> Afterwards, you can run VEP using cache and the <a href="vep_options.html#opt_fasta">--fasta</a> flag: </p> <pre class="code sh_sh">vep -i test.vcf --offline \ --species homo_sapiens_gca009914755v4 \ --cache_version 107 \ --fasta Homo_sapiens-GCA_009914755.4-softmasked.fa.gz</pre> <p> More information on using FASTA files with VEP is available <a href="vep_cache.html#fasta">here</a>. </p> <h5 id="gtf">GTF and GFF annotation</h5> <p> As an alternative to using cache files, VEP can utilise gene information in appropriately indexed GTF or GFF files. GTF and GFF files can be downloaded from the annotation column in the <a href="https://projects.ensembl.org/hprc/" rel="external">Ensembl HPRC data page</a>. The data needs to be re-sorted in chromosomal order, compressed in <b>bgzip</b> and indexed with <b>tabix</b>. We present here the example for a GTF file: </p> <pre class="code sh_sh">curl -O https://ftp.ensembl.org/pub/rapid-release/species/Homo_sapiens/GCA_009914755.4/ensembl/geneset/2022_07/Homo_sapiens-GCA_009914755.4-2022_07-genes.gtf.gz gzip -d Homo_sapiens-GCA_009914755.4-2022_07-genes.gtf.gz grep -v "#" Homo_sapiens-GCA_009914755.4-2022_07-genes.gtf |\ sort -k1,1 -k4,4n -k5,5n -t$'\t' |\ bgzip -c > Homo_sapiens-GCA_009914755.4-2022_07-genes.gtf.gz tabix Homo_sapiens-GCA_009914755.4-2022_07-genes.gtf.gz</pre> <p> FASTA files are <b>always</b> required when running HPRC data with GTF annotation, as the transcript sequences are not available in the GFF files. </p> <p> Afterwards, you can run VEP using the GTF and FASTA files: </p> <pre class="code sh_sh">vep -i test.vcf \ --gtf Homo_sapiens-GCA_009914755.4-2022_07-genes.gtf.gz \ --fasta Homo_sapiens-GCA_009914755.4-softmasked.fa.gz</pre> <p> Check <a href="vep_cache.html#gff">here</a> for more information on using VEP with GTF and GFF annotation. </p> <h3 id="PolyPhen_SIFT"> Missense deleteriousness predictions </h3> <p> Although PolyPhen/SIFT scores are not directly available for alternative assemblies by using <a href="vep_options.html#opt_polyphen">--polyphen</a> and <a href="vep_options.html#opt_sift">--sift</a>, they can be retrieved via the <a href="vep_plugins.html#PolyPhen_SIFT">PolyPhen_SIFT plugin</a>. </p> <p> Using our <a href="https://github.com/Ensembl/ensembl-variation/tree/main/nextflow/ProteinFunction" rel="external">ProteinFunction pipeline</a>, we ran <b>PolyPhen-2 2.2.3</b> and <b>SIFT 6.2.1</b> on the proteome sequences for GRCh38 and all HPRC assemblies (the protein FASTA files indicated in <a href="https://projects.ensembl.org/hprc/" rel="external">Ensembl HPRC data page</a>) and stored their results in a single SQLite file: <a href="http://ftp.ensembl.org/pub/current_variation/pangenomes/Human/homo_sapiens_pangenome_PolyPhen_SIFT_20240502.db" rel="external"><kbd>homo_sapiens_pangenome_PolyPhen_SIFT_20240502.db</kbd></a>. <p> Pre-computed scores and predictions can be retrieved by downloading this file and running VEP with the <b>PolyPhen_SIFT plugin</b>: </p> <pre class="code sh_sh">curl -O http://ftp.ensembl.org/pub/current_variation/pangenomes/Human/homo_sapiens_pangenome_PolyPhen_SIFT_20240502.db vep -i test.vcf --offline \ --species homo_sapiens_gca009914755v4 \ --cache_version 107 \ --fasta Homo_sapiens-GCA_009914755.4-softmasked.fa.gz \ --plugin PolyPhen_SIFT,db=human_pangenomes.PolyPhen_SIFT.db</pre> <h3> Matched variant annotations (ClinVar, gnomAD and dbSNP) </h3> <p> We don't have variant data in the VEP caches for the pangenome assemblies, but it can be integrated using the <a href="vep_custom.html">--custom</a> option with data files using the same assembly coordinates. We have lifted-over some key datasets, including ClinVar and gnomAD to the HPRC assemblies (downloadable from the VCF column in <a href="https://projects.ensembl.org/hprc/" rel="external">Ensembl HPRC data page</a>). </p> <pre class="code sh_sh"># Download ClinVar data and respective index (TBI) curl -O https://ftp.ensembl.org/pub/rapid-release/species/Homo_sapiens/GCA_009914755.4/ensembl/variation/2022_10/vcf/2024_07/clinvar_20240624_GCA_009914755.4.vcf.gz curl -O https://ftp.ensembl.org/pub/rapid-release/species/Homo_sapiens/GCA_009914755.4/ensembl/variation/2022_10/vcf/2024_07/clinvar_20240624_GCA_009914755.4.vcf.gz.tbi # Run VEP with ClinVar data vep -i test.vcf --offline \ --species homo_sapiens_gca009914755v4 --cache_version 107 \ --fasta Homo_sapiens-GCA_009914755.4-softmasked.fa.gz \ --custom file=clinvar_20240624_GCA_009914755.4.vcf.gz,short_name=ClinVar,format=vcf,type=exact,coords=0,fields=CLNSIG%CLNREVSTAT%CLNDN</pre> <h3> Additional annotations </h3> <p> Ensembl VEP plugins are a simple way to add new functionality to your analysis. Many require data that is only available for GRCh37 or GRCh38, but others, for example those based on gene attributes or on the fly analysis are compatible with the HGRC assemblies. </p> <p> Here follows VEP plugins that are easily compatible with alternative human assemblies: </p> <style> tr:nth-child(odd) {background-color: #f0f0f0;} td { line-height: 1.25em; word-wrap: break-word; } th { word-wrap: break-word; } .info {width: inherit !important;} </style> <table class="ss" style="table-layout: fixed"><thead> <tr> <th>Plugin</th> <th>Description</th> <th>Plugin data</th> <th>Usage example</th> </tr></thead> <tbody> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/Blosum62.pm" rel="external">Blosum62</a></td> <td>Looks up the BLOSUM 62 substitution matrix score for the reference and alternative amino acids predicted for a missense mutation.</td> <td></td> <td><kbd>--plugin Blosum62</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/DosageSensitivity.pm" rel="external">DosageSensitivity</a></td> <td>Retrieves haploinsufficiency and triplosensitivity probability scores<br>for affected genes (<a href="https://doi.org/10.1016/j.cell.2022.06.036">Collins <i>et al.</i>, 2022</a>).</td> <td><a href="https://zenodo.org/record/6347673/files/Collins_rCNV_2022.dosage_sensitivity_scores.tsv.gz"><kbd>Collins_rCNV_2022.dosage_sensitivity_scores.tsv.gz</kbd></a></td> <td><kbd>--plugin DosageSensitivity,file=Collins_rCNV_2022.dosage_sensitivity_scores.tsv.gz</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/Downstream.pm" rel="external">Downstream</a></td> <td>Predicts downstream effects of a frameshift variant on the protein sequence of a transcript.</td> <td>Requires a FASTA file provided via the <a href="vep_options.html#opt_fasta">--fasta</a> option</td> <td><kbd>--plugin Downstream</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/Draw.pm" rel="external">Draw</a></td> <td>Draws pictures of the transcript model showing the variant location.</td> <td></td> <td><kbd>--plugin Draw</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/GeneSplicer.pm" rel="external">GeneSplicer</a></td> <td>Runs <a href="https://ccb.jhu.edu/software/genesplicer/">GeneSplicer</a> to get splice site predictions.</td> <td>Binary and training data for GeneSplicer (<a href="vep_plugins.html#GeneSplicer">plugin instructions</a>)</td> <td><kbd>--plugin GeneSplicer,binary=genesplicer/bin/linux/genesplicer,training=genesplicer/human</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/GO.pm" rel="external">GO</a></td> <td>Retrieves Gene Ontology (GO) terms associated with genes (for HGRC assemblies, specifically) using custom GFF annotation containing GO terms.</td> <td><a href="https://projects.ensembl.org/hprc/">Ensembl HPRC data page</a> <b>&gt; FTP dumps &gt; ensembl &gt; variation &gt; [date] &gt; gff:</b> <ul> <li> <kbd>*_GO_plugin.gff.gz</kbd> </li> <li> <kbd>*_GO_plugin.gff.gz.tbi</kbd> </li> </ul> </td> <td><kbd>--plugin GO,file=homo_sapiens_gca009914755v4_110_VEP_GO_plugin.gff.gz</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/HGVSIntronOffset.pm" rel="external">HGVSIntronOffset</a></td> <td>Returns HGVS intron start and end offsets. To be used with <a href="https://www.ensembl.org/info/docs/tools/vep/script/vep_options.html#opt_hgvs">--hgvs</a> option.</td> <td></td> <td><kbd>--plugin HGVSIntronOffset</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/LoFtool.pm" rel="external">LoFtool</a></td> <td>Provides a rank of genic intolerance and consequent susceptibility to disease based on the ratio of Loss-of-function (LoF) to synonymous mutations for each gene.</td> <td></td> <td><kbd>--plugin LoFtool</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/MaxEntScan.pm" rel="external">MaxEntScan</a></td> <td>Runs <a href="http://hollywood.mit.edu/burgelab/maxent/Xmaxentscan_scoreseq.html">MaxEntScan</a> to get splice site predictions.</td> <td>Extracted directory from <a href="http://hollywood.mit.edu/burgelab/maxent/download/fordownload.tar.gz">fordownload.tar.gz</td> <td><kbd>--plugin MaxEntScan,/path/to/fordownload</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/NearestExonJB.pm" rel="external">NearestExonJB</a></td> <td>Finds the nearest exon junction boundary to a coding sequence variant.</td> <td></td> <td><kbd>--plugin NearestExonJB</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/NMD.pm" rel="external">NMD</a></td> <td>Predicts if a variant allows the transcript to escape nonsense-mediated mRNA decay based on certain rules.</td> <td></td> <td><kbd>--plugin NMD</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/Phenotypes.pm" rel="external">Phenotypes</a></td> <td>Retrieves overlapping phenotype information.</td> <td><a href="https://projects.ensembl.org/hprc/">Ensembl HPRC data page</a> <b>&gt; FTP dumps &gt; ensembl &gt; variation &gt; [date] &gt; gff:</b> <ul> <li> <kbd>*_phenotypes_plugin.gvf.gz</kbd> </li> <li> <kbd>*_phenotypes_plugin.gvf.gz.tbi</kbd> </li> </ul> </td> <td><kbd>--plugin Phenotypes,file=homo_sapiens_gca009914755v4_110_VEP_phenotypes_plugin.gvf.gz</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/pLI.pm" rel="external">pLI</a></td> <td>Adds the probability of a gene being loss-of-function intolerant (pLI).</td> <td></td> <td><kbd>--plugin pLI</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/PolyPhen_SIFT.pm" rel="external">PolyPhen_SIFT</a></td> <td>Retrieves PolyPhen and SIFT predictions from a SQLite database.</td> <td><a href="http://ftp.ensembl.org/pub/current_variation/pangenomes/Human/homo_sapiens_pangenome_PolyPhen_SIFT_20240502.db"><kbd>homo_sapiens_pangenome_PolyPhen_SIFT_20240502.db</kbd></a></td> <td><kbd>--plugin PolyPhen_SIFT,db=homo_sapiens_pangenome_PolyPhen_SIFT_20240502.db</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/ProteinSeqs.pm" rel="external">ProteinSeqs</a></td> <td>Writes two files with the reference and mutated protein sequences of any proteins found with non-synonymous mutations in the input file.</td> <td></td> <td><kbd>--plugin ProteinSeqs</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/SingleLetterAA.pm" rel="external">SingleLetterAA</a></td> <td>Returns HGVSp string with single amino acid letter codes.</td> <td></td> <td><kbd>--plugin SingleLetterAA</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/SpliceRegion.pm" rel="external">SpliceRegion</a></td> <td>Provides more granular predictions of splicing effects.</td> <td></td> <td><kbd>--plugin SpliceRegion</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/SubsetVCF.pm" rel="external">SubsetVCF</a></td> <td>Retrieves overlapping records from a given VCF file.</td> <td>A VCF file</td> <td><kbd>--plugin SubsetVCF,file=file.vcf.gz,name=myvfc</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/TranscriptAnnotator.pm" rel="external">TranscriptAnnotator</a></td> <td>Annotates variant-transcript pairs based on a given file.</td> <td>Tab-separated annotation file (<a href="vep_plugins.html#TranscriptAnnotator">plugin instructions</a>)</td> <td><kbd>--plugin TranscriptAnnotator,file=annotation.txt.gz</kbd></td> </tr> <tr> <td><a href="https://github.com/Ensembl/VEP_plugins/blob/main/TSSDistance.pm" rel="external">TSSDistance</a></td> <td>Calculates the distance from the transcription start site for upstream variants.</td> <td></td> <td><kbd>--plugin TSSDistance</kbd></td> </tr> </tbody></table> <hr/> <h2 id="citations">Citations and VEP users</h2> <p> VEP is used by many organisations and projects: </p> <ul> <li> VEP forms a part of <a href="http://variantstudio.software.illumina.com/" rel="external">Illumina's VariantStudio</a> software </li> <li> <a href="http://gemini.readthedocs.org/" rel="external">Gemini</a> is a framework for exploring genome variation that uses VEP </li> <li> The <a href="http://decipher.sanger.ac.uk/" rel="external">DECIPHER project</a> uses VEP in its analysis pipelines</li> </ul> <p> Other citations and use cases: </p> <ul> <li> <a href="http://bib.oxfordjournals.org/content/early/2014/03/12/bib.bbu008.full" rel="external">VAX</a> is a suite of plugins for VEP that expands its functionality</li> <li> <a href="http://research-pub.gene.com/pviz/app/vep/" rel="external">pViz</a> is a visualisation tool for VEP results files </li> <li> <a href="http://genomemedicine.com/content/6/3/26/abstract" rel="external">McCarthy <i>et al</i></a> compares VEP to AnnoVar </li> <li> <a href="http://bib.oxfordjournals.org/content/early/2013/01/21/bib.bbs086.full" rel="external">Pabinger <i>et al</i></a> reviews variant analysis software, including VEP</li> <li> VEP is used to provide annotation for the <a href="http://exac.broadinstitute.org/" rel="external">ExAC</a> and <a href="http://gnomad.broadinstitute.org/" rel="external">gnomAD</a> projects </li> </ul> </div> </div></div> </div> <div id="footer"> <div class="column-wrapper"> <div class="column-two left"> <p>Ensembl release 113 - October 2024 &copy; <span class="print_hide"><a href="http://www.ebi.ac.uk/" class="nowrap constant">EMBL-EBI</a></span> <span class="screen_hide_inline">EMBL-EBI<br />http://useast.ensembl.org</span> </p> </div> <div class="column-two right print_hide"><p> <a class="modal_link" id="p_link" href="/Help/Permalink?url=http%3A%2F%2FOct2024.archive.ensembl.org">Permanent link</a> </p></div> <p class="invisible">.</p> </div> <div class="column-wrapper"><hr /><div id="fat-footer"> <div class="column-four left"> <h3>About Us</h3> <p><a href="/info/about" rel="nofollow">About us</a></p> <p><a href="/info/about/contact/" rel="nofollow">Contact us</a></p> <p><a href="/info/about/publications.html" rel="nofollow">Citing Ensembl</a></p> <p><a href="https://www.ensembl.org/info/about/legal/privacy.html" rel="nofollow">Privacy policy</a></p> <p><a href="https://www.ebi.ac.uk/long-term-data-preservation" rel="nofollow">Long-term data preservation</a></p> <p><a href="/info/about/legal/" rel="nofollow">Disclaimer</a></p> </div> <div class="column-four left"> <h3>Get help</h3> <p><a href="/info/website/" rel="nofollow">Using this website</a></p> <p><a href="/info/website/upload" rel="nofollow">Adding custom tracks</a></p> <p><a href="/downloads.html" rel="nofollow">Downloading data</a></p> <p><a href="/info/website/tutorials/" rel="nofollow">Video tutorials</a></p> <p><a href="/info/docs/tools/vep/" rel="nofollow">Variant Effect Predictor (VEP)</a></p> </div> <div class="column-four left"> <h3>Our sister sites</h3> <p><a href="http://bacteria.ensembl.org" rel="nofollow">Ensembl Bacteria</a></p> <p><a href="http://fungi.ensembl.org" rel="nofollow">Ensembl Fungi</a></p> <p><a href="http://plants.ensembl.org" rel="nofollow">Ensembl Plants</a></p> <p><a href="http://protists.ensembl.org" rel="nofollow">Ensembl Protists</a></p> <p><a href="http://metazoa.ensembl.org" rel="nofollow">Ensembl Metazoa</a></p> </div> <div class="column-four left"> <h3>Follow us</h3> <p><a class="media-icon" href="http://www.ensembl.info/" rel="nofollow"> <img alt="[RSS logo]" title="Ensembl blog" src="/i/rss_icon_16.png"></a> <a href="http://www.ensembl.info/" rel="nofollow">Blog</a></p> <p><a class="media-icon" href="http://www.twitter.com/Ensembl" rel="nofollow"> <img alt="[twitter logo]" title="Follow us on Twitter!" src="/i/twitter.png"></a> <a href="http://www.twitter.com/ensembl" rel="nofollow">Twitter</a></p> <p><a class="media-icon" href="http://www.facebook.com/Ensembl.org" rel="nofollow"> <img alt="[Facebook logo]" title="Our Facebook page" src="/i/facebook.png"></a> <a href="http://www.facebook.com/Ensembl.org" rel="nofollow">Facebook</a></p> </div> </div> <p class="invisible">.</p> </div> </div> </div> </div> </div> <form id="core_params" action="#" style="display:none"> <fieldset><input type="hidden" name="db" value="core" /></fieldset> </form> <input type="hidden" id="species_path" name="species_path" value="" /> <input type="hidden" id="gdpr_version" name="gdpr_version" value="2.0.0" /> <input type="hidden" id="gdpr_policy_url" name="gdpr_policy_url" value="https://www.ensembl.org/info/about/legal/privacy.html" /> <input type="hidden" id="gdpr_terms_url" name="gdpr_terms_url" value="https://www.ebi.ac.uk/about/terms-of-use" /> <input type="hidden" id="gdpr_cookie_name" name="gdpr_cookie_name" value="ENSEMBL_PRIVACY_POLICY" /> <input type="hidden" id="species_common_name" name="species_common_name" value="Human" /> <input type="hidden" id="ensembl_image_root" name="ensembl_image_root" value="/i/" /> <input type="hidden" id="max_region_length" name="max_region_length" value="1000100" /> <input type="hidden" id="ensembl_species_image" name="ensembl_species_image" value="/i/species/Homo_sapiens.png" /> <div id="modal_bg"></div> <div id="modal_panel" class="js_panel"> <input type="hidden" class="panel_type" value="ModalContainer" /> <div class="modal_title"> <ul class="tabs"> <li><a class="modal_user_data" href="UserData/ManageData?time=1740140728">Personal Data</a></li> </ul> <div class="modal_caption"></div> <div class="modal_close"></div> </div> <div id="modal_user_data" class="modal_content js_panel fixed_width" style="display:none"></div> <div id="modal_default" class="modal_content js_panel fixed_width" style="display:none"></div> <div class="modal_overlay"><img class="overlay_close" title="Cancel" alt="close" src="/i/close.png" /><div class="overlay_content"></div></div> <div class="modal_overlay_bg"></div> </div> <script type="text/javascript" src="/minified/9239335eaf61b74631521a0b68948ac7.js"></script> <script type="text/javascript" src="/minified/725a4b78c3f6c26655a50fb3dd6b1888.js"></script> <input type="hidden" id="hidden_species_list" name="hidden_species_list" value=Moschus_moschiferus|Sorex_araneus|Salmo_salar|Fundulus_heteroclitus|Sus_scrofa_usmarc|Calidris_pygmaea|Vombatus_ursinus|Ovis_aries_polled_dorset|Periophthalmus_magnuspinnatus|Ursus_americanus|Naja_naja|Seriola_lalandi_dorsalis|Myotis_lucifugus|Tupaia_belangeri|Oncorhynchus_tshawytscha|Otus_sunia|Callithrix_jacchus|Esox_lucius|Physeter_catodon|Rhinopithecus_roxellana|Mus_musculus_CAST_EiJ|Ictalurus_punctatus|Sus_scrofa_ningxiang|Sarcophilus_harrisii|Heterocephalus_glaber_male|Sander_lucioperca|Poecilia_mexicana|Mus_musculus_LP_J|Lepisosteus_oculatus|Hucho_hucho|Mus_musculus_WSB_EiJ|Carlito_syrichta|Oncorhynchus_kisutch|Pan_paniscus|Mus_musculus_A_J|Peromyscus_maniculatus_bairdii|Oryctolagus_cuniculus|Capra_hircus|Suricata_suricatta|Pavo_cristatus|Mus_musculus|Panthera_pardus|Phocoena_sinus|Gasterosteus_aculeatus|Pan_troglodytes|Nomascus_leucogenys|Canis_lupus_familiarisbasenji|Mus_musculus_BALB_cJ|Castor_canadensis|Salmo_salar_european_origin_alta|Manacus_vitellinus|Erinaceus_europaeus|Amphilophus_citrinellus|Cyclopterus_lumpus|Salvator_merianae|Neovison_vison|Carassius_auratus|Cyprinodon_variegatus|Rattus_norvegicus_SHRUtx|Sus_scrofa_ossabaw_miniature|Panthera_leo|Apteryx_owenii|Ovis_aries_chinese_merino|Lonchura_striata_domestica|Ovis_aries_kermani|Ficedula_albicollis|Podarcis_muralis|Salmo_trutta|Cricetulus_griseus_picr|Ovis_aries_white_dorper|Anser_cygnoides|Sus_scrofa_hampshire|Rattus_norvegicus_SHRSPBbbUtx|Echeneis_naucrates|Sus_scrofa_bama_miniature|Oryzias_latipes_hni|Gallus_gallus_GCA_000002315.5|Leptobrachium_leishanense|Amazona_collaria|Mastacembelus_armatus|Gasterosteus_aculeatus_marine_syl|Salmo_salar_north_american_origin_brian|Takifugu_rubripes|Mus_musculus_AKR_J|Mus_musculus_CBA_J|Cynoglossus_semilaevis|Otolemur_garnettii|Phasianus_colchicus|Spermophilus_dauricus|Rhinopithecus_bieti|Stegastes_partitus|Pogona_vitticeps|Mandrillus_leucophaeus|Sus_scrofa_duroc|Oryzias_javanicus|Ciona_intestinalis|Caenorhabditis_elegans|Salarias_fasciatus|Poecilia_formosa|Cyprinus_carpio_huanghe|Maylandia_zebra|Haplochromis_burtoni|Parus_major|Ursus_maritimus|Monopterus_albus|Chlorocebus_sabaeus|Larimichthys_crocea|Buteo_japonicus|Felis_catus|Strix_occidentalis_caurina|Lates_calcarifer|Piliocolobus_tephrosceles|Sus_scrofa_wuzhishan|Bos_mutus|Electrophorus_electricus|Ovis_aries_qiaoke|Amphiprion_percula|Gambusia_affinis|Sus_scrofa_nihs_2020|Monodelphis_domestica|Pelusios_castaneus|Sus_scrofa|Rattus_norvegicus|Bos_taurus|Gallus_gallus_GCA_016700215.2|Equus_asinus|Serinus_canaria|Oreochromis_aureus|Dromaius_novaehollandiae|Geospiza_fortis|Capra_hircus_xinong_saanen_dairy|Sus_scrofa_bamei|Sus_scrofa_meishan|Drosophila_melanogaster|Octodon_degus|Pelodiscus_sinensis|Echinops_telfairi|Sinocyclocheilus_rhinocerous|Theropithecus_gelada|Lynx_canadensis|Tetraodon_nigroviridis|Camelus_dromedarius|Neolamprologus_brichardi|Bos_grunniens|Ovis_aries_east_friesian|Falco_tinnunculus|Pundamilia_nyererei|Anas_zonorhyncha|Gorilla_gorilla|Pygocentrus_nattereri|Bison_bison_bison|Betta_splendens|Cricetulus_griseus_chok1gshd|Oryzias_melastigma|Astatotilapia_calliptera|Cairina_moschata_domestica|Vulpes_vulpes|Myripristis_murdjan|Rhinolophus_ferrumequinum|Callorhinchus_milii|Papio_anubis|Aotus_nancymaae|Neogobius_melanostomus|Urocitellus_parryii|Macaca_nemestrina|Notechis_scutatus|Dicentrarchus_labrax|Sus_scrofa_jinhua|Saccharomyces_cerevisiae|Ovis_aries_rambouillet|Erpetoichthys_calabaricus|Laticauda_laticaudata|Sus_scrofa_tibetan|Canis_lupus_familiarisboxer|Accipiter_nisus|Denticeps_clupeoides|Xiphophorus_couchianus|Oreochromis_niloticus|Sus_scrofa_pietrain|Oryzias_latipes|Parambassis_ranga|Equus_caballus|Catharus_ustulatus|Bubo_bubo|Vicugna_pacos|Seriola_dumerili|Oryzias_latipes_hsok|Sinocyclocheilus_grahami|Microtus_ochrogaster|Sus_scrofa_rongchang|Zonotrichia_albicollis|Erythrura_gouldiae|Strigops_habroptila|Amphiprion_ocellaris|Ochotona_princeps|Capra_hircus_saanen_dairy|Balaenoptera_musculus|Prolemur_simus|Sus_scrofa_landrace|Mus_musculus_C3H_HeJ|Lepidothrix_coronata|Jaculus_jaculus|Paramormyrops_kingsleyae|Mesocricetus_auratus|Macaca_fascicularis|Marmota_marmota_marmota|Ursus_thibetanus_thibetanus|Ornithorhynchus_anatinus|Fukomys_damarensis|Xiphophorus_maculatus|Macaca_mulatta|Sus_scrofa_pb115|Cervus_hanglu_yarkandensis|Pseudonaja_textilis|Sus_scrofa_meishan_GCA_017957985.1|Anas_platyrhynchos_platyrhynchos|Mustela_putorius_furo|Delphinapterus_leucas|Taeniopygia_guttata|Chelydra_serpentina|Eptatretus_burgeri|Aquila_chrysaetos_chrysaetos|Ailuropoda_melanoleuca|Heterocephalus_glaber_female|Mus_caroli|Mus_spretus|Gouania_willdenowi|Choloepus_hoffmanni|Zalophus_californianus|Struthio_camelus_australis|Canis_lupus_familiarisgreatdane|Chrysolophus_pictus|Petromyzon_marinus|Sinocyclocheilus_anshuiensis|Cottoperca_gobio|Mus_musculus_C57BL_6NJ|Canis_lupus_familiaris|Mus_musculus_129S1_SvImJ|Salmo_salar_north_american_atlantic_salmon|Mola_mola|Mus_musculus_NOD_ShiLtJ|Mus_pahari|Gallus_gallus|Gadus_morhua_celtic_sea|Scophthalmus_maximus|Cyprinus_carpio_hebao_red|Cyanistes_caeruleus|Clupea_harengus|Apteryx_rowi|Apteryx_haastii|Sus_scrofa_largewhite|Cebus_imitator|Anas_platyrhynchos|Meriones_unguiculatus|Chinchilla_lanigera|Chelonoidis_abingdonii|Junco_hyemalis|Loxodonta_africana|Homo_sapiens|Crocodylus_porosus|Mus_musculus_PWK_PhJ|Nothobranchius_furzeri|Terrapene_carolina_triunguis|Numida_meleagris|Astyanax_mexicanus_pachon|Panthera_tigris_altaica|Hippocampus_comes|Saimiri_boliviensis_boliviensis|Mus_musculus_FVB_NJ|Cyprinus_carpio_german_mirror|Labrus_bergylta|Camarhynchus_parvulus|Poecilia_latipinna|Canis_lupus_familiarisgermanshepherd|Stachyris_ruficeps|Bos_indicus_hybrid|Ovis_aries_texel|Rattus_norvegicus_WKYBbb|Sparus_aurata|Gopherus_agassizii|Corvus_moneduloides|Cavia_aperea|Cercocebus_atys|Acanthochromis_polyacanthus|Bos_taurus_hybrid|Meleagris_gallopavo|Latimeria_chalumnae|Coturnix_japonica|Ictidomys_tridecemlineatus|Canis_lupus_dingo|Malurus_cyaneus_samueli|Kryptolebias_marmoratus|Danio_rerio|Astyanax_mexicanus|Nothoprocta_perdicaria|Varanus_komodoensis|Mus_spicilegus|Phascolarctos_cinereus|Gopherus_evgoodei|Mus_musculus_NZO_HlLtJ|Xenopus_tropicalis|Ovis_aries_romanov|Pteropus_vampyrus|Sus_scrofa_berkshire|Capra_hircus_blackbengal|Microcebus_murinus|Nannospalax_galili|Ovis_aries_hu|Sphenodon_punctatus|Oryzias_sinensis|Gasterosteus_aculeatus_freshwater_bot|Cavia_porcellus|Zosterops_lateralis_melanops|Calidris_pugnax|Anabas_testudineus|Dipodomys_ordii|Mus_musculus_DBA_2J|Catagonus_wagneri|Scleropages_formosus|Dasypus_novemcinctus|Anser_brachyrhynchus|Melopsittacus_undulatus|Cyprinus_carpio_carpio|Oncorhynchus_mykiss|Monodon_monoceros|Gasterosteus_aculeatus_marine_bam|Propithecus_coquereli|Athene_cunicularia|Ciona_savignyi|Notamacropus_eugenii|Sus_scrofa_euw1|Chrysemys_picta_bellii|Sphaeramia_orbicularis|Procavia_capensis|Pongo_abelii|Sciurus_vulgaris|Cricetulus_griseus_crigri|Tursiops_truncatus|Anolis_carolinensis|Poecilia_reticulata|Gadus_morhua|Colobus_angolensis_palliatus> </body> </html>

Pages: 1 2 3 4 5 6 7 8 9 10