CINXE.COM

KEGG COMPOUND Database

<!DOCTYPE html> <html lang="en"> <head> <meta charset="utf-8"> <meta name="viewport" content="width=device-width, initial-scale=1"> <title>KEGG COMPOUND Database</title> <link rel="stylesheet" href="/css/kegg25.css"> <style> p { margin: 10px; } div.list dt { width: 75px; margin-left: 10px; } .indent ul { margin: 0; padding-left: 0; padding-top: 4px; padding-bottom: 10px; } .indent li { margin-left: 20px; list-style: none; } </style> </head> <body> <div id="topnav" ontouchstart=""> <div class="navlist"> <a href="/kegg/">KEGG</a> </div> <div class="dropdown"> Databases <div class="dropdown-content"> <ul> <li><a href="/kegg/pathway.html">Pathway</a></li> <li><a href="/kegg/brite.html">Brite</a></li> <li><a href="/kegg/ko.html">KO</a></li> <li><a href="/kegg/genes.html">Genes</a></li> <li><a href="/kegg/compound/">Chemical</a></li> <li><a href="/kegg/disease/">Disease</a></li> <li><a href="/kegg/drug/">Drug</a></li> <li><a href="/kegg/medicus.html">Medicus</a></li> </ul> </div> </div> <div class="dropdown"> Tools <div class="dropdown-content"> <ul> <li><a href="/kegg/mapper/">Mapper</a></li> <li><a href="/kegg/webapp/">Web Apps</a></li> <li><a href="/kegg/syntax/">Syntax</a></li> </ul> </div> </div> <div class="dropdown"> Auto annotation <div class="dropdown-content"> <ul> <li><a href="https://www.kegg.jp/blastkoala/">BlastKOALA</a></li> <li><a href="https://www.kegg.jp/ghostkoala/">GhostKOALA</a></li> </ul> </div> </div> <div class="navlist"> <a href="https://www.kanehisa.jp/">Kanehisa Lab</a> </div> </div> <div id="header"> <div class="logo"> <a href="/kegg/"><img src="/Fig/kegg128.gif" alt="KEGG icon"></a> </div> <div class="title"> <h2 class="txt2">KEGG COMPOUND Database</h2> <h5>Chemical substances integrated with genomics</h5> </div> </div> <div id="dbnav" class="bar2"> <ul> <li><a href="/kegg/kegg2.html">KEGG2</a></li> <li><a href="/kegg/pathway.html">PATHWAY</a></li> <li><a href="/kegg/brite.html">BRITE</a></li> <li><a href="/kegg/module.html">MODULE</a></li> <li><a href="/kegg/ko.html">KO</a></li> <li><span class="current">COMPOUND</span></li> <li><a href="/kegg/glycan/">GLYCAN</a></li> <li><a href="/kegg/reaction/">REACTION</a></li> <li><a href="/kegg/reaction/rmodule.html">RModule</a></li> </ul> </div> <div id="content"> <!-- start --> <div class="searchbox"> <div style="float:left;"> <form action="https://www.kegg.jp/kegg-bin/search_ligand"> <span class="small">Search COMPOUND by C number, name and formula</span> <br> <input type="search" name="query" size="40"> <input type="hidden" name="column" value="entry+name+formula"> <input type="hidden" name="DATABASE" value="compound"> <input type="submit" value="Go"> </form> </div> <div style="float:left; margin-left:20px;"> <form action="https://www.kegg.jp/kegg-bin/search_ligand"> <span class="small">or by exact mass (eg. 100, 100-200)</span> <br> <input type="search" name="query" size="20"> <input type="hidden" name="column" value="exactmass"> <input type="hidden" name="DATABASE" value="compound"> <input type="submit" value="Go"> </form> </div> <div style="clear:both;"></div> </div> <div class="main"> <hr class="frame2"> <h4 class="txt2">Chemical Structures</h4> <div class="indent"> <b>KEGG COMPOUND</b> is one of the four original databases, together with KEGG PATHWAY, KEGG GENES and KEGG ENZYME, introduced at the start of the KEGG project in 1995. It is a collection of small molecules, biopolymers, and other chemical substances that are relevant to biological systems. Each entry is identified by the C number, such as <a href="/entry/C00047">C00047</a> for L-lysine, and contains chemical structure and associated information, as well as various links to other KEGG databases and outside databases. Some COMPOUND entries are also represented as GLYCAN and DRUG entries with the "Same as" links. <br><br> While GLYCAN entries are represented as tree structures with monosaccharide codes, COMPOUND entries for peptides and polyketides, such as <a href="/entry/C11996">C11996</a> for methymycin, are represented as sequences using the abbreviation codes for the monomeric units of amino acids and carboxylic acids. <ul> <li><a href="/kegg/catalog/codes2.html">Monosaccharide codes</a></li> <li><a href="/kegg/catalog/codes3.html">Ketide and peptide codes</a></li> </ul> Naturally processed peptides from gene products, such as <a href="/entry/C00873">C00873</a> for human angiotensin I, are sometimes represented in the KEGG COMPOUND database. They are always associated with sequence information using the three-letter amino acid codes, but they may or may not contain the full atomic structure representation. </div> <hr class="frame2"> <h4 class="txt2">Chemical Compound Categories</h4> <div class="indent"> The classifications of representative categories of KEGG COMPOUND, as well as biosynthetic genes, are given in the following BRITE hierarchy files. <div class="list"> <dl> <dt>br08001</dt><dd><a href="/brite/br08001">Compounds with biological roles</a></dd> <dt>br08011</dt><dd><a href="/kegg/tables/br08011.html">Secondary metabolites in Pathway Maps</a></dd> </dl> <div class="clear"></div> <dl> <dt>br08002</dt><dd><a href="/brite/br08002">Lipids</a></dd> <dt>ko01004</dt><dd><a href="/brite/ko01004">Lipid biosynthesis proteins</a></dd> <dt>ko01008</dt><dd><a href="/brite/ko01008">Polyketide biosynthesis proteins</a></dd> </dl> <div class="clear"></div> <dl> <dt>br08003</dt><dd><a href="/brite/br08003">Phytochemical compounds</a></dd> <dt>br08110</dt><dd><a href="/kegg/tables/br08110.html">Dietary phytochemicals</a></dd> </dl> <div class="clear"></div> <dl> <dt>br08020</dt><dd><a href="/kegg/tables/br08020.html">Nucleotide sugars</a></dd> <dt>br08021</dt><dd><a href="/brite/br08021">Glycosides</a></dd> </dl> <div class="clear"></div> <dl> <dt>br08005</dt><dd><a href="/brite/br08005">Bioactive peptides</a></dd> <dt>br08007</dt><dd><a href="/brite/br08007">Natural toxins</a></dd> </dl> <div class="clear"></div> <dl> <dt>br08006</dt><dd><a href="/brite/br08006">Endocrine disrupting compounds</a></dd> <dt>br08007</dt><dd><a href="/brite/br08007">Pesticides</a></dd> </dl> <div class="clear"></div> <dl> <dt>br08308</dt><dd><a href="/kegg/drug/br08308.html">Narcotics and psychotropics in Japan</a></dd> </dl> <div class="clear"></div> </div> </div> <hr class="frame2"> <h4 class="txt2">Chemical Compounds in Pathways and Diseases</h4> <div class="indent"> The role of <b>KEGG COMPOUND</b> has always been to enable links from molecular-levl data to molecular network-level data. Chemical compound entries are constituents of <a href="/kegg/pathway.html">KEGG pathway maps</a>, <a href="/kegg/module.html">KEGG modules</a>, <a href="/kegg/reaction/rmodule.html">reaction modules</a> and <a href="/kegg/network.html">network variation maps</a>. They are used to analyze, for example, metabolome data to uncover higher-level functional features using the <a href="/kegg/mapper/"><b>KEGG Mapper</b></a> tools. <br><br> In addition, chemical compound entries are used to represent disease-associated perturbed networks, such as <a href="/network/nt06014">nt06014</a> for congenital disorders of sphingolipid metabolism in the KEGG NETWORK database. </div> <p><b class="txt2">Biosynthetic codes</b></p> <div class="indent"> The structures of DNA, RNA, and proteins are determined by template-based syntheses of replication, transcription, and translation with the genetic code. In contrast, the structures of glycans, lipids, polyketides, nonribosomal peptides, and various plant secondary metabolites are determined by biosynthetic pathways. Attempts are being made to develop overview pathway maps by defining reaction modules and to understand such biosynthetic codes. <div class="list"> <dl> <dt>map01212</dt><dd><a href="/pathway/map01212">Fatty acid metabolism</a></dd> <dt>rm020_21</dt><dd><a href="/kegg/reaction/rm020_rm021.html">Variation of fatty acid synthesis</a></dd> </dl> <div class="clear"></div> </div> <!-- <ul> <li><a href="bs_fa.html">Fatty acid biosynthesis</a></li> <li><a href="bs_pk.html">Polyketide biosynthesis</a></li> </ul> --> </div> <p><b class="txt2">Biodegradation codes</b></p> <div class="indent"> KEGG COMPOUND also contains xenobiotic compounds, many of which may be degraded by microbial degradation pathways. Here again attempts are made to develop overview pathway maps together with reaction modules. <div class="list"> <dl> <dt>map01220</dt><dd><a href="/pathway/map01220">Degradation of aromatic compounds</a></dd> </dl> <div class="clear"></div> </div> </div> <hr class="frame2"> <h4 class="txt2">Database Search Tools</h4> <div class="indent"> <b>SIMCOMP</b> and <b>SUBCOMP</b> are database search programs for similar chemical structures. SIMCOMP is based on a graph matching to find maximal common subgraphs allowing mismatches, while SUBCOMP is a bit-string based search program to find exactly matching substructures or superstructures. <ul> <li><a href="https://www.genome.jp/tools/simcomp/">SIMCOMP search</a></li> <li><a href="https://www.genome.jp/tools/subcomp/">SUBCOMP search</a></li> </ul> </div> <hr class="frame2"> <b class="txt2">References</b> <ol> <li>Hashimoto, K., Yoshizawa, A.C., Okuda, S., Kuma, K., Goto, S., and Kanehisa, M.; The repertoire of desaturases and elongases reveals fatty acid variations in 56 eukaryotic genomes. J. Lipid Res. 49, 183-191 (2008). [<a href="https://pubmed.ncbi.nlm.nih.gov/17921532">pubmed</a>] </li> <li>Minowa, Y., Araki, M., and Kanehisa, M.; Comprehensive analysis of distinctive polyketide and nonribosomal peptide structural motifs encoded in microbial genomes. J. Mol. Biol. 368, 1500-1517 (2007). [<a href="https://pubmed.ncbi.nlm.nih.gov/17400247">pubmed</a>] </li> <li>Kanehisa, M.; KEGG bioinformatics resource for plant genomics and metabolomics. Methods Mol. Biol. 1374, 55-70 (2016). [<a href="https://pubmed.ncbi.nlm.nih.gov/26519400">pubmed</a>] </li> <li>Hattori, M., Okuno, Y., Goto, S., and Kanehisa, M.; Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. J. Am. Chem. Soc. 125, 11853-11865 (2003). [<a href="https://pubmed.ncbi.nlm.nih.gov/14505407">pubmed</a>] </li> <li>Hattori, M., Tanaka, N., Kanehisa, M., and Goto, S.; SIMCOMP/SUBCOMP: chemical structure search servers for network analyses. Nucleic Acids Res. 38, W652-W656 (2010). [<a href="https://pubmed.ncbi.nlm.nih.gov/20460463">pubmed</a>] </li> </ol> <hr class="frame2"> <i>Last updated: September 8, 2023</i> </div> <!-- end --> </div> </body> </html>

Pages: 1 2 3 4 5 6 7 8 9 10