CINXE.COM
pdb_extract | Online Manual
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head> <title>pdb_extract | Online Manual</title> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> <META NAME="description" CONTENT="pdb_extract is used to extract statistical information from the output files produced by many software for protein structure determination using Xray Crystallography, NMR, and EM methods."> <META NAME="keywords" CONTENT="pdb_extract, PDB deposition, X-ray crystallography, data collection and reduction, molecular replacement, heavy atom phase determination, structure refinement, NMR structure determination, EM structure determination"> <link href="wwpdb_css/site.css" rel="stylesheet" type="text/css" media="all" /> </head> <body> <div class="container"> <header class="container_12"> <div class="logo"> <a href="http://www.wwpdb.org/"><img src="wwpdb_img/wwpdb_logo.gif" alt="pdb_extract Logo" width="187" height="58" /></a> </div> <div class="topnav frht bg-green ui-corner-all pall"> <a href="index.html" class="ui-button ui-widget ui-state-default ui-corner-all ui-button-icon-only" title="Home"><span class="ui-button-text">Home</span></a> <a href="version.html" class="ui-button ui-widget ui-state-default ui-corner-all ui-button-icon-only" title="Version"></span><span class="ui-button-text">Version</span></a> <a href="documentation.html" class="ui-button ui-widget ui-state-default ui-corner-all ui-button-icon-only" title="Documentation"><span class="ui-button-text">Documentation</span></a> <div class="clear"> </div> </div> <div class="headercontent prefix_1"> <h1>pdb_extract</h1> </div> <div class="clear"> </div> <div class="seperator"></div> </header> <div class="content container_12"> <img src="wwpdb_img/logo_pdbex.jpg" style="margin: 5px 0px 5px 5px; float: right; " /> <div> <h1>pdb_extract - Workstation Version Manual</h1> </div> <h4>Extract information from X-ray crystallography, NMR, and EM software applications</h4> (June, 18, 2004; last modified Nov 1, 2022) | (Latest version 4.0) <a name="top"></a> <table border="0" cellspacing="20"> <tbody><tr> <td colspan="2" bgcolor="lightblue" class="bg9CF"><strong><center> Table of Contents </center></strong></td> </tr> <tr><td width="50%" style="width:50%;"> <ul> <li><a href="#what_do"> <b> What does pdb_extract do? </b></a></li> <li><a href="#prog_access"><b>Program access </b></a></li> <li><a href="#instal"><b>Installation</b></a></li> <ul> </li><li><a href="#instal_scr">Installation of source code distribution</a> </li></ul> <li><a href="#run_prog"><b>Run the program (Xray data)</b> </a> <li><b>Some helpful hints to get the LOG (or output) files from various programs</b> <ul> <li><a href="#scl"> Data collection/reduction</a> </li><li><a href="#mr"> Molecular replacement</a> </li><li><a href="#phs"> Heavy atom phasing</a> </li><li><a href="#dm"> Density modification</a> </li><li><a href="#ref"> Final structure refinement</a> </li></ul> </li> </ul> </td> <td width="50%" style="width:50%;"> <ul> <li><a href="#tables"><b>Tables </b></a> <ul> <li><a href="#tables_opt">command options </a></li> <li><a href="#tables_list">Supported crystallographic software lists </a></li> </ul> </li> </ul> <a name="what_do"></a> </td> </tr> </table> <table border="0" cellspacing="20"> <tr><td colspan="2" bgcolor="lightblue"><strong><center> <b>What does pdb_extract do?</b> <a href="#top" style="font-size:smaller;"> (TABLE OF CONTENTS)</a> </center> </strong></td> </tr> <tr><td colspan="2" > <p> <b>pdb_extract</b> is used to extract statistical information from the output files produced by software for protein structure determination using Xray Crystallography, NMR, and EM method. These statistical information will be written into a complete mmCIF file which is ready for PDB deposition. <p> In the case of Xray structure determination, <b>pdb_extract</b> merges all the information into two mmCIF (macromolecular Crystallographic Information File) files. One mmCIF file contains structure factors and the other contains atomic coordinates and statistics extracted from the steps of structure determination (data collection/integration/reduction, heavy atom phasing, molecular replacement, and final structure refinement) for various methods (MR, SAD, MAD, SIR, SIRAS, MIR, MIRAS). These two mmCIF files are ready for PDB deposition. <p> In the case of NMR and EM structure determination, experimental details from header section of PDB file and coordinates are extracted into a mmCIF file. This file along with chemical shift and restraint files are ready for PDB deposition. <p>The current version supports many software packages and hundreds of different output files produced in various of steps. <a href="#tables_list"> Click here </a> to see the supported software lists. <p> The assembled mmCIF files by <b>pdb_extract</b> should be used for <a href="https://deposit.wwpdb.org/deposition/ " >Deposition. </a> <p> <b>The advantage of using pdb_extract:</b> <ul> <li> Faster to prepare your mmCIF file for deposition. Users only provide the output files produced from various software to get all the statistics. <li> Complete and accurate to deposit your file. The statistics (ranging from index to final refinement) can be automatically extracted, so to avoid typos. <li> Great for multiple structural deposition by using the re-usable data template file <li> Both command options and Web interface are provided. It is flexible to use. <li>Collectively, these software tools reduce the human effort required to assemble complete and validated protein structure entries ready for PDB deposition. </ul> <p> <b>IMPORTANT NOTES:</b> <ol> <li> The LOG or output files generated from any software should not be modified. Otherwise, information may not be extracted. <li> If you have several structures ready to be deposited to the PDB site, you need to apply the <b>pdb_extract</b> program to each individual structure, since each structure requires a single PDB ID for deposition. <li> You may have a lot of trials for each step (data processing, heavy atom phasing, or density modification, or final structure refinement), but information extracted from each step should be only from the best trial that leads to next step toward solving your structure. <li> You may use different programs for heavy atom phasing solution. For example, you used program A to locate heavy atom positions and you used program B to refine heavy atom parameters (like x, y, z, occupancy and B factors etc.). Phasing statistics information will be extracted from the output of program B; therefore, <b>pdb_extract</b> should be applied to the output of program B. <li> You may also use different programs for final structure refinement, but <b>pdb_extract</b> should be only applied to the program which leads to your final structure deposition. </ol> </td> </tr> <!-------------- Program access--------------------> <tr><td bgcolor="lightblue"><strong><a name="prog_access"><center> <b>Program access </b><a href="#top">    TOP</a> </center></a></strong></td> </tr> <tr><td> The source code of <b>pdb_extract</b> can be downloaded from the address <a href="https://sw-tools.rcsb.org/apps/PDB_EXTRACT/source.html" target= "other" > https://sw-tools.rcsb.org/apps/PDB_EXTRACT/source.html</a>. The software is available under an Open Source license. <p> <p id="urlloc"> <p> <b>pdb_extract</b> has been integrated into <a href="http://www.ccp4.ac.uk" target= "other" >CCP4</a> and the CCP4i interface(Version 5.0 and above). Users can run <b>pdb_extract</b> under the CCP4 environment. </td> </tr> <!-------------- Installations-------------------> <tr><td bgcolor="lightblue"><strong><a name="instal"><center><b> Installations</b> <a href="#top">    TOP </a></center></a></strong> </td></tr> <tr><td> <p> <b>System Requirements:</b> <li> platform Intel-Linux: <li> Python (>3.6) <li> C/C++ compilers </td> </tr> </td> </tr> <!-------------- Installations of source code distribution-------------------> <tr><td bgcolor="lightblue"><strong><a name="instal_scr"><center><b> Installation of source code distribution</b> <a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> <pre> <b>Please refer to the README file in the package</b> </pre> </td> </tr> <!--------------Run the program -------------------> <tr><td bgcolor="lightblue"><strong><a name="run_prog"><center><b> Run the program</b> <a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> <pre> <b>Please refer to the README file in the package</b> </pre> </td> </tr> <!-------------- Data collection/reduction-------------------> <tr><td bgcolor="lightblue"><strong><a name="scl"><center><b> Data collection/reduction </b><a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> <P> This section is used to collect statistical information from the LOG files generated by the programs for Data Scaling/Merging/Averaging. <P> <b>Important:</b> The log files must be generated from the LAST (or BEST) trial which corresponds to the files used for phasing or molecular replacement. <P> <P> The extracted information may be the following: <pre> * Intensities (or amplitude) and standard deviations * Data completeness (overall, resolution shells) * Redundancy (overall, resolution shells), mosaicity * R-merge, R-sym (overall, resolution shells) * average(I/sigma), (overall, resolution shells) * Total and unique reflections collected. * Resolution range </pre> <hr> <P> <b>  Some helpful hints for getting LOG files from the program of Data Scaling/Merging/Averaging</b><p> Using <A HREF="http://www.lnls.br/infra/linhasluz/denzo-hkl.htm " TARGET="other">HKL/HKL2000/scalepack </A><p> HKL (or HKL2000 or Scalepack) is a package by Otwinowski for data collection/reduction/scaling. You can use the graphical interface or the scalepack script to scale your data. The LOG file (e.g. scale1.log) contains statistics for PDB deposition. <br> The generated LOG file type is 'LOG'. <p> Using <A HREF="http://www.msc.com/protein/dtrek.html" TARGET="other"> D*trek </A><p> D*trek is a package by Jim Pflugrath at Rigaku/MSC for data collection/reduction/scaling. You can use the graphical interface to scale (or merge/average) your data. The LOG file (e.g. scale1.log) containing statistics is from the step of scaling data. <br> The generated LOG file type is 'LOG'. <p> Using <A HREF="http://www.bruker-axs.de/index.html" TARGET="other"> SAINT </A><p> SAINT is a package by Bruker (Siemens Molecular Analytical Research Tool) for data collection/reduction/scaling. The LOG file (e.g. scale1.ls) containing statistics is from the step of scaling data. <br> The generated LOG file type is 'LOG'. <p> Using <A HREF="http://www.ccp4.ac.uk/dist/html/scala.html" TARGET="other"> SCALA </A><p> SCALA/AIMLESS is the CCP4 supported program. It scales together multiple observations of reflections. SCALA generates <b>mmCIF or LOG </b> file containing useful statistics. When you run the programs, you must ask the program to export the data harvest file (mmCIF type). The mmCIF file will be name.scala or name.truncate. Otherwise, it will generate LOG file. <br> The generated LOG file type is 'LOG or mmCIF'. </td> </tr> <!--------------Molecular replacement -------------------> <tr><td bgcolor="lightblue"><strong><a name="mr"><center><b> Molecular replacement </b> <a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> <P> This section is used to collect key statistical information from Molecular Replacement. You may first generate a LOG file from the rotation function, then generate a LOG file from the translation function. You can upload the two LOG files into this section for data extraction. You can also upload one LOG file which is generated from MR. <P> <b>Important:</b> The log files must be generated from the LAST (or BEST) trial which corresponds to the files used for density modification or refinement. <P> <hr> The extracted information may be the following: <pre> * Low and high resolution used in rotation and translation. * Rotation and translation methods * Reflection cut off criteria, reflection completeness. * Correlation coefficients for I or F between observed and calculated. * R_factor, packing information, and model details. </pre> <hr> <b><center>Some helpful hints for getting LOG files from the program molecular replacement</center></b><p> <p><b> Using <A HREF="http://cns.csb.yale.edu/v1.1/ " TARGET="other"> CNS/CNX/XPLOR </A></b><p> CNS can be used to do molecular replacement. After you finish the translation search, you can get a log file called translation.list which contains all the information of molecular replacement. <p><b>Using <A HREF="http://www.ccp4.ac.uk/dist/html/INDEX.html " TARGET="other"> Amore (CCP4)</A> </b><p> Amore is a program for molecular replacement. It is distributed in the CCP4 package. After rotation and translation search, you will generate two log files rotation.log and translation.log. You may extract information from both log files <p> If you run the program in one script, you may generate one LOG file. Upload this LOG file to the web interface. <p><b>Using <A HREF="http://www.ccp4.ac.uk/dist/html/INDEX.html " TARGET="other"> Molrep(CCP4)</A> </b><p> Molrep is a program for molecular replacement. It is distributed in the CCP4 package. When you run the script, you can specify a LOG file name (e.g. molrep.log). All the statistic information will be recorded in the log file. <p><b> Using <A HREF="http://www.msg.ucsf.edu/local/programs/epmr/epmr.html " TARGET="other"> EPMR</A> </b><p> EPMR is a command line program for molecular replacement. When you run the program, please give a log file name like the following Epmr [options] files > epmr.log All the statisticial information will be written in the log file. <p> <b> Using <A HREF="http://www-structmed.cimr.cam.ac.uk/phaser/ " TARGET="other"> Phaser</A> </b><p> Phaser was developed by Randy Read's group at the University of Cambridge. It is a program for phasing macromolecular crystal structures with maximum likelihood methods. The program generates a LOG file which can be uploaded to the web interface for data extraction. </td> </tr> <!--------------Heavy atom phasing -------------------> <tr><td bgcolor="lightblue"><strong><a name="phs"><center><b> Heavy atom phasing </b> <a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> <P> Heavy atom phasing is performed at an earlier stage of structure determination. The log files generated from phasing contain important statistical information which should be deposited to the Protein Data Bank. <P> From heavy atom phasing, you may have LOG files and heavy atom coordinate file. <P> <pre> The phasing methods are the followings: * MR molecular replacement. * SAD single anomalous dispersion. * MAD multiple anomalous dispersion. * SIR single isomorphous replacement. * SIRAS single isomorphous replacement with anomalous scattering. * MIR multiple isomorphous replacement. * MIRAS multiple isomorphous replacement with anomalous scattering. </pre> <P> <b>Important:</b> The log files must be generated from the LAST (or BEST) trial which corresponds to the files used for density modification or refinement. <P> <hr> <pre> The following items may be extracted: * Wavelength, f_prime, f_double_prime, resolution range * FOM (acentric, centric, overall, resolution shells) * R-Cullis (acentric, centric, overall, resolution shells) * R-Kraut (acentric, centric, overall, resolution shells) * Phasing power (acentric, centric, overall, resolution shells) * Number of heavy atom sites, heavy atom type. * Heavy atom location method. * Heavy atom B-factor, occupancies, and xyz coordinates. </pre> <hr> <b> <center>Some helpful hints for getting the output files generated by various programs </center> </b><p> <p><b>Using <A HREF="http://www.solve.lanl.gov/ " TARGET="other"> SOLVE (version 2.00 and above):</A> </b><p> SOLVE is a program for finding heavy atom location and refining heavy atom parameters. The statistical information is written to a file <b>solve.prt</b> (default name used by the program). The heavy atom coordinates are written to a file <b>ha.pdb</b>. <p> <b>Note:</b> You may upload the two file names <b>solve.prt</b> (file type: LOG) and <b>ha.pdb</b> (file type: PDB). <p> <p><b> Using <A HREF="http://cns.csb.yale.edu/v1.1/ " TARGET="other"> CNS/CNX/XPLOR </A></b><p> <p> CNS is a complete software system for protein crystallography. The scripts for heavy atom location and phasing refinement are mad_phase.inp or ir_phase.inp. When you run these scripts, you will get output files like phase_final.summary, phase_final.sdb or mad_phase.fp. <p> The output file phase_final.summary has all the phasing statistics.<br> The output file phase_final.sdb has all the heavy atom coordinates, occupancies and B factors. <br> The output file mad_phase.fp has refined f_prime and f_double_prime. <br> <p> (Note: The refined heavy atom coordinates, B factors and occupancies can be found in a file like phase_final.sdb. If you prefer to convert to the PDB format, you can run the script sdb_to_pdb.inp. You will get a file phase_final.pdb with PDB format.) <p> <b>Note:</b> You may input at most three files (as shown above) for extracting phase information. <p> <p> <p><b>Using <A HREF="http://www.ccp4.ac.uk/dist/html/INDEX.html " TARGET="other"> MLPHARE (CCP4)</A> </b><p> <p> MLPHARE is a program in the CCP4 suite. It is used for refining heavy atom parameters. <p> If you use the CCP4i graphical interface or the script mode, you need to ask the program to write a harvesting file. Select the data havest button, when you use the CCP4i interface. Do not use the key word NOHARV, when you use script. After you finished running this program, you will get a file (e.g. name.mlphare) which is in mmCIF format. It contains all the information for heavy atom phasing refinement. <p> For extracting the wavelength information, you need to run program REVISE in the CCP4 (version 4.0-4.2.2). You may get a file (e.g. prephadata.log) <p> <b>Note:</b> You may input at most two files (as shown above) for extracting phase information. <p> <p> <p><b>Using <A HREF="http://babinet.globalphasing.com/sharp/ " TARGET="other">SHARP (version 1.3.x and 2.0 and above): </A> </b><p> <p> SHARP is a program for finding heavy atom positions and refining heavy atom parameters. When you run SHARP or autoSHARP, the log files which have useful information are normally in the directory sharpfiles/logfiles_local/dirs, where dirs are all the subdirectories for your various structures. Please note that the location of generated log files may depend on how the program is installed! <p> SHARP produces many output files. <br> <pre> For version 1.3.x: <i>Heavy.pdb</i> contains the heavy atom coordinates. <i>FOMstats.html</i> contains figure of merit statistics. <i>Otherstat.html</i> contains Rcullis, Rkraut, phasing power. For version 2.0 and above: <i>Heavy.pdb </i> contains the heavy atom coordinates. <i>FOMstats.html</i> contains figure of merit statistics. <i>RCullis_?.html</i> contains Rcullis. <i>PhasingPower_?.html</i> contains phasing power </pre> <p> The easiest way to obtain these files is to run the program from the SUSHI interface. Review all the log files from the internet browser and save the files as plain text files. <p> <b>Note:</b> You may input at most four files (as shown above) for extracting phase information. <p> <p><b>Using <A HREF="http://www.hwi.buffalo.edu/SnB/ " TARGET="other">SnB (version 2.0 and above):</A> </b><p> <p> SnB has no heavy atom parameter refinement, and it has no corresponding statistics. SnB gives the heavy atom or substructure coordinates (e.g. <i>heavy.pdb</i>) in PDB format. <p> <b>Note:</b> You may input only one file (as shown above) for phasing extraction. <p> <p><b>Using <A HREF="http://www.hwi.buffalo.edu/BnP/ " TARGET="other">BnP (version 0.93 and above):</A> </b><p> <p> BnP is a combination of program SnB and Phases. The heavy atom positions are located by SnB and the heavy atom parameters will be refined by Phases. <p> The log file (e.g. auto.log) can be found from the directory ~/PHASES/*. Log file normally contains phasing power for each phasing set. <p> The file is in LOG format. <p> <b>Note:</b> You may input at most one file (as shown above) for extracting phase information. <p> <p><b>Using <A HREF="http://shelx.uni-ac.gwdg.de/SHELX/ " TARGET="other">SHELXD or SHELXS (version 97):</A> </b><p> <p> Heavy atom or substructure coordinates are produced in PDB format (e.g. <i>heavy.pdb</i>). <p> <b>Note:</b> You may input at most one file (as shown above) for extracting phase information. <p> </td> </tr> <!-------------- Density modification-------------------> <tr><td bgcolor="lightblue"><strong><a name="dm"><center><b> Density modification </b> <a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> <P>Density modification is normally performed after obtaining phases. If you do density modification in your structure determination, statistics information is needed for PDB deposition. <P> If density modification is not done in a separate step, you may skip this step, since you do not have a log file specifically for density modification. <P> <b>Important:</b> The log files must be generated from the LAST (or BEST) trial which corresponds to the file used for refinement. <P> <hr> <pre> The following items may be extracted: * Density modification method. * FOM after density modification (overall, resolution shells) * Solvent mask determination method. * Structure solution software. </pre> <hr> <b><center>Some helpful hints for getting the output files from each program:</center></b> <p> <p><b>Using <A HREF="http://www.solve.lanl.gov/ " TARGET="other"> RESOLVE (version 2.00 and above):</A> </b><p> <p> RESOLVE is a density modification program in the SOLVE/RESOLVE package. Normally it runs together with SOLVE, but one can run it separately. When you run RESOLVE, you will get a log file like resolve.log. <p> Only one log file (resolve.log) is needed for extraction. File type is LOG. <p><b> Using <A HREF="http://cns.csb.yale.edu/v1.1/ " TARGET="other"> CNS/CNX/XPLOR </A></b><p> <p>The CNS user may need to run the input script like density_modify.inp. You will get a log file called density_modify.list. <p> Only one log file (density_modify.list) is needed for extraction. File type is LOG. <p> <p><b>Using <A HREF="http://www.ccp4.ac.uk/dist/html/INDEX.html " TARGET="other"> DM (CCP4)</A> </b><p> <p> DM is a density modification program in the CCP4 suit. When you run DM either by using the CCP4i graphic interface or the script, you will get a log file like dm.log. <p> Only one log file (dm.log) is needed for extraction. File type is LOG. <p><b>Using <A HREF="http://www.ccp4.ac.uk/dist/html/INDEX.html " TARGET="other"> SOLOMON (CCP4)</A> </b><p> <p> SOLOMON is also a another density modification program in the CCP4 suite. When you run DM either by using the CCP4i graphic interface or the script, you will get a log file like Solomon.log. <p> Only one log file (Solomon.log) is needed for extraction. File type is LOG. <p> </td> </tr> <!-------------- Final structure refinement-------------------> <tr><td bgcolor="lightblue"><strong><a name="ref"><center><b> Final structure refinement </b> <a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> <P> Structure refinement is performed at the end of structure determination. The atom coordinates are generated in PDB or mmCIF format and the statistics are generated in log files. The <b> <b>pdb_extract</b> </b>program is applied to extract statistical information: <P>Since statistics can be carried at the header section of PDB file, you may not provide any LOG files for some programs like CNS, REFMAC5. <P> <b>Important:</b> The log file and the coordinate file must be generated from the LAST (or BEST) trial which corresponds to the file that is used for deposition to the PDB. <P> <hr> <pre> The following items may be extracted: * Resolution range (highest res. shell) * Number of reflections used in refinement, and in R-Free set. * R-factor (overall, resolution shells) * Number of atoms refined * Cell parameters and space group. * The xyz coordinates of all the atoms. * RMS Bond Distances, Bond Angles, Chiral Volume, Torsion Angles * Isotropic temperature factor restraints * Non-crystallographic symmetry restraints * Solvent model used * Overall Average Isotropic B Factor * Overall Anisotropic B Factor * Overall Isotropic B Factor * Topology/parameter data used to refine deposited model * Refinement software </pre> <hr> <b><center>Some helpful hints for getting the output files from each program:</center></b> <p> <p><b>Using <A HREF="http://www.ccp4.ac.uk/dist/html/INDEX.html " TARGET="other"> REFMAC5 (CCP4): </A> </b><p> <p> REFMAC5 is a program for structure refinement used in the CCP4 suite. If you run this program using CCP4i or the script, you can get a PDB file with all the refinement information at the header section. <p> You may directly deposit this PDB file. <p> <p> <p><b> Using <A HREF="http://cns.csb.yale.edu/v1.1/ " TARGET="other"> CNS/CNX/XPLOR </A></b><p> <p> CNS/CNX/XPLOR is a program for final structure refinement. It exports coordinate file in both PDB and mmCIF format. You need the script deposit_mmcif.inp to generate the mmCIF format. <p> The mmCIF file carries more statistical information than the PDB file. Authors are encouraged to deposit the mmCIF file, otherwise authors may need to manually fill in more information. <p> You may not have to give any LOG file generated from CNS/CNX/XPLOR. <p> <p><b>Using <A HREF="http://shelx.uni-ac.gwdg.de/SHELX/ " TARGET="other">SHELXL (version 97):</A> </b><p> <p> SHELXL is a sub_program in the SHELX package. It is used for structure refinement. After you finish structure refinement, you need to run the shelxpro interactive program and use option B. After going through the shelxpro, you will get a PDB file (e.g. name.pdb) with header information. <p> <p><b>Using <A HREF=" http://www.uoxray.uoregon.edu/tnt/welcome.html" TARGET="other">TNT (version 5f):</A> </b><p> <p> TNT is a crystal structure refinement program. Data from this program can be extracted from the output PDB file and some LOG files. You can use the to_pdb command to convert coordinates in TNT format (name.cor) to the PDB format (name.pdb). <p> The command is: to_pdb name.cor <p> After finishing refinement, you must use command rfactor to generate a log file (e.g. rfactor.log) which contains the refinement statistics. <p> The command is: rfactor name.cor > rfactor.log <p> To extract the symmetry information, user must provide the symmetry file (e.g. p6122.dat). This information is in the control file name.tnt <p> <p><b>Using <A HREF="http://www.embl-hamburg.de/ARP/" TARGET="other"> ARP/wARP:</A> </b><p> <p> ARP/wARP is a automatic program for model building and refinement. REFMAC5 is used for the structure refinement step. <p> The new version (6.0 or above) can use CCP4i as graphic interface. You can run this program either by CCP4i or by using script. You will get a log file (for example warpNtrace_refine.log). You also get a PDB file like warpNtrace.pdb. <p> Note: If the coordinate file warpNtrace.pdb is directly used for deposition, you can use this option. Otherwise, use other program for final refinement. <p><b>Using <A HREF="http://www.phenix-online.org/" TARGET="other"> PHENIX </A> </b><p> <p> PHENIX is a new software suite for the automated determination of macromolecular structures using X-ray crystallography and other methods. <p> The PDB file generated by phenix.refine has the non-standard 'REMARK' and the standard 'REMARK 3'. It is also OK to keep the non-standard REMARK for deposion. <p> Note: Sometimes, the MTZ file from PHENIX only contains 2Fo-Fc. Before deposition, you must make sure that the amplitude (Fo) or Intensity (I) is included in the MTZ file. <p> </td> </tr> <!--------------tables -------------------> <tr><td bgcolor="lightblue"><strong><a name="tables"><center><b> Tables </b> <a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> Below are the two Tables. One is for all the command options and the other is for the software supported by pdb_extract. </td> </tr> <!-------------- command options-------------------> <tr><td bgcolor="lightblue"><strong><a name="tables_opt"><center><b> <a href="#top">    TOP</a> command options </b> </center></a></strong></td> </tr> <tr><td> <center><table cellpadding=5 border bgcolor="lightyellow"> <CAPTION ALIGN=top> command line options consist of three executable components of pdb_extract. <br><b>pdb_extract</b> is used to capture the details of molecular replacement, heavy atom phasing, density modification and structure refinement. <br> <b>pdb_extract_sf</b> is used to convert all other structure factor format to mmCIF format for PDB deposition, <br> <b> extract</b> is used to generate a data template file (data_template.text) and a script file (log_script.inp). </CAPTION> <!--- --> <tr> <td colspan=3 align=middle> <b>pdb_extract</b> [OPTION]... [FILE]... </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b>Option </b> </td> <td colspan=2> <FONT SIZE=3 ><b><center> Arguments followed by each option</center> </b> </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b>-o </b> </td> <td colspan=2> <FONT SIZE=3 > output file name (default name is pdb_extract.mmcif) </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -e </b> </td> <td colspan=2> <FONT SIZE=3 > one of experimental methods <br> (MR| SAD | MAD | SIR | MIR | SIRAS | MIRAS) </FONT> </td></tr> <!---index ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -i </b> </td> <td colspan=2> <FONT SIZE=3 > one of programs for indexing<br> [HKL | DENZO | DTREK | MOSFLM] </FONT> </td></tr> <!--- scaling for refinement data ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -s </b> </td> <td colspan=2> <FONT SIZE=3 > one of programs for reflection data scaling (used for refinement)<br> [SCALA |AIMLESS HKL | SCALEPACK | DTREK | SAINT | 3DSCALE | XSCALE | XENGEN | PROSCALE] </FONT> </td></tr> <!---MR ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -m </b> </td> <td colspan=2> <FONT SIZE=3 > one of programs for molecular replacement<br> [AMORE | CNS | XPLOR | EPMR | MOLREP | BEAST | PHASER | COMO] </FONT> </td></tr> <!---Phasing ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -p </b> </td> <td colspan=2> <FONT SIZE=3 > one of programs for heavy atom phasing<br> [CNS | XPLOR | MLPHARE | SOLVE | SHELX | SNB | BnP | BP3 | SHARP | PHASER | PHASES | WARP] </FONT> </td></tr> <!---REFINE ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -r </b> </td> <td colspan=2> <FONT SIZE=3 > one of programs for final structure refinement<br> [CNS | XPLOR | REFMAC5 | SHELX | TNT | BUSTER | PROLSQ | NUCLSQ | RESTRAIN | PHENIX | MAIN] </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -ipdb </b> </td> <td colspan=2> <FONT SIZE=3 > the input file with PDB format. </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -icif </b> </td> <td colspan=2> <FONT SIZE=3 > the input file with mmCIF format. </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -ient </b> </td> <td colspan=2> <FONT SIZE=3 > the input file data_template.text. (for complete sequence.)<br> (It is generated by '</b>extract</b> -pdb pdbfile') </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -idat </b> </td> <td colspan=2> <FONT SIZE=3 > the reflection data file to get < I/SigmaI > (optional) </FONT> </td></tr> <!--- ------------- --> <!---+++++++++++++ pdb_extract_sf +++++++++++++------------ --> <tr> <td colspan=3 align=middle><FONT SIZE=3 > <b>pdb_extract_sf</b> [OPTION]... [FILE]... </td></tr> <tr> <td colspan=2> <FONT SIZE=3> <b> -o </b> </td> <td colspan=2> <FONT SIZE=3 > output file name (default name is pdb_extract_sf.mmcif). </FONT> </td></tr> <!---data type (refine) ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -rt </b> </td> <td colspan=2> <FONT SIZE=3 > data type (I or F) in the reflection data file <br> (used for final structure refinement!) </FONT> </td></tr> <!--- data format (refine) ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -rp </b> </td> <td colspan=2> <FONT SIZE=3 > One of data formats <br> (CNS | mmCIF | SHELX | TNT | HKL/Scalepack | DTrek | SAINT | OTHER )<br> (used for final structure refinement!) </FONT> </td></tr> <!--- data type (phase) ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -dt </b> </td> <td colspan=2> <FONT SIZE=3 > data type (I or F) after data reduction at beam line.<br> (used for phase determination!) </FONT> </td></tr> <!--- data format (phase) ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -dp </b> </td> <td colspan=2> <FONT SIZE=3 > One of programs for data reduction ( HKL/Scalepack, DTrek, SAINT | OTHER ).<br> (used for phase determination!) </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -c </b> </td> <td colspan=2> <FONT SIZE=3 > crystal number (like 1, 2, 3 ...) used for diffraction. </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -w </b> </td> <td colspan=2> <FONT SIZE=3 > wavelength number (like 1, 2, 3 ...) used for diffraction. </FONT> </td></tr> <!--- ------------- --> <tr> <td colspan=2> <FONT SIZE=3> <b> -idat </b> </td> <td colspan=2> <FONT SIZE=3 > data file name used for phasing or structure refinement. </FONT> </td></tr> <!--- ------------- --> <!--- ------------- --> </TABLE> <!-------------- Crystallographic software lists-------------------> <tr><td bgcolor="lightblue"><strong><a name="tables_list"><center><b> Supported crystallographic software lists </b> <a href="#top">    TOP</a></center></a></strong></td> </tr> <tr><td> <ul> <li> PHENIX, REFMAC, CNS/CNX/XPLOR, BUSTER </li> <li> AIMLESS, HKL-2000, HKL-3000, XSCALE, XDS, SCALA, SCALEPACK, XIA2, DIALS, POINTLESS, d*TREK </li> </ul> </td> </tr> </table> </div> <footer class="container_12 ui-corner-all"> <p>© <a href="http://home.rcsb.org">RCSB PDB</a></p> </footer> </div> <script type="text/javascript"> // At end to ensure loaded var protocol = location.protocol; var slashes = protocol.concat("//"); var host = slashes.concat(window.location.hostname); document.getElementById("urlloc").innerHTML= 'The web interface can be accessed at <a href="' + host + '" target= "other" >' + host + '</a>'; document.getElementById("urlloc2").innerHTML= 'Use the <b>pdb_extract</b> <a href="' + host + '" target="other"> Web interface</a>'; </script> </BODY> </html>