SYNOPSIS

concavity [options] \s-1PDBFILE\s0 \s-1OUTPUT_NAME\s0

DESCRIPTION

ConCavity predicts protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

ConCavity takes as input a \s-1PDB\s0 format protein structure \s-1PDBFILE\s0 and optionally files that characterize the evolutionary sequence conservation of the chains in the structure file.

The following result files are produced by default:

  • Residue ligand binding predictions for each chain (*.scores).

  • Residue ligand binding predictions in a \s-1PDB\s0 format file (residue scores placed in the temp. factor field, *_residue.pdb).

  • Pocket prediction locations in a \s-1DX\s0 format file (*.dx).

  • PyMOL script to visualize the predictions (*.pml).

To visualize the predictions in PyMol (it if is installed on your system), load the script by typing \*(L"pymol 1G6C_test1.pml\*(R" at the prompt or by loading it through the pymol interface.

The \s-1PDB\s0 and \s-1DX\s0 files can be input into other molecular viewers if preferred. Several additional output formats are available; see below. Note that the residue numbering in the .scores files may not match that of the \s-1PDB\s0 file.

The ConCavity approach proceeds in three conceptual steps: grid creation, pocket extraction, and residue mapping (see Methods in paper). First, the structural and evolutionary properties of the protein are used to create a regular 3D grid surrounding the protein in which the score associated with each grid point represents an estimated likelihood that it overlaps a bound ligand atom. Second, groups of contiguous, high-scoring grid points are clustered to extract pockets that adhere to given shape and size constraints. Finally, every protein residue is scored with an estimate of how likely it is to bind to a ligand based on its proximity to extracted pockets.

Each of the algorithms described for these steps is implemented in concavity. See the examples.

REFERENCES

Capra \s-1JA\s0, Laskowski \s-1RA\s0, Thornton \s-1JM\s0, Singh M, and Funkhouser \s-1TA\s0(2009) Predicting Protein Ligand Binding Sites by Combining Evolutionary Sequence Conservation and 3D Structure. PLoS Comput Biol, 5(12).

OPTIONS

\s-1PDBFILE\s0 is a protein structure file in \s-1PDB\s0 format. \s-1OUTPUT_NAME\s0 becomes part of the output file names and may not contain \*(L"/\*(R". Output is written to the current directory.

Input

-conservation \s-1PATH\s0

If the \*(L"-conservation\*(R" option is not given, then conservation information is not considered. Note that there are separate conservation files for each protein chain in the structure, and the input to the -conservation option is the prefix of these files. Pre-computed conservation files available for almost the entire \s-1PQS\s0 on the ConCavity web site. If you'd like to compute sequence conservation values for your own alignments, we recommend the \s-1JSD\s0 algorithm: <http://compbio.cs.princeton.edu/conservation/>, available as score_conservation\|(1) from the conservation-code package.

Grid Creation

-grid_method ligsite|surfnet|pocketfinder|custom
-resolution int int int

Set the grid resolution.

-spacing float

Set the grid spacing.

Pocket Extraction

-extraction_method search|topn|custom
-extraction_threshold_range_cutoff \s-1FLOAT\s0

Stop the iterative search method when the diameter of the binary search window is less than -extraction_threshold_range_cutoff * upper_threshold. Recommended value: 1e-6. Default: 0.

Residue Mapping

-res_map_method blur|dist|dist-thresh|custom

Each of these algorithms is described in the text, and each has a number of additional parameters that change their behavior. The \*(L"custom\*(R" option allows you to set the values of all parameters for each step yourself. The presets (e.g. ligsite, search, blur) may override values you set on the command line, so use \*(L"custom\*(R" to have complete control.

Output

There are also several output format options. Pocket prediction grid values can be output in the following formats:

-print_grid_dx 0|1

\s-1DX\s0 format. This is 1 by default.

-print_grid_pdb 0|1

\s-1PDB\s0 format. The residue predictions are output as a \s-1PDB\s0 file with the residue scores mapped to the temp. factor field and pocket numbers to the residue sequence field.

-print_grid_txt 0|1

Raw text.

-v

Verbose mode.

EXAMPLES

Note: you may have to copy and uncompress the example data files before running the following examples.

1.

This will run concavity with default values (equivalent to ConCavity^L in the paper) on the structure 1G6C.pdb and consider the conservation values found in conservation_data/. This set of predictions will be called \*(L"test1\*(R". This produces the following default result files in the current directory:

 concavity -conservation /usr/share/doc/concavity/examples/conservation_data/1G6C /usr/share/doc/concavity/examples/1G6C.pdb test1
2.

For example to score the structure 1G6C.pdb with ConCavity_Pocketfinder, Search, and Blur, you'd type: concavity -conservation /usr/share/doc/concavity/examples/conservation_data/1G6C -grid_method pocketfinder -extraction_method search -res_map_method blur /usr/share/doc/concavity/examples/1G6C.pdb cc-pocketfinder_search_blur

NOTES

The authors primarily use PyMol and Chimera for visualization, but the range of output formats means you should be able to import the data into most structural analysis program. Let us know if there are other output formats you'd like to see.

RELATED TO concavity…

Concavity Homepage <http://compbio.cs.princeton.edu/concavity/>
score_conservation\|(1)