COACH-D: protein-ligand binding site prediction with refiend ligand pose by molecular docking

COACH-D

Help

About COACH-D
Submit a job to COACH-D
Download the complete set of results
Output explanation
How to cite COACH-D?
Need more help?

About COACH-D

The input of COACH-D 2.0 can either be the sequence or the structure of the query protein. Users can also submit a ligand for docking.
As shown in Figure 1, the COACH-D 2.0 workflow is as follows:

(1) If the sequence of the query protein (monomer only) is provided, the I-TASSER Suite or AlphaFold2 will be used to predict the query protein structure.
(2.1) If the structure is a monomer, it will be processed by five individual methods (TM-SITE, S-SITE, COFACTOR, FINDSITE and ConCavity ) to predict the protein-ligand
binding pockets and residues. Four of these methods make predictions by searching the query structure/sequence through the Q-BioLiP and BioLiP template libraries.
(2.2) If the structure is a multimer, it will be processed by the quaternary-based method Q-SITE.
(3) Both template-searched ligands and user-submitted ligand are docked into the predicted binding pockets using the efficient docking algorithm AutoDock Vina.

Figure 1. The overall architecture of COACH-D 2.0

Submit a job to COACH-D

Job submission guide:

(1) Paste or upload a query protein. The supported input formats include: Monomeric structure/sequence & Multimeric structure.
(2) Provide the ligand file(optional).
(3) Choose whether to keep your results private.
(4) Submit and reset.
Notice: Due to computing resource limitation, we now allow no more than 10 running/pending jobs per user at the same time.

Figure 2. The submission page of COACH-D 2.0

Download the complete set of results

There are separate download links for the following:

(1) The submitted protein
(2) The submitted ligand (optional)
(3) A zip file containing the main results

Figure 3. The download link of COACH-D 2.0

Output explanation

The COACH-D 2.0 results are usually summarized in a web page that consists of three sections:
    (1) Visualization of each predicted protein-ligand complex structure.
    (2) Top five predicted protein-ligand complex structures.
    (3) Summary table of potential binding ligands.

(1) Visualization of each predicted protein-ligand complex structure.
         This section displays the predicted binding of five pockets to the ligand.
         Predicted complex structure: Visualization of predicted complex structure.
         Current complex Information: The detailed information of the predicted complex on the left side.

Figure 4. Visualization of each predicted protein-ligand complex structure

(2) Top five predicted protein-ligand complex structures.

This section summarizes the top five predictions in a table.

Rank : The predictions are ranked by the C-Score.
Pose^u: The ligand binding pose submitted by user.
Pose^t: The ligand binding pose obtained from the PDB template.
C-Score: The confidence score of the prediction. C-Score ranges [0-1], with a higher score indicating greater reliability.
Cluster Size : The number of protein template in a cluster.
P Template : The representative protein template in a cluster.
L Template: The representative ligand template in a cluster.
TM-score: The structural similarity between the query and template proteins, normalized by the length of the query protein.
Predicted binding residues: The consensus prediction based on spatial clustering of the predicted binding sites from individual methods.

Figure 5. Top five predicted protein-ligand complex structures

(3) Summary table of potential binding ligands.

This section provides a table summarizing the information about potential ligands predicted to bind the query protein.

Rank: The ligands are ranked by their frequencies in the cluster.
Ligand: Click on the ligand ID to view more information in Q-BioLiP database and click on the download icon to get the corresponding SDF file.
Frequency: The total number of times the ligand occurs in the cluster.
PDB template: The PDB structures of the ligand in the cluster. At most 20 templates are listed for each ligand.
Visualization: Visualizaiton that can be rotated and zoomed in/out with the mouse (powered by 3Dmol).
Others: Other ligands in the cluster. The numbers in brackets are the frequencies. At most 100 ligands are listed.

Figure 6. Summary table of potential binding ligands

How to cite COACH-D?

Please cite the following article when you use the COACH-D 2.0 server:
Q Wu, Z Peng, Y Zhang, J Yang, COACH-D: improved protein-ligand binding site prediction with refined ligand-binding poses through molecular docking, Nucleic Acids
Research, 46: W438–W442 (2018).

Need more help?

If you have more questions or comments about the server, please email yangjy@sdu.edu.cn.