The UniDoc domain prediction modeling are generally summarized in a webpage, the link of which is sent to the users after the decomposing is completed (
). This page includes a detailed explanation on the data listed on the UniDoc output page.
About UniDoc
The input to UniDoc is protein sequence or 3D structure, as shown in Figure 1, the UniDoc works as follows.
(1) When protein structure is submitted, the distance matrix is extracted from 3D structure. Then, we use the hierarchical clustering to decompose the protein.
(2) When protein sequence is submitted, the distance matrix predicted by our recent deep learning based structure prediction algorithm trRosetta. Then, Then, we use the hierarchical clustering to decompose the protein.
Figure 1. The flowchart of the UniDoc algorithm.