CATH

CATH protein structure classification database

CATH is a hierarchical classification of protein domain structures, which clusters proteins at four major levels: Class (C), Architecture (A), Topology (T) and Homologous superfamily (H). The boundaries and assignments for each protein domain are determined using a combination of automated and manual procedures which include computational techniques, empirical and statistical evidence, literature review and expert analysis.

IDCATH
Homehttp://www.cathdb.info/
EDAM topicProtein domains
EDAM topicSequence clustering
Taxonall
SOAPhttp://api.cathdb.info/api/soap/dataservices/wsdl

Available data

Data Format Query Link
CATH domain report HTML CATH domain ID http://www.cathdb.info/domain/%s
CATH domain report XML CATH domain ID http://www.cathdb.info/domain/%s?view=xml
Protein features (domains) {CATH chain} HTML Polypeptide chain ID http://www.cathdb.info/chain/%s
Protein features (domains) {PDB} HTML PDB ID http://www.cathdb.info/PDB/%s
Protein features (domains) {PDB} XML PDB ID http://www.cathdb.info/PDB/%s?view=xml
CATH node HTML CATH node ID http://www.cathdb.info/cathnode/%s

Example queries

Data Format Query Example
CATH domain report HTML CATH domain ID 1cukA01
CATH domain report XML CATH domain ID 1cukA01
Protein features (domains) {CATH chain} HTML Polypeptide chain ID 1cukA
Protein features (domains) {PDB} HTML PDB ID 1cuk
Protein features (domains) {PDB} XML PDB ID 1cuk
CATH node HTML CATH node ID 1.10.10.10