CATH is a classification of protein structures downloaded from the Protein Data Bank. We group protein domains
into superfamilies when there is sufficient evidence they have diverged from a common ancestor. The files contained in this dataset correspond to the version 4.2 release of the CATH classification.