ProDom 2003.1 Statistics

Sequence and Family Data

Proteins used to build ProDom 2003.1:
non fragmentary sequences from
SwissProt (Rel 40.41) + TREMBL (Rel 22.10) - Jan 31, 2003
556964
The clustering process was initiated with domains from the SCOP database. rel. 1.61
domain families with at least 2 sequences144444 (+4.4 % since 2002.1)
domain families391935
(+7.3% since 2002.1)
PDB links 13391
Prosite links (Patterns, Profiles and Prefiles) 1467
Pfam-A links5709
InterPro links 6833


Global statistics

Distribution of the number of domains per sequence

NOTE - The sequences with more than 50 domains are not shown.


Distribution of the radius of gyration

NOTES -

  1. The families with radius of gyration higher than 200 PAM are not shown
  2. Only families with at least 2 sequences are used here


distribution of the diameter

NOTES -

  1. The families with diameter higher than 500 PAM are not shown
  2. Only families with at least 2 sequences are used here


ProDom-SG statistics

Number of ProDom families selectionned: 137077
(82300 have no related family)
Number of ProDom families directly linked to PDB: 5710
Number of ProDom families linked to PDB via a related family: 8099
Number of ProDom families devoid of structure: 123268
Number of mono-domain proteins:86775