ProDom 2002.1 Statistics

Sequence and Family Data

Proteins used to build ProDom 2002.1:
non fragmentary sequences from
SwissProt (Rel 40.18) + TREMBL (Rel 20.6) - May 22, 2002
481952
The clustering process was initiated with domains from the SCOP database. rel. 1.59
domain families with at least 2 sequences138322
(+27 % since 2001.3)
domain families365172
(+19% since 2001.3)


Global statistics

Distribution of the number of domains per sequence

NOTE - The sequences with more than 50 domains are not shown.


Distribution of the radius of gyration

NOTES -

  1. The families with radius of gyration higher than 200 PAM are not shown
  2. Only families with at least 2 sequences are used here


distribution of the diameter

NOTES -

  1. The families with diameter higher than 500 PAM are not shown
  2. Only families with at least 2 sequences are used here


ProDom-SG statistics

Number of ProDom families selectionned: 137077
(82300 have no related family)
Number of ProDom families directly linked to PDB: 5710
Number of ProDom families linked to PDB via a related family: 8099
Number of ProDom families devoid of structure: 123268
Number of mono-domain proteins:86775