Corpora: protein name list

George Demetriou g.demetriou at dcs.shef.ac.uk
Thu Nov 1 15:24:22 UTC 2001


You can find lists of protein names in the following sites:

SCOP database:
http://scop.mrc-lmb.cam.ac.uk/scop/

CATH database:

http://www.biochem.ucl.ac.uk/bsm/cath/index.html

Protein Data Bank:

ftp://ftp.rcsb.org/pub/pdb/

Enzyme names from Expasy:

ftp://www.expasy.ch/databases/enzyme

Also, lists of protein names we have used in the PASTA project
(http://www.dcs.shef.ac.uk/nlp/pasta/) can be made available on request.
Some of the terms in the lexicons were derived from the above sources
but were supplemented with protein names extracted from texts.


George Demetriou


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
                      Dr George Demetriou

Dept. of Computer Science               Room: 219
The University of Sheffield             Tel: +44 (0) 114 2221894
Regent Court                            FAX: +44 (0) 114 2221810
211 Portobello Street                   e-mail: demetri at dcs.shef.ac.uk
Sheffield, S1 4DP, UK
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

> Dear Colleagues,
>
> I am collecting protein name list for bioinformatics research.
> Does anyone know of public protein name list?
>
> Thanks.
>
> Hsin-Hsi Chen
> National Taiwan Unversity



More information about the Corpora mailing list