Corpora: protein name list
George Demetriou
g.demetriou at dcs.shef.ac.uk
Thu Nov 1 15:24:22 UTC 2001
You can find lists of protein names in the following sites:
SCOP database:
http://scop.mrc-lmb.cam.ac.uk/scop/
CATH database:
http://www.biochem.ucl.ac.uk/bsm/cath/index.html
Protein Data Bank:
ftp://ftp.rcsb.org/pub/pdb/
Enzyme names from Expasy:
ftp://www.expasy.ch/databases/enzyme
Also, lists of protein names we have used in the PASTA project
(http://www.dcs.shef.ac.uk/nlp/pasta/) can be made available on request.
Some of the terms in the lexicons were derived from the above sources
but were supplemented with protein names extracted from texts.
George Demetriou
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Dr George Demetriou
Dept. of Computer Science Room: 219
The University of Sheffield Tel: +44 (0) 114 2221894
Regent Court FAX: +44 (0) 114 2221810
211 Portobello Street e-mail: demetri at dcs.shef.ac.uk
Sheffield, S1 4DP, UK
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
> Dear Colleagues,
>
> I am collecting protein name list for bioinformatics research.
> Does anyone know of public protein name list?
>
> Thanks.
>
> Hsin-Hsi Chen
> National Taiwan Unversity
More information about the Corpora
mailing list