[Corpora-List] Comparing files

Otto Lassen otto at lassen.mail.dk
Sat Nov 15 20:28:49 UTC 2003


Hi
That could be done in any language:
1. sort then two lists
2. compare them word for word
3. output words which are not in both lists
Regards
Otto Lassen

At 21:54 15-11-2003 +0100, you wrote:
>Hi,
>
>I'm doing a project that involves comparing two very large word lists
>(~40.000 and 70.000 words). What I need to find out, is which words are on
>one list and not on the other (and/or vice versa).
>Can anyone give me a hint as to how to do this? (I was thinking; maybe a
>perl script?)
>
>Any help will be greatly appreciated.
>Best,
>Tine Lassen

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20031115/69c15965/attachment.htm>


More information about the Corpora mailing list