[Corpora-List] Wordsmith concordance

z.xiao at lancaster.ac.uk z.xiao at lancaster.ac.uk
Wed Dec 18 13:12:06 UTC 2002


In message <3DFF01D0.7677.48A18E at localhost> "Anne Harrap" <aharrap at brookes.ac.uk> writes:
> Does anyone else get a lot of duplicated entries when doing a
> concordance in Wordsmith?
>
> Not sure if this is a bug or we are doing something wrong...
>

I had this problem some time ago. One possible reason is that the text file(s) loaded into the concordancer is tab-delimited (e.g. the concordances downloaded from the BNCweb). I am not sure why WordWmith is incompatible with this type of text, but it is.

If you are working with concordances downloaded from the BNCweb, another problem is that some lines may contain incomplete POS tag marks (i.e. only tag start mark < or tag stop mark >). In this case, if you do not select the box for 'Tags to ignore' (WordSmith settings), there will also be a problem.

Richard

-------------------------
Dr Zhonghua Xiao
Department of Lingusitics
Lancaster University
LA1 4YT



More information about the Corpora mailing list