[Corpora-List] Re. Concordancer for Chinese (Summary of reply)

Kiril Simov kivs at bultreebank.org
Tue Oct 8 11:14:53 UTC 2002


Dear Linda,

I am not a specialist in Chinese, but if your documents are Unicode based
and can be represented as XML documents, you could try our system:
CLaRK. It is an XML-based system for corpora development and it
includes an Unicode XML Editor, XPath language for navigation in
XML documents, XSLT engine for tranformation of XML documents,
Cascaded Regular Grammars, Constraints over XML documents,
Tokenizers, Concordance tool, Extract, Remove and other tools.
The system is freely available at:

http://www.bultreebank.org/clark/index.html

With best regards,

Kiril

-----------------------------------------------------------------
Kiril Simov
BulTreeBank Project
Linguistic Modelling Laboratory, CLPP,
Bulgarian Academy of Sciences
Acad. G.Bonchev St. 25A
1113 Sofia, Bulgaria
E-mail: kivs at bultreebank.org
Web: http://www.bultreebank.org/
-----------------------------------------------------------------
----- Original Message -----
From: "Linda Lin" <eclindal at polyu.edu.hk>
To: "Josephine Lo" <ENJOSELO at cityu.edu.hk>; <CORPORA at HD.UIB.NO>
Cc: "john flowerdew" <ENJOHNF at cityu.edu.hk>
Sent: Monday, October 07, 2002 12:15 PM
Subject: [Corpora-List] Re. Concordancer for Chinese (Summary of reply)


> Dear All
>
> Thanks for your information about the concordancers for Chinese language.
I
> have a question regarding the use of these concordancers. Do you think the
> recommended concordancers such as MonoConc Pro can only recognize
individual
> characters, not actual "words" i.e. strings of characters,  or they can in
> fact process actual "words"?
>
>
> Thanks.
>
> Linda
>
> ----- Original Message -----
> From: Josephine Lo <ENJOSELO at cityu.edu.hk>
> To: <CORPORA at HD.UIB.NO>
> Sent: Wednesday, October 02, 2002 10:01 AM
> Subject: [Corpora-List] Concordancer for Chinese (Summary of reply)
>
>
> Some times ago I ask for recommendation on concordancers working on
Chinese
> characters and thanks for the responses from the following linguists:
>
> Michael Barlow:
> MonoConc Pro should work if you are using Chinese Windows. You
> would have to use the regex search option in advanced search due to
> the lack of spaces. You should try the demo at athel.com
>
> Lou Burnard:
> Any concordancer should be able to work with Chinese characters, but it
> depends rather on how the characters are encoded.
>
> We are working on a version of Sara which is able to operate on Unicode,
> and have been testing it against a Chinese file, which seems to work OK.
>
> Rafal L. Górski:
> Try ConcApp http://vlc.polyu.edu.hk/PUB/concapp/. It is a freeware.
>
> Antoinette Renouf
>
> Simon G. J. Smith
> The CKIP corpus, which you can link to from www.sinica.edu.tw , is
> web-based and lets you do concordances. This is not, however, software
that
> can be used to process your own texts.
>
> Scott Piao:
> I put a downloadable Java tool including multi-lingual concordancer on
> webpage:
> http://www.lancs.ac.uk/staff/piaosl/research/download/download.htm
>
> It has a Graphical interface, and easy to use. In order to run this tool,
> you'll need to install Java Runtime Environment (JRE) first).
>
>
>
>
>



More information about the Corpora mailing list