Corpora: CL at Russian Congress
MOCKBA
rykov at narod.ru
Tue Apr 10 07:01:09 UTC 2001
Hello! There was Russian Language Congress in Moscow about a month ago. Maybe someone will be interested to read a pair of lines about it.
I made a little report about corpus design basic principles- mainly to divert attention to the problem and to inspire discussion. I almost managed to do it - especially that my report was the only one at the Congress. Anatoly Shaikevich (he has a paper printed in IJCL) supported it, saying the best designed corpus he knows is the Czech one. So - I am grateful Dr Chermak for the peper about it sent me a month ago. the latter corpus - the better design. I like also CoSIH design which description Shlomo Izre'el was so kind to send me.
Still - CL rays lighted the audience. There was report about Russian verb "moch' " . It has two meanings - "may" and "can' (usual headache for Russians speaking English). And the report was about diachronic shift in these two meanings usage. And afterwards the first question was - which corpus you used to prove your investigation!
There was an interesting presentation of ENOURMOUS number of Russian PC readable texts - classics - anything. The guy was from our Northern Republic - homeland of a famous US scientist Pitirim Sorokin - former idol of leftist movement.
--
Vladimir Rykov, PhD in Comp Linguistics, Linguistic Institute RAS
More information about the Corpora
mailing list