6.763, FYI: Dutch newspaper corpus, Terminology management system

The Linguist List linguist at tam2000.tamu.edu
Sat Jun 3 05:12:38 UTC 1995


----------------------------------------------------------------------
LINGUIST List:  Vol-6-763. Sat 03 Jun 1995. ISSN: 1068-4875. Lines: 118
 
Subject: 6.763, FYI: Dutch newspaper corpus, Terminology management system
 
Moderators: Anthony Rodrigues Aristar: Texas A&M U. <aristar at tam2000.tamu.edu>
            Helen Dry: Eastern Michigan U. <hdry at emunix.emich.edu>
 
Assoc. Editor: Ljuba Veselinova <lveselin at emunix.emich.edu>
Asst. Editors: Ron Reck <rreck at emunix.emich.edu>
               Ann Dizdar <dizdar at tam2000.tamu.edu>
               Annemarie Valdez <avaldez at emunix.emich.edu>
 
-------------------------Directory-------------------------------------
 
1)
Date: Wed, 24 May 1995 10:41:45 +0100 (MET)
From: ROB at rulxho.LeidenUniv.nl
Subject: Announcement INL 27 Million Words Dutch Newspaper Corpus 1995
 
2)
Date:          Mon, 8 May 1995 10:28:41 +0000
From: "Milde Jordaan-Weiss 012 314-6165" (VT13 at acts2.pwv.gov.za)
Subject:       REQUEST FOR INFORMATION
 
-------------------------Messages--------------------------------------
1)
Date: Wed, 24 May 1995 10:41:45 +0100 (MET)
From: ROB at rulxho.LeidenUniv.nl
Subject: Announcement INL 27 Million Words Dutch Newspaper Corpus 1995
 
INSTITUUT VOOR NEDERLANDSE LEXICOLOGIE
 
 
On-line access to 27 million Words Dutch Newspaper Corpus for
non-commercial purposes.
 
The Institute for Dutch Lexicology INL offers you the possibility to
consult a text corpus of over 27 million words of Dutch newspaper text, by
the international computer network. In 1994, a 5 Million Words Corpus with
diversified composition has been made accessible in a similar way.
 
The retrieval system is essentially the same as that for the 5 Million
Words Corpus 1994. It allows you to search for single words or for word
patterns, including some predefined syntactic patterns that can be changed
by the user. Searches concern the levels of word form, part of speech
(POS), and head word, both separately and in combination by use of Boolean
operators and proximity searches. During the search, data concerning
frequency and distribution over the texts are provided at several levels.
The output most often is a list of items, or a series of concordances
(words in context) with a variable, user-defined textual context. Sorting
facilities may support your analysis of the output data. With some
limitations due to copyright, the output of your searches can be transfered
to your own computer by e-mail. It is not allowed to transfer complete
texts or substantial text parts.
 
Most of the data has not been corrected, neither on the level of the text,
nor on the level of POS and headword. POS and headword have automatically
been assigned to the word forms in the electronic text by lingware
developed at the INL.
 
The provider of the texts has given permission for use of the materials for(B
 
Please note that for an optimal use of the retrieval system, the use of a
VT 220 (or higher) terminal, or an appropriate terminal-emulator (e.g.
Kermit) is recommended.
 
In order to get access to this corpus, an individual user agreement has to
be signed. An electronic user agreement form can be obtained from our
mailserver Mailserv at Rulxho.Leidenuniv.NL. Type in the body of your e-mail
message: SEND [27MLN95]AGREEMNT.USE. For access to the 5 Million Words
Corpus 1994, a separate user agreement is required, which can be obtained
from the same mailserver, by the message SEND [5MLN94]AGREEMNT.USE .
 
Please make a hard copy of the agreement form, sign it, keep a copy
yourself, and return a signed copy to: Institute for Dutch Lexicology INL,
P.O. Box 9515, 2300 RA Leiden. Fax: 31 71 27 2115.
 
After receipt of the signed user agreement, you will be informed about your
username and password.
 
If you need additional information, please send an e-mail message to
Helpdesk at Rulxho.Leidenuniv.NL, or send a fax to Mrs. dr. J.G. Kruyt.
 
--------------------------------------------------------------------------
2)
Date:          Mon, 8 May 1995 10:28:41 +0000
From: "Milde Jordaan-Weiss 012 314-6165" (VT13 at acts2.pwv.gov.za)
Subject:       REQUEST FOR INFORMATION
 
Dear colleague
 
The National Terminology Services (NTS) of South Africa is looking
for a Terminology Management System which will be able to accommodate
all 11 official languages. Some of the African Languages have special
diacritics not yet available in commercial software.
 
Please pass this on to anyone who might be interested. The rather
bulky USER REQUIREMENT SPECIFICATION will be e-mailed to interested
parties as soon as they request it.
 
Please take note of the closing date of 29 May.
 
We appreciate your help in this matter.
 
Yours sincerely
 
Ms Milde Jordaan-Weiss
 
Ms Milde Jordaan-Weiss
National Terminology Services
Department of Arts, Culture, Science and Technology
Private Bag X894
0001 PRETORIA
REPUBLIC OF SOUTH AFRICA
 
Tel +27 12 314-6165{Fax +27 12 325-4943
 
--------------------------------------------------------------------------
LINGUIST List: Vol-6-763.



More information about the LINGUIST mailing list