Indonesian corpus

Brian MacWhinney macw at cmu.edu
Fri Apr 25 20:03:12 UTC 2008


Margaret,
     There are  2417587 word tokens and 28900 word types in the  
Indonesian corpus.
The command is
freq +d4 +u +re *.cha

-- Brian MacWhinney

On Apr 25, 2008, at 12:36 AM, Margaret Fleck wrote:

>
> Can you give a ballpark estimate for the number of words in the
> transcriptions?   (This information is useful for those of us doing
> computational algorithms.)
>
> Margaret
>
> Uri Tadmor <uritadmor at gmail.com> wrote:
>
> I'd just like to add that in addition to a transcription in
> conventional (romanized) orthography, each utterance is also
> phonetically transcribed, glossed, and translated into English. So
> you don't need any prior familiarity with Indonesian in order to use
> the database. Each of the 8 children was recorded at average 10-day
> intervals for 2 to 4 years, so the database is ideal for longitudinal
> studies. I hope you have as much fun working with it as we've had
> compiling it. Thanks so much, Brian, for your help and encouragement
> which made it possible for us to post the database on CHILDES.
>
> If you have any question or comment about the Indonesian child
> language database, please feel free to contact me at
>
> uri at cbn.net.id
>
> Uri Tadmor
>
> On Apr 14, 3:07 am, Brian MacWhinney wrote:
> > Dear Info-CHILDES,
> > I am happy to announce the addition to CHILDES of a very large
> > corpus of data on the acquisition of Jakarta Indonesian  
> contributed by
> > David GIl and Uri Tadmor of the MPI-EVA in Leipzig. The study tracks
> > eight children with an age range, varying by child, from 1;6 up to
> > 8;9. This is the first corpus from an Austronesian language and its
> > addition to CHILDES is most welcome. The readme file is attached.
> >
> > --Brian MacWhinney
> >
> > jakarta.pdf
> > 146KDownload
> >
> >
> Info-CHILDES members:
>
>
>
>
> >


--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups "Info-CHILDES" group.
To post to this group, send email to info-childes at googlegroups.com
To unsubscribe from this group, send email to info-childes-unsubscribe at googlegroups.com
For more options, visit this group at http://groups.google.com/group/info-childes?hl=en
-~----------~----~----~----~------~----~------~--~---

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/info-childes/attachments/20080425/b338cd72/attachment.htm>


More information about the Info-childes mailing list