[RNLD] ELAN tiers and types

Aidan Wilson aidan.wilson at UNIMELB.EDU.AU
Wed Sep 12 22:33:47 UTC 2012


Thanks Andrea, and Tom, Jeremy, John, Sally and Joe for your advice and your 
template files. I think the best way forward for me now is to try to mamnually 
edit my eaf files and nest the tiers, as creating new tiers and trying to 
re-enter the annotations I think will be too long and painful.

-- 
Aidan Wilson

School of Languages and Linguistics
The University of Melbourne

+61428 458 969
aidan.wilson at unimelb.edu.au

On Thu, 13 Sep 2012, Andrea L. Berez wrote:

> Hi All,
> 
> Entering the conversation a bit late because of time zones, but I have some ELAN training materials that anyone is welcome to download and use.
> They (attempt to) explain the relationship between types and tiers and have exercises for building files with increasingly-complex tier structures.
> The materials are a few years old and based on an earlier version of a ELAN, but the basics have not changed.
> 
> https://www.sugarsync.com/pf/D7795324_2102134_63719
> 
> HTH,Andrea
> --
> Andrea L. Berez
> Assistant Professor, Department of Linguistics
> University of Hawai'i at MānoaDirector, Kaipuleohone UH Digital Ethnographic Archive
> Technology reviews editor, Language Documentation & Conservation
> http://www2.hawaii.edu/~aberez
> 
> 
> 
> On Wed, Sep 12, 2012 at 12:57 AM, Honeyman Tom <t.honeyman at gmail.com> wrote:
>       Hi Aidan,
> 
> The short answer is Nick Thieberger recently posted to this list a collection of templates produced by Andrea Berez:
> 
> http://www.rnld.org/software
> 
> or
> 
> http://paradisec.org.au/elansampletemplates.zip
> 
> You could use these and modify them.
> 
> Alternatively, I create tiers in the following way:
> 
> First, here is an etf template file that I use to start a new transcription:
> 
> https://www.sugarsync.com/pf/D138718_8778877_6506503
> 
> To change it for your needs, open it in a text editor and search for all instances of "Tom" and replace with the name of the person entering
> the data (e.g you). Replace "XXX" with the name of the language you are transcribing, and "YYY" with language of your translation. Replace
> "English" with the language you are making utterance level comments in, if it isn't English. Save the file.
> 
> Now replace all instance of "Unknown" with the name of the person you are transcribing. Save a copy of the file in a templates folder
> somewhere and rename it with the speaker's name. Repeat the process for each speaker you are transcribing. In this way I amass a template
> file for each speaker that I have transcribed, and so when I want to start a new transcription I simply import the all the relevant template
> files in with my audio file and hey presto away I go.
> 
> This template provides per speaker tiers for transcription, translation and comments. The transcription is time aligned, while the
> translation and comments are nested underneath. There is also a single tier for time-aligned comments (i.e. general comments that don't align
> with speakers' utterances).
> 
> I use tier names that are compatible with exporting to toolbox. "tx" is from transcriptions, "ft" is for translations, "cm" is for comments.
> The @ + participant name bit at the end of each tier name is a trick to aid exporting to toolbox. If you're not exporting to toolbox, then
> rename these as you like (e.g. make them more verbose), but remember you can't have separate tiers for each speaker that have the same name,
> so it's best to use a similar strategy of including the speaker's name in the tier name.
> 
> Naming and establishing tiers and types consistently across a corpus is really important once the corpus grows. It will allow you to search
> across the corpus and to narrow your searches to just one tier, or type, using the extremely powerful "Structured Search Multiple EAF". Say
> for instance you wanted to limit your search to a particular speaker, you would search for values within a particular tier across multiple
> files. Say you wanted to search in your translations for all speakers, then you would search within the type "YYY translation" (where YYY is
> the language you entered). Always establishing tiers and types using templates will really help to ensure the consistency which will enable
> these kinds of searches.
> 
> For your particular needs I would create two more tiers using the same per-speaker naming strategy, and create separate types for those tiers
> as well, both using the stereotype "symbolic association" (this is really important for the nesting). Then save the file as a template, and
> create all the individual versions for each speaker.
> 
> I translate in both Tok Pisin and English, so I have separate tiers with separate types for both of those. I also have a "tidied text" tier
> which is less representative of the original, but, for instance replaces accidental code-switching (at my consultants' request), and tidies
> up in other ways. I have a close phonetic transcription tier for when I transcribe word lists, again with a separate type. I only mention
> these extra tiers because it really helps to add them from the start, as going back and adding them afterwards is a pain.
> 
> Hope this helps,
> 
> Cheers,
> Tom
> 
> 
> On 12/09/2012, at 6:06 PM, Aidan Wilson <aidan.wilson at unimelb.edu.au> wrote:
>
>       Hi all,
>
>       I'm having immense trouble with a collection of transcripts I'm building at the moment. I have never been successful in creating
>       hierarchies of tiers. Ideally, I want to have a hierarchy like this:
>
>       -[initials]
>       –transcription
>       -morpheme gloss
>       -free translation
>       -action
>
>       for each participant. In reading through the manuals and so forth, I've created linguistic types like 'group' (for the outermost
>       parent tier), transcription, morpheme gloss and so on, but what's happening when I create my tiers is that by trying to select a
>       parent tier, the 'linguistic type' pull-down menu disappears and the 'add' button goes grey. It only allow me to create tiers (or
>       change existing tiers) if I don't plan on nesting them, it seems.
>
>       Has someone got a link to instructions, or even better, a working template than I can reverse-engineer?
>
>       What I'm hoping to do is go through my transcripts with a text editor and manually nest them by editing the xml (unless I can
>       figure out how to retrospectively nest them), but I need to know the structure of the eaf file.
>
>       This is bringing me to tears, as it were.
>
>       --
>       Aidan Wilson
>
>       Dept of Linguistics and Applied Linguistics
>       The University of Melbourne
>
>       +61428 458 969
>       aidan.wilson at unimelb.edu.au
> 
> 
> 
> 
>


More information about the Resource-network-linguistic-diversity mailing list