[Lingtyp] Transcribing interjections
    Ekaterina Georgieva 
    ekaterina.georgieva at nytud.hu
       
    Tue Feb 22 10:59:27 UTC 2022
    
    
  
Dear Eline (and everyone interested),
I did annotate discourse and extralinguistic elements, such as  
hesitation, breathing, laughter, lipsmack, etc.,
in the ELAN transcription (no separate tier) of Udmurt (Uralic)  
recordings that were later published in a sound-aligned corpus  
(http://udmurt.web-corpora.net/index_en.html). For example, <H:xx> was  
used for filled hesitations (where xx stands for the actual phonetic  
realization of the hesitation). I paid less attention to the actual  
realization (for example, "aa" or "aaa").
The full list of conventions are listed in this paper:
Arkhangelskiy, Timofey & Ekaterina Georgieva. 2018. Sound-aligned  
corpus of Udmurt dialectal texts. In: Pirinen, Tommi A. (ed.),  
Proceedings of the 4th International Workshop for Computational  
Linguistics for Uralic Languages (IWCLUL 2018), 26–38.  
Stroudsburg(PA): Association for Computational Linguistics.
https://aclanthology.org/W18-0203.pdf
I remember that I borrowed some of them from the description of a  
corpus building project of German, but unfortunately, I can't find the  
references now.
I hope this helps.
Best regards,
Ekaterina
Quoting Eline Visser <eelienu at protonmail.com>:
> Hi,
>
> I'd like to get in contact with fieldworkers who have annotated or  
> are currently annotating their language corpus for interjections.  
> I'm specifically interested in the following issues:
>
> - How did you transcribe interjections technically? For example, did  
> you use a separate tier in ELAN, did you use FLEx?
> - How did you go about pairing forms and functions? There are many  
> ways to pronounce something like 'eh' or 'hmm'. How fast could you  
> adopt a normalised spelling? (I.e. establish which variants are  
> actually contrastive and which are not.)
> - How easy/problematic was it for you to pair forms to functions?
>
> I'm in the process of doing this for a part of my Kalamang corpus,  
> and these are some of the issues I have encountered so far, so I'd  
> like to exchange thoughts.
>
> You can contact me directly, or reply to the list, whatever you prefer.
>
> Eline
    
    
More information about the Lingtyp
mailing list