[Corpora-List] Phrase extraction
    Antti Arppe 
    aarppe at ling.helsinki.fi
       
    Tue Oct 25 09:32:49 UTC 2005
    
    
  
On Mon, 24 Oct 2005, Helge Thomas Karset Hellerud wrote:
> PoS (Part of Speech) tagging is often used to extract phrases from text
> (like Noun Phrases). But that approach assumes you have a PoS tagger
> available. My document collection is in Norwegian, but I don't have a
> Norwegian tagger.
>
> 1) Is there a way to create a simple PoS tagger to recognize verbs,
> nouns and adjectives (in Norwegian)?
Before creating your own tagger, have you or your department 
considered getting/licensing Multitagger (a PoS tagger for Norwegian 
created by the Universitetet i Oslo / Textlaboratoriet / Janne Bondi 
Johannessen) or an academic version of Connexor's dependency parser 
(Machinese) for Norwegian?
 	-Antti Arppe
    
    
More information about the Corpora
mailing list