pos-errors.cut file

Brian MacWhinney macw at cmu.edu
Mon Oct 9 22:29:24 UTC 2006


Dear Anna,
      Typically pos-errors.cut is empty.  It only shows errors if  
there is some major mismatch of categories between MOR and POST.  For  
example, if you train POST on one set of categories and then redo MOR  
without retraining using POSTTRAIN, then you could get errors.  But  
usually it is empty.  The file that is usually more full of errors is  
the errors file produced during POSTTRAIN.  Again, these errors can  
arise from changes to the MOR grammar over time without doing  
corresponding fixes to the %trn line in the training data files.
    One way of evaluating POST is to have a file with gold-standard  
codes on the %trn line and then run MOR and POST and then to use the  
TRNFIX program to detect disagreements between the %mor and %trn.
However, if both %trn and %mor are wrong in the same way, these  
errors will not get caught.  Also, sometimes %trn is wrong, although  
in theory it would never be wrong.   You can run FREQ on the files  
output from TRNFIX and count the deviations and then divide that by  
the number of words in the file to compute something close to  
accuracy, with the above exceptions being noted.

--Brian MacWhinney

On Oct 9, 2006, at 7:28 AM, Anna Gudmundson wrote:

>
> Dear info chibolts,
>
> I just wonder what kind of information is supposed to show up in  
> the pos-errors.cut file, i.e.
> the file that results from running the POST command. I ask the  
> question because every time I
> run POST, nothing is written to that file, i.e. it always turns out  
> to be empty even though I
> know there should be several errors. What kind of errors should be  
> reported in that file, and
> if there are no errors, does the program still output something  
> like “no errors found”?
>
> I also have a second question that concerns the evaluation of the  
> POST program. Is there a
> way to calculate some kind of accuracy rate, like for example  
> precision and recall. I have a
> vague memory of having seen a percentage value in some output file  
> (could it have been in
> the pos-error file maybe?)
>
> I would be so grateful if someone could help me.
>
> Thanks in advance
> Anna Gudmundson
>
>
>
>



More information about the Chibolts mailing list