[Corpora-List] Email address identification
Ron Artstein
artstein at essex.ac.uk
Fri Jul 20 17:15:13 UTC 2007
Asked on behalf of a sociologist colleague:
Does anyone know of a tool that runs through a corpus of email
correspondence and identifies different email addresses which
belong to the same person?
Obviously this can't be done with 100% certainty, and the ideal
tool for my colleague's purpose should err on the side of unifying
addresses. For example, if the tool is uncertain whether the
following two people are the same:
Andy Smith <a.smith at company.com>
"Smith, A. C." <acs37 at service.net>
Then it should assume the two email addresses belong to the same
person (until there is evidence to the contrary).
Thanks, -Ron.
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list