[Corpora-List] Source code corpora
Darren Pearce
darren.pearce at gmail.com
Thu Nov 20 17:50:52 UTC 2008
Not forgetting Google Project Hosting as well (
http://code.google.com/hosting/). :-)
On Thu, Nov 20, 2008 at 5:09 PM, Alexandre Rafalovitch
<arafalov at gmail.com>wrote:
> Wouldn't any source code repository with version control system give
> you that automatically? They all tell you exactly which code was
> contributed and by whom.
>
> E.g. SourceForge, Apache or Linux Kernel collections.
>
> http://www.koders.com/ might be a good way to search, if you are
> trying to narrow down to a particular area.
>
> Regards,
> Alex.
> Personal blog: http://blog.outerthoughts.com/
> Research group: http://www.clt.mq.edu.au/Research/
>
>
>
> On Thu, Nov 20, 2008 at 1:28 AM, <sdb at cs.rmit.edu.au> wrote:
> > Dear colleages,
> >
> > My research relates to authorship attribution of source code (that is,
> > determining the owner of anonymous work samples based upon other work
> > samples where authors are known).
> >
> > I'm looking for recommendations for source code corpora for this task
> > for any programming language. For the corpora to be useful, authorship
> > has to be identified.
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
--
----------------------------------------------------------------------
:Darren :Pearce
----------------------------------------------------------------------
*** Shop & Donate: http://buy.at/campuskids ***
----------------------------------------------------------------------
darrenp at dcs.bbk.ac.uk
Postdoctoral Researcher
London Knowledge Lab, University of London
----------------------------------------------------------------------
darrenp at sussex.ac.uk
Visiting Research Fellow
Informatics, University of Sussex
http://www.informatics.sussex.ac.uk/users/darrenp/
----------------------------------------------------------------------
darren.pearce at gmail.com
http://www.linkedin.com/in/darrenpearce
----------------------------------------------------------------------
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20081120/1d59b4dd/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list