I would suggest the Ulm corpus, by Erhard Mergenthaler: Mergenthaler, Erhard. Textbank systems : computer science applied in the field of psychoanalysis / Erhard Mergenthaler ; [translator, Michael Wilson]. Berlin ; New York : Springer-Verlag, c1985. Series title: Lecture notes in medical informatics ; 27. -Jane Edwards