Humanist Discussion Group, Vol. 36, No. 17. Department of Digital Humanities, University of Cologne Hosted by DH-Cologne www.dhhumanist.org Submit to: humanist@dhhumanist.org Date: 2022-05-16 13:46:31+00:00 From: Gabor Toth <gabor.toth@maximilianeum.de> Subject: Re: [Humanist] 36.12: working with unnormalised historical texts Dear Crystal, Many thanks for your detailed answer; congratulations for working out all this, which sounds great. Could you please explain what you mean by the following two points: 1. "I hand coded the entries for part of speech using a simplified Penn Tree Bank system, marked known irregulars for hand processing, and built out the other possible variations algorithmically." I understand the hand coding part but I am not sure about the following steps. 2. "The result was a first draft of over 3 million forms" That sounds like a very big number, by forms do you mean types? Cheers, Gabor _______________________________________________ Unsubscribe at: http://dhhumanist.org/Restricted List posts to: humanist@dhhumanist.org List info and archives at at: http://dhhumanist.org Listmember interface at: http://dhhumanist.org/Restricted/ Subscribe at: http://dhhumanist.org/membership_form.php