<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">Dear Colleagues,<br><br>We are happy to announce the release of the CzeSL-SGT corpus ("Czech as a Second Language with Spelling, Grammar and Tags"). The 1 mil. corpus includes 8,617 short essays, written by 1,965 foreign students of Czech with 54 different first languages. <br><br>Most texts are equipped with metadata about the author and the text (30 items). Word forms are tagged by word class, morphological categories and lemmas. Some forms are corrected by an automatic proofreader and the resulting texts are tagged again. Original and corrected forms are compared and error labels assigned. All the annotation is done automatically. <br><br>The corpus is available for on-line searching using a web interface (<a href="https://kontext.korpus.cz/run.cgi/first?corpname=czesl-sgt">https://kontext.korpus.cz/run.cgi/first?corpname=czesl-sgt</a>) and for download as the entire data set (<a href="http://hdl.handle.net/11858/00-097C-0000-0023-95B1-E">http://hdl.handle.net/11858/00-097C-0000-0023-95B1-E</a>). See <a href="http://utkl.ff.cuni.cz/learncorp/">http://utkl.ff.cuni.cz/learncorp/</a> for more details and links.<br><br>Please let us know about any issues. We'll be happy to answer questions and grateful for any comments.<br><br>On behalf of the team<br><br>Alexandr Rosen, Charles University, Prague<br></body></html>