AUTOMATIC CONTEXTUAL TEXT CORRECTION USING THE LINGUISTIC HABITS GRAPH LHG
DOI:
https://doi.org/10.7494/csci.2009.10.3.37Keywords:
automatic text correction, graph LHGAbstract
Automatic text correction is an essential problem of today text processors and editors. Thispaper introduces a novel algorithm for automation of contextual text correction using a LinguisticHabit Graph (LHG) also introduced in this paper. A specialist internet crawler hasbeen constructed for searching through web sites in order to build a Linguistic Habit Graphafter text corpuses gathered in polish web sites. The achieved correction results on a basis ofthis algorithm using this LHG were compared with commercial programs which also enableto make text correction: Microsoft Word 2007, Open Office Writer 3.0 and search engineGoogle. The achieved results of text correction were much better than correction made bythese commercial tools.Downloads
References
Mykowiecka A.: Inzynieria lingwistyczna. Komputerowe przetwarzanie tekstów w jezyku naturalnym. Wydawnictwo Polsko-Japonskiej Wyzszej Szkoły Technik Komputerowych, 2007
Miró J., Rosselló F.: Czy w Unii Europejskiej mówiono po polsku?. Magazyn Delta, 05, 2004
Gawrysiak P.: Modelowanie jezyka. Politechnika Warszawska, 2006
Statistical Inference: n-gram Models over Sparse Data: http://mi007.wikispaces.com/file/view/rozdzial6.pdf, 2009
Debowski Ł.: Prawo Zipfa – próby objasnien. Instytut Podstaw Informatyki PAN, 2005
Microsoft Office Word 2007 2009: Opis programu Word.
http://office.microsoft.com/pl-pl/word/HA101650321045.aspx
OpenOffice.org Writer 2009: Opis programu. http://pl.openoffice.org/
Marciniak M.: MS Office kontra OpenOffice. PC Word 2000
KGLK Krakowska Grupa Lingwistyki Komputerowej: Słownik Frekwencyjny Jezyka Polskiego, 2009