TEXT SUMMARIZING IN POLISH
DOI:
https://doi.org/10.7494/csci.2005.7.4.31Keywords:
natural language processing, text summarizingAbstract
The aim of this article is to describe an existing implementation of a text summarizer forPolish, to analyze the results and propose the possibilities of further development. Theproblem of text summarizing has been already addressed by science but until now there hasbeen no implementation designed for Polish. The implemented algorithm is based on existingdevelopments in the field but it also includes some improvements. It has been optimized fornewspaper texts ranging from approx. 10 to 50 sentences. Evaluation has shown that it worksbetter than known generic summarization tools when applied to Polish.Downloads
References
”summarize” (entry) in Merriam-Webster Online Thesaurus, 15 Jun 2005, http://www.m-w.com/cgi-bin/thesaurus
Van Dijk T. A.: Some Aspects of Text Grammars. A Study in Theoretical Linguistics and Poetics, Mouton, The Hague, 1972
Dalianis H., Hassel M., Smedt de K., Liseth A., Lech T.C., Wedekind J.: Porting and evaluation of automatic summarization. In Holmboe H. (ed.), Nordisk Sprogteknologi 2003. Arbog for Nordisk Sprakteknologisk, Forskningsprogram
–2004, pp. 107–121.
Dalianis H., Hassel M., Wedekind J., Haltrup D., Smedt de K., Lech T. C.: Automatic text summarization for the Scandinavian languages. In Holmboe H. (ed.), Nordisk Sprogteknologi, 2002. Arbog for Nordisk Sprakteknologisk Forskningsprogram 2000–2004, pp. 153–163.
Mazdak N.: FarsiSum – a Persian text summarizer. Master thesis, Department of Linguistics, Stockholm University, 2004
Pachantouris G.: GreekSum – A Greek Text Summarizer. Master Thesis, Department of Computer and Systems Sciences, KTH – Stockholm University 2005
Lin C. Y.: Training a Selection Function for Extraction. In the 8th International Conference on Information and Knowledge Management (CIKM 99), Kansa City, Missouri, 1999
Luhn H. P.: The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development, 1959, pp. 159–165
Edmundson H. P.: New Methods in Automatic Extraction. Journal of the ACM 16(2), 1969, pp. 264–285.
Hassel M.: Evaluation of automatic text summarization - a practical implementation. Licentiate thesis Stockholm, NADA-KTH, 2004
Dalianis H.: SweSum - A Text Summarizer for Swedish. http://www.dsv.su.se/%7Ehercules/papers/Textsumsummary.html, 2000.
Dalianis H.: Aggregation in Natural Language Generation. Journal of Computational Intelligence, Vol. 15, No. 4, 1999, pp. 384–414.
Smedt de K., Liseth A., Hassel M., Dalianis H.: How short is good? An evaluation of automatic summarization. In Holmboe, H. (ed.) Nordisk Sprogteknologi 2004, pp. 267-287
Gajecki M.: Serwer lekskalny jezyka polskiego. Computer Science, Rocznik AGH, 2001