TEXT SUMMARIZING IN POLISH

Emilia Branny, Marek Gajęcki

Abstract


The aim of this article is to describe an existing implementation of a text summarizer forPolish, to analyze the results and propose the possibilities of further development. Theproblem of text summarizing has been already addressed by science but until now there hasbeen no implementation designed for Polish. The implemented algorithm is based on existingdevelopments in the field but it also includes some improvements. It has been optimized fornewspaper texts ranging from approx. 10 to 50 sentences. Evaluation has shown that it worksbetter than known generic summarization tools when applied to Polish.

Keywords


natural language processing; text summarizing

Full Text:

PDF

References


”summarize” (entry) in Merriam-Webster Online Thesaurus, 15 Jun 2005, http://www.m-w.com/cgi-bin/thesaurus

Van Dijk T. A.: Some Aspects of Text Grammars. A Study in Theoretical Linguistics and Poetics, Mouton, The Hague, 1972

Dalianis H., Hassel M., Smedt de K., Liseth A., Lech T.C., Wedekind J.: Porting and evaluation of automatic summarization. In Holmboe H. (ed.), Nordisk Sprogteknologi 2003. Arbog for Nordisk Sprakteknologisk, Forskningsprogram

–2004, pp. 107–121.

Dalianis H., Hassel M., Wedekind J., Haltrup D., Smedt de K., Lech T. C.: Automatic text summarization for the Scandinavian languages. In Holmboe H. (ed.), Nordisk Sprogteknologi, 2002. Arbog for Nordisk Sprakteknologisk Forskningsprogram 2000–2004, pp. 153–163.

Mazdak N.: FarsiSum – a Persian text summarizer. Master thesis, Department of Linguistics, Stockholm University, 2004

Pachantouris G.: GreekSum – A Greek Text Summarizer. Master Thesis, Department of Computer and Systems Sciences, KTH – Stockholm University 2005

Lin C. Y.: Training a Selection Function for Extraction. In the 8th International Conference on Information and Knowledge Management (CIKM 99), Kansa City, Missouri, 1999

Luhn H. P.: The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development, 1959, pp. 159–165

Edmundson H. P.: New Methods in Automatic Extraction. Journal of the ACM 16(2), 1969, pp. 264–285.

Hassel M.: Evaluation of automatic text summarization - a practical implementation. Licentiate thesis Stockholm, NADA-KTH, 2004

Dalianis H.: SweSum - A Text Summarizer for Swedish. http://www.dsv.su.se/%7Ehercules/papers/Textsumsummary.html, 2000.

Dalianis H.: Aggregation in Natural Language Generation. Journal of Computational Intelligence, Vol. 15, No. 4, 1999, pp. 384–414.

Smedt de K., Liseth A., Hassel M., Dalianis H.: How short is good? An evaluation of automatic summarization. In Holmboe, H. (ed.) Nordisk Sprogteknologi 2004, pp. 267-287

Gajecki M.: Serwer lekskalny jezyka polskiego. Computer Science, Rocznik AGH, 2001




DOI: https://doi.org/10.7494/csci.2005.7.4.31

Refbacks

  • There are currently no refbacks.