作者
Vishal Gupta, Gurpreet Singh Lehal
发表日期
2012/12
研讨会论文
Proceedings of COLING 2012: Demonstration Papers
页码范围
191-198
简介
Text Summarization is condensing the source text into shorter form and retaining its information content and overall meaning. Punjabi text Summarization system is text extraction based summarization system which is used to summarize the Punjabi text by retaining relevant sentences based on statistical and linguistic features of text. Punjabi text summarization system is available online at website: http://pts. learnpunjabi. org/default. aspx It comprises of two main phases: 1) Pre Processing 2) Processing. Pre Processing is structured representation of original Punjabi text. Pre processing phase includes Punjabi words boundary identification, Punjabi sentences boundary identification, Punjabi stop words elimination, Punjabi language stemmer for nouns and proper names, applying input restrictions and elimination of duplicate sentences. In processing phase, sentence features are calculated and final score of each sentence is determined using feature-weight equation. Top ranked sentences in proper order are selected for final summary. This demo paper concentrates on Automatic Punjabi Text Extractive Summarization System.
引用总数
2013201420152016201720182019202020212022202358221113732
学术搜索中的文章
V Gupta, GS Lehal - Proceedings of COLING 2012: Demonstration Papers, 2012