Exploring Graph Bushy Paths to Improve Statistical Multilingual Automatic Text Summarization
Abstract
Statistical extractive summarization is one of the most exploited approach in automatic text summarization due to its generation speed, implementation easiness and multilingual property. We want to improve statistical sentence scoring by exploring a simple, yet powerful, property of graphs called bushy paths represented by the number of node’s neighbors. A graph of similarities is constructed in order to select candidate sentences. Statistical features such as sentence position, sentence length, term frequency and sentences similarities are used to get a primary score for each candidate sentence. The graph is used again to enhance the primary score by using bushy paths property. Also, we tried to exploit the graph in order to enhance summary’s coherence. We experimented our method using MultiLing’15 workshop’s corpora for multilingual single document summarization. Using graph properties can improve statistical scoring without loosing the multilingualism of the method.
Domains
Computer Science [cs]Origin | Files produced by the author(s) |
---|
Loading...