Multi document summarization based on cross-document relation using voting technique

News articles which are available through online search often provide readers with large collection of texts. Especially in the case of news story, different news sources reporting on the same event usually returns multiple articles in response to a reader's search. In this work, we first ident...

Penerangan Penuh

Disimpan dalam:
Butiran Bibliografi
Pengarang-pengarang Utama: Kumar, Yogan Jaya, Salim, Naomie, Abuobieda, Albaraa, Tawfik, Ameer
Format: Conference or Workshop Item
Diterbitkan: 2013
Subjek-subjek:
Capaian Atas Talian:http://eprints.utm.my/51184/
http://eprints.utm.my/51184/
Penanda-penanda: Tambah Penanda
Tiada Penanda, Jadilah orang pertama menanda rekod ini!
Penerangan
Ringkasan:News articles which are available through online search often provide readers with large collection of texts. Especially in the case of news story, different news sources reporting on the same event usually returns multiple articles in response to a reader's search. In this work, we first identify cross-document relations from un-annotated texts using Genetic-CBR approach. Following that, we develop a new sentence scoring model based on voting technique over the identified cross-document relations. Our experiments show that incorporating the proposed methods in the summarization process yields substantial improvement over the mainstream methods. The performances of all methods were evaluated using ROUGE - a standard evaluation metric used in text summarization.