Tensor Term Indexing: An application of HOSVD for Document Summarization

dc.contributor.authorManna, Sukanya
dc.contributor.authorPetres, Zoltan
dc.contributor.authorGedeon, Tamas (Tom)
dc.coverage.spatialEgypt
dc.date.accessioned2015-12-10T22:25:06Z
dc.date.createdOctober 21-25 2009
dc.date.issued2009
dc.date.updated2016-02-24T10:17:42Z
dc.description.abstractIn this paper, a new method for text summarization is proposed by using an extended version of the Tensor Term Importance (TTI) model. This method summarizes documents by extracting important sentences from a document. It improves the per document summarization efficiency by incorporating additional information of the whole document set referring to the same topic (or coherent documents). The basic idea of this approach is to represent the whole document set in a uniform form, in the term-sentence-document tensor, and to use higher-order singular value decomposition (HOSVD) to highlight the important terms in each document. Here, we present two different methods of summarization. In the first method, the sentences having the highly weighted terms are extracted as the important sentences representing the document. The important sentences identified by selecting those that contains more from the important terms. The second model uses a so-called super sentence and uses that to extract other sentences having high similarity with it. Unlike in Latent Semantic Analysis (LSA) where SVD is applied for compressing the sparse term-document matrix and defining latent semantic links between terms, in TTI SVD is used to reduce noise and to highlight the important term-document relations in the document. Our evaluation results show that our TTI based methods are more similar to human generated summaries than other automated summarizers which work on single documents at a time.
dc.identifier.isbn9781424453818
dc.identifier.urihttp://hdl.handle.net/1885/53331
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE Inc)
dc.relation.ispartofseriesInternational Symposium on Computational Intelligence and Intelligent Informatics (ISCII 2009)
dc.sourceProceedings of the 4th International Symposium on Computational Intelligence Informatics (ISCII 2009)
dc.source.urihttp://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?asf_arn=null&asf_iid=null&asf_pun=5339523&asf_in=null&asf_rpp=null&asf_iv=null&asf_sp=null&asf_pn=1
dc.subjectKeywords: Basic idea; Document matrices; Document summarization; Evaluation results; Extended versions; Extracting important sentences; Higher order singular value decomposition; Latent Semantic Analysis; Latent semantics; Term importance; Text summarization; Artif
dc.titleTensor Term Indexing: An application of HOSVD for Document Summarization
dc.typeConference paper
local.bibliographicCitation.lastpage141
local.bibliographicCitation.startpage135
local.contributor.affiliationManna, Sukanya, College of Engineering and Computer Science, ANU
local.contributor.affiliationPetres, Zoltan, College of Engineering and Computer Science, ANU
local.contributor.affiliationGedeon, Tamas (Tom), College of Engineering and Computer Science, ANU
local.contributor.authoruidManna, Sukanya, u4321410
local.contributor.authoruidPetres, Zoltan, a276450
local.contributor.authoruidGedeon, Tamas (Tom), u4088783
local.description.embargo2037-12-31
local.description.notesImported from ARIES
local.description.refereedYes
local.identifier.absfor080704 - Information Retrieval and Web Search
local.identifier.ariespublicationu3594520xPUB271
local.identifier.doi10.1109/ISCIII.2009.5342266
local.identifier.scopusID2-s2.0-72449174243
local.type.statusPublished Version

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
01_Manna_Tensor_Term_Indexing:_An_2009.pdf
Size:
221.17 KB
Format:
Adobe Portable Document Format