Semantic Hierarchical Document Signature For Determining Sentence Similarity

dc.contributor.authorManna, Sukanya
dc.contributor.authorGedeon, Tamas (Tom)
dc.coverage.spatialBarcelona Spain
dc.date.accessioned2015-12-10T22:30:48Z
dc.date.available2015-12-10T22:30:48Z
dc.date.createdJuly 18-23 2010
dc.date.issued2010
dc.date.updated2016-02-24T10:17:59Z
dc.description.abstractIn this paper, we present a new approach that incorporates semantic information from a document, in the form of Hierarchical Document Signature (HDS), to measure semantic similarity between sentences. Due to variability of expressions of natural language, it is very essential to exploit the semantic properties of a document to accurately identify semantically similar sentences since sentences conveying the same fact or concept may be composed lexically and syntactically different. Inversely, sentences which are lexically common may not necessarily convey the same meaning. This poses a significant impact on many text mining applications performance where sentence-level judgment is involved. Our HDS uses the natural hierarchy of the document and represents it in a modularized form of document level to sentence level, sentence to word level; aggregating similarity components at the lower levels and propagating them to the next higher level to produce the final similarity between sentences. The evaluation of our HDS model has shown that it resembles the decision making process as done by human to a greater extent than different vector space models which only uses 'bag of words' concept.
dc.identifier.isbn9781424469208
dc.identifier.urihttp://hdl.handle.net/1885/55247
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE Inc)
dc.relation.ispartofseriesIEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2010)
dc.sourceProceedings of the 19th international conference on Fuzzy Systems
dc.subjectKeywords: Bag of words; Decision making process; Hierarchical document; Modularized; Natural languages; New approaches; Semantic information; Semantic properties; Semantic similarity; Sentence level; Sentence similarity; Significant impacts; Text mining; Vector spa
dc.titleSemantic Hierarchical Document Signature For Determining Sentence Similarity
dc.typeConference paper
local.contributor.affiliationManna, Sukanya, College of Engineering and Computer Science, ANU
local.contributor.affiliationGedeon, Tamas (Tom), College of Engineering and Computer Science, ANU
local.contributor.authoremailu4088783@anu.edu.au
local.contributor.authoruidManna, Sukanya, u4321410
local.contributor.authoruidGedeon, Tamas (Tom), u4088783
local.description.notesImported from ARIES
local.description.refereedYes
local.identifier.absfor080107 - Natural Language Processing
local.identifier.absseo970108 - Expanding Knowledge in the Information and Computing Sciences
local.identifier.ariespublicationU3594520xPUB323
local.identifier.doi10.1109/FUZZY.2010.5584332
local.identifier.scopusID2-s2.0-78549259684
local.identifier.uidSubmittedByU3594520
local.type.statusPublished Version

Downloads