Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

HDSKG: Harvesting Domain Specific Knowledge Graph from Content of Webpages

dc.contributor.authorZhao, Xuejiao
dc.contributor.authorXing, Zhenchang
dc.contributor.authorKabir, Muhammad Ashad
dc.contributor.authorSawada, Naoya
dc.contributor.authorLi, Jing
dc.contributor.authorLin, Shang-Wei
dc.contributor.editorBavota, G.
dc.contributor.editorPinzger, M.
dc.contributor.editorMarcus, A.
dc.coverage.spatialKlagenfurt, Austria
dc.date.accessioned2021-07-01T01:46:13Z
dc.date.createdFebruary 20-24 2017
dc.date.issued2017
dc.date.updated2020-11-23T10:36:51Z
dc.description.abstractKnowledge graph is useful for many different domains like search result ranking, recommendation, exploratory search, etc. It integrates structural information of concepts across multiple information sources, and links these concepts together. The extraction of domain specific relation triples (subject, verb phrase, object) is one of the important techniques for domain specific knowledge graph construction. In this research, an automatic method named HDSKG is proposed to discover domain specific concepts and their relation triples from the content of webpages. We incorporate the dependency parser with rule-based method to chunk the relations triple candidates, then we extract advanced features of these candidate relation triples to estimate the domain relevance by a machine learning algorithm. For the evaluation of our method, we apply HDSKG to Stack Overflow (a Q&A website about computer programming). As a result, we construct a knowledge graph of software engineering domain with 35279 relation triples, 44800 concepts, and 9660 unique verb phrases. The experimental results show that both the precision and recall of HDSKG (0.78 and 0.7 respectively) is much higher than the openIE (0.11 and 0.6 respectively). The performance is particularly efficient in the case of complex sentences. Further more, with the self-training technique we used in the classifier, HDSKG can be applied to other domain easily with less training data.en_AU
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.isbn9781509055012en_AU
dc.identifier.urihttp://hdl.handle.net/1885/238483
dc.language.isoen_AUen_AU
dc.publisherIEEEen_AU
dc.relation.ispartofseries24th IEEE International Conference on Software Analysis, Evolution, and Reengineering, SANER 2017en_AU
dc.rights© 2017 IEEEen_AU
dc.sourceSANER 2017 - 24th IEEE International Conference on Software Analysis, Evolution, and Reengineeringen_AU
dc.source.urihttps://ieeexplore.ieee.org/document/7884609en_AU
dc.subjectKnowledge Graphen_AU
dc.subjectStructural Information Extractionen_AU
dc.subjectopenIEen_AU
dc.subjectStack Overflowen_AU
dc.subjectDependency Parseen_AU
dc.titleHDSKG: Harvesting Domain Specific Knowledge Graph from Content of Webpagesen_AU
dc.typeConference paperen_AU
local.bibliographicCitation.lastpage67en_AU
local.bibliographicCitation.startpage56en_AU
local.contributor.affiliationZhao, Xuejiao, Nanyang Technological Universityen_AU
local.contributor.affiliationXing, Zhenchang, College of Engineering and Computer Science, ANUen_AU
local.contributor.affiliationKabir, Muhammad Ashad, Charles Sturt Universityen_AU
local.contributor.affiliationSawada, Naoya, NTT Communications Corporationen_AU
local.contributor.affiliationLi, Jing, Nanyang Technological Universityen_AU
local.contributor.affiliationLin, Shang-Wei, Nanyang Technological Universityen_AU
local.contributor.authoruidXing, Zhenchang, u1023389en_AU
local.description.embargo2099-12-31
local.description.notesImported from ARIESen_AU
local.description.refereedYes
local.identifier.absfor080199 - Artificial Intelligence and Image Processing not elsewhere classifieden_AU
local.identifier.ariespublicationa383154xPUB5894en_AU
local.identifier.doi10.1109/SANER.2017.7884609en_AU
local.identifier.scopusID2-s2.0-85018390307
local.publisher.urlhttps://ieeexplore.ieee.orgen_AU
local.type.statusPublished Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
01_Zhao_HDSKG%3A_Harvesting_Domain_2017.pdf
Size:
343.94 KB
Format:
Adobe Portable Document Format
abcd