MEL: Metadata Extractor & Loader

dc.contributor.authorRodríguez Méndez, Sergio J.en
dc.contributor.authorOmran, Pouya G.en
dc.contributor.authorHaller, Arminen
dc.contributor.authorTaylor, Kerryen
dc.date.accessioned2026-01-01T15:42:49Z
dc.date.available2026-01-01T15:42:49Z
dc.date.issued2021en
dc.description.abstractThe metadata and content-based information extraction tasks from heterogeneous file sets are pre-processing steps of many Knowledge Graph Construction Pipelines (KGCP). These tasks often take longer than necessary due to the lack of proper tools that integrate several complementary extraction methods and properties to get a rich output set. This paper presents MEL, a Python-based tool that implements a set of methods to extract metadata and content-based information from unstructured information encoded in different source document formats. The results are generated as JSON files, which can: (a) optionally be stored in a document store, and (b) easily be mapped to RDF using a variety of tools such as J2RM. MEL supports more than 20 different file types, making it a versatile tool that aids pre-processing tasks as part of a KGCP based on comprehensive configurable settings.en
dc.description.statusPeer-revieweden
dc.format.extent5en
dc.identifier.otherORCID:/0000-0001-7203-8399/work/166032755en
dc.identifier.scopus85117698070en
dc.identifier.urihttps://hdl.handle.net/1885/733801523
dc.language.isoenen
dc.relation.ispartofseries2021 International Semantic Web Conference Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice, ISWC-Posters-Demos-Industry 2021en
dc.rightsPublisher Copyright: © 2021 CEUR-WS. All rights reserved.en
dc.subjectData Analysis Pipelineen
dc.subjectData Pre processingen
dc.subjectInformation Extractionen
dc.subjectKnowledge Graph Constructionen
dc.subjectMetadata Extractionen
dc.titleMEL: Metadata Extractor & Loaderen
dc.typeConference paperen
dspace.entity.typePublicationen
local.contributor.affiliationRodríguez Méndez, Sergio J.; School of Computing, ANU College of Systems and Society, The Australian National Universityen
local.contributor.affiliationOmran, Pouya G.; School of Computing, ANU College of Systems and Society, The Australian National Universityen
local.contributor.affiliationHaller, Armin; Research School of Management, ANU College of Business & Economics, The Australian National Universityen
local.contributor.affiliationTaylor, Kerry; School of Computing, ANU College of Systems and Society, The Australian National Universityen
local.identifier.ariespublicationa383154xPUB24251en
local.identifier.pure9acb1580-4e32-4697-b890-3c87c0894d83en
local.identifier.urlhttps://www.scopus.com/pages/publications/85117698070en
local.identifier.urlhttps://ceur-ws.org/Vol-2980/en
local.type.statusPublisheden

Downloads