Zero- and Few-Shots Knowledge Graph Triplet Extraction with Large Language Models
| dc.contributor.author | Papaluca, Andrea | en |
| dc.contributor.author | Krefl, Daniel | en |
| dc.contributor.author | Rodríguez Méndez, Sergio J. | en |
| dc.contributor.author | Lensky, Artem | en |
| dc.contributor.author | Suominen, Hanna | en |
| dc.date.accessioned | 2025-05-31T01:28:34Z | |
| dc.date.available | 2025-05-31T01:28:34Z | |
| dc.date.issued | 2024 | en |
| dc.description.abstract | In this work, we tested the Triplet Extraction (TE) capabilities of a variety of Large Language Models (LLMs) of different sizes in the Zero- and Few-Shots settings. In detail, we proposed a pipeline that dynamically gathers contextual information from a Knowledge Base (KB), both in the form of context triplets and of (sentence, triplets) pairs as examples, and provides it to the LLM through a prompt. The additional context allowed the LLMs to be competitive with all the older fully trained baselines based on the Bidirectional Long Short-Term Memory (BiLSTM) Network architecture. We further conducted a detailed analysis of the quality of the gathered KB context, finding it to be strongly correlated with the final TE performance of the model. In contrast, the size of the model appeared to only logarithmically improve the TE capabilities of the LLMs. We release the code on GitHub 1 for reproducibility. | en |
| dc.description.sponsorship | Andrea Papaluca was supported by an Australian Government Research Training Program International Scholarship. Artem Lensky was partially supported by the Commonwealth Department of Defence, Defence Science and Technology Group. | en |
| dc.description.status | Peer-reviewed | en |
| dc.format.extent | 12 | en |
| dc.identifier.isbn | 9798891761476 | en |
| dc.identifier.other | ORCID:/0000-0001-7203-8399/work/171153707 | en |
| dc.identifier.scopus | 85204482864 | en |
| dc.identifier.uri | http://www.scopus.com/inward/record.url?scp=85204482864&partnerID=8YFLogxK | en |
| dc.identifier.uri | https://hdl.handle.net/1885/733755750 | |
| dc.language.iso | en | en |
| dc.publisher | Association for Computational Linguistics (ACL) | en |
| dc.relation.ispartof | KaLLM 2024 - 1st Workshop on Knowledge Graphs and Large Language Models, Proceedings of the Workshop | en |
| dc.relation.ispartofseries | 1st Workshop on Knowledge Graphs and Large Language Models, KaLLM 2024 | en |
| dc.relation.ispartofseries | KaLLM 2024 - 1st Workshop on Knowledge Graphs and Large Language Models, Proceedings of the Workshop | en |
| dc.rights | Publisher Copyright: ©2024 Association for Computational Linguistics. | en |
| dc.title | Zero- and Few-Shots Knowledge Graph Triplet Extraction with Large Language Models | en |
| dc.type | Conference paper | en |
| dspace.entity.type | Publication | en |
| local.bibliographicCitation.lastpage | 23 | en |
| local.bibliographicCitation.startpage | 12 | en |
| local.contributor.affiliation | Papaluca, Andrea; Australian National University | en |
| local.contributor.affiliation | Rodríguez Méndez, Sergio J.; School of Computing, ANU College of Systems and Society, The Australian National University | en |
| local.contributor.affiliation | Lensky, Artem; University of New South Wales | en |
| local.contributor.affiliation | Suominen, Hanna; School of Computing, ANU College of Systems and Society, The Australian National University | en |
| local.identifier.pure | f61e6154-68f4-40b2-95ac-6692bd7fc261 | en |
| local.identifier.url | https://www.scopus.com/pages/publications/85204482864 | en |
| local.type.status | Published | en |