Pairwise FastText Classifier for Entity Disambiguation
Date
2016
Authors
Yu, Cheng
Chu, Bing
Ram, Rohit
Aichinger, James
Qu, Lizhen
Suominen, Hanna
Journal Title
Journal ISSN
Volume Title
Publisher
Association for Computational Linguistics
Abstract
For the Australasian Language Technology Association (ALTA) 2016 Shared Task, we devised Pairwise FastText Classifier (PFC), an efficient embedding-based text classifier, and used it for entity disambiguation. Compared with a few baseline algorithms, PFC achieved a higher F1 score at 0.72 (under the team name BCJR). To generalise the model, we also created a method to bootstrap the training set deterministically without human labelling and at no financial cost. By releasing PFC and the dataset augmentation software to the public1, we hope to invite more collaboration.
Description
Keywords
Citation
Collections
Source
Proceedings of Australasian Language Technology Association Workshop 2016 Workshop
Type
Conference paper
Book Title
Entity type
Access Statement
Open Access
License Rights
DOI
Restricted until
Downloads
File
Description