Pairwise FastText Classifier for Entity Disambiguation

Date

2016

Authors

Yu, Cheng
Chu, Bing
Ram, Rohit
Aichinger, James
Qu, Lizhen
Suominen, Hanna

Journal Title

Journal ISSN

Volume Title

Publisher

Association for Computational Linguistics

Abstract

For the Australasian Language Technology Association (ALTA) 2016 Shared Task, we devised Pairwise FastText Classifier (PFC), an efficient embedding-based text classifier, and used it for entity disambiguation. Compared with a few baseline algorithms, PFC achieved a higher F1 score at 0.72 (under the team name BCJR). To generalise the model, we also created a method to bootstrap the training set deterministically without human labelling and at no financial cost. By releasing PFC and the dataset augmentation software to the public1, we hope to invite more collaboration.

Description

Keywords

Citation

Source

Proceedings of Australasian Language Technology Association Workshop 2016 Workshop

Type

Conference paper

Book Title

Entity type

Access Statement

Open Access

License Rights

DOI

Restricted until