Pairwise FastText Classifier for Entity Disambiguation
For the Australasian Language Technology Association (ALTA) 2016 Shared Task, we devised Pairwise FastText Classifier (PFC), an efficient embedding-based text classifier, and used it for entity disambiguation. Compared with a few baseline algorithms, PFC achieved a higher F1 score at 0.72 (under the team name BCJR). To generalise the model, we also created a method to bootstrap the training set deterministically without human labelling and at no financial cost. By releasing PFC and the dataset...[Show more]
|Collections||ANU Research Publications|
|Source:||Proceedings of Australasian Language Technology Association Workshop 2016 Workshop|
|Access Rights:||Open Access|
|01_Yu_Pairwise_FastText_Classifier_2016.pdf||764.79 kB||Adobe PDF|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.