Corpus based classification of text in Australian contracts
Date
Authors
Curtotti, Michael
McCreath, Eric
Journal Title
Journal ISSN
Volume Title
Publisher
Australasian Language Technology Association
Abstract
Written contracts are a fundamental
framework for commercial and cooperative transactions and relationships. Limited research has been published on the application of machine learning and natural language processing (NLP) to contracts.
In this paper we report the classification of components of contract texts using machine learning and hand-coded methods.
Authors studying a range of domains have found that combining machine learning and rule based approaches increases accuracy of machine learning. We find similar results which suggest the utility of considering leveraging hand coded classification rules for machine learning. We attained an average accuracy of 83.48% on a multiclass labelling task on 20 contracts combining machine learning and rule based approaches, increasing performance over machine learning alone.
Description
Citation
Curtotti, M. & McCreath, E. (2010). Corpus based classification of text in Australian contracts. In N. Indurkhya & S. Zwarts (Eds), Proceedings of the Australasian Language Technology Association Workshop 2010 (pp.18-26). Melbourne, Vic.: ALTA
Collections
Source
Proceedings of the Australasian Language Technology Association Workshop (ALTA 2010)
Type
Book Title
Entity type
Access Statement
License Rights
DOI
Restricted until
Downloads
File
Description