Corpus based classification of text in Australian contracts
| dc.contributor.author | Curtotti, Michael | |
| dc.contributor.author | McCreath, Eric | |
| dc.coverage.spatial | Melbourne Australia | |
| dc.date.accessioned | 2010-12-22T05:09:02Z | en_AU |
| dc.date.accessioned | 2011-04-19T01:17:09Z | |
| dc.date.available | 2010-12-22T05:09:02Z | en_AU |
| dc.date.available | 2011-04-19T01:17:09Z | |
| dc.date.created | December 9-10 2010 | |
| dc.date.issued | 2010 | en_AU |
| dc.date.updated | 2015-12-09T10:59:17Z | |
| dc.description.abstract | Written contracts are a fundamental framework for commercial and cooperative transactions and relationships. Limited research has been published on the application of machine learning and natural language processing (NLP) to contracts. In this paper we report the classification of components of contract texts using machine learning and hand-coded methods. Authors studying a range of domains have found that combining machine learning and rule based approaches increases accuracy of machine learning. We find similar results which suggest the utility of considering leveraging hand coded classification rules for machine learning. We attained an average accuracy of 83.48% on a multiclass labelling task on 20 contracts combining machine learning and rule based approaches, increasing performance over machine learning alone. | |
| dc.format | 9 pages | |
| dc.identifier.citation | Curtotti, M. & McCreath, E. (2010). Corpus based classification of text in Australian contracts. In N. Indurkhya & S. Zwarts (Eds), Proceedings of the Australasian Language Technology Association Workshop 2010 (pp.18-26). Melbourne, Vic.: ALTA | |
| dc.identifier.isbn | 1834-7037 | |
| dc.identifier.issn | 1834-7037 | en_AU |
| dc.identifier.uri | http://hdl.handle.net/10440/1263 | en_AU |
| dc.publisher | Australasian Language Technology Association | |
| dc.relation.ispartofseries | Australasian Language Technology Association Workshop (ALTA 2010) | |
| dc.rights | Authors own the copyright. Permission granted to archive the paper and make it publicly available - from author's email dated 22/12/10 | |
| dc.source | Proceedings of the Australasian Language Technology Association Workshop (ALTA 2010) | |
| dc.source.uri | http://www.alta.asn.au/events/alta2010/proceedings/ALTA2010.pdf | en_AU |
| dc.source.uri | http://cs.anu.edu.au/people/Michael.Curtotti/papers/alta2010contractclassification.pdf | en_AU |
| dc.subject | contract | |
| dc.subject | natural language processing | |
| dc.subject | artificial intelligence and law | |
| dc.subject | machine learning | |
| dc.subject | classification | |
| dc.subject | corpus linguistics | |
| dc.title | Corpus based classification of text in Australian contracts | |
| dc.type | Conference paper | |
| local.bibliographicCitation.lastpage | 26 | |
| local.bibliographicCitation.startpage | 18 | |
| local.contributor.affiliation | Curtotti, Michael, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | McCreath, Eric, College of Engineering and Computer Science, ANU | |
| local.contributor.authoruid | u3752363 | en_AU |
| local.contributor.authoruid | u4033585 | en_AU |
| local.description.notes | Workshop held 9-10 December 2010, University of Melbourne, Melbourne, Australia | en_AU |
| local.description.refereed | Yes | |
| local.identifier.absfor | 080109 - Pattern Recognition and Data Mining | |
| local.identifier.absfor | 080107 - Natural Language Processing | |
| local.identifier.absseo | 890201 - Application Software Packages (excl. Computer Games) | |
| local.identifier.absseo | 940406 - Legal Processes | |
| local.identifier.ariespublication | U3594520xPUB402 | |
| local.publisher.url | http://www.alta.asn.au/ | en_AU |
| local.type.status | Accepted Version | en_AU |
Downloads
Original bundle
1 - 1 of 1