Corpus based classification of text in Australian contracts

dc.contributor.authorCurtotti, Michael
dc.contributor.authorMcCreath, Eric
dc.coverage.spatialMelbourne Australia
dc.date.accessioned2010-12-22T05:09:02Zen_AU
dc.date.accessioned2011-04-19T01:17:09Z
dc.date.available2010-12-22T05:09:02Zen_AU
dc.date.available2011-04-19T01:17:09Z
dc.date.createdDecember 9-10 2010
dc.date.issued2010en_AU
dc.date.updated2015-12-09T10:59:17Z
dc.description.abstractWritten contracts are a fundamental framework for commercial and cooperative transactions and relationships. Limited research has been published on the application of machine learning and natural language processing (NLP) to contracts. In this paper we report the classification of components of contract texts using machine learning and hand-coded methods. Authors studying a range of domains have found that combining machine learning and rule based approaches increases accuracy of machine learning. We find similar results which suggest the utility of considering leveraging hand coded classification rules for machine learning. We attained an average accuracy of 83.48% on a multiclass labelling task on 20 contracts combining machine learning and rule based approaches, increasing performance over machine learning alone.
dc.format9 pages
dc.identifier.citationCurtotti, M. & McCreath, E. (2010). Corpus based classification of text in Australian contracts. In N. Indurkhya & S. Zwarts (Eds), Proceedings of the Australasian Language Technology Association Workshop 2010 (pp.18-26). Melbourne, Vic.: ALTA
dc.identifier.isbn1834-7037
dc.identifier.issn1834-7037en_AU
dc.identifier.urihttp://hdl.handle.net/10440/1263en_AU
dc.publisherAustralasian Language Technology Association
dc.relation.ispartofseriesAustralasian Language Technology Association Workshop (ALTA 2010)
dc.rightsAuthors own the copyright. Permission granted to archive the paper and make it publicly available - from author's email dated 22/12/10
dc.sourceProceedings of the Australasian Language Technology Association Workshop (ALTA 2010)
dc.source.urihttp://www.alta.asn.au/events/alta2010/proceedings/ALTA2010.pdfen_AU
dc.source.urihttp://cs.anu.edu.au/people/Michael.Curtotti/papers/alta2010contractclassification.pdfen_AU
dc.subjectcontract
dc.subjectnatural language processing
dc.subjectartificial intelligence and law
dc.subjectmachine learning
dc.subjectclassification
dc.subjectcorpus linguistics
dc.titleCorpus based classification of text in Australian contracts
dc.typeConference paper
local.bibliographicCitation.lastpage26
local.bibliographicCitation.startpage18
local.contributor.affiliationCurtotti, Michael, College of Engineering and Computer Science, ANU
local.contributor.affiliationMcCreath, Eric, College of Engineering and Computer Science, ANU
local.contributor.authoruidu3752363en_AU
local.contributor.authoruidu4033585en_AU
local.description.notesWorkshop held 9-10 December 2010, University of Melbourne, Melbourne, Australiaen_AU
local.description.refereedYes
local.identifier.absfor080109 - Pattern Recognition and Data Mining
local.identifier.absfor080107 - Natural Language Processing
local.identifier.absseo890201 - Application Software Packages (excl. Computer Games)
local.identifier.absseo940406 - Legal Processes
local.identifier.ariespublicationU3594520xPUB402
local.publisher.urlhttp://www.alta.asn.au/en_AU
local.type.statusAccepted Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Curtotti_Corpus2010.pdf
Size:
144.26 KB
Format:
Adobe Portable Document Format