Skip navigation
Skip navigation

On authorship attribution via Markov chains and sequence kernels

Sanderson, Conrad; Guenter, Simon

Description

We investigate the use of recently proposed character and word sequence kernels for the task of authorship attribution and compare their performance with two probabilistic approaches based on Markov chains of characters and words. Several configurations of the sequence kernels are studied using a relatively large dataset, where each author covered several topics. Utilising Moffat smoothing, the two probabilistic approaches obtain similar performance, which in turn is comparable to that of...[Show more]

CollectionsANU Research Publications
Date published: 2006
Type: Conference paper
URI: http://hdl.handle.net/1885/27940
Source: Proceedings of the 18th International Conference on Pattern Recognition
DOI: 10.1109/ICPR.2006.899

Download

File Description SizeFormat Image
01_Sanderson_On_authorship_attribution_via_2006.pdf97.58 kBAdobe PDF    Request a copy
02_Sanderson_On_authorship_attribution_via_2006.pdf166.17 kBAdobe PDF    Request a copy
03_Sanderson_On_authorship_attribution_via_2006.pdf128.74 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  23 August 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator