Skip navigation
Skip navigation

Data mining methodological weaknesses and suggested fixes

Maindonald, John

Description

Predictive accuracy claims should give explicit descriptions of the steps followed, with access to the code used. This allows referees and readers to check for common traps, and to repeat the same steps on other data. Feature selection and/or model selection and/or tuning must be independent of the test data. For use of cross-validation, such steps must be repeated at each fold. Even then, such accuracy assessments have the limitation that the target population, to which results will be...[Show more]

CollectionsANU Research Publications
Date published: 2006
Type: Conference paper
URI: http://hdl.handle.net/1885/26185
Source: Proceedings of the fifth Australasian Data Mining Conference (AusDM2006)

Download

File Description SizeFormat Image
01_Maindonald_Data_mining_methodological_2006.pdf465.67 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  12 November 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator