Data mining methodological weaknesses and suggested fixes
Predictive accuracy claims should give explicit descriptions of the steps followed, with access to the code used. This allows referees and readers to check for common traps, and to repeat the same steps on other data. Feature selection and/or model selection and/or tuning must be independent of the test data. For use of cross-validation, such steps must be repeated at each fold. Even then, such accuracy assessments have the limitation that the target population, to which results will be...[Show more]
|Collections||ANU Research Publications|
|Source:||Proceedings of the fifth Australasian Data Mining Conference (AusDM2006)|
|01_Maindonald_Data_mining_methodological_2006.pdf||465.67 kB||Adobe PDF||Request a copy|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.