Skip navigation
Skip navigation

An improved multiclass LogitBoost using Adaptive-One-Vs-One

Sun, Peng; Reid, Mark; Zhou, Jie

Description

LogitBoost is a popular Boosting variant that can be applied to either binary or multi-class classification. From a statistical viewpoint LogitBoost can be seen as additive tree regression by minimizing the Logistic loss. Following this setting, it is still non-trivial to devise a sound multi-class LogitBoost compared with to devise its binary counterpart. The difficulties are due to two important factors arising in multiclass Logistic loss. The first is the invariant property implied by the...[Show more]

dc.contributor.authorSun, Peng
dc.contributor.authorReid, Mark
dc.contributor.authorZhou, Jie
dc.date.accessioned2015-12-13T22:34:15Z
dc.identifier.issn0885-6125
dc.identifier.urihttp://hdl.handle.net/1885/76042
dc.description.abstractLogitBoost is a popular Boosting variant that can be applied to either binary or multi-class classification. From a statistical viewpoint LogitBoost can be seen as additive tree regression by minimizing the Logistic loss. Following this setting, it is still non-trivial to devise a sound multi-class LogitBoost compared with to devise its binary counterpart. The difficulties are due to two important factors arising in multiclass Logistic loss. The first is the invariant property implied by the Logistic loss, causing the optimal classifier output being not unique, i.e. adding a constant to each component of the output vector won’t change the loss value. The second is the density of the Hessian matrices that arise when computing tree node split gain and node value fittings. Oversimplification of this learning problem can lead to degraded performance. For example, the original LogitBoost algorithm is outperformed by ABC-LogitBoost thanks to the latter’s more careful treatment of the above two factors. In this paper we propose new techniques to address the two main difficulties in multiclass LogitBoost setting: (1) we adopt a vector tree model (i.e. each node value is vector) where the unique classifier output is guaranteed by adding a sum-to-zero constraint, and (2) we use an adaptive block coordinate descent that exploits the dense Hessian when computing tree split gain and node values. Higher classification accuracy and faster convergence rates are observed for a range of public data sets when compared to both the original and the ABC-LogitBoost implementations. We also discuss another possibility to cope with LogitBoost’s dense Hessian matrix. We derive a loss similar to the multi-class Logistic loss but which guarantees a diagonal Hessian matrix. While this makes the optimization (by Newton descent) easier we unfortunately observe degraded performance for this modification. We argue that working with the dense Hessian is likely unavoidable, therefore making techniques like those proposed in this paper necessary for efficient implementations.
dc.publisherKluwer Academic Publishers
dc.sourceMachine Learning
dc.subjectKeywords: Classification (of information); Convex optimization; Forestry; Matrix algebra; Supervised learning; Block coordinate descents; Boosting; Classification accuracy; Efficient implementation; Ensemble; LogitBoost; Multi-class classification; Multiclass logit Boosting; Convex optimization; Ensemble; LogitBoost; Supervised learning
dc.titleAn improved multiclass LogitBoost using Adaptive-One-Vs-One
dc.typeJournal article
local.description.notesImported from ARIES
local.identifier.citationvolume97
dc.date.issued2014
local.identifier.absfor080100 - ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING
local.identifier.ariespublicationU3488905xPUB4952
local.type.statusPublished Version
local.contributor.affiliationSun, Peng, Tsinghua University
local.contributor.affiliationReid, Mark, College of Engineering and Computer Science, ANU
local.contributor.affiliationZhou, Jie, Tsinghua University
local.description.embargo2037-12-31
local.bibliographicCitation.issue3
local.bibliographicCitation.startpage295
local.bibliographicCitation.lastpage326
local.identifier.doi10.1007/s10994-014-5434-3
local.identifier.absseo970108 - Expanding Knowledge in the Information and Computing Sciences
dc.date.updated2019-05-19T08:21:15Z
local.identifier.scopusID2-s2.0-84910111533
local.identifier.thomsonID000344173100003
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Sun_An_improved_multiclass_2014.pdf2.6 MBAdobe PDF    Request a copy
02_Sun_An_improved_multiclass_2014.pdf2.6 MBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator