What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

Tu, Weijie; Deng, Weijian; Zheng, Liang; Gedeon, Tom

What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

dc.contributor.author	Tu, Weijie	en
dc.contributor.author	Deng, Weijian	en
dc.contributor.author	Zheng, Liang	en
dc.contributor.author	Gedeon, Tom	en
dc.date.accessioned	2025-05-23T11:24:09Z
dc.date.available	2025-05-23T11:24:09Z
dc.date.issued	2024	en
dc.description.abstract	This work aims to develop a measure that can accurately rank the performance of various classifiers when they are tested on unlabeled data from out-of-distribution (OOD) distributions. We commence by demonstrating that conventional uncertainty metrics, notably the maximum Softmax prediction probability, possess inherent utility in forecasting model generalization across certain OOD contexts. Building on this insight, we introduce a new measure called Softmax Correlation (SoftmaxCorr). It calculates the cosine similarity between a class-class correlation matrix, constructed from Softmax output vectors across an unlabeled test dataset, and a predefined reference matrix that embodies ideal class correlations. A high resemblance of predictions to the reference matrix signals that the model delivers confident and uniform predictions across all categories, reflecting minimal uncertainty and confusion. Through rigorous evaluation across a suite of datasets, including ImageNet, CIFAR-10, and WILDS, we affirm the predictive validity of SoftmaxCorr in accurately forecasting model performance within both in-distribution (ID) and OOD settings. Furthermore, we discuss the limitations of our proposed measure and suggest avenues for future research.	en
dc.description.status	Peer-reviewed	en
dc.identifier.scopus	85213130774	en
dc.identifier.uri	http://www.scopus.com/inward/record.url?scp=85213130774&partnerID=8YFLogxK	en
dc.identifier.uri	https://hdl.handle.net/1885/733752168
dc.language.iso	en	en
dc.rights	Publisher Copyright: © 2024, Transactions on Machine Learning Research. All rights reserved.	en
dc.source	Transactions on Machine Learning Research	en
dc.title	What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?	en
dc.type	Journal article	en
dspace.entity.type	Publication	en
local.contributor.affiliation	Tu, Weijie; ANU College of Systems and Society, The Australian National University	en
local.contributor.affiliation	Deng, Weijian; School of Computing, ANU College of Systems and Society, The Australian National University	en
local.contributor.affiliation	Zheng, Liang; School of Computing, ANU College of Systems and Society, The Australian National University	en
local.contributor.affiliation	Gedeon, Tom; School of Computing, ANU College of Systems and Society, The Australian National University	en
local.identifier.citationvolume	2024	en
local.identifier.pure	ffa5fd91-6c25-4163-b1a5-f1622bbc88f7	en
local.identifier.url	https://www.scopus.com/pages/publications/85213130774	en
local.type.status	Published	en

Collections

ANU Research Publications

Cultural advice

What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

Downloads

Collections