Skip navigation
Skip navigation

Making Subsequence Time Series Clustering Meaningful

Chen, Jason Robert

Description

Recently, the startling claim was made that sequential time series clustering is meaningless. This has important consequences for a significant amount of work in the literature, since such a claim invalidates this work's contribution. In this paper, we show that sequential time series clustering is not meaningless, and that the problem highlighted in these works stem from their use of the Euclidean distance metric as the distance measure in the subsequence vector space. As a solution, we...[Show more]

dc.contributor.authorChen, Jason Robert
dc.coverage.spatialHouston USA
dc.date.accessioned2015-12-13T22:49:21Z
dc.date.createdNovember 27 2005
dc.identifier.isbn1550-4786
dc.identifier.urihttp://hdl.handle.net/1885/80501
dc.description.abstractRecently, the startling claim was made that sequential time series clustering is meaningless. This has important consequences for a significant amount of work in the literature, since such a claim invalidates this work's contribution. In this paper, we show that sequential time series clustering is not meaningless, and that the problem highlighted in these works stem from their use of the Euclidean distance metric as the distance measure in the subsequence vector space. As a solution, we consider quite a general class of time series, and propose a regime based on two types of similarity that can exist between subsequence vectors, which give rise naturally to an alternative distance measure to Euclidean distance in the subsequence vector space. We show that, using this alternative distance measure, sequential time series clustering can indeed be meaningful. We repeat a key experiment in the work on which the "meaningless" claim was based, and show that our method leads to a successful clustering outcome.
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE Inc)
dc.relation.ispartofseriesIEEE International Conference on Data Mining (ICDM 2005)
dc.sourceProceedings Fifth IEEE International Conference on Data Mining
dc.source.urihttp://www.cacs.louisiana.edu/~icdm05/
dc.subjectKeywords: Clustering algorithms; Data mining; Euclidean distance metric; Vector space; Time series analysis
dc.titleMaking Subsequence Time Series Clustering Meaningful
dc.typeConference paper
local.description.notesImported from ARIES
local.description.refereedYes
dc.date.issued2005
local.identifier.absfor080109 - Pattern Recognition and Data Mining
local.identifier.ariespublicationMigratedxPub8770
local.type.statusPublished Version
local.contributor.affiliationChen, Jason Robert, College of Engineering and Computer Science, ANU
local.description.embargo2037-12-31
local.bibliographicCitation.startpage114
local.bibliographicCitation.lastpage121
local.identifier.doi10.1109/ICDM.2005.91
dc.date.updated2015-12-11T10:34:35Z
local.identifier.scopusID2-s2.0-33750343311
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Chen_Making_Subsequence_Time_Series_2005.pdf257.42 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator