Semi-Markov models for sequence segmentation

dc.contributor.authorShi, Qinfengen
dc.contributor.authorAltun, Yaseminen
dc.contributor.authorSmola, Alexen
dc.contributor.authorVishwanathan, S. V.N.en
dc.date.accessioned2026-01-01T08:42:13Z
dc.date.available2026-01-01T08:42:13Z
dc.date.issued2007en
dc.description.abstractIn this paper, we study the problem of automatically segmenting written text into paragraphs. This is inherently a sequence labeling problem, however, previous approaches ignore this dependency. We propose a novel approach for automatic paragraph segmentation, namely training Semi-Markov models discriminatively using a Max-Margin method. This method allows us to model the sequential nature of the problem and to incorporate features of a whole paragraph, such as paragraph coherence which cannot be used in previous models. Experimental evaluation on four text corpora shows improvement over the previous state-of-the art method on this task.en
dc.description.statusPeer-revieweden
dc.format.extent9en
dc.identifier.scopus78649917412en
dc.identifier.urihttps://hdl.handle.net/1885/733799217
dc.language.isoenen
dc.relation.ispartofseries2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL 2007en
dc.titleSemi-Markov models for sequence segmentationen
dc.typeConference paperen
dspace.entity.typePublicationen
local.bibliographicCitation.lastpage648en
local.bibliographicCitation.startpage640en
local.contributor.affiliationShi, Qinfeng; School of Computing, ANU College of Systems and Society, The Australian National Universityen
local.contributor.affiliationAltun, Yasemin; Toyota Technological Institute at Chicagoen
local.contributor.affiliationSmola, Alex; School of Computing, ANU College of Systems and Society, The Australian National Universityen
local.contributor.affiliationVishwanathan, S. V.N.; School of Computing, ANU College of Systems and Society, The Australian National Universityen
local.identifier.ariespublicationu8803936xPUB190en
local.identifier.pure74a70eed-10eb-4560-96ea-a8330f41270fen
local.identifier.urlhttps://www.scopus.com/pages/publications/78649917412en
local.type.statusPublisheden

Downloads