Semi-Markov models for sequence segmentation
dc.contributor.author | Shi, Qinfeng | |
dc.contributor.author | Altun, Yasemin | |
dc.contributor.author | Smola, Alexander | |
dc.contributor.author | Vishwanathan, S | |
dc.coverage.spatial | Prague Czech Republic | |
dc.date.accessioned | 2015-12-10T22:12:31Z | |
dc.date.created | June 28-30 2007 | |
dc.date.issued | 2007 | |
dc.date.updated | 2016-02-24T11:43:33Z | |
dc.description.abstract | In this paper, we study the problem of automatically segmenting written text into paragraphs. This is inherently a sequence labeling problem, however, previous approaches ignore this dependency. We propose a novel approach for automatic paragraph segmentation, namely training Semi-Markov models discriminatively using a Max-Margin method. This method allows us to model the sequential nature of the problem and to incorporate features of a whole paragraph, such as paragraph coherence which cannot be used in previous models. Experimental evaluation on four text corpora shows improvement over the previous state-of-the art method on this task. | |
dc.identifier.uri | http://hdl.handle.net/1885/49701 | |
dc.publisher | OmniPress | |
dc.relation.ispartofseries | Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2007) | |
dc.source | Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2007) | |
dc.source.uri | http://www.aclweb.org/anthology-new/D/D07/D07-1.pdf | |
dc.subject | Keywords: Experimental evaluation; Paragraph segmentation; Semi Markov model; Sequence Labeling; State of the art; Text corpora; Written texts; Computational linguistics; Markov processes; Natural language processing systems | |
dc.title | Semi-Markov models for sequence segmentation | |
dc.type | Conference paper | |
local.bibliographicCitation.lastpage | 648 | |
local.bibliographicCitation.startpage | 640 | |
local.contributor.affiliation | Shi, Qinfeng, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Altun, Yasemin, Toyota Technological Institute at Chicago | |
local.contributor.affiliation | Smola, Alexander, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Vishwanathan, S, College of Engineering and Computer Science, ANU | |
local.contributor.authoremail | repository.admin@anu.edu.au | |
local.contributor.authoruid | Shi, Qinfeng, u4265690 | |
local.contributor.authoruid | Smola, Alexander, u4039398 | |
local.contributor.authoruid | Vishwanathan, S, a204054 | |
local.description.embargo | 2037-12-31 | |
local.description.notes | Imported from ARIES | |
local.description.refereed | Yes | |
local.identifier.absfor | 080109 - Pattern Recognition and Data Mining | |
local.identifier.ariespublication | u8803936xPUB190 | |
local.identifier.scopusID | 2-s2.0-78649917412 | |
local.identifier.uidSubmittedBy | u8803936 | |
local.type.status | Published Version |
Downloads
Original bundle
1 - 3 of 3
No Thumbnail Available
- Name:
- 01_Shi_Semi-Markov_models_for_2007.pdf
- Size:
- 806.88 KB
- Format:
- Adobe Portable Document Format
No Thumbnail Available
- Name:
- 02_Shi_Semi-Markov_models_for_2007.pdf
- Size:
- 29.43 KB
- Format:
- Adobe Portable Document Format
No Thumbnail Available
- Name:
- 03_Shi_Semi-Markov_models_for_2007.pdf
- Size:
- 355.14 KB
- Format:
- Adobe Portable Document Format