Skip navigation
Skip navigation

Text segmentation and Chinese site search

Zhou, Liyuan; Hawking, David; Thomas, Paul

Description

Automatic segmentation and overlapping bigrams are the most common methods for overcoming the lack of explicit word boundaries in Chinese text. Past studies have compared their effectiveness, but findings have been equivocal and site search has been little studied. We compare representatives of the two approaches using a 465,000 page crawl and test queries applicable to the university context. 503 pairs of result sets were judged by 56 Chinese students. Although there are differences on certain...[Show more]

CollectionsANU Research Publications
Date published: 2015
Type: Conference paper
URI: http://hdl.handle.net/1885/103834
Source: Text segmentation and Chinese site search
DOI: 10.1145/2838931.2838940

Download

File Description SizeFormat Image
01_Zhou_Text_segmentation_and_Chinese_2015.pdf315.37 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  23 August 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator