Skip navigation
Skip navigation

Fuzzy Pseudo-thesaurus Based Clustering of a Folkloristic Corpus

Szaszko, S; Koczy, Laszlo T.; Gedeon, Tamas (Tom)

Description

Automatic thesaurus extraction is essential for modern information retrieval. We develop a method for fuzzy pseudo-thesaurus based on word pair co-occurrence in documents. In this study it is presented, that considering the Word Frequency Degree counted on the whole corpus makes the obtained pseudo-thesaurus usable. Such parameters were found with which most of the obtained pairs of words were validated to be related by human expert. Among the extracted pairs and groups of words the...[Show more]

CollectionsANU Research Publications
Date published: 2005
Type: Conference paper
URI: http://hdl.handle.net/1885/33797
Source: Proceedings of the 2005 IEEE International Conference on Fuzzy Systems

Download

File Description SizeFormat Image
01_Szaszko_Fuzzy_Pseudo-thesaurus_Based_2005.pdf1.95 MBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  20 July 2017/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator