Newman, David; Bonilla, Edwin; Buntine, Wray
Topic models have the potential to improve search and browsing by extracting useful semantic themes from web pages and other text documents. When learned topics are coherent and interpretable, they can be valuable for faceted browsing, results set diversity analysis, and document retrieval. However, when dealing with small collections or noisy text (e.g. web search result snippets or blog posts), learned topics can be less coherent, less interpretable, and less useful. To overcome this, we...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.