The usefulness of web spam
Download (81.26 kB)
Jones, Timothy; Thomas, Paul; Sankaranarayana, Ramesh S; Hawking, David
Description
Spam comprises at least 60% of the public web, and search engine companies invest considerable effort in rejecting these apparently useless pages. But how bad are spam pages in search results? Can spam be dealt with as a side-effect of dealing with page utility, or is the relationship more complex? Thirty-four volunteer judges rated selected individual documents first on usefulness to a specified task and then on degree of "spamminess". Our results show that the relationship between spamminess...[Show more]
dc.contributor.author | Jones, Timothy | |
---|---|---|
dc.contributor.author | Thomas, Paul | |
dc.contributor.author | Sankaranarayana, Ramesh S | |
dc.contributor.author | Hawking, David | |
dc.coverage.spatial | Canberra Australia | |
dc.date.accessioned | 2015-12-07T22:18:41Z | |
dc.date.created | December 2 2011 | |
dc.identifier.isbn | 9781921426926 | |
dc.identifier.uri | http://hdl.handle.net/1885/18934 | |
dc.description.abstract | Spam comprises at least 60% of the public web, and search engine companies invest considerable effort in rejecting these apparently useless pages. But how bad are spam pages in search results? Can spam be dealt with as a side-effect of dealing with page utility, or is the relationship more complex? Thirty-four volunteer judges rated selected individual documents first on usefulness to a specified task and then on degree of "spamminess". Our results show that the relationship between spamminess and utility is far from clear cut; judges found that an important proportion of spam documents were useful. We conclude that evaluation should consider both utility and spamminess, as separate factors; and that search engines should not summarily discard spam pages but should take their utility into account as well. | |
dc.publisher | RMIT University | |
dc.relation.ispartofseries | Australasian Document Computing Symposium (ADCS 2011) | |
dc.source | Proceedings of the Sixteenth Australasian Document Computing Symposium | |
dc.source.uri | http://www.cs.rmit.edu.au/adcs2011/ | |
dc.subject | Keywords: CAN-SPAM; Engine companies; Search results; Side effect; User study; Web document; Search engines; World Wide Web User Studies Involving Documents; Web Documents | |
dc.title | The usefulness of web spam | |
dc.type | Conference paper | |
local.description.notes | Imported from ARIES | |
local.description.refereed | Yes | |
dc.date.issued | 2011 | |
local.identifier.absfor | 080704 - Information Retrieval and Web Search | |
local.identifier.ariespublication | u4313336xPUB6 | |
local.type.status | Published Version | |
local.contributor.affiliation | Jones, Timothy, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Thomas, Paul, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Sankaranarayana, Ramesh S, College of Engineering and Computer Science, ANU | |
local.contributor.affiliation | Hawking, David, College of Engineering and Computer Science, ANU | |
local.description.embargo | 2037-12-31 | |
local.bibliographicCitation.startpage | 2 | |
local.bibliographicCitation.lastpage | 5 | |
local.identifier.absseo | 890301 - Electronic Information Storage and Retrieval Services | |
dc.date.updated | 2016-02-24T10:54:10Z | |
local.identifier.scopusID | 2-s2.0-84872841127 | |
Collections | ANU Research Publications |
Download
File | Description | Size | Format | Image |
---|---|---|---|---|
01_Jones_The_usefulness_of_web_spam_2011.pdf | 81.26 kB | Adobe PDF | ||
02_Jones_The_usefulness_of_web_spam_2011.pdf | 775.03 kB | Adobe PDF |
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.
Updated: 17 November 2022/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator