Research Article Open Access

Normalized Web Distance Based Web Query Classification

S. Lovelyn Rose and K. R. Chandran

Abstract

Problem statement: The problem is to classify a given web query to a set of 67 target categories. The target categories are ranked based on the degree of similarity to a given query. Approach: The feature set is the set of intermediate categories retrieved from a directory search engine for a given query. Using direct mapping and Normalized Web Distance (NWD) the intermediate categories are mapped to the required target categories. The categories are then ranked based on three parameters of the intermediate categories namely, position, frequency and a combination of frequency and position. Results: The results proved that the third parameter gave a better result and a maximum of 40 search result pages ensure better results. Conclusion: With NWD as the similarity measure, the precision and recall is found to increase by 10% over the previous methods.

Journal of Computer Science
Volume 8 No. 5, 2012, 804-808

DOI: https://doi.org/10.3844/jcssp.2012.804.808

Submitted On: 3 January 2012 Published On: 9 March 2012

How to Cite: Rose, S. L. & Chandran, K. R. (2012). Normalized Web Distance Based Web Query Classification. Journal of Computer Science, 8(5), 804-808. https://doi.org/10.3844/jcssp.2012.804.808

  • 3,145 Views
  • 2,606 Downloads
  • 1 Citations

Download

Keywords

  • Automatic web query classification
  • directory search
  • query log
  • NWD