Performance Evaluation of Search Engines Using Enhanced Vector Space Model
- 1 Babasaheb Bhimrao Ambedkar University Lucknow, India
Abstract
Vector space model allows computing a continuous degree of similarity between queries and retrieved documents and then ranks the documents in increasing order of cosine (similarity) value. It computes cosine or similarity value using their cosine function. The cosine function computes the similarity value by computing the weight of each term in the documents using a weighting scheme but it is a complex process to compute the weight of each term in the documents. It is also found that sometimes it fails to compute a similarity score, Firstly if there is only one document in the corpus and query terms match with the document and secondly, if the number of documents containing query terms and total number of documents retrieved are equal. To address this problem in order to improve the performance, we proposed an enhanced approach for computation of cosine or similarity value by enhancing the vector space model. Our work intends to analyze and implement our proposed method in performance evaluation of three search engines Google, Yahoo and MSN. To verify our method, we compared our proposed method with a manually computed relevance score and found that our evaluations match with manual method.
DOI: https://doi.org/10.3844/jcssp.2015.692.698
Copyright: © 2015 Jitendra Nath Singh and Sanjay K. Dwivedi. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 3,262 Views
- 2,565 Downloads
- 7 Citations
Download
Keywords
- Information Retrieval
- Term Frequency
- Cosine Value
- IDF
- Vector Space Model