Research Article Open Access

Prototype-Based Sample Selection for Active Hashing

Cheong Hee Park1
  • 1 Chungnam National University, Korea

Abstract

Several hashing-based methods for Approximate Nearest Neighbors (ANN) search in a large data set have been proposed recently. In particular, semi-supervised hashing utilizes semantic similarity given for a small fraction of pairwise data samples and active hashing aims to improve the performance for ANN search by relying on an expert for the labeling of the most informative points. In this study, we present an active hashing method by prototype-based sample selection. Knowing semantic similarities between cluster prototypes can help extracting relations among the points in the corresponding clusters. For expert labeling, we select prototypes from clusters which do not contain any data points with labeled information so that all areas can be covered effectively. Experimental results demonstrate that the proposed active hashing method improves the performance for ANN search.

Journal of Computer Science
Volume 11 No. 7, 2015, 839-844

DOI: https://doi.org/10.3844/jcssp.2015.839.844

Submitted On: 2 September 2014 Published On: 2 November 2015

How to Cite: Park, C. H. (2015). Prototype-Based Sample Selection for Active Hashing. Journal of Computer Science, 11(7), 839-844. https://doi.org/10.3844/jcssp.2015.839.844

  • 3,155 Views
  • 2,156 Downloads
  • 0 Citations

Download

Keywords

  • Active Hashing
  • Approximate Nearest Neighbors (ANN) Search
  • Hierarchical Clustering
  • Prototype-Based Sample Selection
  • Semi-Supervised Hashing