A GPU-Accelerated Two Stage Visual Matching Pipeline for Image and Video Retrieval

Publication from Digital

Fassold H., Stiegler H., Rosner J., Thaler M., Bailer W.

Prague, Czech Republic 13th International Workshop on Content-Based Multimedia Indexing (CBMI), 6/2015


We propose a two stage visual matching pipeline including a first step using VLAD signatures for filtering results, and a second step which reranks the top results using raw matching of SIFT descriptors. This enables adjusting the trade-off between high computational cost of matching local descriptors and the insufficient accuracy of compact signatures in many application scenarios. We describe GPU accelerated extraction and matching algorithms for SIFT, which result in a speedup factor of at least 4. The VLAD filtering step reduces the number of images/frames for which the local descriptors need to be matched, thus speeding up retrieval by an additional factor of 12-13 without sacrificing mean average precision over full raw descriptor matching.