A Randomized Approximate Nearest Neighbors Algorithm

Seminar:

Applied Mathematics

Event time:

Tuesday, November 16, 2010 - 11:15am to 12:15pm

Location:

LOM 206

Speaker:

Andrei Osipov

Speaker affiliation:

Applied Mathematics, Yale University

Event description:

We present a randomized algorithm for the approximate nearest neighbor problem in ddimensional
Euclidean space. Given N points {xj} in Rd, the algorithm attempts to find
k nearest neighbors for each of xj , where k is a user-specified integer parameter. The
algorithm is iterative, and its CPU time requirements are proportional to T Â·N Â· (dÂ· (log d)+
k Â· (d + log k) Â· (logN)) + N Â· k2 Â· (d + log k), with T the number of iterations performed.
The memory requirements of the procedure are of the order N Â· (d + k).
A byproduct of the scheme is a data structure, permitting a rapid search for the k nearest
neighbors among {xj} for an arbitrary point x 2 Rd. The cost of each such query is
proportional to T Â· (d Â· (log d) + log(N/k) Â· k Â· (d + log k)), and the memory requirements for
the requisite data structure are of the order N Â· (d + k) + T Â· (d + N).
The algorithm utilizes random rotations and a basic divide-and-conquer scheme, followed
by a local graph search. We analyze the schemeâ€™s behavior for certain types of distributions
of {xj}, and illustrate its performance via several numerical examples.
A Randomized Approximate Nearest Neighbors
Algorithm.
Peter W. Jones, Andrei Osipov, Vladimir Rokhlin