faiss

Commit Graph

Author	SHA1	Message	Date
Matthijs Douze	c7fd8d7ac3	improved range search evaluation functions Summary: For range search evaluation, this diff adds optimized functions for ground-truth generation (on GPU). Reviewed By: beauby Differential Revision: D25822716 fbshipit-source-id: c5278dfad0510d24c2a5c87c1f8c81161fa8c5d3	2021-01-11 08:12:10 -08:00
Matthijs Douze	3dd7ba8ff9	Add range search accuracy evaluation Summary: Added a few functions in contrib to: - run range searches by batches on the query or the database side - emulate range search on GPU: search on GPU with k=1024, if the farthest neighbor is still within range, re-perform search on CPU - as reference implementations for precision-recall on range search datasets - optimized code to plot precision-recall plots (ie. sweep over thresholds) The new functions are mainly in a new `evaluation.py` Reviewed By: wickedfoo Differential Revision: D25627619 fbshipit-source-id: 58f90654c32c925557d7bbf8083efbb710712e03	2020-12-17 17:17:09 -08:00
Matthijs Douze	c5975cda72	PQ4 fast scan benchmarks (#1555 ) Summary: Code + scripts for Faiss benchmarks around the Fast scan codes. Pull Request resolved: https://github.com/facebookresearch/faiss/pull/1555 Test Plan: buck test //faiss/tests/:test_refine Reviewed By: wickedfoo Differential Revision: D25546505 Pulled By: mdouze fbshipit-source-id: 902486b7f47e36221a2671d124df8c114f25db58	2020-12-16 01:18:58 -08:00
Matthijs Douze	6d0bc58db6	Implementation of PQ4 search with SIMD instructions (#1542 ) Summary: IndexPQ and IndexIVFPQ implementations with AVX shuffle instructions. The training and computing of the codes does not change wrt. the original PQ versions but the code layout is "packed" so that it can be used efficiently by the SIMD computation kernels. The main changes are: - new IndexPQFastScan and IndexIVFPQFastScan objects - simdib.h for an abstraction above the AVX2 intrinsics - BlockInvertedLists for invlists that are 32-byte aligned and where codes are not sequential - pq4_fast_scan.h/.cpp: for packing codes and look-up tables + optmized distance comptuation kernels - simd_result_hander.h: SIMD version of result collection in heaps / reservoirs Misc changes: - added contrib.inspect_tools to access fields in C++ objects - moved .h and .cpp code for inverted lists to an invlists/ subdirectory, and made a .h/.cpp for InvertedListsIOHook - added a new inverted lists type with 32-byte aligned codes (for consumption by SIMD) - moved Windows-specific intrinsics to platfrom_macros.h Pull Request resolved: https://github.com/facebookresearch/faiss/pull/1542 Test Plan: ``` buck test mode/opt -j 4 //faiss/tests/:test_fast_scan_ivf //faiss/tests/:test_fast_scan buck test mode/opt //faiss/manifold/... ``` Reviewed By: wickedfoo Differential Revision: D25175439 Pulled By: mdouze fbshipit-source-id: ad1a40c0df8c10f4b364bdec7172e43d71b56c34	2020-12-03 10:06:38 -08:00
Jeff Johnson	8d776e6453	PyTorch tensor / Faiss index interoperability (#1484 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/1484 This diff allows for native usage of PyTorch tensors for Faiss indexes on both CPU and GPU. It is currently only implemented in this diff for things that inherit from `faiss.Index`, which covers the non-binary indices, and it patches the same functions on `faiss.Index` that were also covered by `__init__.py` for numpy interoperability. There must be uniformity among the inputs: if any array input is a Torch tensor, then all array inputs must be Torch tensors. Similarly, if any array input is a numpy ndarray, then all array inputs must be numpy ndarrays. If `faiss.contrib.torch_utils` is imported, it ensures that `import faiss` has already been performed to patch all of the functions using the base `__init__.py` numpy wrappers, and then patches the following functions again: ``` add add_with_ids assign train search remove_ids reconstruct reconstruct_n range_search update_vectors search_and_reconstruct sa_encode sa_decode ``` to allow usage of PyTorch CPU tensors, and additionally PyTorch GPU tensors if the index being used is on the GPU. numpy functionality is still available when `faiss.contrib.torch_utils` is imported; we pass through to the original patched numpy function when we detect numpy inputs. In addition, to allow for better (asynchronous) GPU usage without requiring the CPU to be involved, all of these functions which construct tensors/arrays for output now take optional arguments for storage (numpy or torch.Tensor) to be provided that will contain the output data. `range_search` is the only exception to this, as the size of the output data is indeterminate. The eventual GPU implementation will likely require the user to provide a maximum cap on the output size, and allow that to be passed instead. If the optional pre-allocated output values are presented by the user, they are used; otherwise, new return ndarray / Tensors are constructed as before and used for the return. If this feature were not provided on the GPU, then every execution would be completely serial as we would depend upon the CPU to allocate GPU memory before every operation. Instead, now this can function much like NN graph execution on the GPU, assuming that all of the data requirements are pre-allocated, so the execution will run at the full speed of the GPU and not be stalled sequentially launching kernels. This diff also exposes the `GpuResources` shared_ptr object owned by a GPU index. This is required for pytorch GPU so that we can perform proper stream ordering in Faiss with respect to the current pytorch stream. So, Faiss indices now perform more or less as any NN operation in Torch does. Note, however, that a Faiss index has its own setting on current device, and if the pytorch GPU tensor inputs are resident on a different device than what the Faiss index expects, a cross-device copy will be initiated. I may choose to make this an error in the future and require matching device to device. This diff also found a bug when passing GPU data directly to `train()` for `GpuIndexIVFFlat` and `GpuIndexIVFScalarQuantizer`, as I guess we never tested passing GPU data directly to these functions before. `GpuIndexIVFPQ` was doing the right thing however. The assign function is now also implemented on the GPU as well, and is now marked `const` to be in line with the `search` function. Also added better checking of non-contiguous inputs for both Torch tensors and numpy ndarrays. Updated the `knn_gpu` function with a base implementation always present that allows for usage of numpy arrays, which is overridden when `torch_utils` is imported to allow torch usage. This supports row/column major layout, float32/float16 data and int64/int32 indices for both numpy and torch. Reviewed By: mdouze Differential Revision: D24299400 fbshipit-source-id: b4f117b9c120bd1ad83e7702087051ab7b303b29	2020-10-23 22:24:22 -07:00
Matthijs Douze	f2369fcc82	benchmark SSD IndexIVF Summary: This is some code for benchmakring the SSD reads. Reviewed By: MDSilber Differential Revision: D24457715 fbshipit-source-id: 475668e4dc450dc4652ef8828111335c236bfa44	2020-10-21 18:21:39 -07:00
Matthijs Douze	92306e3a69	Synthetic dataset with inner product option Summary: The synthetic dataset can now have IP groundtruth Reviewed By: wickedfoo Differential Revision: D24219860 fbshipit-source-id: 42e094479311135e932821ac0a97ed0fb237bf78	2020-10-20 03:46:26 -07:00
Lucas Hosseini	70eaa9b1a3	Add missing copyright headers. (#1460 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/1460 Reviewed By: wickedfoo Differential Revision: D24278804 Pulled By: beauby fbshipit-source-id: 5ea96ceb63be76a34f1eb4da03972159342cd5b6	2020-10-13 11:15:59 -07:00
Jeff Johnson	e796f4f9df	Improve Faiss / PyTorch GPU interoperability Summary: PyTorch GPU in general is free to use whatever stream it currently wants, based on `torch.cuda.current_stream()`. Due to C++/python language barrier issues, we couldn't previously pass the actual `cudaStream_t` that is currently in use on a given device from PyTorch C++ to Faiss C++ via python. This diff adds conversion functions to convert a Python integer representing a pointer to a `cudaStream_t` (which is itself a `CUstream_st*`), so we can pass the stream specified in `torch.cuda.current_stream()` to `StandardGpuResources::setDefaultStream`. We thus guarantee that all Faiss work is ordered on the same stream that is in use in PyTorch. For use in Python, there is now the `faiss.contrib.pytorch_tensors.using_stream` context object which automatically sets and unsets the current PyTorch stream within Faiss. This takes a `StandardGpuResources` object in Python, and an optional `torch.cuda.Stream` if one wants to use a different stream, otherwise it uses the current one. This is how it is used: ``` # Create a non-default stream s = torch.cuda.Stream() # Have Torch use it with torch.cuda.stream(s): # Have Faiss use the same stream as the above with faiss.contrib.pytorch_tensors.using_stream(res): # Do some work on the GPU faiss.bfKnn(res, args) ``` `using_stream` uses the same pattern as the Pytorch `torch.cuda.stream` object. This replaces any brute-force GPU/CPU synchronization work that was necessary before. Other changes in this diff: - cleans up the config objects in the GpuIndex subclasses, to distinguish between read-only parameters that can only be set upon index construction, versus those that can be changed at runtime. - StandardGpuResources now more properly distinguishes between user-supplied streams (like the PyTorch one) which will not be destroyed upon resources destruction, versus internal streams. - `search_index_pytorch` now needs to take a `StandardGpuResources` object as well, there is no way to get this from an index instance otherwise (or at least, I would have to return a `shared_ptr`, in which case we should just update the Python SWIG stuff to use `shared_ptr` for GpuResources or something. Reviewed By: mdouze Differential Revision: D24260026 fbshipit-source-id: b18bb0eb34eb012584b1c923088228776c10b720	2020-10-13 09:11:19 -07:00
Matthijs Douze	8b05434a50	Remove useless function Summary: Removed an unused function that caused compile errors in some configurations. Added contrib function (exhaustive_search.knn) to compute the k nearest neighbors without constructing an index. Renamed the equivalent GPU function as exhaustive_search.knn_gpu (it does not make much sense to mention numpy in the name as all functions take numpy arguments by default). Reviewed By: beauby Differential Revision: D24215427 fbshipit-source-id: 6d8e1eafa7c57593304b7b76f83b3015e4d2a2bb	2020-10-09 07:57:04 -07:00
Jeff Johnson	0412d761e5	GPU brute-force kNN can take int32 indices (#1445 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/1445 As requested in https://github.com/facebookresearch/faiss/issues/1304, `bfKnn` can now produce int32 indices for output. The native kernels themselves for brute-force kNN only operate on int32 indices in any case, so this is faster. Also added a SWIG definition for float16 numpy arrays. As there is not a native half type, the reverse definition is undefined, so this is only really used for taking float16 data (e.g., from numpy) as input when in Python. Added a `knn_numpy_gpu` wrapper as well that handles calling the `bfKnn` GPU implementation using CPU numpy arrays. This handles transposition and f32/f16/i32 data types as needed. Reviewed By: mdouze Differential Revision: D24152296 fbshipit-source-id: caa7daea23438cf26aa248e380f0dab2b6b907fd	2020-10-08 17:50:42 -07:00
Matthijs Douze	8dfd51d3a7	Moved pytorch interop code to contrib Summary: The pytorch interop code was in a test until now. However, it is better if people can rely on it to be updated when the API is updated. Therefore, we move it into contrib. Also added a README.md Reviewed By: wickedfoo Differential Revision: D23392962 fbshipit-source-id: 9b7c0e388a7ea3c0b73dc0018322138f49191673	2020-08-28 17:50:49 -07:00
Matthijs Douze	f849680777	Dataset access in contrib Summary: This diff adds an object for a few useful dataset in faiss.contrib. This includes synthetic datasets and the classic ones. It is intended to work on: - the FAIR cluster - gluster - manifold Reviewed By: wickedfoo Differential Revision: D23378763 fbshipit-source-id: 2437a7be9e712fd5ad1bccbe523cc1c936f7ab35	2020-08-27 19:19:33 -07:00
Lucas Hosseini	cd38e82f0c	Facebook sync 2020-07-31 (#1308 )	2020-08-03 22:15:02 +02:00

14 Commits (f351a83ef6cbe3a7b8ef6c57c7da004b23f5be56)