faiss

mirror of https://github.com/facebookresearch/faiss.git synced 2025-06-03 21:54:02 +08:00

Author	SHA1	Message	Date
Matthijs Douze	039409d950	split off RQ encoding steps to another file (#3011 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3011 After Alexandr's optimizations the ResidualQuantizer code has become harder to read. Split off the quantization code to a separate .h / .cpp to make it clearer. Reviewed By: pemazare Differential Revision: D48448614 fbshipit-source-id: c90d572ea3afe12a7a7e5092f88710e8eceaa2d1	2023-09-01 07:06:14 -07:00
Matthijs Douze	67d87275f8	Clean up batch comments + obey IO_FLAG_SKIP_PRECOMPUTE_TABLE (#3013 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3013 To avoid OOM when loading some RCQs, don't precompute cross product tables when io_flags contains bit IO_FLAG_SKIP_PRECOMPUTE_TABLE Reviewed By: pemazare Differential Revision: D48448616 fbshipit-source-id: a261259f1fb583aa358d6b6c42d9b851e9729247	2023-09-01 07:06:14 -07:00
Matthijs Douze	82352dd453	make nbits configurable for graph indices based on PQ (#3031 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3031 As requested in https://github.com/facebookresearch/faiss/issues/3027 Indeed, PQ sizes with nbits > 8 are good tradeoffs, so it is interesting to support them. Reviewed By: pemazare Differential Revision: D48860659 fbshipit-source-id: 6f3c642e0902e1523bef36db6be3af3688d529a5	2023-09-01 02:37:33 -07:00
Matthijs Douze	5c4bd3feb3	Cleanup clustering code (#3030 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3030 Added default arguments to the .h file (for some reason I forgot this file when migrating default args). Logging a hash value in MatrixStats, useful to check if two runs really really run on the same matrix... Reviewed By: pemazare Differential Revision: D48834343 fbshipit-source-id: 7c1948464e66ada1f462f4486f7cf3159bbf9dfd	2023-08-31 01:11:45 -07:00
Corey J. Nolet	3888f9bb11	Using expanded distance forms in `RaftFlatIndex.cu` (#3021 ) Summary: This is a minor bug that comes with a perf impact. The classic FAISS `FlatIndex` always uses expanded form of distance computation even though an argument `exactDistances` is provided. `RaftFlatIndex` was using this argument to determine whether the computation should be exhaustive. This PR includes one additional change to eagerly initialize the `cublas_handle` on the `device_resources` instance when it's created. Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3021 Reviewed By: pemazare Differential Revision: D48739660 Pulled By: mdouze fbshipit-source-id: a361334eb243df86c169c69d24bb10fed8876ee9	2023-08-30 09:05:59 -07:00
Richard Barnes	fef49a6307	Del `(object)` from 50 inc faic/experiments/blip_finetune/transform/randaugment.py Summary: Python3 makes the use of `(object)` in class inheritance unnecessary. Let's modernize our code by eliminating this. Reviewed By: palmje Differential Revision: D48718370 fbshipit-source-id: 6794156f7dd835cca8e12b65067f95b6991a218c	2023-08-27 22:20:41 -07:00
Gergely Szilvasy	c00fe254e4	faiss-gpu-raft, fix dispatch test (#3017 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3017 Somehow this got mixed up with the deleted install-cmake.sh Reviewed By: mlomeli1 Differential Revision: D48496314 fbshipit-source-id: 851a5f9222d74e7681c8dcddded83dd8f9945591	2023-08-19 10:26:36 -07:00
Gergely Szilvasy	a02b37dccf	relax test_lut rtol (#3016 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3016 To resolve test failure for raft nightly. Reviewed By: mdouze Differential Revision: D48465199 fbshipit-source-id: 6bfa3c585e6c3d540e1f58a6351e61d96a54e0c0	2023-08-18 03:47:19 -07:00
Matthijs Douze	69cb877683	Fix memory leak for ParameterSpace objects (#3007 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3007 There is a complicated interaction between SWIG and the python wrappers where the ownership of ParameterSpace arguments was stolen from Python. This diff adds a test, fixes that behavior and fixes the referenced_objects construction Reviewed By: mlomeli1 Differential Revision: D48404252 fbshipit-source-id: 8afa9e6c15d11451c27864223e33ed1187817224	2023-08-17 12:51:29 -07:00
Gergely Szilvasy	e3731f7886	faiss-gpu-raft, the missing bits (#3009 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3009 1. Added the nightly build trigger, duh! 2. Run test_partitioning in fbcode Reviewed By: mdouze Differential Revision: D48425784 fbshipit-source-id: 58db0bd86d2673507b5d5ce2cb8b890713f9d919	2023-08-17 03:05:03 -07:00
qmc20234	88b7255830	fix argument error (#2965 ) Summary: the argument in IndexIVFPQ constructor should be pq.M, not code_size Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2965 Reviewed By: algoriddle Differential Revision: D48024513 Pulled By: mdouze fbshipit-source-id: d5cb92a32bbcb647ee12a4bc6b026059c20740db	2023-08-16 12:21:33 -07:00
Gergely Szilvasy	2768fb38b2	faiss-gpu-raft package (#2992 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2992 Reviewed By: mdouze Differential Revision: D48391366 Pulled By: algoriddle fbshipit-source-id: 94b7f62afc8a09a9feaea47bf60e5358d89fcde5	2023-08-16 09:30:41 -07:00
Maria Lomeli	c09992bc8a	Back out "Better NaN handling" (#3006 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3006 Original commit changeset: 99e7786582e9 Original Phabricator Diff: D48031390 Reviewed By: algoriddle Differential Revision: D48353221 fbshipit-source-id: fd326f2a45d20f68507ca39a33a325528651b37d	2023-08-15 09:32:01 -07:00
Fernando Gasperi	e3deb71cdb	Enable for faiss tests (#3002 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3002 title Reviewed By: jbardini Differential Revision: D48266242 fbshipit-source-id: b53e186f1954916a90dc8dbba67963f40d0aead7	2023-08-14 08:03:40 -07:00
Gergely Szilvasy	ef7e945b4d	remove avx2 from raft cmake contbuild Summary: Unnecessary for contbuild and doubles the build time. Reviewed By: mlomeli1 Differential Revision: D48148734 fbshipit-source-id: ca44a1e328ce6980c8a867a33ce311fe6eeb90e0	2023-08-08 11:44:14 -07:00
Matthijs Douze	687457b2f4	Access graph structure for NSG (#2984 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2984 It is not entirely trivial to access the NSG graph structure from Python (although it is a fixed size N-by-K matrix of vector ids). This diff adds an inspect_tools function to do that. Reviewed By: algoriddle Differential Revision: D48026775 fbshipit-source-id: 94cd7be7f656bcd333d62586531f287ea8e052e5	2023-08-04 06:55:24 -07:00
Gergely Szilvasy	da16d9d3ca	simplify raft build (#2983 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2983 Reviewed By: mdouze Differential Revision: D48063550 Pulled By: algoriddle fbshipit-source-id: c67e13cec97f4de8cc30cae47186593dbe0bdadb	2023-08-04 06:52:07 -07:00
Matthijs Douze	a3fbf2d61c	Better NaN handling (#2986 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2986 A NaN vector is a vector with at least one NaN (not-a-number) entry. After discussion in the Faiss team we decided that: - training should throw an exception on NaN vectors - added NaN vectors should be ignored (never returned) - searched NaN vectors should return only -1s This diff implements this for a few common index types + adds relevant tests. Reviewed By: algoriddle Differential Revision: D48031390 fbshipit-source-id: 99e7786582e91950e3a53c1d8bcffdd00b6afd24	2023-08-04 06:51:06 -07:00
generatedunixname89002005325676	a4ddb18605	Daily `arc lint --take CLANGFORMAT` Reviewed By: 0x1eaf Differential Revision: D47985815 fbshipit-source-id: 47bbe26ec689ac5521fe94ab52d174c60ded2ba5	2023-08-02 07:34:56 -07:00
Maria	35dac924d1	Added version to nighly install (#2982 ) Summary: The gpu nightly package install command did not install v1.7.4, see [P801820926](https://www.internalfb.com/intern/paste/P801820926) Adding the version fixes this issue, see [P801849181](https://www.internalfb.com/intern/paste/P801849181) Funnily enough, faiss-cpu nightly command works fine, see [P801848411](https://www.internalfb.com/intern/paste/P801848411) Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2982 Reviewed By: mdouze Differential Revision: D47952190 Pulled By: mlomeli1 fbshipit-source-id: 2185197e0a513c7da441d791c0b373f06f570f62	2023-08-01 12:14:35 -07:00
Alexandr Guzhva	5a95d47858	Upgrade AVX2 code for SQ8 (#2942 ) Summary: More efficient code for SQ8 for AVX2. For clang-15, improves a number of Instructions per cycle (IPC) from 2.49 to 3.20 Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2942 Reviewed By: algoriddle Differential Revision: D47946167 Pulled By: mdouze fbshipit-source-id: da864bac8d452f2eb111ca356e54a8a69cd03dbf	2023-08-01 06:08:44 -07:00
youcheng huang	0aae4d3eec	fix hnsw shrink_neighbor_list comment (#2980 ) Summary: This pr is to fix the issue https://github.com/facebookresearch/faiss/issues/2978 . Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2980 Reviewed By: mdouze Differential Revision: D47950592 Pulled By: mlomeli1 fbshipit-source-id: 32ef06c3775f7234a5a4bb4dab36c176edea2d1f	2023-08-01 05:01:30 -07:00
Corey J. Nolet	7bf714928c	Adding `libraft` dependency to speed up compile times with `USE_RAFT` (#2958 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2958 Reviewed By: mlomeli1, mdouze Differential Revision: D47678341 Pulled By: algoriddle fbshipit-source-id: 2ab2d0e8349498faa0fc59ac9800da29a201c766	2023-07-31 07:37:27 -07:00
Gergely Szilvasy	726143d056	install libraft for cmake build (#2968 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2968 Reviewed By: mlomeli1, mdouze Differential Revision: D47677660 Pulled By: algoriddle fbshipit-source-id: 8fad8323ea3c0a264149c76fc9519d9c63346d00	2023-07-31 07:37:27 -07:00
Gergely Szilvasy	821a401ae9	CodeSet for deduping large datasets (#2949 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2949 A more scalable alternative to `np.unique` for deduping large datasets with a quantized code. Reviewed By: mlomeli1 Differential Revision: D47443953 fbshipit-source-id: 4a1554d4d4200b5fa657e9d8b7395bba9856a8e3	2023-07-19 10:05:46 -07:00
Matthijs Douze	43d86e3073	Relax IVF AQ FastScan (#2940 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2940 This test fails on some occasions. After investigation it turns out this is due to non reproducible behavior IndexIVFFastScan::search_implem_14 with a parallel loop, where there are ties in the resutls (ie. the resulting distances are the same but not the ids). As a workaround I relaxed the test slightly. + a fix in the checksum function. Reviewed By: algoriddle Differential Revision: D47229086 fbshipit-source-id: 55e53bcfe47cf33041cc7fd5691b5de65067ce0f	2023-07-05 21:51:12 -07:00
Maria	a757806ae9	added blas=1.0=mkl to INSTALL (#2939 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2939 Reviewed By: algoriddle Differential Revision: D47229098 Pulled By: mlomeli1 fbshipit-source-id: 91761499d9cd13ecafe12186ddbd80224c2e7410	2023-07-05 10:05:19 -07:00
Sid Jha	d48e777412	Fix import (#2936 ) Summary: Previous import does not exist. Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2936 Reviewed By: mlomeli1 Differential Revision: D47221019 Pulled By: mdouze fbshipit-source-id: 9ceeba229a10dd4b66da3483cc7695b198e1a8d8	2023-07-05 06:59:05 -07:00
Matthijs Douze	1c1d5c808f	Make tests a little less verbose Summary: Useful info on github test runs is burried in spurious logging. Avoid this. Reviewed By: mlomeli1 Differential Revision: D47209139 fbshipit-source-id: b5111c91e2b94f0c3678d599197f8e7094993df1	2023-07-04 07:02:53 -07:00
Richard Barnes	4bfdd4324f	Parallelize kernel compilation in FAISS (#2922 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2922 This parallelizes kernel compilation by taking a template function from much deeper in the stack than was previously the case and generating 128 compilation units rather than the original 8. Reviewed By: mdouze Differential Revision: D46674315 fbshipit-source-id: 830eeaf43dee2c081f735be47c809b28aa3a05f6	2023-06-30 01:30:01 -07:00
Matthijs Douze	a91a2887fe	use dispatcher function to call HammingComputer (#2918 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2918 The HammingComputer class is optimized for several vector sizes. So far it's been the caller's responsiblity to instanciate the relevant optimized version. This diff introduces a `dispatch_HammingComputer` function that can be called with a template class that is instanciated for all existing optimized HammingComputer's. Reviewed By: algoriddle Differential Revision: D46858553 fbshipit-source-id: 32c31689bba7c0b406b309fc8574c95fa24022ba	2023-06-26 14:06:10 -07:00
Matthijs Douze	a27036aa72	add small benchmark for hamming computers Summary: to measure impact of hamming computer diff Reviewed By: algoriddle Differential Revision: D46913890 fbshipit-source-id: 7b9850205885b9b7c5f394f17a79ba222e7b1e2e	2023-06-26 14:06:10 -07:00
Gergely Szilvasy	391601dc3f	relax test_ivf_train_2level threshold (#2927 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2927 Reviewed By: mlomeli1 Differential Revision: D47017009 fbshipit-source-id: cfa1df4b9632b085d3a61b56d8617bebd7e5aad6	2023-06-26 05:02:47 -07:00
Gergely Szilvasy	1d7c05de5f	raft nightly (#2926 ) Summary: Moving the raft build to a nightly, to remove the noise from the PR contbuilds. Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2926 Reviewed By: mlomeli1 Differential Revision: D47016318 Pulled By: algoriddle fbshipit-source-id: 3c60aa382b9aa68dcadb929e0e4afade13c9123e	2023-06-26 03:10:05 -07:00
Octavian Guzu	9126f863d4	Prevent snprintf vulnerability Summary: With a very big name for a `ParameterRange`, the `snprintf` call from `combination_name` can end up having a negative second parameter, causing a memory overflow, which can lead to a serious security issue. We can checking that the second parameter is always >= 0 and throw an exception if not. See the new GTEST. Reviewed By: mdouze Differential Revision: D46856956 fbshipit-source-id: 91c657ec028c462d4b808b595811342034e00133	2023-06-23 08:52:20 -07:00
Richard Barnes	8ac4e41983	Switch //faiss/gpu to use templates instead of macros (#2914 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2914 The macros are part of a system to reduce compilation time via separate compilation units. Unfortunately, the parallelization is across C++ template functions instead of NVCC invocations on kernel compilation, which would be much more effective. This diff removes the preprocessor macros and expands them into templates. Compilation time after this diff is given by [this buck2 output](https://www.internalfb.com/buck2/ae9e6b28-a1bd-4d46-8af8-2895e6f182c8) with 1,043s through impl/scan/IVFInterleaved2048.cu Reviewed By: mdouze Differential Revision: D46549341 fbshipit-source-id: 5c3457876fd649e03ebeac89e4d1713f091ee9f5	2023-06-21 08:04:58 -07:00
Gergely Szilvasy	e0741ca5d7	fix for lib/jvm/languages/python/bin/conda no such file (#2917 ) Summary: environment: line 9: /opt/conda/lib/jvm/languages/python/bin/conda: No such file or directory Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2917 Reviewed By: mdouze Differential Revision: D46841321 Pulled By: algoriddle fbshipit-source-id: bdfbc16fbf422406c5195293dd4730f71a261e40	2023-06-21 00:29:51 -07:00
Gergely Szilvasy	f69b1db60a	update installation instructions with notes about mkl and the nvidia channel Reviewed By: mdouze Differential Revision: D46844223 fbshipit-source-id: 1a0862c160f2c9656db68b80475712815ee81daa	2023-06-19 11:47:31 -07:00
Matthijs Douze	07fe2b622f	Binary cloning and GPU range search (#2916 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2916 Overall better support for binary indexes: - cloning (to CPU and GPU), only for BinaryFlat for now - fix bug in reconstruct_n - range_search_max_results Reviewed By: algoriddle Differential Revision: D46755778 fbshipit-source-id: 777ad90aff5c54a77f9685ed6512247a922c6ef5	2023-06-19 06:05:14 -07:00
Gergely Szilvasy	e153cac419	fix the osx nightly build (#2896 ) Summary: Based on comments in https://github.com/conda/conda-build/issues/4498 Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2896 Reviewed By: mdouze Differential Revision: D46802512 Pulled By: algoriddle fbshipit-source-id: 7449b2f0db08fdd793770a44afb659d7ac28e3cd	2023-06-16 13:01:17 -07:00
Gergely Szilvasy	092606b293	bbs producer/consumer threading (#2901 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2901 This diff allows each GPU to work independently, a hot centroid (eg. out-of-distribution queries that hit a centroid heavily) will only block the one GPU that is processing it, others will continue to pick up work independently. Reviewed By: mdouze Differential Revision: D46521298 fbshipit-source-id: 171cb06cce8b2d16b7bd744799b105b3cd525be3	2023-06-14 07:58:44 -07:00
I	d8a6350607	Update docs (C++11 -> C++17) (#2907 ) Summary: following https://github.com/facebookresearch/faiss/issues/2899 This PR doesn't affect the software behavior Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2907 Reviewed By: mdouze Differential Revision: D46720499 Pulled By: algoriddle fbshipit-source-id: 00b47baf526a94449e2b1c9ca5fcd4cf961f6f17	2023-06-14 05:06:15 -07:00
Gergely Szilvasy	6951466b43	raft enabled cmake build (#2898 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2898 Reviewed By: mdouze Differential Revision: D46561295 Pulled By: algoriddle fbshipit-source-id: b9806c0c52acf82124c3b2e0095b1c1979318dcd	2023-06-13 08:43:18 -07:00
Richard Barnes	27ffd14ae4	Use C++17 [[fallthrough]] in faiss/utils/distances_simd.cpp (#2913 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2913 Reviewed By: algoriddle Differential Revision: D46603510 fbshipit-source-id: 374d530d79176ac553b40d5ad04bf83d4920b107	2023-06-12 15:07:08 -07:00
Richard Barnes	100beb8565	Use C++17 [[fallthrough]] in faiss/utils/hamming_distance/avx2-inl.h Reviewed By: mdouze Differential Revision: D46603512 fbshipit-source-id: fa4bab4d24f5c9e2a3506f2a67d3a7db2a01512f	2023-06-12 08:19:22 -07:00
Richard Barnes	463ffd8e28	Indicate that fallthrough is intentional in faiss (#2897 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2897 Reviewed By: algoriddle Differential Revision: D46385243 fbshipit-source-id: f08b16c9db91edca53cdbf0932a990c8c1f9d0db	2023-06-08 12:22:11 -07:00
Taras Tsugrii	8ec166c9fd	Simplify non-optimal points removal. Summary: This version is more concise and doesn't need a new scope to reduce visibility of local variable `i`. Created from CodeHub with https://fburl.com/edit-in-codehub Reviewed By: mdouze Differential Revision: D46431189 fbshipit-source-id: 5bbe8df6014d8e25aeb8d5d15145b703e9651327	2023-06-08 08:50:28 -07:00
Taras Tsugrii	f82298ffe5	Remove unused unordered_map include. (#2900 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2900 This makes builds brittle and slows down builds. Reviewed By: algoriddle Differential Revision: D46445595 fbshipit-source-id: 03a02e274922dd6215e467ead148890d79b3c2f8	2023-06-07 12:39:24 -07:00
Gergely Szilvasy	451f6cdbe5	c++ 17 (#2899 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2899 Reviewed By: mlomeli1 Differential Revision: D46521588 Pulled By: algoriddle fbshipit-source-id: 6ac4b9d7590329317455d35256cab9dc820dfccf	2023-06-07 09:10:11 -07:00
I	9c884225c1	Some changes to simdlib (#2885 ) Summary: - Use elementwise operation and reduction once instead of across-vector comparing operation twice - Use already implemented supporting functions - Unify semantics of `operator==` as same as `simd16uint16` - `operator==` of `simd8uint32` and `simd8float32` had been implemented on https://github.com/facebookresearch/faiss/issues/2568, but these has not same semantics as `simd16uint16` (which had been implemented in a long time ago). For getting the vector equality as `bool` , now we should use `is_same_as` member function. - Change `is_same_as` to accept any vector type as argument for `simdlib_neon` - `is_same_as` has supported any vector type on `simdlib_avx2` and `simdlib_emulated` already - Remove unused function `simd16uint16::is_same` on `simdlib_avx2` - Is it typo of `is_same_as` ? Anyway it seems to be used unlikely Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2885 Reviewed By: mdouze Differential Revision: D46330666 Pulled By: alexanderguzhva fbshipit-source-id: 0ea14f8e9a8bda78f24a655219dffe3e07fc110f	2023-06-01 07:39:02 -07:00

1 2 3 4 5 ...

877 Commits