faiss

mirror of https://github.com/facebookresearch/faiss.git synced 2025-06-03 21:54:02 +08:00

Author	SHA1	Message	Date
Fernando Gasperi	e3deb71cdb	Enable for faiss tests (#3002 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/3002 title Reviewed By: jbardini Differential Revision: D48266242 fbshipit-source-id: b53e186f1954916a90dc8dbba67963f40d0aead7	2023-08-14 08:03:40 -07:00
Alexandr Guzhva	04ba8f97e0	Additional comparison facilities in simdlib (#2783 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2783 Add needed facilities for future top-k needs Reviewed By: mdouze Differential Revision: D44397732 fbshipit-source-id: 001f9baff0bd234e33f7d0a1da6dc6cb990b1844	2023-03-28 14:01:59 -07:00
Alexandr Guzhva	868e17f294	OSS legal requirements (#2698 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2698 Add headers about copyright. Reviewed By: algoriddle Differential Revision: D43085637 fbshipit-source-id: 5a57876b7047097ffe01cd79322674625d9bca34	2023-02-07 14:32:56 -08:00
Alexandr Guzhva	0b74765cca	Speedup exhaustive_L2sqr_blas for AVX2, ARM NEON and AVX512 (#2568 ) Summary: Pull Request resolved: https://github.com/facebookresearch/faiss/pull/2568 Add a fused kernel for exhaustive_L2sqr_blas() call that combines a computation of dot product and the search for the nearest centroid. As a result, no temporary dot product values are written and read in RAM. Speeds up the training of PQx[1] indices for dsub = 1, 2, 4, 8, and the effect is higher for higher values of [1]. AVX512 version provides additional overloads for dsub = 12, 16. The speedup is also beneficial for higher values of pq.cp.max_points_per_centroid (which is 256 by default). Speeds up IVFPQ training as well. AVX512 kernel is not enabled, but I've seen it speeding up the training TWICE versus AVX2 version. So, please feel free to use it by enabling AVX512 manually. Reviewed By: mdouze Differential Revision: D41166766 fbshipit-source-id: 443014e2e59396b3a90b9171fec8c8191052bcf4	2022-11-14 17:01:52 -08:00

4 Commits