History

Lucas Hosseini 6d51766607 Fix unused variables in python Reviewed By: mdouze Differential Revision: D26633983 fbshipit-source-id: 32b9f95ed9647716f65b93f2713a8d5bad6abe78		2021-02-24 11:52:18 -08:00
..
README.md	…
bench_all_ivf.py	PQ4 fast scan benchmarks (#1555 )	2020-12-16 01:18:58 -08:00
bench_kmeans.py	PQ4 fast scan benchmarks (#1555 )	2020-12-16 01:18:58 -08:00
cmp_with_scann.py	Add missing copyright headers. (#1689 )	2021-02-16 09:11:30 -08:00
datasets.py	PQ4 fast scan benchmarks (#1555 )	2020-12-16 01:18:58 -08:00
make_groundtruth.py	PQ4 fast scan benchmarks (#1555 )	2020-12-16 01:18:58 -08:00
parse_bench_all_ivf.py	Fix unused variables in python	2021-02-24 11:52:18 -08:00
run_on_cluster_generic.bash	PQ4 fast scan benchmarks (#1555 )	2020-12-16 01:18:58 -08:00

Benchmark of IVF variants

This is a benchmark of IVF index variants, looking at compression vs. speed vs. accuracy. The results are in this wiki chapter

The code is organized as:

datasets.py: code to access the datafiles, compute the ground-truth and report accuracies
bench_all_ivf.py: evaluate one type of inverted file
run_on_cluster_generic.bash: call bench_all_ivf.py for all tested types of indices. Since the number of experiments is quite large the script is structued so that the benchmark can be run on a cluster.
parse_bench_all_ivf.py: make nice tradeoff plots from all the results.

The code depends on Faiss and can use 1 to 8 GPUs to do the k-means clustering for large vocabularies.

It was run in October 2018 for the results in the wiki.