Official DeiT repository
Go to file
Jon Janzen 7e160fe43f
Delete .circleci directory
2024-03-15 06:20:37 -07:00
.github add cosub 2023-05-22 11:29:45 +02:00
.gitignore Update .gitignore 2021-01-13 15:58:31 -08:00
LICENSE Change LICENSE to Apache 2.0 2021-01-08 10:51:58 +01:00
README.md add cosub 2023-05-22 11:16:18 +02:00
README_3things.md Update README_3things.md 2022-06-20 19:31:01 -05:00
README_cait.md Update README_cait.md 2022-05-08 20:11:29 +02:00
README_cosub.md add cosub 2023-05-22 11:29:30 +02:00
README_deit.md fixed typo in path for 2nd validation class 2022-06-09 09:06:29 +02:00
README_patchconvnet.md Add files via upload 2022-05-08 20:06:44 +02:00
README_resmlp.md Add files via upload 2022-05-08 20:06:44 +02:00
README_revenge.md Add eval-crop-ratio to args 2022-09-22 11:27:03 +09:00
augment.py Update augment.py 2022-05-30 10:08:18 +02:00
cait_models.py refactor: init_weights 2022-02-06 10:15:05 +09:00
datasets.py Add eval-crop-ratio to args 2022-09-22 11:27:03 +09:00
engine.py cosub bugfix 2023-08-11 19:28:19 +09:00
hubconf.py Update hubconf.py 2021-12-29 16:30:55 +01:00
losses.py Add comments in the code 2021-10-08 09:41:34 +02:00
main.py fix argument bug 2023-05-28 16:52:55 +08:00
models.py Add deit_base_distilled_patch16_384 () 2021-01-18 12:19:47 +01:00
models_v2.py minor fix 2023-05-22 11:47:25 +02:00
patchconvnet_models.py Update patchconvnet_models.py 2022-02-16 14:01:23 +01:00
requirements.txt Bump torch from 1.7.0 to 1.13.1 2023-08-08 16:57:59 +00:00
resmlp_models.py refactor: init_weights 2022-02-06 10:15:05 +09:00
run_with_submitit.py Change LICENSE to Apache 2.0 2021-01-08 10:51:58 +01:00
samplers.py Update samplers.py 2022-01-07 13:49:43 +01:00
tox.ini Initial commit 2020-12-23 10:47:58 -08:00
utils.py fix ema load 2022-07-13 20:01:42 +02:00

README.md

Data-Efficient architectures and training for Image classification

This repository contains PyTorch evaluation code, training code and pretrained models for the following papers:

DeiT Data-Efficient Image Transformers, ICML 2021 [bib]
@InProceedings{pmlr-v139-touvron21a,
  title =     {Training data-efficient image transformers & distillation through attention},
  author =    {Touvron, Hugo and Cord, Matthieu and Douze, Matthijs and Massa, Francisco and Sablayrolles, Alexandre and Jegou, Herve},
  booktitle = {International Conference on Machine Learning},
  pages =     {10347--10357},
  year =      {2021},
  volume =    {139},
  month =     {July}
}
CaiT (Going deeper with Image Transformers), ICCV 2021 [bib]
@InProceedings{Touvron_2021_ICCV,
    author    = {Touvron, Hugo and Cord, Matthieu and Sablayrolles, Alexandre and Synnaeve, Gabriel and J\'egou, Herv\'e},
    title     = {Going Deeper With Image Transformers},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {32-42}
}
ResMLP (ResMLP: Feedforward networks for image classification with data-efficient training), TPAMI 2022 [bib]
@article{touvron2021resmlp,
  title={ResMLP: Feedforward networks for image classification with data-efficient training},
  author={Hugo Touvron and Piotr Bojanowski and Mathilde Caron and Matthieu Cord and Alaaeldin El-Nouby and Edouard Grave and Gautier Izacard and Armand Joulin and Gabriel Synnaeve and Jakob Verbeek and Herv'e J'egou},
  journal={arXiv preprint arXiv:2105.03404},
  year={2021},
}
PatchConvnet (Augmenting Convolutional networks with attention-based aggregation) [bib]
@article{touvron2021patchconvnet,
  title={Augmenting Convolutional networks with attention-based aggregation},
  author={Hugo Touvron and Matthieu Cord and Alaaeldin El-Nouby and Piotr Bojanowski and Armand Joulin and Gabriel Synnaeve and Jakob Verbeek and Herve Jegou},
  journal={arXiv preprint arXiv:2112.13692},
  year={2021},
}
3Things (Three things everyone should know about Vision Transformers), ECCV 2022 [bib]
@article{Touvron2022ThreeTE,
  title={Three things everyone should know about Vision Transformers},
  author={Hugo Touvron and Matthieu Cord and Alaaeldin El-Nouby and Jakob Verbeek and Herve Jegou},
  journal={arXiv preprint arXiv:2203.09795},
  year={2022},
}
DeiT III (DeiT III: Revenge of the ViT), ECCV 2022 [bib]
@article{Touvron2022DeiTIR,
  title={DeiT III: Revenge of the ViT},
  author={Hugo Touvron and Matthieu Cord and Herve Jegou},
  journal={arXiv preprint arXiv:2204.07118},
  year={2022},
}
Cosub (Co-training 2L Submodels for Visual Recognition), CVPR 2023 [bib]
@article{Touvron2022Cotraining2S,
  title={Co-training 2L Submodels for Visual Recognition},
  author={Hugo Touvron and Matthieu Cord and Maxime Oquab and Piotr Bojanowski and Jakob Verbeek and Herv'e J'egou},
  journal={arXiv preprint arXiv:2212.04884},
  year={2022},
}
If you find this repository useful, please consider giving a star and cite the relevant papers.

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

Contributing

We actively welcome your pull requests! Please see CONTRIBUTING.md and CODE_OF_CONDUCT.md for more info.