* support custom op `mmcv::cummax` for onnxruntime in mmcv
* fix clang-format lint error
* support mmcv::cummin, reformat codes
* fix merge from master
* add docs for mmcv::cummax and mmcv::cummin
* format doc
* add assertion for torch version, when exporting `cummax` to onnx
* add more comments for torch version
* handle exporting to onnx in `soft_nms`
* commit for test_onnx
* remove `is_in_onnx_export` in softnms
* add more comments
* fix c++ lint error
* add known issues doc for `cummax`
* fix known issues doc
* supports for onnxruntime custom op `mmcv::MMCVTopPool`
* supports for onnxruntime custom op `mmcv::MMCVCornerPool`, involving TopPool, BottomPool, LeftPool and RightPool
* add unittest for corner_pool
* supports mmcv::CornerPool without memcpy
* add docs for mmcv::CornerPool
* re-add docs for mmcv::CornerPool
* fix output dtype doc
* reformat
* format with pre-commit
* format
* fix lint error, by using google clang-format style for c/c++
* support clipping min_lr in StepLrUpdaterHook
* add docstring for StepLrUpdaterHook
* fix small bugs
* add unit test for StepLrUpdaterHook
* fix linting error
* [Feature] Add PyTorch Profiler to analyze training process
* [Feature] Add PyTorch Profiler to analyze training process
* pring log and fix docstring
* add by_epoch
* modify docstring according to comment
* add torch.cuda.amp to fp16_utils and optimizers
* use with context manager for autocast
* add doc to explain the behavior differences between real amp and ours
* fix docstring
* fix raise error bug in registering multiple names
* fix bug in checking the type of name
* fix lint
* fix unit test for registry
* fix bug in unit test
* add c++ ms_deform_atten
* fix cpp lint
* fix cpp lint
* clang format
* remove cmakefile
* google style
* clang-format precommit
* use clang-format-lint-action
* add transformer base class
* add merge
* add docstr
* add pyargs
* fix according to commments
* resiger module
* change to use basemodule
* add _ between build function
* split the name
* fix according to comments
* fix lint and fix unitest
* fix cpp lint
* fix bug of deformdetr_atten
* fix drop out
* fix residual
* use CUDA_1D_KERNEL_LOOP
* add roi_align_rotated
* code format
* Add align key to roi align rotated
* Add clockwise for rotated roi align
* fix bugs in onnx export
* Add docstring for RoIAlignRotated
* remove cuda unittest
* Reformat c++ code
* add onnx roi align rotated file
* fix unittest
* Add cpu and float64 of cuda support for parrots
* code format
* Add unified header to roi align rotated
Co-authored-by: luopeichao <luopeichao@sensetime.com>
* replace floor/ceil to floorf/ceilf
to eliminate cuda compilation errors in the latest version of Visual Studio 16.9
* fix lint error
* fix lint
* fix lint
* Update .pre-commit-config.yaml
* Update .pre-commit-config.yaml
Co-authored-by: wangruohui <12756472+wangruohui@users.noreply.github.com>
* [Fix] OneCycleLrUpdaterHook interface
* revise according to comments
* revise according to comments
* add test
* fix lint
* revise according to comments
* minors
* add pytest param
* fix lint
* ci
* fix wandb logger drop result bug by delete step param
* add global_step in wandb log to help align train and val step log
* fix wandb hook test unit fail bug
* fix lint issue
* add with_step param of WandbLoggerHook in wandb.py
* add grid sample trt support
* fix align_corners=True
* change 'intep' to 'interp', fix bugs of grid sampler, better test script
* remove unused import
* move source comment to the top of trt_grid_sampler_kernel.cu