168 Commits

Author SHA1 Message Date
q.yao
5fdf00324b
[Fix] add bounds to avoid large resource usage of nms operator on jetson (#1686)
* fix trt nms jetson

* update-for-comment

* clang format
2023-02-01 14:11:43 +08:00
lvhan028
b101a4af65
[Enhancement] remove MMDEPLOY_BUILD_SDK_CXX_API option (#1688)
* remove MMDEPLOY_BUILD_SDK_CXX_API option

* update

* update
2023-01-31 13:29:59 +08:00
Li Zhang
3d425bbb9f
[Feature] Pose tracker C/C++/Python API&demos (#1663)
* add PoseTracker API

* add mahalanobis distance, add det_pose demo

* simplify api

* simplify api

* fix cmake & fix `CropResizePad`

* ignore out of frame bboxes

* clean-up

* fix lint

* add c api docs

* add c++ api docs/comments

* fix gcc7 build

* fix gcc7+opencv3

* fix stupid lint

* fix ci

* add help info & webcam support for C++ pose tracker demo

* add webcam support for Python pose tracker demo

* fix lint

* minor

* minor

* fix MSVC build

* fix python binding

* simplify module adapter

* fix module adapter

* minor fix
2023-01-31 11:24:24 +08:00
AllentDan
093badf90c
fix rknn output index error in SDK (#1683) 2023-01-30 20:50:06 +08:00
q.yao
8a050f10dc
suppress onnx optimizer warning (#1593) 2023-01-20 00:25:50 +08:00
Li Zhang
8bb3fcc6d8
fix 'cvtcolor' error in the preprocessing of single channel images (#1666) 2023-01-20 00:04:42 +08:00
Chen Xin
c458e2a524
[Enhancement] Speedup TopDownAffine by CropResizePad (#1486)
* "use 'CropResizePad' to speed up topdownaffine"

* add missing header
2023-01-13 16:08:29 +08:00
Chen Xin
9a1f4e6145
[Fix] Fix example standalone build for msvc (#1626)
* fix example build for msvc

* move /Zc:__cplusplus to core
2023-01-11 10:55:17 +08:00
Li Zhang
e4ad0d4c45
[Fix] Fix aligned allocations on Android (#1612)
* fix android alignment

* fix typo

* fix size
2023-01-06 18:04:28 +08:00
Chen Xin
c0ca074c11
Fix build error on windows-cuda platform (#1598) 2023-01-03 12:12:20 +08:00
Li Zhang
20e0563682
[Enhancement] Optimize pose tracker (#1460)
* sync master

* suppress overlapped tracks

* add CUDA WarpAffine

* export symbols

* fix linkage

* update pose tracker

* clean-up

* fix MSVC build

* fix MSVC build

* add ffmpeg cli command
2022-12-29 19:12:55 +08:00
hanrui1sensetime
f21dc4e7d3
Cherry-pick #1352 to fix ops unittest seg-fault error (#1556)
* cherry-pick PR1352 to master

* fix test_ops with teardown and skip

* remove useless line

* fix lint

Co-authored-by: q.yao <yaoqian@sensetime.com>
2022-12-29 12:08:40 +08:00
AllentDan
85b7b967ee
[Feature] Support probability output for segmentation (#1379)
* add do_argmax flag

* sdk support

* update doc

* replace do_argmax with with_argmax

* add todo
2022-12-26 15:48:07 +08:00
q.yao
d113a5f1c7
[Refactor] refactor is_available, check_env (#1540)
* refactor is available

* remove try catch in apis

* fix trt check env

* fix ops_info

* update default value

* remove backend list

* optimial pycuda

* update requirement, check env for rknn
2022-12-23 12:06:32 +08:00
Michał Antoszkiewicz
202bf00eb7
Fix 'cannot seek vector iterator' in debug windows build (#1555)
Signed-off-by: Michal Antoszkiewicz <mantoszkiewicz@codeflyers.com>

Signed-off-by: Michal Antoszkiewicz <mantoszkiewicz@codeflyers.com>
2022-12-20 10:13:33 +08:00
Chen Xin
dbc4b26dc1
fix cuda10.2 build (#1539) 2022-12-16 10:15:15 +08:00
GY
05ed8e16ea
update to ppl.nn v0.9.1 and ppl.cv v0.7.1 (#1356) 2022-12-13 14:21:24 +08:00
q.yao
7cb4b9b18a
[Enhancement] Support tvm (#1216)
* finish framework

* add autotvm and auto-scheduler tuner

* add python deploy api

* add SDK net(WIP

* add sdk support

* support det, support vm

* fix vm sdk

* support two stage detector

* add instance seg support

* add docstring

* update docs and ut

* add quantize

* update doc

* update docs

* synchronize stream

* support dlpack

* remove submodule

* fix stride

* add alignment

* support dlpack

* remove submodule

* replace exclusive_scan

* add backend check

* add build script

* fix comment

* add ci

* fix ci

* ci fix2

* update build script

* update ci

* add pytest

* update sed command

* update sed again

* add xgboost

* remove tvm ut

* update ansor runner

* add stream sync

* fix topk

* sync default stream

* fix tvm net

* fix window
2022-12-12 21:19:40 +08:00
q.yao
ac47cad407
[Improvements] Support TorchAllocator as TensorRT Gpu Allocator (#1493)
* add TorchAllocator for TensorRT

* check mdcn input shape
2022-12-12 18:43:59 +08:00
Chen Xin
52fd4fe9f3
[Fix] Remove cudnn dependency for transform 'mmaction2::format_shape' (#1509)
* fix format shape

* merge common code

* use throw_exception

* udpate code format
2022-12-12 14:34:15 +08:00
Chen Xin
0830acb40c
[FIX] Fix csharp net48 and batch inference (#1370)
* fix csharp net48

* add missing file

* update

* fix batch inference

* update demo

* update

* update version
2022-11-29 11:48:36 +08:00
hanrui1sensetime
9ea8610133
[Fix] fix ncnn torch 1.12 master (#1430)
* fix ncnn torch 1.12 master

* remove debug line

* add docstring
2022-11-28 17:34:39 +08:00
Li Zhang
d77aeaa480
[Refactor] Decouple preprocess operation and transformation (#1353)
* refactor SDK registry

* fix lint

* decouple transform logic and operations

* data management

* improve data management

* improve data management

* context management

* fix ResizeOCR

* fix operation fallback logic

* fix MSVC build

* clean-up

* sync master

* fix lint

* Normalize - add `to_float`, merge `cvtcolor` operations

* fix macOS build

* rename

* cleanup

* fix lint

* fix macOS build

* fix MSVC build

* support elena

* fix

* fix

* optimize normalize

* fix

* fix MSVC build

* simplify

* profiler

* use `throw_exception`

* misc

* fix typo
2022-11-28 14:46:05 +08:00
Chen Xin
3d1c135297
[Enhancement] refactor profiler (#1403)
* reduce profile node name

* add profiler for pipeline

* add profiler for cond

* udpate
2022-11-28 10:44:54 +08:00
Li Zhang
6468ef180d
[Fix] Relax module adapter template constraints (#1366)
* relax module adapter constraint

* remove forwarding `operator()`
2022-11-27 11:58:09 +08:00
Li Zhang
4d4c10a2dc
[Enhancement] Avoid copying dense arrays in Python API (#1349)
* eliminate copying for segmentor

* fix segmentor

* eliminate copying in Python API

* minor fix
2022-11-24 18:23:34 +08:00
AllentDan
301035a06f
[Fix] fix cls head in SDK (#1420)
* fix cls head

* resolve comments
2022-11-24 14:15:34 +08:00
Li Zhang
b5b0dcfcff
[Fix] Support onnxruntime-1.13 (#1407)
* support onnxruntime-1.13

* fix lint
2022-11-22 20:25:44 +08:00
AllentDan
4dd4d4851b
Add rv1126 yolov3 support to sdk (#1280)
* add yolov3 head to SDK

* add yolov5 head to SDK

* fix export-info and lint, add reverse check

* fix lint

* fix export info for yolo heads

* add output_names to partition_config

* fix typo

* config

* normalize config

* fix

* refactor config

* fix lint and doc

* c++ form

* resolve comments

* fix CI

* fix CI

* fix CI

* float strides anchors

* refine pipeline of rknn-int8

* config

* rename func

* refactor

* rknn wrapper dict and fix typo

* rknn wrapper output update,  mmcls use end2end type

* fix typo
2022-11-22 20:16:22 +08:00
Chen Xin
cdb6b46955
Sdk profiler (#1274)
* sdk-profiler

* fix lint

* support lift

* sync net module when profile

* use Scope*

* update use task name

* fix

* use std::unique_ptr<Event>

* remove mmdeploy::graph link for c and transform

* fix

* fix

* fix
2022-11-21 12:52:21 +08:00
Li Zhang
99040d5655
[Refactor] better SDK registry (#1368)
* refactor SDK registry

* fix lint

* fix typo

* sync

* use nested namespace

* rename
2022-11-15 21:06:13 +08:00
Mingcong Han
ff7b8fb176
[FIX] set stream argument when using async memcpy (#1314) 2022-11-09 13:41:41 +08:00
Li Zhang
b49cf42220
[Enhancement] Avoid copying dense arrays in C API (#1261)
* reduce copying dense array in C API

* format

* fix detector

* fix MSVC build

* simplify
2022-11-07 22:01:31 +08:00
lvhan028
625593d6f3
[Feature] Support rv1126 in sdk (#1238)
* tmp

* refine

* update ssd-lite

* tmp

* tmp

* 0.1

* 0.1.1

* rename to base_dense_head

* remove debug code

* wait stream

* update cmakelists

* add some comments

* fix lint

* fix ci error

* fix according reviewer comments

* update params

* fix

* support normalize with to_float being false

* fix lint

* support rv1126 build ci

* support rv1126 build ci

* change debug level

* fix ci

* update

* update doc

* fix circleci error

* update normalize

* update

* check in build script

* change name
2022-11-07 11:13:47 +08:00
Chen Xin
940fffa075
fix some errors (#1309) 2022-11-04 22:38:28 +08:00
Chen Xin
d8e6229dc5
Support mmaction master (#1183)
* cpu format shape

* convert model

* python api

* speedup dataloader

* minor

* add cpp demo

* add visualize

* fix resize param order

* export pipeline.json

* fix three crop

* read SampleFrames from model_cfg

* minor

* lint

* move to a func

* speed up format shape cpu

* use input mat device

* fix comments

* fix comments

* update docs/benchmark

* docs/supported-codebases

* update tests/data

* fix lint

* fix lint
2022-11-04 14:15:36 +08:00
Li Zhang
f2be2abeb5
[Feature] Add Cond node and pose tracker demo (#1186)
* add Cond node

* WIP PoseTracker

* fix pose tracker

* minor fix

* simplify design

* add timing

* sync

* visualize

* remove file check
2022-10-27 14:52:04 +08:00
lvhan028
f051a31e0f
make onnxruntime(gpu) available in SDK (#1253) 2022-10-26 19:23:42 +08:00
AllentDan
3eb60ea584
[Feature] Add RKNN support to SDK (#1145)
* add rknn_net [WIP]

* add cmake

* enable mmcls

* remove toTensor in SDK pipeline

* update doc

* translate to Chinese

* update doc and add tool-chain cmake

* use ::framework

* fix lint

* doc and print log

* data map

* refine install doc

* add rknpu2 workflow

* update gcc yaml

* better cmake file

* update doc link

* use vector instead of array

* better env variable

* use soft link

* release ctx

* name rule
2022-10-18 17:52:31 +08:00
Jiahao Sun
8e634059a1
[Feat] Support Monocular 3D Detection and FCOS3D Deployment (#1047)
* add monodet task

* format monodet

* format monodet

* sort test_monocular_detection_model.py import

* add fcos3d deploy

* change doc support model & fcos3d UT

* fix test monodet UT bug & remove ONNXBEVNMS op
2022-10-18 11:23:39 +08:00
tpoisonooo
ace44ae9d9
improvement(scripts): cross build aarch64 (#1126)
* udpate

* update

* CI(scripts): add auto cross build aarch64

* docs(scripts): add zh_cn doc

* docs(scripts): update

* docs(scripts): update

* fix(tools): update

* docs(zh_cn): update

* fix(scripts): remove gcc-7

* docs(scripts): update result

* udpate

* fix(tools): remove useless option

* docs(en): typo

* Update cross_build_aarch64.md

* Update cross_build_aarch64.md

* fix(tools/scripts): review advices

* fix(tools/scripts): update

* fix(cmake): remove useless option

* Update aarch64-linux-gnu.cmake
2022-10-17 11:15:29 +08:00
Li Zhang
8c164ab3e5
[Fix] Detect filesystem library in CMake (#1190)
* detect filesystem library in CMake

* add filesystem.cmake
2022-10-14 12:52:16 +08:00
Li Zhang
f389a68dd4
[Enhancement] Add a device field for mmdeploy_mat_t (#1176)
* add device field for mmdeploy_mat_t

* fix lint
2022-10-11 13:58:49 +08:00
tpoisonooo
5cdf29339b
feat(tools/scripts): add auto install jetson orin (#1105)
* feat(tools/scripts): add auto install jetson orin

* docs(script): rename file

* fix(script): if clause error

* style(scripts): typo
2022-10-09 15:53:19 +08:00
lvhan028
613acb366c
fix prebuilt error (#1123) 2022-09-28 17:52:09 +08:00
Chen Xin
0720ed8404
fix tracer (#1107) 2022-09-27 15:24:57 +08:00
Li Zhang
bf887cc8bc
[Enhancement] Refactor SDK pipeline (#938)
* unify C API naming

* fix demo and move apis/c/* -> apis/c/mmdeploy/*

* fix lint

* WIP refactor pipeline

* backward compatibility

* working pipeline demo

* add text det-recog demo

* add det-pose demo

* fix build

* fix demo

* add environment interface

* add environment to pass scheduler & model info at runtime

* update demos

* add pipeline API for Python

* fix `FromPyObject`

* fix for opencv-4.2

* environment -> context, improve pipeline

* python model interface

* fix cmake

* fix python & cmake

* context & C++ pipeline API

* minor fix

* improve API

* fix shared libs

* refresh C/Python API

* propagate context

* fix  python demo

* fix

* add namespace

* fix namespace

* fix mis-changed strings

* fix

* fix python api

* rename

* clean-up

* fix pose detector

* clean-up

* clean-up

* clean-up

* fix python API build

* fix CI

* fix lint

* fix lint

* fix lint & demo

* install pipeline.hpp

* fix MSVC shared library build

* fix sample

* fix MSVC monolithic build

* minor fix
2022-09-26 16:11:14 +08:00
q.yao
0caeaf238c
[CustomOps] TensorRT Gather Topk Ops (#1033)
* add gather topk

* add shape inference and document

* fix faster rcnn

* reshape topk

* fix
2022-09-19 13:48:26 +08:00
Li Zhang
b602356181
[Fix] Add isolated option for TorchScript SDK backend (#1002)
* add option for TorchScript SDK backend

* add doc

* format
2022-09-06 12:24:07 +08:00
Chen Xin
6b01a2e649
[Feature] Add option to fuse transform. (#741)
* add collect_impl.cpp to cuda device

* add dummy compute node wich device elena

* add compiler & dynamic library loader

* add code to compile with gen code(elena)

* move folder

* fix lint

* add tracer module

* add license

* update type id

* add fuse kernel registry

* remove compilier & dynamic_library

* update fuse kernel interface

* Add elena-mmdeploy project in 3rd-party

* Fix README.md

* fix cmake file

* Support cuda device and clang format all file

* Add cudaStreamSynchronize for cudafree

* fix cudaStreamSynchronize

* rename to __tracer__

* remove unused code

* update kernel

* update extract elena script

* update gitignore

* fix ci

* Change the crop_size to crop_h and crop_w in arglist

* update Tracer

* remove cond

* avoid allocate memory

* add build.sh for elena

* remove code

* update test

* Support bilinear resize with float input

* Rename elena-mmdeploy to delete

* Introduce public submodule

* use get_ref

* update elena

* update tools

* update tools

* update fuse transform docs

* add fuse transform doc link to get_started

* fix shape in crop

* remove fuse_transform_ == true check

* remove fuse_transform_ member

* remove elena_int.h

* doesn't dump transform_static.json

* update tracer

* update CVFusion to remove compile warning

* remove mmcv version > 1.5.1 dep

* fix tests

* update docs

* add elena use option

* remove submodule of CVFusion

* update doc

* use auto

* use throw_exception(eEntryNotFound);

* update

Co-authored-by: cx <cx@ubuntu20.04>
Co-authored-by: miraclezqc <969226879@qq.com>
2022-09-05 20:29:18 +08:00