History

Megvii Engine Team c2e9860feb chore(license): remove all license in file header GitOrigin-RevId: `a0e31247a6`		2 years ago
..
Makefile	chore(license): remove all license in file header	1 year ago

README.md	feat(imperative/opr): fix extern c opr mace example md	2 years ago

dump_model.py	chore(license): remove all license in file header	1 year ago

extern_c_opr.h	feat(mgb/external): extern-c-opr dumper and loader for MACE	3 years ago

mace_loader.cpp	chore(license): remove all license in file header	1 year ago

README.md

Extern-C-Opr with MACE

Extern-C-Opr with MACE

Build MegEngine `load_and_run` for arm64-v8a

NOTICE: build depends on NDK
after download, please config env by:

export NDK_ROOT=path/to/ndk
export ANDROID_NDK_HOME=${NDK_ROOT}
export PATH=${NDK_ROOT}/toolchains/llvm/prebuilt/linux-x86_64/bin/:$PATH

cd $MEGENGINE_HOME
git checkout v1.0.0    (we only test v1.0.0 version)
./scripts/cmake-build/cross_build_android_arm_inference.sh -a arm64-v8a -r

After successfully built:

load_and_run should be in $MEGENGINE_HOME/build_dir/android/arm64-v8a/Release/install/bin
libmegengine.so should be in $MEGENGINE_HOME/build_dir/android/arm64-v8a/Release/install/lib

Build MACE libraries for arm64-v8a with GPU runtime

cd $MACE_HOME
RUNTIME=GPU bash tools/cmake/cmake-build-arm64-v8a.sh
export SDKPATH=${MACE_HOME}/build/cmake-build/arm64-v8a/install

After successfully libmace.so should be in $MACE_HOME/build/cmake-build/arm64-v8a/install/lib/libmace.so

Build MACE loader for MegEngine

If SDKPATH is not set, by default it's ./arm64-v8a

You can run with debug mode(by adding DEBUG=1 to make command), which will show more running information

Prepare a MACE model(for example: resnet_50), wrap it with MegEngine extern c opr

python3 dump_model.py --input path/to/resnet_50.pb --param path/to/resnet_50.data --output resnet_50.mdl --config path/to/resnet_50.yml

*.pb file denotes the model structure, *.data denotes the model parameters

Check here to learn how to write yml files for MACE

Run with load-and-run

First of all, send all files to the executed device(for example: /data/local/tmp/test/):

load_and_run
resnet_50.mdl
libmace_loader.so
libmegengine.so
libmace.so

As mace build with c++_shared by default, but old AOSP device do not have libc++_shared.so by default, if you use this class devices
also need send it to devices, which always can be found at ${NDK_ROOT}/sources/cxx-stl/llvm-libc++/libs/arm64-v8a/libc++_shared.so

login to device
cd /path/to/ (for example: /data/local/tmp/test/)

MGB_MACE_RUNTIME=GPU MGB_MACE_OPENCL_CACHE_PATH=./ MGB_MACE_LOADER_FORMAT=NCHW LD_LIBRARY_PATH=. ./load_and_run resnet_50.mdl  --c-opr-lib libmace_loader.so  --input input-bs1.npy

RUNTIME candidates:

CPU
GPU

MGB_MACE_OPENCL_CACHE_PATH is the directory path where OpenCL binary cache writes to (the cache file name is always mace_cl_compiled_program.bin), if the cache file does not exist then it will be created.

We mainly use NCHW data format, if you have NHWC model, use environment MGB_MACE_LOADER_FORMAT=NHWC

For CPU runtime, default running thread is 1, could be specified with MGB_MACE_NR_THREADS=n

if you want to run with HEXAGON runtime, more efforts should be made, please check here.

Tuning on specific OpenCL device

MACE supports tuning on specific SoC to optimize the performace on GPU, see doc.

To enable this feature, use MGB_MACE_TUNING_PARAM_PATH env to give the path to the tuning param file.

To generate the tunig param file, give MACE_TUNING=1 env and set the MACE_RUN_PARAMETER_PATH to the file name you want.

# search for tuning param
MACE_TUNING=1 MACE_RUN_PARAMETER_PATH=opencl/vgg16.tune_param MGB_MACE_RUNTIME=GPU MGB_MACE_OPENCL_PATH=opencl MGB_MACE_LOADER_FORMAT=NCHW ./load_and_run mace/vgg16.mdl --c-opr-lib libmace_loader.so --input 4d.npy

# then run test using the param
MGB_MACE_TUNING_PARAM_PATH=opencl/vgg16.tune_param MGB_MACE_RUNTIME=GPU MGB_MACE_OPENCL_PATH=opencl MGB_MACE_LOADER_FORMAT=NCHW ./load_and_run mace/vgg16.mdl --c-opr-lib libmace_loader.so --input 4d.npy

天元（MegEngine）是旷视自主研发的开源深度学习框架，于2020年3月正式在 OpenI 启智社区开源，天元能够帮助开发者高效的完成深度学习算法的设计、训练、部署，有效提升AI研发工作效率。

https://megengine.org.cn/

ai开发工具 openi deep-learning

C++ Cuda Python C Starlark other

megengine@megvii.com huangxinda@megvii.com luzzyzhang@gmail.com wenjuan@megvii.com 1041563782@qq.com xxr@megvii.com 61608041+megvii-mge@users.noreply.github.com

1271808136@qq.com 35462954+XindaH@users.noreply.github.com 70926289+kagome1007@users.noreply.github.com zhanghaolong@megvii.com acdoge.cao@gmail.com 75015964+HuaHua404@users.noreply.github.com khj.application@aliyun.com 17733781361@163.com chenjiahui@megvii.com liuke@megvii.com quqi.liu@shopee.com 974658390@qq.com 2283984853@qq.com lijiansong@ict.ac.cn loves0510@outlook.com p2oileen@whu.edu.cn 1157539822@qq.com

How to access data resources in code