关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

ChaoII df7758d400 [DOC]fix doc dead link		1 month ago
..
docs	[XPU] Update XPU L3 Cache setting docs (#2001)	11 months ago

scripts	[Bug Fix] fix build xpu encrypt & auth image scripts (#2133)	10 months ago

src	Add ORT fp16 support in server (#2069)	10 months ago

CMakeLists.txt	support build cpu images (#341)	1 year ago

Dockerfile	[Server] Support encrypt & auth for FD Server (#2018)	11 months ago

Dockerfile_CUDA_11_2	[Serving] add fastdeployserver dockerfile for cuda11.2 (#1169)	1 year ago

Dockerfile_CUDA_11_2_TRT_8_5_PADDLE_2_4_2	[Server] Support encrypt & auth for FD Server (#2018)	11 months ago

Dockerfile_CUDA_11_4_TRT_8_4	[Server] Support encrypt & auth for FD Server (#2018)	11 months ago

Dockerfile_cpu	[Serving]modify docker images name (#992)	1 year ago

Dockerfile_ipu	[Serving]: add ipu support for serving. (#10) (#470)	1 year ago

Dockerfile_xpu	[Serving] Support FastDeploy XPU Triton Server (#1994)	11 months ago

Dockerfile_xpu_encrypt_auth	[Serving] Support XPU encrypt & auth server (#2007)	11 months ago

README.md	[DOC]fix doc dead link	1 month ago

README_CN.md	[DOC]fix doc dead link	1 month ago

README.md

FastDeploy Serving Deployment

简体中文 | English

FastDeploy Serving Deployment

Introduction

FastDeploy builds an end-to-end serving deployment based on Triton Inference Server. The underlying backend uses the FastDeploy high-performance Runtime module and integrates the FastDeploy pre- and post-processing modules to achieve end-to-end serving deployment. It can achieve fast deployment with easy-to-use process and excellent performance.

FastDeploy also provides an easy-to-use Python service deployment method, refer PaddleSeg deployment example for its usage.

Prepare the environment

Environment requirements

Linux
If using a GPU image, NVIDIA Driver >= 470 is required (for older Tesla architecture GPUs, such as T4, the NVIDIA Driver can be 418.40+, 440.33+, 450.51+, 460.27+)

Obtain Image

CPU Image

CPU images only support Paddle/ONNX models for serving deployment on CPUs, and supported inference backends include OpenVINO, Paddle Inference, and ONNX Runtime

docker pull registry.baidubce.com/paddlepaddle/fastdeploy:1.0.7-cpu-only-21.10

GPU Image

GPU images support Paddle/ONNX models for serving deployment on GPU and CPU, and supported inference backends including OpenVINO, TensorRT, Paddle Inference, and ONNX Runtime

docker pull registry.baidubce.com/paddlepaddle/fastdeploy:1.0.7-gpu-cuda11.4-trt8.5-21.10

Users can also compile the image by themselves according to their own needs, referring to the following documents:

FastDeploy Serving Deployment Image Compilation

Other Tutorials

Serving Deployment Demo

Task	Model
Classification	PaddleClas
Detection	PaddleDetection
Detection	ultralytics/YOLOv5
NLP	PaddleNLP/ERNIE-3.0
NLP	PaddleNLP/UIE
Speech	PaddleSpeech/PP-TTS
OCR	PaddleOCR/PP-OCRv3

No Description

C++ Python Java Markdown Text other

54695910+leiqing1@users.noreply.github.com jiangjiajun@baidu.com 31974251+DefTruth@users.noreply.github.com imylovex1@163.com 852142024@qq.com zhoushunjie@baidu.com wjjisloser@163.com 109218879+yunyaoXYY@users.noreply.github.com 58363586+Zheng-Bicheng@users.noreply.github.com 1101791222@qq.com 30516196+yeliang2258@users.noreply.github.com 67993288+ziqi-jin@users.noreply.github.com chenjian26@baidu.com 849453582@qq.com 53218160+Zeref996@users.noreply.github.com 1558270516@qq.com wangxinyu_es@163.com 44280887+GodIsBoom@users.noreply.github.com 49013063+CoolKbh@users.noreply.github.com

35565423+HexToString@users.noreply.github.com chenzeyu01@baidu.com 115439700+charl-u@users.noreply.github.com wang_bojun@outlook.com shaywxy@gmail.com

How to access data resources in code