Are you sure you want to delete this task? Once this task is deleted, it cannot be recovered.
unknown 2c59ce36c1 | 1 year ago | |
---|---|---|
.. | ||
configs | 1 year ago | |
data | 1 year ago | |
output/BMN_best | 1 year ago | |
paddlevideo | 1 year ago | |
tools | 1 year ago | |
LICENSE | 1 year ago | |
MANIFEST.in | 1 year ago | |
README.md | 1 year ago | |
README_cn.md | 1 year ago | |
__init__.py | 1 year ago | |
main.py | 1 year ago | |
requirements.txt | 1 year ago | |
run.sh | 1 year ago | |
setup.py | 1 year ago |
简体中文 | English
PaddleVideo is a toolset for video recognition, action localization, and spatio temporal action detection tasks prepared for the industry and academia. This repository provides examples and best practice guildelines for exploring deep learning algorithm in the scene of video area. We devote to support experiments and utilities which can significantly reduce the "time to deploy". By the way, this is also a proficiency verification and implementation of the newest PaddlePaddle 2.0 in the video field.
Various dataset and models
PaddleVideo supports more datasets and models, including Kinetics400, UCF101, YoutTube8M, NTU-RGB+D datasets, and video recognition models, such as TSN, TSM, SlowFast, TimeSformer, AttentionLSTM, ST-GCN and action localization model, like BMN.
Higher performance
PaddleVideo has built-in solutions to improve accuracy on recognition models. PP-TSM, which is based on the standard TSM, already archive the best performance in the 2D recognition network, has the same size of parameters but improve the Top1 Acc to 76.16%.
Faster training strategy
PaddleVideo suppors faster training strategy, such as AMP training, Distributed training, Multigrid method for Slowfast, OP fusion method, Faster reader and so on.
Deployable
PaddleVideo is powered by the Paddle Inference. There is no need to convert the model to ONNX format when deploying it, all you want can be found in this repository.
Applications
PaddleVideo provides some interesting and practical projects that are implemented using video recognition and detection techniques, such as FootballAction and VideoTag.
Field | Model | Dataset | Metrics | ACC% |
---|---|---|---|---|
action recognition | PP-TSM | Kinetics-400 | Top-1 | 76.16 |
action recognition | PP-TSN | Kinetics-400 | Top-1 | 75.06 |
action recognition | AGCN | FSD | Top-1 | 90.66 |
action recognition | ST-GCN | FSD | Top-1 | 86.66 |
action recognition | TimeSformer | Kinetics-400 | Top-1 | 77.29 |
action recognition | SlowFast | Kinetics-400 | Top-1 | 75.84 |
action recognition | TSM | Kinetics-400 | Top-1 | 71.06 |
action recognition | TSN | Kinetics-400 | Top-1 | 69.81 |
action recognition | AttentionLSTM | Youtube-8M | Hit@1 | 89.0 |
action detection | BMN | ActivityNet | AUC | 67.23 |
release/2.1 was released in 20/05/2021. Please refer to release notes for details.
PaddleVideo is released under the Apache 2.0 license.
This poject welcomes contributions and suggestions. Please see our contribution guidelines.
基于飞桨实现乒乓球时序动作定位大赛 :B榜第2名方案 单模型,无TTA,A榜第一,B榜第二
Python Jupyter Notebook Markdown Text Shell
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》