Model Zoo
⚠️ For recent papers along with pre-trained models, training/evaluation recipes, and configuration files, please see [examples](../../../../../examples)
folder. We will update model zoo periodically.⚠️
This file contains the links to all the pre-trained models in CVNets and their configs:
Classification (ImageNet-1k)
MobileViTv1 (Legacy)
Note: These resutls are from CVNets v0.1. We discontinued the support of OpenCV and switched to PIL in v0.2. For MobileViTv1 results, see v0.1.
Model |
Parameters |
Top-1 |
Pretrained weights |
Config file |
MobileViT-XXS |
1.3 M |
69.0 |
Link |
Link |
MobileViT-XS |
2.3 M |
74.7 |
Link |
Link |
MobileViT-S |
5.6 M |
78.3 |
Link |
Link |
MobileViTv2 (256x256)
MobileViTv2 (Trained on 256x256 and Finetuned on 384x384)
MobileViTv2 (Trained on ImageNet-21k and Finetuned on ImageNet-1k 256x256)
Model |
Parameters |
Top-1 |
Pretrained weights |
Config file |
Logs |
MobileViTv2-1.5 |
10.6 M |
81.46 |
Link |
Link |
Link |
MobileViTv2-1.75 |
14.3 M |
81.94 |
Link |
Link |
Link |
MobileViTv2-2.0 |
18.4 M |
82.36 |
Link |
Link |
Link |
MobileViTv2 (Trained on ImageNet-21k, Finetuned on ImageNet-1k 256x256, and Finetuned on ImageNet-1k 384x384)
Model |
Parameters |
Top-1 |
Pretrained weights |
Config file |
Logs |
MobileViTv2-1.5 |
10.6 M |
82.60 |
Link |
Link |
Link |
MobileViTv2-1.75 |
14.3 M |
82.93 |
Link |
Link |
Link |
MobileViTv2-2.0 |
18.4 M |
83.41 |
Link |
Link |
Link |
Object Detection (MS-COCO)
Segmentation
Note: The number of parameters reported does not include the auxiliary branches.
ADE20K Dataset
Pascal VOC 2012 Dataset
Video Classification (Kinetics-400)
Model |
Parameters |
Top-1 |
Pretrained weights |
Config file |
Logs |
MobileViTv1-small-SpatioTemporal |
5.2 M |
68.38 |
Link |
Link |
Link |