文本识别(trocr-small-handwritten)

TrOCR model fine-tuned on the IAM dataset. It was introduced in the paper TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models by Li et al. and first released in this repository.

The TrOCR model is an encoder-decoder model, consisting of an image Transformer as encoder, and a text Transformer as decoder. The image encoder was initialized from the weights of BEiT, while the text decoder was initialized from the weights of RoBERTa.

Images are presented to the model as a sequence of fixed-size patches (resolution 16x16), which are linearly embedded. One also adds absolute position embeddings before feeding the sequence to the layers of the Transformer encoder. Next, the Transformer text decoder autoregressively generates tokens.

You can use the raw model for optical character recognition (OCR) on single text-line images.

模型来源： https://hf-mirror.com/microsoft/trocr-small-handwritten

模型应用开发和部署

模型服务化

本模型基于 ServiceBoot微服务引擎进行服务化封装，参见：《CubeAI模型开发指南》

直接源代码运行

$ sh pip-install-reqs.sh
$ serviceboot start
或
$ python3 run_model_server.py

本地容器化部署

一键式本地容器化部署和运行，参见：《CubeAI模型独立部署指南》或 CubeAI Docker Builder

云原生网络部署

本模型服务可一键发布至 CubeAI智立方平台进行共享和部署，参见：《CubeAI模型发布指南》

2.3 KiB Raw Permalink Blame History

文本识别(trocr-small-handwritten)

模型应用开发和部署

模型服务化

直接源代码运行

本地容器化部署

云原生网络部署

更多CubeAI模型服务，参见： 《CubeAI服务原生模型示范库》

2.3 KiB

Raw Permalink Blame History

更多CubeAI模型服务，参见：《CubeAI服务原生模型示范库》