PyOCRでconfigを読み込む方法 - haitenaipants’s blog

PyOCRでconfigを読み込む方法についてのメモ。

ちなみに以下のような環境を想定している。

Windows10 64bit

python 3.6

OpenCV 3.4.1

tesseract 3.02

PyOCR 0.5

またPyOCRのインストール方法や基本的な使い方は以下を参照のこと。

haitenaipants.hatenablog.com

０）configデータの確認

tesseractをインストールすると、~\Tesseract-OCR\tessdata\configsというディレクトリがあるはず。

（自分の場合は、C:\Users\ユーザー名\AppData\Local\Tesseract-OCR\tessdata\configsにあった。）

この中のdigitsが以下のようになっていることを確認する。

tessedit_char_whitelist 0123456789-.

１）configの設定

以下のような、image1.jpgなる画像を用意する。

f:id:haitenaipants:20180629111618p:plain — image1.jpg

この画像に対し、以下のコードを与える。

特に赤文字の箇所においてconfigを設定し、digitsを読み込んでいる。

import pyocr
import pyocr.builders
from PIL import Image

img = Image.open("image1.jpg")

tools = pyocr.get_available_tools()
tool = tools[0]

builder = pyocr.builders.TextBuilder(tesseract_layout=6)
builder.tesseract_configs.append("digits")

res = tool.image_to_string(img,
lang="eng",
builder=builder)

print(res)

結果は以下のようになる。数字だけを含んでおり、configの設定に成功していることが分かる。

1234567

以下の記事を参考にした。

qiita.com

０）configデータの確認

１）configの設定

関連