Base64でエンコードされた画像からのテキスト検出にGoogle Vision APIを使用するにはどうすればよいですか？

私はbase64でエンコードされた画像を持っています。Base64でエンコードされた画像からのテキスト検出にGoogle Vision APIを使用するにはどうすればよいですか？

imageData = 'data:image/png;base64,iVBORw0rrfwfwHReger32QRQWr...'

グーグルクラウドビジョンのPythonライブラリでテキストを検出する方法を教えてください。

from google.cloud import vision 
client = vision.Client() 
imageData = 'data:image/png;base64,iVBORw0rrfwfwHReger32QRQWr...' 
image = client.image(content=imageData) 
texts = image.detect_text() 
print texts[0].description

出典

2017-03-29 Kaushal Meena

Cloud Client library for Python vision as demonstrated hereを使用します。

マイコードは次のようになります。クラウドクライアントライブラリは、あなたのためにすべてのbase64エンコーディングをバックグラウンドで実行します。

virtualenvのクライアントライブラリを使用してフォルダを初期化します。

virtualenv env && source env/bin/activate 
pip install google-cloud-vision

次のスニペットは、画像ファイルにOCR処理を行い、file.png：

import io 
from google.cloud import vision 

vision_client = vision.Client() 

with io.open('file.png', 'rb') as image_file: 
    content = image_file.read() 

image = vision_client.image(content=content) 

texts = image.detect_text() 
print('Texts:') 

for text in texts: 
    print('\n"{}"'.format(text.description)) 

    vertices = (['({},{})'.format(bound.x_coordinate, bound.y_coordinate) 
        for bound in text.bounds.vertices]) 

    print('bounds: {}'.format(','.join(vertices)))

出典

2017-04-14 17:42:04 class

Base64でエンコードされた画像からのテキスト検出にGoogle Vision APIを使用するにはどうすればよいですか？

答えて

関連する問題