GATE Cloud Mutlilingual Image OCR - Items

Item
Groups

approved

GATE Cloud Mutlilingual Image OCR

Service that uses optical character recognition (OCR) to identify text contained within images. This is a multi-lingual service and not restricted to Latin scripts. It works in three stages. First it determines the bounding boxes of related text within the image. Secondly it extracts the text from within each bounding box, before finally determining the language of the extracted text. Two output CSV files are returned, one with one row per image giving the primary detected script for that image, and another with one row per bounding box giving the extracted text

Tags

Data and Resources

To access the resources you must log in

Method Enginemethod-engine

Run this method in the Method Engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!

Item URL

https://data.d4science.org/ctlg/ResourceCatalogue/gate_cloud_mutlilingual_image_ocr

Additional Info

Field	Value
Accessibility	Virtual Access
AccessibilityMode	OnLine Access
Availability	On-Line
Basic rights	Other rights
CreationDate	2021-09-06
Creator	Singh, Iknoor, i.singh@sheffield.ac.uk
Field/Scope of use	Non-commercial only
Group	Societal Debates and Misinformation
Owner	Singh, Iknoor, i.singh@sheffield.ac.uk
ProgrammingLanguage	Python
Sublicense rights	No
Territory of use	World Wide
Thematic Cluster	Visual Analytics [VA]
system:type	Method

Management Info

Field	Value
Author	Roberts Ian
Maintainer	Roberts Ian
Version	1
Last Updated	8 September 2023, 18:55 (CEST)
Created	22 June 2023, 18:26 (CEST)