approved
GATE Cloud Mutlilingual Image OCR

Service that uses optical character recognition (OCR) to identify text contained within images. This is a multi-lingual service and not restricted to Latin scripts. It works in three stages. First it determines the bounding boxes of related text within the image. Secondly it extracts the text from within each bounding box, before finally determining the language of the extracted text. Two output CSV files are returned, one with one row per image giving the primary detected script for that image, and another with one row per bounding box giving the extracted text

Tags
Data and Resources
To access the resources you must log in
  • Method Enginemethod-engine

    Run this method in the Method Engine

    The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Virtual Access
AccessibilityMode OnLine Access
Availability On-Line
Basic rights Other rights
CreationDate 2021-09-06
Creator Singh, Iknoor, i.singh@sheffield.ac.uk
Field/Scope of use Non-commercial only
Group Societal Debates and Misinformation
Owner Singh, Iknoor, i.singh@sheffield.ac.uk
ProgrammingLanguage Python
Sublicense rights No
Territory of use World Wide
Thematic Cluster Visual Analytics [VA]
system:type Method
Management Info
Field Value
Author Roberts Ian
Maintainer Roberts Ian
Version 1
Last Updated 8 September 2023, 18:55 (CEST)
Created 22 June 2023, 18:26 (CEST)