OCR With Tesseract

Amuda Adelou shows how to use Tesseract’s Java API to perform character recognition in images:

Extracting text from an image means that you are considering the flowchart imagery that’s processed to extract the text components and then extracting the geometrical shapes components. The text components are extracted with geometrical components, as well. The internal relationship between the components is set up by tracing the flow lines that connect different components. The extracted components are output to metadata (in XML format), which is machine-readable. This metadata can be archived, stored in a knowledge base, or shared with others.

Click through for a demo app and code.

Related Posts

Using Azure Cloud Shell

Jeffrey Verheul shows off a bit of Azure Cloud Shell: Connecting to a database Now that your Cloud Shell is ready to go, you can start using Bash. This means you can also use sqlcmd from within Bash. You can connect to a database with sqlcmd, by using the following command: sqlcmd -S servername.database.windows.net -U […]

Read More

Golang And SQL Server

Mat Hayward-Hill gives us another language to think about: Right now I spend most of my time in Management Studio writing TSQL. And I use PowerShell whenever I need to do something on more than one machine at a time. But now Microsoft is embracing open source should I be thinking the same and learn […]

Read More

Categories

April 2017
MTWTFSS
« Mar May »
 12
3456789
10111213141516
17181920212223
24252627282930