Working with Tesseract OCR – Ubuntu

Note : Until this tag is removed , the blog is not complete . Please donot follow it

Step 1 : Install Tesseract using the command

sudo apt-get install tesseract-ocr

Step 2 : Install the following dependencies

sudo apt-get install autoconf automake libtool
sudo apt-get install libpng12-dev
sudo apt-get install libjpeg62-dev
sudo apt-get install libtiff4-dev
sudo apt-get install zlib1g-dev
sudo apt-get install libicu-dev      # (if you plan to make the training tools)
sudo apt-get install libpango1.0-dev # (if you plan to make the training tools)
sudo apt-get install libcairo2-dev   # (if you plan to make the training tools

Some useful Links :

1.http://blog.cedric.ws/how-to-train-tesseract-301

2.How to add  new fonts in training phase (small but presice)

http://michaeljaylissner.com/posts/2012/02/11/adding-new-fonts-to-tesseract-3-ocr-engine/

3.Training procedure  given in google site

https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s