Call For Paper

Volume 10 Issue 7

July 2024

Submit Paper Here
Download Paper Format
Copyright Form
News for Authors:

We have started accepting articles by online means directly through website. Its our humble request to all the researchers to go and check the new method of article submission on below link: Submit Manuscript

Follow us on Social Media:

Dear Researchers, to get in touch with the recent developments in the technology and research and to gain free knowledge like , share and follow us on various social media. Facebook


Text Extraction from Tamil and Hindi Document Images using Open Source Optical Character Recognition tools




Optical Character Recognition, OCR architecture for Tamil and Hindi document images, Google Docs, Free Online OCR, i2OCR.


Optical Character Recognition (OCR) is a technique, which is used to extract the text from document images and convertedinto text format. This kind of information retrieval is called as recognition based retrieval hence that it can be edited, searched, stored more efficiently. OCR is used for many applications such as library, organization, bank cheques, number plate recognition, historical book analysis and many others applications.Various OCR tools are available for converting document images in different types of languages.The primary objective of this work is to compare the performance analysis of the three different OCR tools for extracting the text informationfrom Tamil and Hindi document images. The OCR tools considered in this analysis are Google Docs, Free Online OCR and i2OCR. Based on the conversion accuracy it is observed that the performance of Free Online OCR is better than other OCR tools.

Other Details

Published in: Volume : 2, Issue : 11
Publication Date: 11/1/2016

Article Preview

Download Article