Read https://sourceforge.net/projects/tesseracthindi/files/OCRHindi_using_VietOCR_and_Tesseract.pdf/download for how to use vietocr gui for OCR of Hindi and Sanskrit texts using tesseract-ocr
*****
Please see https://github.com/Shreeshrii/
imagessan and imageshin for newer box/tiff pairs, traineddata files, ocr evaluation statistics and ground truth files with images for Sanskrit and Hindi.
*****
Following is OLD information – saved only for archival purposes.
Tesseract OCR 3.02 provides hin.traineddata for recognizing texts in devanagari scripts. However the Hindi training texts, images and box files are not provided, so it is difficult to improve the accuracy by further improving the traineddata. It is noted that recognition is more accurate and faster if the training is done with the same /similar font as used in the text to be OCRed.
See https://sourceforge.net/p/tesseracthindi/wiki/OCR%20for%20Devanagari/ for more details.
Today’s small-to-medium-sized (SMB) businesses and large enterprises are saving on their monthly communications costs by making one simple decision: to switch to a VoIP service solution from their old, outdated Plain Old Telephone Service (POTS). By choosing a new VoIP service, these companies enjoy the flexibility, reliability, call features, and audio quality that only a VoIP service can provide. Plus, they cut their phone bill by up to 70%!
Website | https://tesseracthindi.sourceforge.io/ |
Tags | Projects |
Features |
|