4,713
edits
No edit summary |
|||
Line 1: | Line 1: | ||
== OCR Software used by ADBC projects == | == OCR Software used by ADBC projects == | ||
*[http://finereader.abbyy.com/corporate/ ABBYY FineReader] - high performing proprietary OCR software provided by the [http://www.abbyy.com ABBYY] software company. The Professional and Corporate Editions are designed specifically for Microsoft Windows operating systems. | *[http://finereader.abbyy.com/corporate/ ABBYY FineReader] - high performing proprietary OCR software provided by the [http://www.abbyy.com ABBYY] software company. The Professional and Corporate Editions are designed specifically for Microsoft Windows operating systems. | ||
**[[OCR Tips# | **[[OCR Tips#FineReader_tips|FineReader tips]] | ||
*[http://www.abbyy.com/recognition_server/functionality/?utm_expid=34274949-7&utm_referrer=http%3A%2F%2Fwww.abbyy.com%2Frecognition_server%2Fkey_features%2F ABBYY Recognition Server] - extends the features of FineReader and places them in a server-based scalable platform. | *[http://www.abbyy.com/recognition_server/functionality/?utm_expid=34274949-7&utm_referrer=http%3A%2F%2Fwww.abbyy.com%2Frecognition_server%2Fkey_features%2F ABBYY Recognition Server] - extends the features of FineReader and places them in a server-based scalable platform. | ||
**[[OCR Tips# | **[[OCR Tips#Recognition_Server|Recognition Server tips]] | ||
*[http://en.wikipedia.org/wiki/GOCR GOCR] (or JOCR) is a free optical character recognition program, initially written by Jörg Schulenburg. It can be used to convert or scan image files (portable pixmap or PCX) into text files. | *[http://en.wikipedia.org/wiki/GOCR GOCR] (or JOCR) is a free optical character recognition program, initially written by Jörg Schulenburg. It can be used to convert or scan image files (portable pixmap or PCX) into text files. | ||
Line 11: | Line 11: | ||
*[http://en.wikipedia.org/wiki/Ocropus OCRopus] - free document analysis and optical character recognition (OCR) system released under the Apache License, Version 2.0 with a very modular design through the use of plugins. | *[http://en.wikipedia.org/wiki/Ocropus OCRopus] - free document analysis and optical character recognition (OCR) system released under the Apache License, Version 2.0 with a very modular design through the use of plugins. | ||
*[http://en.wikipedia.org/wiki/Omnipage Omnipage] - | *[http://en.wikipedia.org/wiki/Omnipage Omnipage] - high performing proprietary OCR software provided by the [http://www.nuance.com/for-business/by-product/omnipage/index.htm Omnipage software company]. The Professional and Standard Editions are designed specifically for Microsoft Windows operating systems. | ||
**[[Omnipage Features]] | **[[Omnipage Features]] | ||
*[http://en.wikipedia.org/wiki/Tesseract_(software) Tesseract] - Open source optical character recognition engine available under the Apache License, Version 2.0. Software is capable to functioning on various operating systems. Considered to be one of the more accurate OCR engines that are available under a free software license. | *[http://en.wikipedia.org/wiki/Tesseract_(software) Tesseract] - Open source optical character recognition engine available under the Apache License, Version 2.0. Software is capable to functioning on various operating systems. Considered to be one of the more accurate OCR engines that are available under a free software license. | ||
**[http://tesseract-ocr.googlecode.com/svn/trunk/doc/tesseracticdar2007.pdf An Overview of the Tesseract OCR Engine] by Ray Smith at Google Inc. | **[http://tesseract-ocr.googlecode.com/svn/trunk/doc/tesseracticdar2007.pdf An Overview of the Tesseract OCR Engine] by Ray Smith at Google Inc. | ||
**[[OCR Tips# | **[[OCR Tips#Tesseract_tips|Tesseract tips]] | ||
*Xerox OCR engine - | *Xerox OCR engine - |