The utility VeryDOC Raster to Text OCR Converter Command Line could help you convert scanned PDF/raster to text in various languages flexibly on Windows platforms.
This application supports recognizing text in images, recognizing text in scanned PDF and converting PDF to plain text professionally. Also it supports converting scanned image files to plain Text files including TIFF, JPG, PNG, BMP, GIF, PCX, TGA, JP2, PNM and MNG, which is able to retain original layout of PDF documents. Moreover, this tool supports English, French, German, Italian, Czech, Danish, Dutch, Norwegian, Polish, Portuguese, Spanish, Swedish, etc. various languages.
If you want to get trial version of this application, welcome to click following icon, then you can get ZIP file, and after you extract its content to your computer, trial version could be yours.
Then, in the next paragraphs, I will show you how to realize converting raster/scanned PDF to text in various languages, if you want to know that, just focus on the followings: 🙂
Step1. Run cmd.exe
Command Prompt window is operating environment of commands on Windows platforms. You can run cmd.exe to open it directly, after you get Run dialog box on screen. After elementary environment is opened, commands could be typed directly. 🙂
Step2. Convert scanned PDF/raster file to text in various languages
Here some parameters need to be typed in command prompt window, when you want to use OCRÂ technology to convert scanned PDF in supported languages:
- -ocr: enable OCR function for scanned PDF file
- -lang <string>: choose the language for OCR engine, e.g., –lang eng, –lang deu, etc.
And here are examples for you to refer to below:
pdf2txtocr.exe -ocr -lang eng C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -lang deu C:\in.pdf C:\out.txt
However, if you want to use this software to convert raster image file, just type commands directly without OCR parameters, for program has been set to help you convert raster image to editable text directly without helping of specific OCR parameters. And if you understand what I mentioned above, here are some examples to help you convert raster file to text:
pdf2txtocr.exe C:\in.tif C:\out.txt
pdf2txtocr.exe C:\in.jpg C:\out.txt
After you follow my steps above to input commands with or without parameters above to convert scanned PDF/PDF or raster image, once you use VeryDOC Raster to Text OCR Converter Command Line, targeting text of editable could be instantly produced from each of them separately. 🙂
Here is end of this article, and I hope it can be helpful to you some time. However, to know more practical functions of this application, just download trial version to read “readme.txt” or just follow us in articles that will be updated later. 🙂 Thank you for reading this article here with me, which is about how to convert scanned PDF/raster image to text in various languages. And at last, here is purchase entrance if you want to get full version of VeryDOC Raster to Text OCR Converter Command Line. 🙂