PDF to Vector Converter

The pdf2vector application was unable to start correctly (oxc0000005), Click OK to close the application, On 2008 server.

I am having problems running pdf2vector on 2008 server

I downloaded the evaluation version from the web page:

https://www.verydoc.com/pdf2vec_cmd.zip

When I unzip it an run it on Windows Server 2008 64bits R2 I get the following error:

VeryPDF PDF2Vector Converter has stopped working
The application was unable to start correctly (oxc0000005). Click OK to close the application.

The pdf2vector application was unable to start correctly (oxc0000005), Click OK to close the application, On 2008 server.

Best regards,
Customer
----------------------------
Please turn off DEP for "pdf2vec.exe" application to try again, please refer to following steps about how to turn off DEP in your system,

1. Click "Start"
2. Select "Control Panel"
3. Select "System"
4. Click the "Advanced" tab
5. In the "Performance" region select "Settings"
6. Click the "Data Execute" tab in the dialog box that opens
7. Select "Turn on DEP for all programs and services except for those I select"
8. Click "Add"
9. The open dialog box will open. Browse and select "pdf2vec.exe" application in your computer,
10. Click "Open"
11. Click "Apply"
12. Click "Ok"
13. Reboot

OK, you should no problem to run "pdf2vec.exe" now, please give it a try.

VeryDOC

Raster to Text OCR Command Line

How to extract text from raster image and save them in text?

   In this article, I will show you how to extract text from raster image file and then save them in text file. The extraction could be done in batch by advanced OCR technology. When saving them in text file, you can also add various page number to text file. When operation, you do not need to open input raster image file as the conversion could be done by from MS Dos Windows by command line.

   The method could be fulfilled under the help of software VeryDOC Raster to Text OCR Converter Command Line, which is a professional tool of converting raster image file to text. In the following part, I will show you how to use this software.

Step 1. Download Raster to Text OCR Converter Command Line

  • The current version of this software is Version: v2.0. And if you use this software just for simply conversion, please download the server version, which allows you to use this software under the whole server.
  • When downloading finishes, there will be a zip file. You need to extract it to some folder then you can call this software from MS Dos Windows.

Step 2. Extract text from raster image file and save it as text document.

  • When you use this software, please refer to the usage and examples.
  • Here is the usage for your reference: Usage:  pdf2txtocr.exe [options] <PDF-file> <Text-file>
  • When you extract text from raster image file, please refer to the following command line templates.
  • pdf2txtocr.exe C:\in.tif C:\out.txt
    By this command line, we can extract text from tiff raster image file. And even if there are many pages in tiff, the extraction could be done fast and accurately.
    pdf2txtocr.exe C:\in.jpg C:\out.txt
    Same with the above command line, by it we can extract text from JPG raster image file.
    pdf2txtocr.exe C:\in.bmp C:\out.txt
    pdf2txtocr.exe C:\in.png C:\out.txt
    When you need to extract text from other raster image file, simply change the input image file formats that would be OK.

    The raster image could be any kind of scan file. If you can scan image to black and white, the extraction effect would be much better.

When do extraction, we often meet some raster image files which are slope, dirty. Those factors will effect conversion effect. In order to fix image, you can process image in advanced by this software. The following parameters are for your reference.
-bitcount <int>     : by this parameter, we can set color depth when render PDF page to image data, it can be set 1, 8, 24, default is 8-bit
-rotate <int>       : this parameter can help you rotate pages before OCR.
-threshold <int>    : by this parameter, we can adjust lightness threshold that used to convert image to B&W

If you need to know more functions of this software, please visit homepage of this software. During the using, if you have any question, please contact us as soon as possible.

Raster to Text OCR Command Line

How to convert raster image to searchable PDF file?

    Sometime we need to convert image to PDF, but when converting finishes and checking PDF file, we will find that the PDF is different with others PDF file, which can not be copied and pasted. In order to solve this problem, VeryDOC will  introduce one way of converting raster image to searchable PDF file. The software I used here is VeryDOC Raster to Text OCR Converter Command Line, which also can be used to convert image to text. In the following part, I will show you how to use this software.

Step 1. Download Raster to Text OCR Converter Command Line

  • Once downloading finishes, there will be a zip file. Please extract it to some folder then you can call the executable file in MS  Dos Windows.
  • There are some help documents, bat file by which you can check conversion effect at once.

Step 2. Convert raster image to searchable PDF file.

  • When you use this software, please read usage and parameter list carefully.
  • Here is the usage for your reference.  Usage:   pdf2txtocr.exe [options] <PDF-file> <Text-file>
  • When converting raster image to searchable PDF, please refer to the following command line templates.
  • pdf2txtocr.exe -ocrmode 1 -threshold 200 -ocr C:\in.tif C:\out.pdf
    pdf2txtocr.exe -ocrmode 2 -rotate 90 -ocr C:\in.jpg C:\out.pdf
    pdf2txtocr.exe -ocrmode 3 -threshold 200 -ocr C:\in.png C:\out.pdf
    pdf2txtocr.exe -ocrmode 4 -rotate 90 -ocr C:\in.bmp C:\out.pdf
    pdf2txtocr.exe -ocrmode 3 -threshold 200 -ocr C:\in.gif C:\out.pdf
    pdf2txtocr.exe -ocrmode 4 -rotate 90 -ocr C:\in.tga C:\out.pdf

This software provides 5 OCR modes, please check related parameters. Please note do not use -ocrmode 0 as this parameter can help you output TEXT file input image file.
-ocrmode <int>      : set OCR mode
    -ocrmode 0: output to text file
    -ocrmode 1: OCR PDF pages and insert new text layer under original PDF pages
    -ocrmode 2: output to plain text based PDF file
    -ocrmode 3: output to OCRed PDF file (BW) with hidden text layer
    -ocrmode 4: output to OCRed PDF file (Color) with hidden text layer

Also before processing PDF, you can adjust input image in advance. Say you can rotate input image, adjust image resolution and so on so forth. Here are some parameters for your reference:

-bitcount <int>     : set color depth when render PDF page to image data, it can be set 1, 8, 24, default is 8-bit
-rotate <int>       : rotate pages before OCR
-threshold <int>    : lightness threshold that used to convert image to B&W
-ocr                : enable OCR function for scanned PDF file

By this software, you can convert most of the raster image file like TIFF, JPG, PNG, BMP, GIF, PCX, TGA, JP2, PNM and MNG to searchable PDF file. During the using, if you have any question, please contact us as soon as possible.

Raster to Text OCR Command Line

How to convert raster to text by command line?

VeryDOC Raster to Text OCR Converter Command Line can be used to convert raster image to text. Raster image could be the following image file formats: TIFF, JPG, PNG, BMP, GIF, PCX, TGA, JP2, PNM and MNG. Meanwhile this software also can help you convert PDF file including image PDF file to text by command line. In the following part, I will show you how to use this software.

Step 1. Download Raster to Text OCR Converter

  • When downloading finishes, there will be a zip file. You need to extract it to some folder then you can call the executable file in MS Dos Windows.
  • This is Windows application and for now it can not be used under Mac or Linux system.

Step 2. Convert raster to text by command line.

  • When you use this software, please refer to the usage and examples.
  • Here is the usage for your reference:  pdf2txtocr.exe [options] <PDF-file> <Text-file>
  • When converting raster to text, please refer to the following command line templates.
  • pdf2txtocr.exe C:\in.tif C:\out.txt
    pdf2txtocr.exe C:\in.jpg C:\out.txt
    pdf2txtocr.exe C:\in.bmp C:\out.txt
    pdf2txtocr.exe C:\in.png C:\out.txt
    pdf2txtocr.exe C:\in.pcx C:\out.txt
    pdf2txtocr.exe C:\in.tga C:\out.txt
    pdf2txtocr.exe C:\in.pnm C:\out.txt
    pdf2txtocr.exe C:\in.mng C:\out.txt
    When converting raster image files to text, simply input full path of input file and output file and you do not need to add any other parameters. When you need to convert image file to text in batch, please use wild character like the following command line templates.
    pdf2txtocr.exe C:\*.tif C:\*.txt
    In order to improve OCR recognition rate, you can convert image to PDF first as when converting raster to PDF, you can adjust image threshold and rotate image in some degree.
    pdf2txtocr.exe -ocrmode 3 -threshold 200 -ocr C:\in.tif C:\out.pdf
    pdf2txtocr.exe -ocrmode 4 -rotate 90 -ocr C:\in.tif C:\out.pdf
    Now let us check related parameters:
    -rotate <int>       : when you need to rotate pages before OCR, please add this parameter.
    -threshold <int>    :when you need to adjust lightness threshold that used to convert image to B&W, please add this parameter.
    -ocr                : this parameter will enable OCR function when converting image file scanned PDF file to text or searchable PDF file.
      -ocrmode <int>      : set OCR mode
    -ocrmode 4: output to OCRed PDF file (Color) with hidden text layer

By this function, you can extract text from raster image file to text. Meanwhile you can convert raster image to searchable PDF file. When output PDF file is PDF file, there are many parameter available for you to choose. If you need to check more functions and parameters of this software, please visit its homepage. During the using if you have any question, please contact us as soon as possible. Now let us check the conversion effect from the following snapshot.

input tiff file
                       This is from input tiff file.

output text from PDF
   This is from output text file.

PDF Margin Crop

How to crop PDF margins by command line?

     In last article, we talked about how to crop PDF by GUI version software of VeryDOC PDF Margin Crop. In the following part, I will show you how to crop PDF by its command line version. This software either can be used as GUI version software or command line version. And by the command line version, you can call it from Visual Basic, C/C++, Delphi, ASP, PHP, C#, .NET, etc. Please check more information on software homepage and in the following part, I will show you how to use the command line software.

Step 1. Install PDF Margin Crop

  • This software is Window application and it is bundled together with GUI version and command line version. When downloading finishes, there will be an exe file. You need to install this software by double clicking the exe file and following installation message. When installation finishes, there will be icon on desktop. Meanwhile in the installation folder you can find the command line executable file.
  • Or you can click Start then go to installation folder, where you can find command line short cut icon.

Step 2. Crop PDF through command line operation.

  • When you use this software, please refer to the usage and command line parameters.
  • Usage:      pdfmc [options] <pdf-file> [<out-pdf>]
  • When you need to crop PDF, please refer to the following command line templates.
    pdfmc.exe C:\in.pdf C:\out.pdf
    When cropping PDF, if you do not add any parameters, this software will crop PDF according to content automatically.
    pdfmc.exe C:\in\*.pdf C:\out\*.pdf
    By wild character, we can crop PDF in batch.
    pdfmc.exe -linewidth 8 C:\in.pdf C:\out.pdf
    By this command line, we can crop PDF and remove black borders which width less than this value 8.
    pdfmc.exe -linewidth 8 -specklesize 20 C:\in.pdf C:\out.pdf
    By this command line, we can crop PDF and remove the speckles which size less than this value.
    pdfmc.exe -linewidth 0 -specklesize 0 C:\in.pdf C:\out.pdf
    By this kind value 0, we can crop PDF and remove all the lines and speckles.
    for %F in (D:\test\*.pdf) do "pdfmc.exe" "%F" "%~dpnF-out.pdf"
    for /r D:\test %F in (*.pdf) do "pdfmc.exe" "%F" "%~dpnF-out.pdf"
    By above command line, we can crop PDF in batch or write bat files.

Now let us check related parameters:

  -skip                : don't overwrite an output file if it already exists
  -margin <string>     : Set page margins to output PDF file
        -margin 10            : Set margin to 10 pt to left
        -margin 10x10         : Set margin to 10 pt to left,top
        -margin 10x10x10      : Set margin to 10 pt to left,top,right
        -margin 10x10x10x10   : Set margin to 10 pt to left,top,right,bottom
        -margin 10pt          : Set margin to 10 pt to left
        -margin 10x10pt       : Set margin to 10 pt to left,top
        -margin 10x10x10pt    : Set margin to 10 pt to left,top,right
        -margin 10x10x10x10pt : Set margin to 10 pt to left,top,right,bottom
        -margin 10mm          : Set margin to 10 mm to left
        -margin 10x10mm       : Set margin to 10 mm to left,top
        -margin 10x10x10mm    : Set margin to 10 mm to left,top,right
        -margin 10x10x10x10mm : Set margin to 10 mm to left,top,right,bottom
        -margin 10in          : Set margin to 10 inch to left
        -margin 10x10in       : Set margin to 10 inch to left,top
        -margin 10x10x10in    : Set margin to 10 inch to left,top,right
        -margin 10x10x10x10in : Set margin to 10 inch to left,top,right,bottom
  -tempdir <string>  : set a folder to store temporary files
  -linewidth <int>     : remove black borders which width less than this value,default is 8
  -specklesize <int>  : remove the speckles which size less than this value, default is 20

By this software, we can crop PDF easily. During the using, if you have any question, please contact us as soon as possible.