OCR

When using the Aquaforest OCR SDK, intermittently you may receive the following message in your application:

System.IO.FileNotFoundException was caught
FileName=C:\WINDOWS\TEMP\AquaforestOcr\xxxx_xx\x_x.hocr
Message=Could not find file ‘C:\WINDOWS\TEMP\AquaforestOcr\xxxx_xx\x_x.hocr’.

This message is generated as a direct result of the source file not being OCR’d, however the particular message is not appropriate in this case.  In order to resolve this issue you need to subscribe to the StatusUpdate which will allow […]

Printer Issue whilst installing Autobahn DX or PDF Junction

The following blog illustrates how to rectify “Printer already exists” issue when attempting to install Autobahn DX or PDF Junction:

Below is a screen shot which is presented to users who are installing Autobahn DX or PDF Junction.

 

 

This issue occurs when the previous version of Autobahn DX or PDF Junction was un-installed but failed to remove the ‘easyPDF SDK 6’ printer.  The failure […]

Microsoft Outlook Setup for Autobahn DX and PDF Junction

This blog illustrates how to make Microsoft Outlook ready for Autobahn DX Server and PDF Junction.

As mentioned in one of the previous blogs, one of the core reason for the failure of the conversion process ‘Convert any document to PDF’ in Autobahn and PDF Junction is caused by pop-up dialogs being displayed from Office products during the PDF conversion.  This blog […]

Microsoft Word Setup for Autobahn DX and PDF Junction

This blog illustrates how to make Microsoft Word ready for Autobahn DX Server and PDF Junction.  The other office products can also be configured just the same way.

One of the core reason for the failure of the conversion process ‘Convert any document to PDF’ in Autobahn and PDF Junction is caused by pop-up dialogs being displayed from Office products during the PDF […]

Autobahn DX Windows Service

Autobahn and BCL easyPDF X Loader Windows Services

These are the heart of the product and control the execution of both scheduled jobs and ad-hoc jobs whether submitted via the Autobahn Manager or via the Autobahn Job API. The services analyse the XML Job Definition files on start-up and when new files are created in the Job Definition directory by the Autobahn Manager, […]

In-Place OCR Processing

By “in-place” processing we generally mean processing PDF documents that have already been added to a document repository or system and need to be turned into searchable PDFs in-place. By contrast “workflow” processing where documents pass through Autobahn DX on their way into a document repository via watched folders.

The job shown below will convert PDFs under the tree C:qat\221008\in to […]

Searchable PDF Explained

This article aims to provide guidance for the creation of searchable PDF files from scanned documents, whether standard TIFF Files or Image-Only PDF files.

 
Types of PDF File
 

PDF Type
Description

Normal
This is the most common type of PDF and is most typically created from a document such as Microsoft Word.  It contains the full text of the page with appropriate coding to define […]

At Aquaforest we are often asked questions such as “I have 1 million documents I need to convert – how long will it take?” or “I need to convert 30,000 documents per day – how many servers will I need?”.  This article gives a straightforward method that can be used to provide broad estimates for conversion times.
Step 1 – Scope […]