PDF Archives - Aquaforest

PDF

Run a QUICK & FREE Audit to check that your documents are Searchable

Discover how many of your digital documents are hidden from your search tools by using the FREE Audit Tool with Aquaforest Searchlight…
Studies have shown that in most organizations over 20% of documents may not be fully text searchable so will not be located by text search or discovery exercises.

You may have hundreds of thousands, if not millions of documents in […]

ESPC Power Automate Blog Articles

The following blog articles can be found on the European Sharepoint, Office 365 & Azure Conference website

 
 
 
 
 
 
 
 
 
 
 
 

1. Get Text from PDF, creating list items from document content
Using the Get Text from PDF action to populate custom Metadata Fields

 

2. Split PDF based on Data Values
Using the Split by Text action to Split a PDF based on the content of the document

  […]

Aquaforest PDF Connector: Using Get Data from PDF Synonyms in JSON

In this article, we will explain how to setup Synonyms in JSON within our “Get Data from PDF” Power Automate PDF Connector.
From the example invoices below, we want to extract the following named Name-Value Pairs, “Invoice No/Inv No” & “Purchase Order/PO No”. The invoices are different layouts & have different entity names for these fields.

 

 

As per the screenshot below the […]

Aquaforest PDF Connector: Get Data from PDF from Image Only & Text Searchable PDFS

In this article, we will outline how to use the Aquaforest PDF Connector for the Power Automate Platform to Get Name-Value Pairs from a mixture of image-only and text-searchable PDFs & populate them into Custom Metadata fields.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

The first step is to define the trigger for our flow, in this example, we are going to Trigger the flow when an item […]

Aquaforest PDF Connector : Get Data from PDF & Populate into Excel

 
In this article, we will outline how to use the Aquaforest PDF Connector for the Power Automate Platform to Get Named Value Pairs from Invoice documents & populate them into Excel.
We are going to extract the following named value pairs, Invoice Number, Name, Document Type & Invoice Value from three different invoices that have totally different layouts (see below).

1.

 

2.
 

 

3.

 
The first step […]

Adding a Kingfisher Job As An Autobahn DX Step

Aquaforest Kingfisher in a PDF data extraction tool that uses the text or barcode information found in PDF pages to perform different operations like:

Renaming PDF Files based on text matching or barcode information
Splitting PDF Files based on text matching or barcode information
Extracting Pages from PDF Files based on text matching or barcode information
Extract Content from PDF Files to txt or […]

Office to PDF printer

There are three different ways of producing MS Office to PDF,

MS Office Native Conversion *recommended option, for most scenarios
MS Office Direct Print
MS Office Extended Print

In most cases, we would recommend using the MS Office Native conversion as it’s the most reliable & probably has the all the features that are required for most scenarios. There are however some scenario’s where […]

Extract, Split and Rename PDFs with Aquaforest Kingfisher

Aquaforest Kingfisher is a sophisticated and powerful tool that is designed to help unlock and organize key business information trapped in PDF documents such as financial records, customer reports, scanned files and payment runs.

A core feature of the product is the ability to OCR PDF files during the conversion which means you can process image only (non-searchable) PDFs.

Below is a […]

Converting Postscript (.ps) files to PDF using Autobahn DX Server

The following steps will enable you to convert postscript (.ps) files to searchable PDF using Autobahn DX.

Download and install the relevant version of GSview using the following links gsv50w64.exe  gsv50w32.exe
Once installed change your .ps file to open with gsview, to do this:

Right click on a post script file and select properties.
Click on the change button.
Select either C:\Program Files\Ghostgum\gsview\gsview64.exe or gsview32.exe

 

 

Open the easyPDF […]

This short blog provides information on how Autobahn DX Server converts office type documents to PDF and what is required from your environment to successfully achieve this.

Autobahn DX uses the Autobahn and BCL easyPDF x Loader services to control the execution of both scheduled jobs and ad-hoc jobs whether submitted via the Autobahn Manager or via the Autobahn Job API.

The […]