Aquaforest PDF Connector : Get Data from PDF & Populate into Excel - Aquaforest

Aquaforest PDF Connector : Get Data from PDF & Populate into Excel


In this article, we will outline how to use the Aquaforest Flow Connector to Get Named Value Pairs from Invoice documents & populate them into Excel.

We are going to extract the following named value pairs, Invoice Number, Name, Document Type & Invoice Value from three different invoices that have totally different layouts (see below).








The first step is to define the trigger for our flow, in this example we are going to Trigger the flow when an item gets created in Sharepoint & then using the Aquaforest “Get Data from PDF” to retrieve the named value pairs, before we populate these into an Excel

  1. Create a new Automated Flow
    • Give is a name “Get data from PDF & Populate into Excel-
    • Select your Trigger “When a file is created in a folder”










  1. Specify the Location where you are going to be putting your invoices






3. We then need to add a step to get the contents of the file




    • Specify the Site Address & also “Identifier





  1. Add an “Aquaforest -Get Data from PDF” Step











         5. We then specify the following parameters,

    • File Content: Sharepoint File Content Step
    • Expected Keys: Title, Name, Invoice Number & Grand Total









      6. In Microsoft Excel create a spreadsheet containing a table, saving it to the Sharepoint library.

    1. Give the Table a Name: “Invoices”
    2. Add the following Columns “Invoice No”, Title”, “Total”, Full Name”








  1. Add new Step “Excel Online – Add a row into a Table”


















  1. Populate the “Add a row into Table” step as follows
    1. Location: Of the Spreadsheet
    2. Library: Containing the spreadsheet
    3. File: folder & file name of spreadsheet
    4. Table: Enter the Table Name (Invoices)
    5. Enter the values from Get Data from PDF Step










  1. When you drop the documents into the folder, the flow runs populates the named value pairs into the pre-prepared Microsoft Excel Table as per below.





The following two tabs change content below.
Neil Pitman founded Aquaforest Limited in 2001 and is the chief architect for the company’s PDF and OCR software products used by thousands of organizations ranging from NASA to the Dutch Ministerie van Justitie. Neil has 30 years’ experience in the software industry in the UK and USA in the areas of database systems, document management and software development tools and has served on the IDT committees of the British Standards Institute (BSI) and was a co-author of the BSI’s 2007 publication on the Long Term Preservation of Digital Documents.