[Extract form data] Tab


[Extract form data] tab is available only when the SmartWorker option is applied.

Extract the form data contained in a PDF file and outputs it as either a CSV or XML file.

[Specify form PDF] area

[Add] [Select folder]  Select a PDF file from [Select file] to add the selected PDF file to the file list.
Select a folder from [Select folder] to add all PDF files directly under that folder to the file list. Does not apply to subfolders.
Files and folders can also be added to the list by dragging and dropping them.  
[Up] [Down] To change the order, press the [Up][Down] buttons.
[Sort] [...] When you press [...], you can configure the sorting of the file list.
When you press [Sort], the file list is sorted according to the settings made in [...].
[Remove] [Remove All] If there is a file that you want to stop extracting form data from, press [Delete] to remove it from the file list.
To remove all files from the file list, press the [Remove All] button. 

 

Form data is extracted in order from the top of the file list.

[Output format] area

CSV

Output the extracted form data as a comma-separated CSV file.

PDF FileName, PDF Folder, and Result are always output to the CSV file.

CSV : Output to file for each PDF (Values are set in horizontal) Form data set extracted by one PDF is saved as one CSV file.
File name : <Original file name>.csv
1st row : Item name
2nd row : Data
(Ex.) Sample1.csv
PDF FileName, PDF Folder, Result, "txt_1", "txt_2"
Sample1.pdf, C:\test, Success, "ABC", "abc"
CSV : Output a file for each PDF (Values are set in vertical) Form data set extracted by one PDF is saved as one CSV file.
File name : <Original file name>.csv
1st column : Item name
2nd column : Data
(Ex.) Sample1.csv
PDF FileName, Sample1.pdf
PDF Folder, C:\test
Result, Success
"txt_1", "ABC"
"txt_2", "abc"
CSV : Output a file for all PDF by specifying the template PDF Data that matches the field name of the specified PDF is extracted from all PDF files in the file list, and output as a single CSV file. 
file name : <Template PDF file name>.csv
1st row : Item name based on the contents of the specified the Template PDF.
2nd row : the value of the specified the Template PDF.
3rd and subsequent : the values of PDF files in the file list.
(Ex.) Template.pdf
PDF FileName, PDF Folder, Result, "txt_1", "txt_2"
Template.pdf, C:\test, Success, "", ""
Sample1.pdf, C:\test, Success, "ABC", "abc"
Sample2.pdf, C:\test, Success, "DEF", "def"
Sample3.pdf, C:\test, Success, "GHI", "ghi"

XML

XML : Output a file for each PDF  Select [XML: Output a file for each PDF] to output the extracted form data as an XML file.
The content of the output XML file is the same as the XML data that can be obtained using the GetInputXmlData function of SkyPDF ToolsSDK III, a separate product we offer. 

[Template PDF] area

[PDF file at the top of the PDF list is used as the template PDF] PDF file at the top of the PDF list is used as the template PDF. 
[Specify the template PDF file] Press [Browse] and specify the template PDF. 

[Specify Folder] area

[Output to specified folder]  Press the Browse button and select an output directory.
Cannot output to a folder that does not exist or a folder for which you do not have write permission. 
[Output to same folder as original file]  The PDF file will be saved in the same directory as the source file in the file list.
Cannot be selected when [CSV: Output a file for all PDF by specifying the template PDF] is specified. 
[Same as template PDF] Can be selected when [CSV: Output a file for all PDF by specifying the template PDF] is specified.

[If file with same filename exists, add sequential number and output]

When this checkbox is unchecked, if a file with the same filename exists in the output destination, it will be overwritten and saved.

When this checkbox is checked, if a file with the same filename exists in the output destination, a sequential number will be added to the end of the filename and it will be saved as a separate file. 

[Continue processing without stopping if any error occurs]

If an error occurs, the process will be interrupted, at which point processing of the remaining files will also be stopped.

Check this checkbox to continue the conversion process for the next file without stopping even if an error occurs.

The results of the process can be confirmed in the “Status” column of the file list.

[Begin extraction]

Press the [Begin extraction] button to start processing.