The Get Document Text Detection Job Result step retrieves result pages for an async text detection job. Amazon Textract is an AWS service that automatically reads and extracts text, data, and tables from documents and images.
A dependency must be created after installing the module for the Step to be used properly.
Considerations
Asynchronous steps (Start Document Analysis Job, Get Document Analysis Job Result, Start Document Text Detection Job, Get Document Text Detection Job Result) can be used for larger multipage documents in the PDF or TIFF format.
Users may need to add a Textract Policy to their AWS user account to authorize access to the API.
Properties
Inputs
Property
Description
Data Type
Job Id
Takes in the Job ID from a Document Analysis Job.
List of String
Max Results
Allows Users to specify a maximum number of results. (default is 1000)
FileData
Next Token
Allows Users to specify an optional next token.
List of String
Region
Allows Users to specify a region.
AmazonAWSRegion
Override Settings
Authentication
If an Access Key ID and Secret Access Key are provided, those credentials are used.
If overrides are not provided, system/IAM credentials from existing AWS settings will be utilized.
If overrides are used, both override fields must be provided.
Users must ensure Textract permissions are present in the IAM policy for the selected credentials.
Settings
Property
Description
Data Type
Throw Error if Null is Not Expected
Enables Users to toggle this setting on/off.
Boolean
Override Settings
Property
Description
Data Type
Access Key ID
Allows Users to specify an Access Key ID for an AWS account.
String
Secret Access Key
Allows Users to specify a Secret Access Key for an AWS account.
String
Outputs
Property
Description
Data Type
GetDocumentTextDetectionJobResult1_Ouput
Retrieves the Textract Document result.
TextractDocumentResult
Step Changes
Description
Version
Date
Developer Task
The AWS module has been updated with 9 additional steps that allow Users to access the Textract API.