AWS Textract Asynchronous Architecture
Project details
Description
Requirements for this project have been to extract information from PDF documents of which thousands are uploaded per day. As a presetting, AWS Textract had to be used to extract information and tables from uploaded documents. For this purpose and also due to the vast amount of files an asynchronous pipeline has to be constructed which can start and process multiple Textract jobs at the same time without blocking the scheduling of proceeding jobs. The requirements of the client demanded to store and further process the Textract results on the client's infrastructure being separated from the AWS environment. My job was to design and implement the architecture in AWS and Cloud Foundry.
-
Order Date:
10.01.2022 -
Final Date:
12.03.2022 -
Status:
Completed -
Client:
Asset Manager -
Location:
Germany, Frankfurt
Client reviews

Paul Trueman
Working with Artur has been a pleasure. Better yet - I alerted them of a minor issue before going to sleep. The issue was fixed the next morning. I couldn't ask for better support. Thank you Artur! This is easily a 5 star freelancer.

Paul Trueman
Working with Artur has been a pleasure. Better yet - I alerted them of a minor issue before going to sleep. The issue was fixed the next morning. I couldn't ask for better support. Thank you Artur! This is easily a 5 star freelancer.

Paul Trueman
Working with Artur has been a pleasure. Better yet - I alerted them of a minor issue before going to sleep. The issue was fixed the next morning. I couldn't ask for better support. Thank you Artur! This is easily a 5 star freelancer.

Paul Trueman
Working with Artur has been a pleasure. Better yet - I alerted them of a minor issue before going to sleep. The issue was fixed the next morning. I couldn't ask for better support. Thank you Artur! This is easily a 5 star freelancer.