DEPS Document processing, data extraction software
DEPS
Solution Overview
Show more
Customer problem
Document management is a time-consuming, error-prone, yet critical task for many sectors, and this task still requires significant manual efforts for a majority of companies. As of now, unsearchable and unstructured data accounts for 80% of the enterprise document flow and results in wasted millions on manual document processing. Manually extracting data not only takes time but hinders data analysis as well.
EPAM Solution
DEPS, a data extraction solution, helps business operations units working with document processing. The platform simplifies document processing by allowing you to build custom document workflows. It extracts key information from your scanned or digitized documents accurately, validates the extracted data for reliability, and converts it into a format that's easy to search and find.
Benefits
Fast feasibility assessment for business
Feedback for various documents without deployment to client’s environment
End-to-end business flow
Platform has services for full document lifecycle, from upload to archiving
High operation accuracy
DEPS includes continuously improving AI components to achieve maximum accuracy
Cost efficiency
Various pluggable open source components ensure optimal customization and cost
Speed of deployment
Easy integration with client’s ecosystem in cloud, on-premise or hybrid
Democratization in use
UI portal for labeling, review, model improvement via continuous feedback
Features
User Interface:
- Designed to be continuously enhanced without IT support (auto-training is included for machine learning-based solutions)
- Customer-facing tool (simplistic web UI) that is customizable and flexible
OCR (Optical Character Recognition) and ICR (Intelligent Character Recognition):
- Out-of-the-box pre-trained models for processing documents such as invoices
- Models in this document extraction software are capable of handling low-input quality documents and can recognize handwritten data in different languages
- Will process documents with complex data and process structure (PDFs, scanned, faxed, etc.)
- Capabilities support third-party vendor tools such as Tesseract, ABBYY, AWS Textract, Google Vision API, MS Azure CS
Data Extraction:
- Documents can also be uploaded via API
- Intelligent processing capabilities with end-to-end data processing and continuous accuracy improvement powered by deep learning and ML algorithms
- Results review and corrections with mismatch / Error notifications
- Labeling
- Results delivered in different formats (XML, JSON, etc.) together with issues report
Use Cases
Use Case for Oil & Gas Industry
Problem Statement
A large volume of scanned documents (PDF files) in hundreds of formats ⠀ ⠀
Solution Proposed
- Advanced ML techniques (bidirectional recurrent neural networks) on top of the processing pipeline packaging all extraction steps
- Supporting workflow for golden dataset creation (original document regions markup and preliminary data labeling)
Achieved Results
Digitalized three million documents and achieved three-times faster turnaround with 90% accuracy
Use Case for Retail Industry
Problem Statement
Process photographed receipts to collect and analyze the list of bought items
Solution Proposed
- Custom ML model for automatic data extraction
- Identification of brand, product category, etc., based on item name through validation with client’s data storage
- Intuitive UI illustrating confidence level of item recognition
- Integration with customer services via API
Achieved Results
Opened a new market data stream by scanning 10,000 paychecks monthly across nine commercial networks and applying analytics solution
Questions & Answers
Can you integrate with our in-house authentication provider?
Posted on November 5, 2021 by Hans G
DEPS can be easily integrated with any authentication provider that uses OpenID Connect protocol. Integration with other authorized providers is also possible but requires certain development efforts. In case you prefer to use default DEPS authentication, it is powered by Keycloak.
Posted on November 5, 2021 by SolutionsHub Support
Can you add a specific file format support or customize document workflow?
Posted on November 4, 2021 by Liza
Absolutely, DEPS works with a variety of formats including images, spreadsheets, human-readable formats such as PDF, machine-readable formats. Adding support of a custom file format is possible. Changes to document workflow can be implemented in scope of customization for the client.
Posted on November 4, 2021 by SolutionsHub Support
What is the final accuracy level of DEPS?
Posted on November 2, 2021 by Alex Mahno
DEPS works with several OCR engines including both free and paid. Depending on your file formats, we’ll be able to propose the most optimal extraction engine or combination of engines.
To improve extraction, we use post-processing, autocorrection, validation and dictionary mapping. Documents with low confidence of extraction can be sent for manual review to ensure high accuracy.
Posted on November 3, 2021 by SolutionsHub Support
View All Questions
Have a question? We are ready to help you.
type
license type
industries
categories
Integrates with
ABBYY OCR SDK
Tesseract
AWS Services
Azure Services
Google Cloud Services
CockroachDB
Tech Requirements
Сhrome 92+/Edge/Firefox/Safari
Get solution in 3 simple steps
We can help you achieve more! Choose the solution that supports your growth and success.
01
Reach Out to Us
Request the solution by submitting a short form
02
Sit Back & Relax
Our experts swiftly process your request and get back to you
03
Start Using The Solution
Dive in and unlock all the benefits