Smart Document Engine — high-performance software for automatic classification, recognition and analysis of documents and forms. The system helps to automate document management workflows and optimize document entry processes. Smart Document Engine quickly and securely scans and extracts the required data from various document types — standard and reporting forms, primary, business, statutory, financial, notarial, legal, insurance and banking documents, as well as standard questionnaires and forms of strict accountability.
The most important feature of Smart Document Engine is the ability to create specialized solutions for automatic processing, classification, recognition and analysis of documents and forms of various complexity required by particular customers. High-performance and accurate document recognition solutions, optimized for a specific data flow and business processes, will allow your organization to reduce costs and ease the workload processing, without compromising data security — the images are not transferred to any third-parties or their services.
Thanks to our state-of-the-art recognition and computer vision algorithms, compact deep neural network models with the Hough transform (HoughNet and HoughEncoder) and energy efficient GreenOCR® text recognition technology developed by our scientists and engineers, Smart Document Engine solves business document recognition tasks even on mobile phones, previously only high-performance servers managed to do it.
Thanks to Smart Engines technologies, a modern mobile phone scans a document stream from a specialized document scanner with a capacity of up to 30 pages per minute, previously only high-performance workstations or servers were able to do it. At the same time, the quality is on a high level and the solution ensures the best text recognition, digital and other document data accuracy. Thus, the recognition of a full-page tax certificate on Galaxy S10 takes less than 3 seconds. At the same time, the system quickly and precisely scans both scans and photos and automatically performs all the actions necessary for classification, data extraction and text recognition, robust to lighting conditions, geometric distortions and poor image quality.
Smart Engines software:
Documents with fixed layout are documents, various copies of which match when deleting the field details. Classic examples of fixed document forms are the medical absence excuse or machine-readable student examination forms. Smart Document Engine includes state-of-the-art algorithms for fast location and typization and of fixed-layout forms in scans, photos and video stream. The technology allows you to instantly detect and scan such documents, as well as verify their types.
Smart Document Engine allows to create solutions for scanning and analysis of documents such as power of attorney, agreements and contracts. With the Smart Document Engine solution it is possible to classify such documents, extract text or other details, verify signatures and stamps, and much more.
Smart Document Engine allows you to classify and recognize both individual images of documents, questionnaires and forms, as well as multi-page documents, or page sequences containing multiple documents. Page sequence processing allows you to simplify the stream scanning process, sort the stream and check the presence of the required documents (completeness check).
Smart Document Engine allows you to accurately and reliably recognize marks and checkboxes made both digitally and with a pen.
For high-quality processing of accounting, tax, banking and other documents, search and recognition of tabular data have been implemented in the Smart Document Engine system. Tables with a fixed structure, relational and non-relational, tables with a variable number of columns and table-like structured data recognition is supported.
While analyzing complex structured documents to identify significant phrases and paragraphs, as well as to detect incorrect changes in printed document texts, Smart Document Engine allows you to check the presence of keywords and keyphrases and monitor the integrity of the targeted paragraphs.
Smart Document Engine allows you to detect, locate and recognize blots, strikethrough, corrections and other markings in a document in order to control its validity and extract additional information.
Smart Document Engine allows you to determine both the document colour as a whole and its individual elements (stamps and signatures), thus, it allows to recognize a black-and-white document copy even when re-scanning or photographing.
Smart Document Engine allows you to extract handwritten fields and signatures, recognize handwritten numerical fields, and verify the presence of handwritten marks and signatures.
Flexible forms are documents, whose elements and details can change their position relative to one another. Classic examples of flexible forms are tax and accounting documents, such as bills, enterprise balance sheets, financial performance reports, payment orders, etc. Smart Document Engine quickly identifies such documents, detects significant details and perform their recognition and analysis.
Smart Document Engine can recognize documents and forms both from images captured from flatbed and sheetfed scanners, as well as from photos and videos captured using smartphones, tablets or other mobile devices.
The technology stack implemented in the Smart Document Engine platform allows you to quickly and accurately recognize single-line and multi-line text fields in more than 100 languages, extract details from dense paragraphs, typewritten and handprinted fields, and much more.
Smart Document Engine is equipped with fast stamp detection, location, and classification modules. In addition to the stamp search and verification, the platform allows you to recognize individual text components of stamps.
Smart Document Engine verifies document data entries, including mandatory text or graphic field entries check, and performs analysis of document zones designed for handwritten or handprinted filling.
Smart Document Engine allows you to detect, locate and verify graphic elements, such as company logos, as well as important document graphic fields, for example, a personal photo attached to a questionnaire.
In addition to the text field recognition, Smart Document Engine allows you to analyze the attributes of text field and individualcharacters, such as font characteristics (presence of serifs, italics and boldness), estimation of uniformity, monotonicity, etc.
The GreenOCR® technology included in Smart Document Engine provides high recognition accuracy of printed texts (OCR), handprinted and handwritten form fields (ICR), as well as label and checkmarks recognition (OMR). The implemented AI-based approach “I extract what I see” does not use linguistic models and high-quality text recognition is achieved by extremely high accuracy of character recognition. Thus, it allows you to achieve high digitization accuracy and avoid errors when the recognition system substitutes the result based on a particular linguistic or statistical language model.
The Smart Document Engine SDK allows you to turn an iOS or Android-based mobile device into an high-quality document scanner.
Automated document scanning in a webcam video stream in real-time for automatic data input in CIP KYC, AML, and other customer identification tasks.
A wide range of software interfaces provides the possibility of native integration into most information systems.
Smart Document Engine SDK allows you to implement scalable AI platforms for automatic data extraction from business document
With the Smart Document Engine SDK, you can add deep document layout analysis and recognition to your infrastructure solutions for back-office automation, as well as to mobile applications to maximally simplify remote automatic document.
Smart Document Engine is delivered as an autonomous document scanner SDK (software development kit) which contains all the necessary precompiled libraries, programming interface documentation and integration examples for various programming languages. Developers are provided with a simple but multifunctional API (application programming interface), which allows to integrate the document recognition solution into the client’s software using languages C++, C#, Java, Python and Objective-C for a wide range of systems: iOS, Android, Linux, Windows, MacOS, Sailfish Mobile, Astra Linux, etc. The following hardware platforms are supported: x86_64, ARM v7, v8 (Aarch32, Aarch64), MIPS. It is possible to connect to the popular RPA frameworks.
Out of the box document templates are tax certificates, payment orders, balance sheets, the profit and loss statement, Texas DL-14A, USA forms of SSA, CMS, IRS and other.
Smart Document Engine can be bundled with Smart ID Engine to provide functionality for entering ID documents data, the solution provides high quality and fast recognition of more than 2484 document types. Recognition of codified objects such as machine-readable zones (MRZ) and barcodes is available through the use of Smart Code Engine.
Easy-to-use software
Our advanced AI algorithms automatically detect the document in the frame, automatically determine its type, find textual and graphical details and perform their recognition. The system is resistant to various geometric distortions, noise, inconsistent lighting, printing defects and low resolution.
Operating Systems
Smart Document Engine supports a wide class of operating systems, including specialized operating systems for personal data processing: Cent OS, Ubuntu, Red Hat Enterprise Linux, SUSE Linux Enterprise Server, Arch Linux and other Linux distributions, MS Windows, macOS, Aurora OS, iOS, Android, Sailfish Mobile OS.
Reliability
To increase the reliability, the AI-based “I extract what I see” approach is implemented. This approach does not involve the use of any dictionaries and grammars and is based on responsible compact networks. In addition to the recognition results, the user can access the confidence rates for each document field and get other information about the recognition process.
Security
Thanks to the achievements of our scientists, all computations are performed on device, data is not transferred to servers, which is confirmed by independent international audit. Data is NOT transferred, NOT stored, Internet access is NOT required, and data processing is performed in the local RAM. The “rule of three NOTs” ensures a high security and privacy level for our customers.
Coverage
The system supports recognition of documents in 100 languages worldwide, Cyrillic and Latin alphabets, and logographic scripts including Korean, Japanese, traditional and simplified Chinese. Besides, the system searches for tables, stamps, signatures, handwritten notes, and recognizes checkboxes and barcodes.
Efficiency
The highest performance is achieved due to computer vision algorithms and compact deep neural networks. The full cycle from the type definition to the recognition of all the details may take only 2 seconds for an A4 document page.
Other document types can be added upon request.
Abkhaz, Afrikaans, Albanian, Armenian, Aymara, Azerbaijanian, Belarusian, Berber (Latin alphabet), Bosnian, Bulgarian, Cantonese, Castilian, Catalan, Chewa (Latin alphabet), Chibarwe (Latin alphabet), Chichewa (Latin alphabet), Comorian (Latin alphabet), Croatian, Czech, Danish, Dutch, English, Estonian, Fiji Hindi (Latin script), Fijian, Filipino, Finnish, French, Georgian, German, Greek, Haitian Creole, Hebrew, Hungarian, Icelandic, Indonesian, Irish, Italian, Japanese, Kazakh, Kinyarwanda, Kirundi, Korean, Kyrgyz, Latvian, Lithuanian, Luxembourgish, Macedonian, Malay (Latin alphabet), Maltese, Mandarin, Maori, Moldovan, Mongolian (Cyrillic alphabet), Nauruan, Ndau, Ndebele, Northern Sotho, Norwegian, Ossetian, Palauan, Polish, Portuguese, Quechua, Romanian, Russian, Serbian, Seychellois Creole, Shangani, Shona, Slovak, Slovene, Somali (Latin alphabet), Sotho, Southern Ndebele, Spanish, Swahili (Latin alphabet), Swazi, Swedish, Thai, Tajik, Tamazight (Latin alphabet), Tetum, Tok Pisin, Tonga, Tsonga, Tswana, Turkish, Turkmen, Ukrainian, Uzbek, Woleaian, Xhosa, Zulu.
Mobile SDK includes React Native and Flutter integration interfaces.
Integration with other programming languages and frameworks (such as Go, Perl, Xamarin, etc.) is ensured by using the API in C
The SDK already contains examples of using Smart Document Engine in Java / Python / C++ / C / Swift / Objective C / C# / Java / PHP.
Argos KYC
Argos KYC partners AI-driven Smart Engines to reinforce identity verification
BioCollections Worldwide
BioCollections Worldwide speeds up patient data intake for COVID-19 testing with Smart Engines
Vodafone Qatar
Vodafone Qatar uses Smart Engines’ technology for scanning ID card of Qatar
Kaspi.kz
Smart Engines rolls out its AI-driven software in super app Kaspi.kz
Send Request
Please fill out the form to get more information about the products,
pricing and trial SDK for Android, iOS, Linux, Windows.