Automatic recognition of business forms and documents

Send Request

Smart Document Engine — automatic analysis and data extraction from business documents for desktop, server and mobile platforms


Smart Document Engine — high-performance software for automatic classification, recognition and analysis of documents and forms. The system helps to automate document management workflows and optimize document entry processes. Smart Document Engine quickly and securely scans and extracts the required data from various document types — standard and reporting forms, primary, business, statutory, financial, notarial, legal, insurance and banking documents, as well as standard questionnaires and forms of strict accountability.


The most important feature of Smart Document Engine is the ability to create specialized solutions for automatic processing, classification, recognition and analysis of documents and forms of various complexity required by particular customers. High-performance and accurate document recognition solutions, optimized for a specific data flow and business processes, will allow your organization to reduce costs and ease the workload processing, without compromising data security — the images are not transferred to any third-parties or their services.

High-speed and user-friendly document scanner

Thanks to our state-of-the-art recognition and computer vision algorithms, compact deep neural network models with the Hough transform (HoughNet and HoughEncoder) and energy efficient GreenOCR® text recognition technology developed by our scientists and engineers, Smart Document Engine solves business document recognition tasks even on mobile phones, previously only high-performance servers managed to do it.


Thanks to Smart Engines technologies, a modern mobile phone scans a document stream from a specialized document scanner with a capacity of up to 30 pages per minute, previously only high-performance workstations or servers were able to do it. At the same time, the quality is on a high level and the solution ensures the best text recognition, digital and other document data accuracy. Thus, the recognition of a full-page tax certificate on Galaxy S10 takes less than 3 seconds. At the same time, the system quickly and precisely scans both scans and photos and automatically performs all the actions necessary for classification, data extraction and text recognition, robust to lighting conditions, geometric distortions and poor image quality.

Private & Secure document scanner

Smart Engines software:


  • Runs 100% on-premise/on-device
  • High-level security standards of personal data processing — GDPR, CCPA, HIPAA
  • Compliance with regulatory requirements KYC in CIP and AML
  • Does NOT transfer the client’s data to third-party services and/or third parties for manual processing
  • Does NOT save or store it — the recognition process is performed in the local RAM of the device
  • Does NOT require internet access.

Who already use scanning technologies from Smart Engines

Test Drive our technologies

Try out Smart ID Engine technologies to scan ID cards, passports, driver’s licenses, visas, residence permits on photos and in a real-time video stream

Document scanner features

Documents with fixed layout are documents, various copies of which match when deleting the field details. Classic examples of fixed document forms are the medical absence excuse or machine-readable student examination forms. Smart Document Engine includes state-of-the-art algorithms for fast location and typization and of fixed-layout forms in scans, photos and video stream. The technology allows you to instantly detect and scan such documents, as well as verify their types.

Smart Document Engine allows you to extract handwritten fields and signatures, recognize handwritten numerical fields, and verify the presence of handwritten marks and signatures.

Flexible forms are documents, whose elements and details can change their position relative to one another. Classic examples of flexible forms are tax and accounting documents, such as bills, enterprise balance sheets, financial performance reports, payment orders, etc. Smart Document Engine quickly identifies such documents, detects significant details and perform their recognition and analysis.

Smart Document Engine can recognize documents and forms both from images captured from flatbed and sheetfed scanners, as well as from photos and videos captured using smartphones, tablets or other mobile devices.

The technology stack implemented in the Smart Document Engine platform allows you to quickly and accurately recognize single-line and multi-line text fields in more than 100 languages, extract details from dense paragraphs, typewritten and handprinted fields, and much more.

Smart Document Engine is equipped with fast stamp detection, location, and classification modules. In addition to the stamp search and verification, the platform allows you to recognize individual text components of stamps.

Smart Document Engine verifies document data entries, including mandatory text or graphic field entries check, and performs analysis of document zones designed for handwritten or handprinted filling.

Smart Document Engine allows you to detect, locate and verify graphic elements, such as company logos, as well as important document graphic fields, for example, a personal photo attached to a questionnaire.

In addition to the text field recognition, Smart Document Engine allows you to analyze the attributes of text field and individualcharacters, such as font characteristics (presence of serifs, italics and boldness), estimation of uniformity, monotonicity, etc.

High-precision GreenOCR®

The GreenOCR® technology included in Smart Document Engine provides high recognition accuracy of printed texts (OCR), handprinted and handwritten form fields (ICR), as well as label and checkmarks recognition (OMR). The implemented AI-based approach “I extract what I see” does not use linguistic models and high-quality text recognition is achieved by extremely high accuracy of character recognition. Thus, it allows you to achieve high digitization accuracy and avoid errors when the recognition system substitutes the result based on a particular linguistic or statistical language model.

Omnichannel Deployment

Mobile SDK

The Smart Document Engine SDK allows you to turn an iOS or Android-based mobile device into an high-quality document scanner.

More about Mobile SDK

In-browser SDK

Automated document scanning in a webcam video stream in real-time for automatic data input in CIP KYC, AML, and other customer identification tasks.

More about Web SDK

Desktop SDK

A wide range of software interfaces provides the possibility of native integration into most information systems.

More about Desktop SDK

Server SDK

Smart Document Engine SDK allows you to implement scalable AI platforms for automatic data extraction from business document

More about Server SDK

What do we provide?

With the Smart Document Engine SDK, you can add deep document layout analysis and recognition to your infrastructure solutions for back-office automation, as well as to mobile applications to maximally simplify remote automatic document.


Smart Document Engine is delivered as an autonomous document scanner SDK (software development kit) which contains all the necessary precompiled libraries, programming interface documentation and integration examples for various programming languages. Developers are provided with a simple but multifunctional API (application programming interface), which allows to integrate the document recognition solution into the client’s software using languages C++, C#, Java, Python and Objective-C for a wide range of systems: iOS, Android, Linux, Windows, MacOS, Sailfish Mobile, Astra Linux, etc. The following hardware platforms are supported: x86_64, ARM v7, v8 (Aarch32, Aarch64), MIPS. It is possible to connect to the popular RPA frameworks.


Out of the box document templates are tax certificates, payment orders, balance sheets, the profit and loss statement, Texas DL-14A, USA forms of SSA, CMS, IRS and other.


Smart Document Engine can be bundled with Smart ID Engine to provide functionality for entering ID documents data, the solution provides high quality and fast recognition of more than 2484 document types. Recognition of codified objects such as machine-readable zones (MRZ) and barcodes is available through the use of Smart Code Engine.


Easy-to-use software
Our advanced AI algorithms automatically detect the document in the frame, automatically determine its type, find textual and graphical details and perform their recognition. The system is resistant to various geometric distortions, noise, inconsistent lighting, printing defects and low resolution.


Operating Systems
Smart Document Engine supports a wide class of operating systems, including specialized operating systems for personal data processing: Cent OS, Ubuntu, Red Hat Enterprise Linux, SUSE Linux Enterprise Server, Arch Linux and other Linux distributions, MS Windows, macOS, Aurora OS, iOS, Android, Sailfish Mobile OS.

To increase the reliability, the AI-based “I extract what I see” approach is implemented. This approach does not involve the use of any dictionaries and grammars and is based on responsible compact networks. In addition to the recognition results, the user can access the confidence rates for each document field and get other information about the recognition process.


Thanks to the achievements of our scientists, all computations are performed on device, data is not transferred to servers, which is confirmed by independent international audit. Data is NOT transferred, NOT stored, Internet access is NOT required, and data processing is performed in the local RAM. The “rule of three NOTs” ensures a high security and privacy level for our customers.

The system supports recognition of documents in 100 languages worldwide, Cyrillic and Latin alphabets, and logographic scripts including Korean, Japanese, traditional and simplified Chinese. Besides, the system searches for tables, stamps, signatures, handwritten notes, and recognizes checkboxes and barcodes.


The highest performance is achieved due to computer vision algorithms and compact deep neural networks. The full cycle from the type definition to the recognition of all the details may take only 2 seconds for an A4 document page.

Recommended solutions

Technical Specification

  • x86
  • x86_64
  • ARMv7-v8 (AArch32 and AArch64)
  • MIPS (MIPS32 and MIPS64)
  • Android (5.1 and up)
  • iOS (10 and up)
  • Sailfish Mobile OS (2.2 and up)
  • Custom versions: upon request
  • MS Windows (all officially supported versions)
  • Linux kernel based OS, including Ubuntu, Red Hat (RHEL), Debian, CentOS, SUSE, Astra Linux, Oracle Linux, and others
  • macOS (all officially supported versions)
  • QNX (version 7.0 and up)
  • Solaris (version 11.3 and up)
  • Custom versions: upon request

Abkhaz, Afrikaans, Albanian, Armenian, Aymara, Azerbaijanian, Belarusian, Berber (Latin alphabet), Bosnian, Bulgarian, Cantonese, Castilian, Catalan, Chewa (Latin alphabet), Chibarwe (Latin alphabet), Chichewa (Latin alphabet), Comorian (Latin alphabet), Croatian, Czech, Danish, Dutch, English, Estonian, Fiji Hindi (Latin script), Fijian, Filipino, Finnish, French, Georgian, German, Greek, Haitian Creole, Hebrew, Hungarian, Icelandic, Indonesian, Irish, Italian, Japanese, Kazakh, Kinyarwanda, Kirundi, Korean, Kyrgyz, Latvian, Lithuanian, Luxembourgish, Macedonian, Malay (Latin alphabet), Maltese, Mandarin, Maori, Moldovan, Mongolian (Cyrillic alphabet), Nauruan, Ndau, Ndebele, Northern Sotho, Norwegian, Ossetian, Palauan, Polish, Portuguese, Quechua, Romanian, Russian, Serbian, Seychellois Creole, Shangani, Shona, Slovak, Slovene, Somali (Latin alphabet), Sotho, Southern Ndebele, Spanish, Swahili (Latin alphabet), Swazi, Swedish, Thai, Tajik, Tamazight (Latin alphabet), Tetum, Tok Pisin, Tonga, Tsonga, Tswana, Turkish, Turkmen, Ukrainian, Uzbek, Woleaian, Xhosa, Zulu.

  • Java (version 1.7 and higher)
  • Python (version 3.7.2 and higher)
  • C++ (standard C++ 11 and higher)
  • C (standard C99 and higher)
  • C# (version 6.0 and higher)
  • PHP (version 5 and higher)

Mobile SDK includes React Native and Flutter integration interfaces.

Integration with other programming languages and frameworks (such as Go, Perl, Xamarin, etc.) ​​is ensured by using the API in C

The SDK already contains examples of using Smart Document Engine in Java / Python / C++ / C / Swift / Objective C / C# / Java / PHP.

Send Request

Please fill out the form to get more information about the products,pricing and trial SDK for Android, iOS, Linux, Windows.

    Our customers


    Raiffeisenbank scans identity documents using Smart Engines technologies

    BioCollections Worldwide

    BioCollections Worldwide speeds up patient data intake for COVID-19 testing with Smart Engines

    Caribbean Airlines

    Caribbean Airlines improves passenger onboarding in its app with passport scanning by Smart Engines


    Tessi use Smart Engines AI-based solutions into its business process services

    Send Request

    Please fill out the form to get more information about the products,pricing and trial SDK for Android, iOS, Linux, Windows.