📄

OpenReader

Intelligent Document Processing with OCR & AI Extraction

Coming Soon

About OpenReader

OpenReader is an enterprise-grade Intelligent Document Processing (IDP) platform built on Drupal 11. It transforms unstructured documents — invoices, passports, contracts, receipts — into structured, actionable data using a modular pipeline of OCR engines and AI extraction models. Designed with a Core + Extension architecture, it is infinitely extensible for any document type or industry.

OpenReader automates the tedious, error-prone process of manual data entry from documents. Upload a PDF, image, or scan — and the system automatically detects the document type, extracts key fields using AI, validates the data, and exports it in the format you need. The modular plugin architecture means new document types can be added without touching core code. With deep ecosystem integration, Embassy Portal uses it for passport scanning, OpenStore for invoice processing, and MineRooms for ID verification.

Key Features

Multi-Engine OCR Pipeline (Tesseract, Google Vision, AWS Textract)
AI-Powered Field Extraction & Classification
Automatic Document Type Detection
Modular Core + Extension Architecture
Batch Processing & Queue Management
Confidence Scoring with Human-in-the-Loop Review
Template-Based Extraction Rules
JSON/CSV/API Data Export
Document Version History & Audit Trail
Real-time Processing Dashboard
Multi-language OCR Support (50+ Languages)
RESTful API for External Integration

Use Cases

See how OpenReader fits your industry

Invoice Processing

Automatically extract vendor, amounts, line items, and tax from invoices for accounting integration.

Identity Document Scanning

Extract passport, ID card, and visa data for Embassy Portal and immigration workflows.

Contract Analysis

Parse legal contracts to extract key clauses, dates, parties, and obligations for review.

Technology Stack

Drupal 11
PHP 8.3
Tesseract OCR
Google Vision AI
AWS Textract
Python ML

What Makes OpenReader Different

1

Plugin architecture: add new document types without touching core

2

Multi-engine OCR for maximum accuracy across document types

3

Confidence scoring ensures human review only when needed

4

Event-driven pipeline for extensibility and custom workflows

5

Ecosystem-native: powers document processing across all WeebPal products

For Partners

🔧

For Customizers

Customize OpenReader for your market.

Start Customizing
🏪

For Operators

Deploy OpenReader for your business.

Start Operating

Interested in OpenReader?

Request a free demo or start customizing today.

Request Demo