Build Your Own
OCR Pipeline
Open-source document understanding framework. Complete training scripts, modular architecture, and full control. Train your own models.
Why Doctane?
Complete control over your document processing pipeline
End-to-End OCR
Complete pipeline from text detection to recognition. Handles straight and rotated text seamlessly.
Layout Understanding
Automatic page orientation detection and straightening. Multi-column document support.
Modular Architecture
Plug-and-play model components. Easy to extend with custom models and pipelines.
High Performance
Optimized inference with PyTorch. DDP training support for multi-GPU acceleration.
Multi-Language
Support for English, French, and extensible to other languages.
Structured Output
Hierarchical Document objects. Export to JSON and hOCR formats.
Processing Pipeline
Four-stage pipeline for complete document understanding
Preprocessing
Page orientation detection and optional straightening.
Text Detection
Segmentation-based model identifies text regions.
Text Recognition
Crops fed to recognition model for transcription.
Document Assembly
Structured output with geometry and confidence scores.
100+ Supported Models
State-of-the-art architectures for detection and recognition
LinkNet
DetectionDeepLabV3+
DetectionSegFormer
DetectionUNet
DetectionUNet++
DetectionFPN
DetectionPSPNet
DetectionPAN
DetectionMAnet
DetectionFaster R-CNN
DetectionSAR
RecognitionViTSTR
RecognitionCRNN
RecognitionMASTER
RecognitionTRBA
RecognitionABINet
RecognitionLSTR
RecognitionViTPTR
RecognitionMATRN
RecognitionPARSeq
RecognitionPlus 80+ encoder variants (ResNet, EfficientNet, VGG, MobileNet, DenseNet, etc.)
Get Started
Clone, install, and run in minutes
Clone
Download from GitHub
Install
Run pip install
Launch
Open localhost:8000/app
๐ง No Pre-trained Weights
We provide the code only, not model weights. Train your own models using the provided scripts.
๐ป Bring Your Infrastructure
Training requires GPU. Use your own hardware or cloud (AWS/GCP/Azure). DDP supported.
Featured Projects
๐ฌ Doctane
Multimodal intelligent document analysis and understanding system with OCR, layout understanding.
๐ Exploit2Patch
AI-Powered Vulnerability Intelligence Platform with autonomous CVE research and patch generation.
๐งช DL-Studio
Local deep learning development environment with 20+ algorithms, built-in XAI, and web interface.