Document AI
Advanced OCR, document analysis, entity extraction, and AI-powered legal document processing.
Features
OCR Extraction
Advanced optical character recognition with support for multiple languages and document formats.
Document Analysis
AI-powered analysis to extract key information, summarize content, and identify important clauses.
Entity Recognition
Automatically identify parties, dates, amounts, obligations, and deadlines in legal documents.
Risk Analysis
Identify potential legal risks and problematic clauses in contracts and agreements.
Template Generation
Generate documents from customizable templates with variable substitution.
Multi-Format Support
Process PDF, Word, images, and scanned documents with high accuracy.
Use Cases
Contract Review
Automatically extract clauses, identify obligations, and highlight potential issues in contracts.
Document Digitization
Convert scanned legal documents into searchable, structured digital formats.
Due Diligence
Process large volumes of documents to extract key information for M&A transactions.
Legal Research
Extract and organize information from case files and legal briefs.
Upload Document
Drag and drop your document here, or click to browse
Supports PDF, DOCX, PNG, JPG (max 50MB)
Analysis Options
API Endpoints
/api/docai/documents/
Upload document
/api/docai/documents/{id}/run_ocr/
Run OCR
/api/docai/documents/{id}/analyze/
AI analysis
/api/docai/documents/{id}/entities/
Get entities
/api/docai/templates/{id}/generate/
Generate from template
Example Request
curl -X POST "http://localhost:8000/api/docai/documents/" \
-H "Authorization: Bearer <token>" \
-H "Content-Type: multipart/form-data" \
-F "title=Contract.pdf" \
-F "file=@/path/to/document.pdf" \
-F "document_type=contract"
Run Analysis
curl -X POST "http://localhost:8000/api/docai/documents/<doc_id>/analyze/" \
-H "Authorization: Bearer <token>" \
-H "Content-Type: application/json" \
-d '{
"analysis_type": "entity_recognition",
"ai_model": "gemini-1.5-flash"
}'