Crawl AI
Intelligent web crawling for legal resources. Discover, index, and organize legal documents from courts, governments, and international sources.
Features
Legal Source Discovery
Automatically discover and index legal resources from court websites, government databases, and legislation portals.
Multi-Source Crawling
Crawl Korean courts, Supreme Court, Constitutional Court, international tribunals, and regulatory agencies.
Content Extraction
Extract case citations, statute references, legal opinions, and regulatory guidelines with high accuracy.
Intelligent Scheduling
Schedule automated crawls to keep your legal database up-to-date with the latest rulings and amendments.
Duplicate Detection
Content-hash based deduplication ensures clean, unique legal resources without redundancy.
Resource Collections
Organize crawled documents into curated collections for specific cases, matters, or research projects.
Crawl Pipeline
Discover
Find legal sources
Crawl
Extract content
Process
Parse & classify
Index
Make searchable
Document Types
Korean Legal Sources
| Source | Type | Documents | Status |
|---|---|---|---|
| Supreme Court of Korea | Court | 1.2M+ | Active |
| Constitutional Court | Court | 45K+ | Active |
| National Law Information Center | Legislation | 800K+ | Active |
| Korea Legislation Research Institute | Research | 120K+ | Active |
| Financial Services Commission | Regulatory | 25K+ | Active |
International Sources
United Nations Treaty Collection
InternationalWorld Trade Organization
InternationalInternational Court of Justice
HagueEuropean Court of Human Rights
EuropeStart New Crawl Job
Search Crawled Documents
API Endpoints
/api/crawlai/sources/List all crawl sources/api/crawlai/jobs/Start new crawl job/api/crawlai/documents/search/Search crawled documents/api/crawlai/documents/recent/Recent crawled docsStart Crawl Job
curl -X POST "http://localhost:8000/api/crawlai/jobs/" \
-H "Authorization: Bearer <token>" \
-H "Content-Type: application/json" \
-d '{
"source": "<source_uuid>",
"query": "contract violation",
"max_pages": 100,
"filters": {"document_type": "case"}
}'