Agentic Document Analyser

Articles 11 + 19Infrastructure

Converts unstructured compliance documents — risk assessments, model cards, contracts, audit logs — into structured JSON using Vision-Language Models. Acts as the evidence processing layer for the AiExponent compliance toolchain. Feeds Article 11 technical documentation and Article 19 automatically-generated-log preservation workflows.

On this page

Quick Start

bashdocker compose up

Features

  • Vision-Language Model (Qwen2-VL) for unified layout analysis and OCR in a single pass
  • Detects and classifies document elements: text blocks, headings, tables, figures, form fields, signatures
  • Returns precise bounding boxes for every detected element
  • Parallel page processing for multi-page PDFs
  • Structured JSON output consumable by downstream compliance tools
  • Docker Compose deployment — four microservices, one command

Regulatory Foundation

Regulatory mapping in review. This tool is pre-release; the EU AI Act article mapping will be published before general availability.

Known Limitations

  • Requires Docker Compose; no standalone pip package available.
  • Depends on Fireworks AI API key — no offline/local inference by default.
  • No persistent storage; results are not retained between container restarts.
  • No authentication on the /analyze endpoint — not suitable for public deployment without a reverse proxy.
  • Alpha quality: no production hardening, rate limiting, or database backend yet.

For the most current status, see GitHub issues.

Contributing

Contributions are welcome — Apache 2.0 licensed. See the contributing guide and open issues.

License

Licensed under the Apache License 2.0. Not legal advice. Not a notified body.

The Compound Moat

One tool is a start. The chain is the moat.

Each AiExponent tool produces structured evidence the next tool consumes. Browse the full toolchain — from Article 5 screening through Article 72 post-market monitoring.

See all tools →