Agentic Document Analyser

Name: Agentic Document Analyser
Author: AiExponent

Converts unstructured compliance documents (risk assessments, model cards, contracts, audit logs) into structured JSON using Vision-Language Models. Acts as the evidence processing layer for the AiExponent compliance toolchain. Feeds Article 11 technical documentation and Article 19 automatically-generated-log preservation workflows.

View on GitHub

Quick Start

bashdocker compose up

Features

Vision-Language Model (Qwen2-VL) for unified layout analysis and OCR in a single pass
Detects and classifies document elements: text blocks, headings, tables, figures, form fields, signatures
Returns precise bounding boxes for every detected element
Parallel page processing for multi-page PDFs
Structured JSON output consumable by downstream compliance tools
Docker Compose deployment: four microservices, one command

Regulatory Foundation

Regulatory mapping in review. This tool is pre-release; the EU AI Act article mapping will be published before general availability.

Known Limitations

Requires Docker Compose; no standalone pip package available.
Depends on Fireworks AI API key; no offline/local inference by default.
No persistent storage; results are not retained between container restarts.
No authentication on the /analyze endpoint, so it is not suitable for public deployment without a reverse proxy.
Alpha quality: no production hardening, rate limiting, or database backend yet.

For the most current status, see GitHub issues.

Contributing

Contributions are welcome; everything is Apache 2.0 licensed. See the contributing guide and open issues.

License

Licensed under the Apache License 2.0. Not legal advice. Not a notified body.

AI Exponent LLC is a USA-registered company that operates two brand arms: aiexponent.com (the technology arm shipping these open-source tools) and askajay.ai (an independent advisory practice). The tool above is product, not advisory. The advisory does not require, sell, or recommend purchase of any product on this site.

Evidence flow

One tool covers one article. The full set covers your audit.

Each AiExponent tool emits a named artefact the next tool reads as input. Browse the full toolchain, from Article 5 screening through Article 72 post-market monitoring.

See all tools →