Structured PDF extraction API that turns complex files into consistent JSON.
codex-pdf is the document-facts layer in the printwithsynergy family. It parses uploaded PDFs into a stable schema your apps can consume for automation, QA summaries, and downstream prepress workflows.
- Single extract endpoint with predictable JSON output
- Page-level document facts for automation and QA
- Works alongside lens-pdf visual review
- Simple API contract for browser and backend integrations
- Built for production pipelines with optional token auth
Programmatic PDF assembly — a deterministic API build step for rewriting and generating print-ready PDFs.
compile-pdf is a programmatic PDF assembly utility. Call it as a discrete API build step to rewrite and generate print-ready PDFs — byte-deterministic output with SHA-256 lineage so every transform is reproducible and auditable. It plugs into the prepress workflow you already run.
- Deterministic PDF rewrite + page assembly over a simple API
- Same input → same bytes, every operation verifiable
- SHA-256 lineage on every step for full auditability
- Optional marks, imposition, and trapping utilities when you need them
- Run it as a CLI, an in-process library, or an API sidecar
Detection-only PDF preflight engine — 500+ checks plus the PDF/X-4 conformance suite.
lint-pdf is the open-source preflight engine. It inspects PDFs against 500+ checks across fonts, color, images, transparency, page geometry, and packaging — plus a 91-check PDF/X-4 conformance suite (ISO 15930-7). Detection-only by design: your originals are never modified.
- 500+ engine checks + PDF/X-4 (ISO 15930-7) conformance suite
- Built-in rulesets for GWG sheetfed, GWG digital, PDF/X-4, packaging
- External imports from PitStop, callas pdfToolbox, Acrobat
- FastAPI service + CLI + Python SDK
- Detection-only: your files are never modified
Embeddable PDF viewer with separations, TAC, layers, and annotation overlays.
lens-pdf is an embeddable web PDF viewer purpose-built for prepress review. It surfaces ink separations, total-area-coverage maps, layer toggles, a single-pixel densitometer, and annotation overlays — the things a prepress operator actually needs when looking at a job.
- Per-channel ink separations (CMYK + spots)
- TAC heatmap and densitometer probe
- Layer toggles + annotation overlays
- Embeddable in any Next.js / React host
- Plugin slots for custom toolbars and panels
PDF assay and metadata reporting — surface what's actually inside the file.
assay-pdf reports the structural and content characteristics of a PDF without making a value judgment about them. Page boxes, embedded fonts, color spaces, image inventory, security flags, structure tags — the raw inventory that downstream tools (preflight, accessibility, archival workflows) need to do their job.
- Per-page metadata: dimensions, boxes, rotation, content streams
- Font inventory: embedded vs. referenced, subset state, encoding
- Color and image inventory with downsampling detection
- Security and encryption posture
- Structure tag tree for accessibility tooling
WYSIWYG canvas editor for label and packaging artwork — PDF/X-4 output, flexo support, and a full create-to-RIP workflow.
artwork-pdf is an embeddable canvas editor for label and packaging artwork. It outputs PDF/X-4 via Ghostscript, handles flexo distortion compensation per separation, parses CF2/DDES/ARD dielines, and supports Pantone, white ink, and varnish as named separations. Self-hostable via Docker Compose.
- PDF/X-4 output via Ghostscript — print-ready from first render
- Flexo distortion compensation per separation — automatic plate correction
- CF2, DDES, and ARD dieline format support
- Pantone, white ink, and varnish as named separations with preview thumbnails
- Embeddable as @printwithsynergy/artwork-pdf-editor — Docker Compose self-hosting
Stateless imposition-planning solver — step-and-repeat, gang, and true-shape nesting.
sift-pdf is the imposition-planning solver in the printwithsynergy family. Given SKU/job rules and a press/substrate context it computes the most efficient step-and-repeat, gang, or true-shape nest layout, emits an ImposePlan, and hands it to compile-pdf to write the sheet — it never writes a PDF itself.
- Three solver tiers: uniform grid, CP-SAT gang, true-shape nest
- Stateless + deterministic — same inputs, same plan, content-addressed cache
- Builds on codex geometry primitives (tile_grid, polygon ops)
- Hands off to compile-pdf.impose for the actual PDF write
- API-first: /v1/sift/solve, /suggest, /estimate + CLI
Content-addressed digital-asset plane — versioned blobs, a presigned data plane, and on-prem agent recall.
trove is the content-addressed digital-asset plane for the printwithsynergy stack. It stores versioned assets by content hash, serves a presigned data plane so bytes move directly to and from object storage, and recalls on-prem files through a lightweight agent — every transfer hash-verified end to end.
- Content-addressed, deduplicated blob + versioned asset store
- Presigned data plane — bytes never cross the control channel
- On-prem trove-agent for plant-local file recall over one WebSocket
- Per-tenant isolation enforced with Postgres RLS
- Caches codex-derived facts by content hash