Blog

Insights, tutorials, and updates on high-assurance AI systems, neuro-symbolic programming, and vericoding.

Code Will Become Opaque — And That's Fine

Most software developers haven't twigged the real potential of automated theorem proving with LLMs. If provers are powerful enough, you no longer need to understand the code — only the contracts at the interface.

by gavin
verificationLLMstheorem provingvericodingsoftware engineering

Axiomander: Robots on Rails

Contracts as plain assert statements. Verification via Coq and SMT. Zero imports, zero decorators, zero runtime overhead. Bringing theorem-prover-grade verification to real Python programmers.

by gavin
verificationneuro-symbolicPythonCoqMCP

Ghost Entities: Why LLM Hallucinations in Entity Extraction Are a Serious Downstream Risk

Hallucinated entities and relationships look identical to real ones inside a knowledge graph. We measured how often frontier models inject facts from parametric memory rather than from your documents — and found rates as high as 73% on a single document. Here is why that matters and what to do about it.

by gavin
AI ResearchEntity ExtractionHallucinationsKnowledge GraphsLLMs