Published Accuracy

Everyone in construction tech claims AI. Almost nobody shows the numbers.

We publish ours because you should be able to check.

The Problem with "AI"
"40% efficiency gains! 10x faster workflows! Revolutionary AI!"

— Every pitch deck, ever

"But... can I see the math?"

— You, hopefully

You wouldn't quote a $50M mechanical package without reading the spec. You'd check every section, flag the alternates, and know exactly what you're pricing.

Why would software be any different?

Our Numbers
100K+
AI executions
12
distinct workflows
89%
overall accuracy

What's Effectively Solved

Component Spec Parsing 99%+
Document Classification 97%+
Equipment Extraction 95%+
Equipment Quantity 90.5%

What We're Still Improving

Table Alternates 83.1%
Complex Mech Schedules 81%

The gap between these and the solved tasks tells you where a human still needs to check.

What These Tasks Mean
Document Classification
Is this a spec, drawing, schedule, or addendum? Misclassify a document and everything downstream inherits the error.
Equipment Extraction
What MEP equipment is specified? Miss a chiller buried in an addendum and you miss the project entirely.
Component Specs
Capacities, efficiencies, voltages, RPMs. These determine whether equipment actually meets design intent.
Table Alternates
What substitutions are acceptable? Knowing your options early changes how you price the job and which lines you push.

Each task sounds straightforward. In practice, construction documents are messy. Specs contradict drawings. Addenda override base documents. Equipment schedules use different naming conventions from page to page.

That's why we measure on real project data.

Why We're Publishing This

Receipts > promises

We say equipment extraction runs at 95%. Here are the executions.

We say we've processed tens of thousands of specs. Here's the data.

You can check.

Updated quarterly. Last updated: Q1 2026.

When models improve or we add new workflows, the numbers here change. If accuracy drops on something, that shows up too.

See for yourself

See It On Your Documents

Send us a bid package. See the extractions. Judge the accuracy yourself.

Get Started

Data from production workloads over the past quarter. Accuracy measured against human-verified ground truth. Some things work well. Some don't yet. That's the point of publishing it.