Nearmap AI Gen 6 vs Claude Fable and Gemini

Jun 2026

In the 72 hours that Claude Fable was available, we benchmarked it against Nearmap AI. Here are the results.

Jun 2026

Read the full report →

As general-purpose LLMs have matured, a fair question has emerged: do purpose-built AI models still hold a measurable advantage for defined property tasks?

Nearmap ran a controlled benchmark comparing Nearmap AI Gen 6 against seven generalist models from Google and Anthropic, including Claude Fable 5. The test was run across four property intelligence tasks, including pool detection, roof area, roof count and roof condition.

Nearmap AI Data Layers gif of US swimming pool detections

The benchmark

hand-labelled US residential properties in the test dataset

0.0

F1 score achieved by Nearmap AI Gen 6 on pool detection

properties per second: Nearmap AI Gen 6 throughput

generalist models tested across Google and Anthropic, including Fable

AI Accuracy

The results

Findings indicate a material performance gap across accuracy, failure mode distribution, and operational throughput, with structural causes that are unlikely to be resolved by improvements in general model capability alone.

How property intelligence helps you succeed

Nearmap AI Gen 6 vs Claude Fable and Gemini

Jun 2026

In the 72 hours that Claude Fable was available, we benchmarked it against Nearmap AI. Here are the results.

Jun 2026

Read the full report →

The benchmark

AI Accuracy

The results

How property intelligence helps you succeed

Accuracy

False negatives

False positives

Throughput

Applications

Data & Insights

Solutions

Company

Support

Connect