Evidence
Delta Analysis
Blind evaluation by GPT-5 across 30 analytical dimensions. Configured output, using inference-time cognitive configuration, against default output from the same model on the same prompt.
Overall Score
Average across all dimensions
Semantic Density
Concepts per word
Meta-Reasoning
On meta-cognitive execution
Dimension-by-Dimension Comparison
Selected dimensions. Full 30-dimension matrix in the delta analysis.
Semantic Density Comparison
Unit: concepts per word. Higher = more information per token.
See the full evidence
The raw responses and the blind judge analysis
The full response from a standard, fresh Gemini 3 Deep Think instance. The full response from Gemini 3 Deep Think running the complete NovaThink Cognitive Seed stack of eight meta-cognitive priors that govern global reasoning. And the full delta analysis from a GPT-5 instance acting as a blind judge with no knowledge of which output came from which configuration.
01
Default Response
A fresh, unconfigured Gemini 3 Deep Think instance answers the synthesis prompt.
02
Configured Response
Gemini 3 Deep Think with the full NovaThink Cognitive Seed stack answers the same prompt.
03
Delta Analysis
A blind GPT-5 instance compares both outputs across 30 analytical dimensions.
Want to see how your own AI outputs compare?
Try The Inference Auditor