Open computational mathematics. AI-audited, not peer-reviewed. All code and data open for independent verification.

Audit Log

Every finding is checked claim-by-claim by AI models against published literature and mathematical databases. This is not a substitute for formal peer review — it is an informal error-catching process.
Each review logs which model performed the check.

17 Findings audited
44 Reviews total
164 Issues discovered
91% Issues resolved

What the Badges Mean

Gold literature-supported

3+ published papers corroborate the methods. Validated against published benchmarks.

e.g. Hausdorff digit 1 dominance — validated against Jenkinson-Pollicott, Hensley, and Falk-Nussbaum
Silver literature-supported

1+ published paper plus arXiv coverage. Methods grounded in established literature.

e.g. Spectral gaps — Bourgain-Gamburd-Sarnak property (τ) computationally supported at large scale
Bronze novel observation

Novel observation. Related preprints exist but no direct literature precedent.

e.g. Golden ratio witness — no prior report of this concentration

How It Works

1

Claim Extraction

Each finding's specific numerical claims are identified — not vague descriptions, but checkable statements like "A={1,2,3} has exactly 27 exceptions, all ≤ 6234."

2

Literature Cross-Reference

Each claim is checked against live academic databases via our MCP server: arXiv, zbMATH, Semantic Scholar, OEIS, LMFDB, and Lean/Mathlib. Not a keyword search — an actual comparison of our numbers against published theorems and bounds.

3

Claim-by-Claim Verdict

Each claim receives: VERIFIED, NEEDS CLARIFICATION, DISPUTED, or UNVERIFIABLE. The reviewer explains reasoning and cites specific papers.

4

Overall Verdict & Certification

ACCEPT, ACCEPT WITH REVISION, REVISE AND RESUBMIT, or REJECT. This is not a substitute for traditional peer review — it is a transparent pre-review process. The review is saved with the reviewer's model identity.

As AI models get smarter, findings get re-reviewed. The ledger grows. Confidence compounds.

The Living Ledger

Findings accumulate reviews over time from various AI models and occasional manual checks. Each review logs which model performed it.

Date Model Provider Verdict Finding
2026-04-04 gpt-4.1 openai ACCEPT WITH REVISION Zaremba Density Phase Transition: A={1,...
2026-04-04 o3-pro openai REVISE AND RESUBMIT Zaremba Density Phase Transition: A={1,...
2026-04-03 o3-pro OpenAI ACCEPT WITH REVISION Cayley Graph Diameters of Zaremba's Sem...
2026-04-03 o3-pro OpenAI ACCEPT WITH REVISION Cohen-Lenstra at Scale: h=1 Rate Falls ...
2026-04-03 o3-pro OpenAI ACCEPT WITH REVISION Congruence Spectral Gaps for Zaremba's ...
2026-04-03 o3-pro OpenAI ACCEPT WITH REVISION Digit 1 Dominance: Five Digits With 1 B...
2026-04-03 o3-pro OpenAI ACCEPT WITH REVISION Kronecker Coefficients: Complete S_30 T...
2026-04-03 o3-pro OpenAI ACCEPT WITH REVISION Kronecker S_40: Complete Character Tabl...

Real Issues Found

Across 17 findings, reviewers discovered 164 issues in 15 findings. 150 resolved, 14 remaining.

9Critical
92Important
63Minor
Critical
Transitivity proof: Circular dependency in Step 3 — used orbit size to bound |H|, which is what the proof tries to establish.
Rewritten with non-circular resultant-based argument. Borel exclusion strengthened to check all bases.
Critical
Zaremba proof: MOW constant-tracking unverified — no explicit constants in published paper.
Retitled "proof framework". Six known gaps enumerated. rho_eta needs interval certification.
Important
Zaremba density: δ > 1/2 threshold contradicted by our own data.
{2,3,4,5} has δ=0.605 but only 97%. Reframed as two necessary conditions (digit 1 + transitivity).
Important
Digit pair hierarchy: "Closed" exception sets declared without completeness certificate.
Rephrased as conjectural. No branch-and-bound argument provided for finiteness.
Minor
Hausdorff: 3-decimal accuracy claimed without truncation error analysis.
Precision hedged. Convergence study (N=15, 25, 35, ...) added to show resolution above numerical noise.

Community Verification

Anyone can submit computation results via our Colab notebooks. Every new submission is automatically re-run on our GPU cluster to confirm the numbers match. Fake or tampered results are flagged instantly.

1

Submit

Run an experiment on Colab (free T4). Click “Submit to GitHub” — results are pre-filled.

2

Triage

Bot checks against known frontiers. Already computed? Auto-closed. New data? Labeled for verification.

3

Verify

Research agent re-runs the exact same experiment on our cluster. Numbers match? Labeled verified.

Submissions are free. Verification costs GPU time. That’s what Guerrilla Mathematics™ funds.

What This Is NOT

Not traditional peer review

No human referee panel. This is AI-assisted literature cross-referencing with claim-by-claim analysis.

Not proof verification

We check mathematical context, not formal correctness. For formal proofs, use Lean 4.

Not infallible

AI reviewers make errors. That's why the ledger accumulates reviews from multiple models.

Contribute

Any AI model or human researcher can verify our findings, run new experiments, and submit reviews.

Fastest: The Research Agent

If you have Claude Code and a GPU, the research agent handles everything — monitoring experiments, harvesting results, running multi-model peer reviews, fixing issues, and deploying updates.

git clone https://github.com/cahlen/idontknow && cd idontknow
export OPENAI_API_KEY='sk-...'
./scripts/run_agent.sh              # one cycle
./scripts/run_agent.sh --loop 10m   # autonomous loop
Uses your Claude Code account for analysis. OpenAI key optional (for multi-model reviews). Source · Guide

Manual: Review a Finding

1 Connect to mcp.bigcompute.science
2 Call get_finding("slug")
3 Call verify_finding("slug")
4 Write review per schema
5 Submit PR
{
  "mcpServers": {
    "bigcompute": {
      "url": "https://mcp.bigcompute.science/mcp"
    }
  }
}
22 tools. No auth. arXiv, zbMATH, OEIS, LMFDB, Lean/Mathlib, and more.

Audit Dashboard

17 findings · 44 reviews · 164 issues tracked (150 resolved)

SILVER 7 findings
Claude + o3-pro · Accept w/ revision (3 reviews)
7/7 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-01 Claude Opus 4.6 Anthropic GOLD ACCEPT WITH REVISION
important Clarify which moduli were processed and correct the stated count. resolved
important The prime count 172 is correct (π(1021)=172); the reviewer's '669' ... resolved
minor Cite a proof or give a careful argument for the mod-p to integer lift. resolved
minor Report measured throughput, GPU model specs, and validation checks ... resolved
important Add measured throughput and validation details to the Method sectio... resolved
minor Provide quantitative trend analysis and error bars; temper the asym... resolved
+1 more
View finding →
o3-pro · Accept w/ revision (3 reviews)
4/4 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
important Add wall-clock time, GPU utilization, and validation sampling detai... resolved
minor Provide total wall-clock time, GPU utilisation, and validation samp... resolved
minor Publish aggregated counts and a checksum of the raw file so others ... resolved
important Include randomised cross-checks or a second, different algorithm on... resolved
View finding →
Claude + o3-pro · Accept w/ revision (3 reviews)
9/10 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-01 Claude Opus 4.6 Anthropic SILVER ACCEPT WITH REVISION
minor The three tightest gaps (m=1469: 0.237, m=638: 0.258, m=34: 0.271) ... acknowledged
minor Provide convergence data (e.g. N = 20, 25) and rigorous enclosures ... resolved
minor The threshold σ > 0.277 comes from the Bourgain-Kontorovich circle ... resolved
minor The threshold value depends on the proof framework. State this depe... resolved
minor Add explicit eigensolver stopping criteria and computation paramete... resolved
minor Publish the N = 40 prime data and document the stopping criteria of... resolved
+4 more
View finding →
Claude + o3-pro · Accept w/ revision (3 reviews)
9/13 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-01 Claude Opus 4.6 Anthropic GOLD ACCEPT
important We now provide the actual rank-correlation (Spearman's ρ ~ 0.996 at... resolved
important Supply correlation coefficients and examples where the ranking fails. resolved
important Auto-demoted from fix: fix only adds hedging (2 hedge phrases, no c... acknowledged
important Quantify the truncation error or increase N; otherwise refrain from... resolved
important Without a truncation error analysis, claiming 3-decimal-place accur... resolved
minor Need to run N=25,35 convergence study to bound error on delta=0.002... acknowledged
+7 more
View finding →
Claude + o3-pro · Accept w/ revision (2 reviews)
7/11 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-02 Claude Opus 4.6 Anthropic GOLD ACCEPT
important The finding already uses the standard academic hedge: 'These are, t... disputed
important Add explicit orthogonality verification metrics for both n=20 and n... resolved
minor Include explicit verification results (max absolute orthogonality e... resolved
important The finding already states: 'The previous highest-n full table in t... disputed
important The finding already includes explicit values: 'For Sₙ with n=20, ma... disputed
important This is addressed by the same edit as issue #2 above — the hedging ... resolved
+5 more
View finding →
Claude + o3-pro · Accept w/ revision (2 reviews)
10/10 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-03 Claude Opus 4.6 Anthropic GOLD ACCEPT
important Already resolved in commit 9428528. The finding now includes the ex... resolved
important Add explicit 40-partition shape set with representative examples, l... resolved
minor Supply explicit shape set and a checksum of the output so others ca... resolved
important Clarify in the section heading that our near-rectangular set is bro... resolved
minor The set of 'near-rectangular' partitions used is broader than the s... resolved
important Specify sampling distribution (uniform over unordered triples witho... resolved
+4 more
View finding →
Claude + gpt-4.1 + o3-pro · Revise & resubmit (5 reviews)
12/13 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-01 Claude Opus 4.6 Anthropic GOLD ACCEPT WITH REVISION
2026-04-04 gpt-4.1 openai SILVER ACCEPT WITH REVISION
2026-04-04 o3-pro openai SILVER REVISE AND RESUBMIT
important Add formal reference and clarify that the transitivity claim is emp... resolved
minor Give formal statement and proof or reference (e.g. Bourgain–Fuchs 2... resolved
important Add reproducibility section with SHA-256 checksums of the GPU log a... resolved
minor Provide checksum of the 10^10 bitset, reproducibility scripts, and ... resolved
important Add SHA-256 of the CSV data file and path to the generation script. resolved
minor Publish the script that generated the CSV and SHA-256 of the data f... resolved
+7 more
View finding →
BRONZE 8 findings
Claude + o3-pro · Accept w/ revision (2 reviews)
14/14 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-01 Claude Opus 4.6 Anthropic BRONZE ACCEPT WITH REVISION
minor Publish the full sweep results (or compressed checksum) so others c... resolved
minor Add link to the published full sweep CSV (1,023 rows) with SHA-256 ... resolved
important Replace qualitative statement with log-log OLS regression: exponent... resolved
important Perform a log–log regression with confidence intervals and quantify... resolved
minor Add SHA-256 hashes and GPU timing for all 5 log files in the Reprod... resolved
minor Provide the list (or hash digest) of uncovered denominators and tim... resolved
+8 more
View finding →
Claude + o3-pro · Accept w/ revision (2 reviews)
5/6 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-02 Claude Opus 4.6 Anthropic BRONZE ACCEPT
important Per-GPU timing, communication pattern, and peak memory were added i... resolved
minor Provide per-GPU timing, inter-GPU communication pattern, and peak m... resolved
minor Specify exact GPU clock, compiler flags, baseline CPU model, thread... resolved
important Hardware specs already added in prior remediation (commit 9428528):... resolved
minor Kernel source is linked (matrix_enum.cu). Finding already states 'T... acknowledge
minor Include kernel listing, occupancy, and an ablation study separating... resolved
View finding →
Claude + o3-pro · Revise & resubmit (2 reviews)
15/15 resolved
2026-04-03 o3-pro OpenAI BRONZE REVISE AND RESUBMIT
2026-04-02 Claude Opus 4.6 Anthropic SILVER ACCEPT
critical Four 'closed' exception sets ({1,2,3}=27, {1,2,4}=64, {1,2,5}=374, ... resolved
important The claimed exponent 3-4 is wrong. OLS log-log regression on k=3..1... resolved
minor Include a log-log regression with confidence interval and discuss s... resolved
minor Same as first claim. resolved
important Add explicit log file references so the {1,k} vs {2,k} comparison i... resolved
important Clarify definition of ‘value’ and base claim on independently check... resolved
+9 more
View finding →
Claude + o3-pro · Accept w/ revision (2 reviews)
11/11 resolved
2026-04-03 o3-pro OpenAI BRONZE ACCEPT WITH REVISION
2026-04-02 Claude Opus 4.6 Anthropic BRONZE ACCEPT
minor Replace truncated witness table with complete table for all 25 reso... resolved
minor Publish, for each of the 25 denominators, an explicit numerator who... resolved
important Add the complete list of all 27 exceptions with reference to verifi... resolved
minor Add code path, SHA-256 checksum of exception list, and reproduction... resolved
minor Release the complete list of 27 exceptions together with code and c... resolved
minor Add exhaustive search results for both denominators showing all cop... resolved
+5 more
View finding →
Claude + o3-pro · Accept w/ revision (3 reviews)
10/10 resolved
2026-04-03 o3-pro OpenAI SILVER ACCEPT WITH REVISION
2026-04-01 Claude Opus 4.6 Anthropic BRONZE ACCEPT
important Replace approximate table values with exact computed values and add... resolved
minor Release the full R(d) list (e.g. CSV) and a verifier that recompute... resolved
important Replace overclaim about monotonic increase with precise statistical... resolved
important Clarify the definition of ‘on average’ (e.g. Cesàro mean, logarithm... resolved
minor Report standard error or 95 % confidence interval for the fitted ex... resolved
important Add the confidence interval and clarify the percentage discrepancy ... resolved
+4 more
View finding →
Claude + o3-pro · Accept w/ revision (3 reviews)
11/11 resolved
2026-04-03 o3-pro OpenAI BRONZE ACCEPT WITH REVISION
2026-04-01 Claude Opus 4.6 Anthropic BRONZE ACCEPT WITH REVISION
important Soften the golden ratio connection from a claimed structural link t... resolved
important Reframe golden ratio connection as heuristic, explicitly acknowledg... resolved
minor The claimed value 0.1514 vs observed 0.171 is a 13% discrepancy. Th... resolved
important Add data availability section with checksum commitment and raw (d,α... resolved
minor Provide the raw list of (d,α) pairs or a checksum so that external ... resolved
important Add data availability section with raw data reference and checksum ... resolved
+5 more
View finding →
Claude + o3-pro · Revise & resubmit (3 reviews)
8/8 resolved
2026-04-03 o3-pro OpenAI BRONZE REVISE AND RESUBMIT
2026-04-01 Claude Opus 4.6 Anthropic SILVER ACCEPT
critical The Borel exclusion argument checks that h1 has nonzero (2,1) entry... resolved
critical Step (3) uses the presumed transitivity |orbit|=p^2-1 to lower-boun... resolved
important Compute eigenvectors of h1 and h2 modulo p and show they cannot coi... resolved
important Need an invariant-subspace analysis rather than checking a single m... resolved
critical The proof assumes a shared eigenvector forces a shared eigenvalue, ... resolved
minor Extend computation further and/or supply a complete non-circular al... resolved
+2 more
View finding →
Claude + GPT-5.2 + Grok + o3-pro · Revise & resubmit (5 reviews)
18/21 resolved
2026-04-01 Grok xAI SILVER ACCEPT WITH REVISION
2026-04-02 GPT-5.2 OpenAI SILVER ACCEPT WITH REVISION
2026-04-03 o3-pro OpenAI BRONZE REVISE AND RESUBMIT
2026-04-01 Claude Opus 4.6 Anthropic SILVER REVISE AND RESUBMIT
minor Provide the exact command line, input parameters, final checksum/bi... resolved
important Add explicit command line, input parameters, and checksum reference... resolved
minor Either supply a computer-assisted Dolgopyat proof à la Buzzi-Tsujii... resolved
important Downgrade the claim from 'certified' to numerical heuristic, consis... resolved
important Downgrade the Dolgopyat claim from 'arb-certified' to 'numerical bo... resolved
important The word 'proof' is premature. Replace with 'proof framework' in th... resolved
+15 more
View finding →

Recent Updates

updateGPU Zoo: cards now expandable (tap to see specs + what it can compute)
updateGPU Zoo: interactive comparison with verified specs from NVIDIA
updateUpdate README: current architecture, key pages, machine discoverability
updateAdd LICENSE: CC BY 4.0 (attribution required)
updateImprove AI crawlability: semantic HTML + contact info
reviewRegenerate meta.json + certifications.json (now auto-generated)
updateAdd /meta.json: machine-readable index for AI crawlers
findingAdd /cite/ page: ready-to-copy citations for every finding
updateAdd IndexNow key verification file
findingAdd structured data for machine discoverability on every finding page