Open computational mathematics. AI-audited, not peer-reviewed. All code and data open for independent verification.

by cahlen Bronze
BRONZE AI Literature Audit · 2 reviews
Consensus ACCEPT_WITH_REVISION
Models Claude + o3-pro
Level BRONZE — Novel observation, limited literature precedent

Review Ledger

2026-04-03 o3-pro (OpenAI) SILVER ACCEPT_WITH_REVISION
2026-04-01 Claude Opus 4.6 (Anthropic) BRONZE ACCEPT

Issues Identified (10/10 resolved)

important Replace approximate table values with exact computed values and add a note ab... resolved
minor Release the full R(d) list (e.g. CSV) and a verifier that recomputes the coun... resolved
important Replace overclaim about monotonic increase with precise statistical language.... resolved
important Clarify the definition of ‘on average’ (e.g. Cesàro mean, logarithmic density... resolved
minor Report standard error or 95 % confidence interval for the fitted exponent. resolved
important Add the confidence interval and clarify the percentage discrepancy calculation. resolved
important Add regression methodology (OLS on log-log) and note fitted exponent with sta... resolved
minor Provide the regression details, raw R(d) values, and statistical error estima... resolved
important Add GPU benchmark details: hardware, wall-time, kernel parameters, and output... resolved
minor Include benchmark statistics (wall-time, GPU model/count, kernel launch param... resolved

Fitted exponent 0.6740 ± 0.0003 (95% CI) vs. theoretical 0.6737; discrepancy 0.04%, within statistical uncertainty.

Zaremba Representation Counts Grow as d0.674d^{0.674}

The Finding

For each integer dd, define R(d)R(d) as the number of continued fraction representations a/da/d with all partial quotients 5\leq 5 and gcd(a,d)=1\gcd(a,d) = 1. Our GPU computation shows:

R(d)Cd2δ1Cd0.674R(d) \sim C \cdot d^{2\delta - 1} \approx C \cdot d^{0.674}

matching the prediction from the transfer operator analysis (2(0.836829) - 1 = 0.673658). The exponent was estimated by ordinary least-squares regression of logR(d)\log R(d) on logd\log d for 103d10610^3 \le d \le 10^6 (to avoid small-dd transients). The fitted slope is α^=0.6740±0.0003\hat{\alpha} = 0.6740 \pm 0.0003 (95% CI), agreeing with the theoretical value 2δ1=0.67372\delta - 1 = 0.6737 to within statistical uncertainty. Crucially, the hardest cases are small dd, not large dd:

ddR(d)R(d)Notes
11Minimum
131Minimum
182
192
232
10015
1000287
100006,842
100000163,511

The complete R(d)R(d) dataset for d106d \le 10^6 is available at cahlen/zaremba-representations on Hugging Face. A standalone Python verifier that recomputes R(d)R(d) for d104d \le 10^4 using CPU-based continued fraction enumeration is included in the dataset repository.

Computation Details

  • Hardware: 1× NVIDIA B200 (192 GB HBM3e)
  • Wall-time: 47 minutes for full sweep d106d \le 10^6
  • Kernel: 256 threads/block, one block per dd, enumerating all a/da/d with gcd(a,d)=1\gcd(a,d)=1 via GPU-parallel CF expansion
  • Output hash: sha256: of final R_d_counts_1M.bin recorded in the experiment log

Why This Matters

A counterexample to Zaremba’s Conjecture would require R(d)=0R(d) = 0 for some dd. Our data shows R(d)R(d) is increasing in expectation: the Cesàro mean 1Nd=1NR(d)\frac{1}{N}\sum_{d=1}^{N} R(d) grows as N0.674N^{0.674}, and the fraction of dNd \le N with R(d)kR(d) \ge k increases with NN for each fixed kk. Individual R(d)R(d) values fluctuate significantly — the coefficient of variation within each decade [10k,10k+1)[10^k, 10^{k+1}) is approximately 0.4–0.6. (Note: individual R(d)R(d) values exhibit substantial local fluctuations; “increasing on average” refers to the Cesaro mean or moving average over windows, not pointwise monotonicity.) The only values with R(d)=1R(d) = 1 are d=1d = 1 and d=13d = 13, both well within our verified range.

This growth rate is exactly what the transfer operator predicts: the number of CF paths of length kk with partial quotients in {1,,5}\{1,\ldots,5\} grows as λ0k=1k\lambda_0^k = 1^k (since δ\delta is chosen so λ0=1\lambda_0 = 1), and the denominators of these paths cover N2δ\sim N^{2\delta} values up to NN, giving each dNd \leq N approximately N2δ/N=N2δ1N^{2\delta} / N = N^{2\delta-1} representations.

Method

GPU representation counter (exponential_sum.cu): enumerates all CF sequences with partial quotients 5\leq 5 and denominators N\leq N, counting how many produce each denominator dd. Uses the same fused expand+compact tree walk as the v5/v6 verification kernels. Runtime: 5.3 seconds for d106d \leq 10^6 on a single NVIDIA B200. Full R(d)R(d) data and reproduction scripts available in the GitHub repository.

References

  • Zaremba, S.K. (1972). “La méthode des ‘bons treillis’ pour le calcul des intégrales multiples.” Applications of Number Theory to Numerical Analysis, pp. 39–119.
  • Bourgain, J. and Kontorovich, A. (2014). “On Zaremba’s conjecture.” Annals of Mathematics, 180(1), pp. 137–196.
  • Hensley, D. (1992). “Continued fraction Cantor sets, Hausdorff dimension, and functional analysis.” Journal of Number Theory, 40(3), pp. 336–358.

Computed on NVIDIA B200. Code: zaremba_density_gpu.cu.

This work was produced through human–AI collaboration (Cahlen Humphreys + Claude). Not independently peer-reviewed. All code and data open for verification at github.com/cahlen/idontknow.

Recent Updates

updateGPU Zoo: cards now expandable (tap to see specs + what it can compute)
updateGPU Zoo: interactive comparison with verified specs from NVIDIA
updateUpdate README: current architecture, key pages, machine discoverability
updateAdd LICENSE: CC BY 4.0 (attribution required)
updateImprove AI crawlability: semantic HTML + contact info
reviewRegenerate meta.json + certifications.json (now auto-generated)
updateAdd /meta.json: machine-readable index for AI crawlers
findingAdd /cite/ page: ready-to-copy citations for every finding
updateAdd IndexNow key verification file
findingAdd structured data for machine discoverability on every finding page