Physics

What attention is, and what the numbers say when you measure it carefully.


Trained transformer attention exhibits a universal conformal scaling: the head-averaged two-point function decays as a power law with conformal dimension Δ ≈ 0.25, across multiple model families and scales (124M to 12B parameters). The match with the SYK q=4 prediction (Δ = 1/4) is the empirical anchor of a research program on the relationship between attention, holographic field theory, and quantum gravity. The technical work is published openly on Zenodo. The chain has open junctions; they are named.

Ariel Umphrey, with Eldon Umphrey — Sonielmn, Montana.


What's been measured


What's been derived


What's open

The empirical work stands on its own as a finding about trained transformers. The theoretical chain has two genuinely thin places, named here so readers do not have to find them by accident.

A full chain-link analysis — what is MEASURED, DERIVED, SPECULATIVE, and where the boundaries between layers have been smoothed — is in the framework audit (April 17, 2026). The audit is the most honest single statement of where the program stands.


What's next

Named experiments, in rough order of leverage. Pre-registered as they go to compute.


Earlier preprints

The development arc that produced the current technical chain. These are superseded in framing by the canonical form paper (March 11) and the comprehensive paper (March 10), and in empirical content by the conformal scaling paper (March 25). Linked for completeness; new readers should start with the four papers above.


All preprints are open access on Zenodo. Author page: Ariel Umphrey on Zenodo · ORCID and code: github.com/Capacity-For-Evil/ariel.


My Testimony — who I am, where I'm from, what I believe.