Akshay's first breakthrough, according to Sarnak, was subconvexity. He became interested in ergodic theory, because they could prove hard theorems, equidistribution is a powerful tool in number theory.
Our main goal will be to talk about: Some problems stated (and proved) by Linnik. In the book, Ergodic properties of algebraic fields, 1968. He considered lattice points on a sphere of radius \(n\text{,}\) these are points (say in \(\RR^3\)) whose coordinates are integral of a fixed distance from the origin. Analogous to the circle problem. If \(n\) is fixed there is nothing to distribute, but if we vary \(n\) and project down we can ask do they accumulate miss any patches, generally how do they distribute. Of course this can be generalised.
this is where the number theory comes, we are looking at representability of numbers \(n\) by this ternary quadratic form.
Various methods exist for studying this for varying \(n\text{,}\) quadratic reciprocity for \(n=2\text{,}\) circle method/Vinogradov for \(n=4\text{.}\) \(n=3\) is the cut off, here half-integral weight modular forms are relevant.
Are \(\Omega_N\) “equidistributed” as \(N \to \infty\text{?}\)
There is an immediate obstruction to \(N\) being a sum of \(3\) squares, e.g. \(N = 7\) implies \(\Omega_7 = \emptyset\text{.}\)
Recall: (Gauss/Legendre)
\begin{equation*}
N = x_1^2 + x_2^2+ x_3^2,\,x_i \in \ZZ \iff N \ne 4^a(8b+7)\text{,}
\end{equation*}
so we avoid these sets and ask the same question.
Theorem1.2Linnik
Let \(f \in \cinf(S^2)\text{,}\) then as \(N \to \infty\text{,}\) \(N\) squarefree \(N \not\equiv 7 \pmod 8\text{,}\) \(\legendre{N}{p} = 1\) for some fixed odd prime \(p\)
where \(\diff \sigma\) is the Lebesgue measure on \(S^2\text{.}\)
This is saying that the points are equidistributed with respect to the Lebesgue measure.
The last condition is a defect of the method, known as a Linnik condition.
Remark1.3
Linnik's proof is ergodic theoretic.
After this came Duke in 1988, in the mean time, Weil conjectures were proved, Iwaniec gave bounds for Kloosterman sums. Duke was a graduate student of Sarnak at Courant. He gave a more direct proof of 1.2 which does not have the \(\legendre{N}{p} = 1\) condition. His proof is based on the theory of (half-integral weight) modular forms, and a good bound Iwaniec on certain exponential sums.
Why do exponential sums enter the picture? We are trying to prove that we have a sum converging to an integral. Generally we work with a basis of functions first, we could try using fourier analysis, using harmonics as our basis, this is when exponential sums appear. That requires us to work out harmonics on the sphere (spherical harmonics) which leads to representation theory, which as the sphere is compact involves Weyl representations etc.
Duke also proved “the same” theorem over modular surfaces. Instead of looking at expanding spheres we study expanding hyperboloids.
If \(d\) is fundamental, i.e. a discriminant of some \(\QQ(\sqrt{d})\) then \(H(d) = h(d)\) the regular class number.
For the measure on \(\mathcal F\) we take
\begin{equation*}
\diff\mu = \frac 3\pi \frac{\diff x \diff y}{y^2}\text{.}
\end{equation*}
Theorem1.9Duke
Let \(f \in \cinf(\HH)\text{,}\) that is \(\Gamma\) invariant and bounded on \(\mathcal F\text{,}\) then as \(d \to \infty\) over fundamental discriminants
Equidistribution implies density, but is so much more, for example we cannot have dense points but which happen to cluster towards some line for example.
Question: Let \(\alpha \in \RR\) and consider \(\{ \alpha n \}\) where \(\{ x \} = x \pmod 1\) so \(\left\{\frac32\right\} = \frac 12\text{.}\) How are these distributed?
Example1.10
If \(\alpha = \frac 27\) then we have \(\{ \{\alpha n \} : n \in \NN\} = \{\frac i7 : i \in \{0,\ldots, 6\}\}\) and in fact it hits each evenly.
Example1.11
If \(\alpha = \sqrt 2\) so \(\{\alpha\} \approx 0.4142?...\) \(\{\alpha2\} \approx 0.8284?...\) \(\{\alpha3\} \approx 0.24264...\) \(\{\alpha4\} \approx 0.656854..\text{.}\) These spread out densely, but there is a difference between density and equidistribution. In this example, equidistribution says that the proportion of time the sequence spends in each interval \((a,b)\) is \(b -a\text{.}\)
So questions are: is \(\{n\alpha\}\) dense?
Is \(\{n \alpha\}\) uniformly distributed (equidistributed with respect to the standard measure)?
The answer to both questions is yes.
Theorem1.12Kronecker
Let \(\alpha \in \RR\smallsetminus \QQ\) then \(\{n \alpha\}\) is dense in \(\lb 0,1)\text{.}\)
Digression (Diophantine approximation)
This is a very tough area of number theory, not so many definitive results here.
Theorem1.13Dirichlet
Let \(\alpha \in \RR,N\in \ZZ_{\gt 0}\) then there exists \(p,q\) with \(q \gt 0\text{.}\)
\begin{equation*}
\left| q\alpha - p \right| \lt \frac 1N\text{.}
\end{equation*}
as soon as we get to \(\alpha_N\) we must have two in one subinterval say \(|\alpha_{n_1} - \alpha_{n_2}| \lt \frac 1N\text{.}\) So there exists \(p_{n_1}, p_{n_2}\) such that
This theme of using some calculus is repeated across diophantine analysis.
Remark1.20
Thue: can replace \(n\) with \((\deg(\alpha) + 2)/2\) (This already has implications to integral solutions of degree \(\ge 3\) polynomials \(f \in \ZZ\lb x \rb\text{,}\) e.g. elliptic curves with bounded integral discriminant)
Roth (\(\sim\) 1958): for all \(\epsilon \gt 0 \text{,}\) there are only finitely many \(p/q\) satisfying
Notation \(\| x\|\) means the distance to the nearest integer. Dirichlet implies that infinitely many \(p/q\) have \(\alpha - p/q| \lt 1/q^2\text{.}\)Given \(\epsilon \gt 0\) let \(q\) be such that
To begin let's show \(a_n = \{n!e\}\) is not equidistributed as it has only one limit point \(0\text{.}\)
\begin{equation*}
e = \sum_{n=1}^\infty \frac{1}{n!}
\end{equation*}
so \(n!e \in \ZZ + \frac{1}{n+1} + \frac{(n+1)(n+2)} + \cdots \le \frac{e}{n+1} \to 0\text{,}\) as \(n\to \infty\text{.}\) So its not dense, hence certainly not equidistributed.
If not sense we are done already. Otherwise let \(I_1 = \lb 0, \frac 12) ,I_2 = \lb\frac 12, 1\rb\) Let \(X = \{a_n \in I_1\}, Y = \{a_n \in I_2\}\text{,}\) so both sets are infinite. Let \(b_n = x_1, \ldots, x_{10^{10}}, y_1, x_{10^{10} + 1}, \ldots, x_{20^{20}},\ldots\) so \(\# \{n:b_n \in \lb4/5,1\rb \} \sim \frac{N}{10^{10}}\text{.}\)
What can we say about the space of uniformly distributed sequences? Is it closed under addition? No (\(a_n = -b_n\)). \(\{k a_n\}\) neither (\(a_n = \alpha n,\,\frac1\alpha a_n = n\)). \(a_nb_n\) doesn't work either, but if \(b_n\) converges then \(a_nb_n\) is uniformly distributed.
He actually showed weak Goldbach conjecture, Every (sufficiently large) odd integer is a sum of 3 primes. No bound by Vinogradov, Borozdin gave a large bound, Helfgott brought it down to reality.
Application 2
\begin{equation*}
\{\beta^n \}
\end{equation*}
Theorem1.30Koksma 1935
For almost every (Lebesgue) \(\beta\in \RR_{\gt 1}\text{,}\) \(\left\{\beta^n\right\}\) is equidistributed.
Koksma was a Dutch student of Van der Corput.
Theorem1.31Weyl
Let \(|a_n| \to \infty\) be a sequence of distinct integers, the set of \(\alpha \in \RR\) such that \(\alpha a_n\) is not uniformly distributed has Lebesgue measure 0.
Let \(|x_n| \to \infty\) be a sequence of distinct integers, the set of \(\alpha \in \RR\) such that \(\alpha x_n\) is not uniformly distributed has Lebesgue measure 0.
\begin{equation*}
= (N+H) \sum_{-H \le l \le H} H + 1 - |l | \sum_m \overline y_m y_{m-l}
\end{equation*}
then if \(l = 0\) we get the first term of the statement, \(l \ne 0\) the other.
What is the difference between this and Weyl differencing? When \(H\) is large, not so much, but we can take \(H\) small now, shifting the weighting around. We change the balance to make one part shorter and the other longer.
Theorem1.44Van der Corput differencing
If for each \(h \in \ZZ_{\ge 1}\) the sequence \(b_h(n) = a_{n+h} - a_n\) then so is \(a_n\text{.}\)
Let \(\deg P = d\) then, for \(d = 1\) we are done by Weyl's criterion, for \(d \le D\) use van der Corput differencing. Note that for fixed \(h\text{,}\) \(P(n+h) -P(n)\) is of lower degree.
Subsection1.7A different perspective (Ergodic)
Furstenberg (1981 book) gives a different proof that \(\{n ^2 \alpha\}\) is uniformly distributed.
Ergodic theory 101
Definition1.45Ergodic measures
Let \(X\) be a locally compact space and \(H\) a non-compact group, \(H \acts X\text{.}\) \(\mu\) a probability measure on \(X\text{,}\) \(H\)-invariant. We say that \(\mu\) is an ergodic measure if any of the following equivalent conditions hold
\(A \subseteq X\) and \(A \) is \(H\)-invariant (\(hA = A\) for all \(h \in H\)). Then
\begin{equation*}
\implies f \equiv C \text{ a.e.}
\end{equation*}
Definition1.50Equidistribution
A sequence of probability measures \(\mu_n\) on a locally compact space \(X\) is called \(\mu\)-equidistributed if they converge to \(\mu\) in the weak \(\ast\) topology. I.e.
\begin{equation*}
\forall f \in C_c(X),\, \int f\diff \mu_n \to \int f \diff \mu\text{.}
\end{equation*}
Remark1.51
If we have a sequence \(a_n\) these define a sequence of measures
\((X, B, \mu, T)\) a measure preserving system (as in the definition of ergodic) with \(\mu\) a probability measure. Then for any \(f \in L^1(X, \mu)\)
\begin{equation*}
\int \bar f \diff \mu = \int f \diff \mu
\end{equation*}
and if \(T\) is ergodic then \(\bar f (x) = \int f \diff \mu\) almost everywhere.
Remark1.53
This does not help us!
But the following does:
Theorem1.54
If \(X\) is compact and \(H \cong \RR\) is uniquely ergodic with the unique \(H\)-invariant measure \(\mu\) then the statement of Birkhoff holds for every \(x \in X\text{.}\)