CV Research Teaching Extra
Research projects & open problems
Shifts of finite type
Fix a family of forbidden patterns on a transitive graph, i.e. some colorings of some finite subgraphs. Consider an iid coloring on the full graph (or a finite subset) conditioned to avoid all the forbidden patterns. What does a typical configuration look like, and what statistical properties does it have? This model originated in ergodic theory and symbolic dynamics, and lies at the intersection of probability, combinatorics, dynamics, and statistical physics. Many interesting questions arise already in the one dimensional setting. For example: consider bi-infinite binary words, and forbid a single binary pattern. The entropy of the resulting shift space (i.e. the exponential growth rate of the number of allowable words) is described by a simple combinatorial statistic, the auto-correlation polynomial of the forbidden word. In higher dimensions, computing the entropy or establishing mixing properties is much more challenging; and in the non-amenable setting, a nice notion of entropy is still not known. Another natural object is the asymptotic density of 1s, or of another fixed (allowable) pattern. Is there a similar combinatorial statistic that describes these densities?

Talk slides

Continuously-evolving draft: SFT Patterns

Shifts of finite type with a single forbidden word, with N. Chandgotia, B. Marcus, C. Wu. In preparation.

Bias, length, and symmetry in Penney's ante, with M. Drexel, A. Ju, P. Peng. Undergraduate research. In preparation.

2dsft_example
Fix \(\beta \in \mathbb{R}\), and put a Gibbs measure \(\mu_\beta(x)\) on configurations of Red/Green/Blue on a subset of a discrete 2D lattice with \(\mu_\beta(x) \sim \exp(\beta N(x))\), where \(x\) is a configuration of R/G/B, and \(N(x)\) is the number of mono-colored \(2 \times 2\) squares in \(x\). These images are sampled from Glauber dynamics w.r.t. \(\mu\) on a large 2D torus: from left to right, \(\beta = -2, 1,\) and \(9/4\).
Self organized criticality for abelian particle systems
In the 1980s physicists Per Bak, Chao Tang and Kurt Wiesenfeld coined and popularized the idea of self organized criticality (SOC), a phenomenon they observed in real-world systems like avalanches and forest fires. In these systems, energy slowly builds up over time, and is released in shocks with sizes that follow a statistical power law. They proposed a number of simple but mathematically challenging models to study SOC, including the abelian sandpile model, a precursor to activated random walk (ARW). Deepak Dhar later proved that the sandpile has a certain abelian property, which is shared by activated random walk. Simulations suggest that ARW has a critical limiting state that exhibits SOC in a robust way, with little dependence on the exact setup of the model. Research on ARW has recently gained traction, and many results on the critical density are now available. Still, we lack a mathematical description of exactly how SOC arises for ARW. While the two-dimensional case is of particular practical interest, ARW in one dimension, on a tree, or in the mean field (on the complete graph) are more tractable, and are not yet fully solved.

Talk slides

Active phase for the stochastic sandpile on \(\mathbb{Z}\), with C. Hoffman, Y. Hu, and D. Rizzolo. Submitted to CPAM, 2023

Active phase for activated random walk on \(\mathbb{Z}\), with C. Hoffman and L. Rolla. Published in CMP, 2022.

Activated random walk on a cycle, with R. Basu, S. Ganguly, and C. Hoffman. Published in AIHP, 2019.

avalanche_combo
Two avalanches, at different scales, in the stochastic sandpile on a \(200 \times 200\) torus at (approximately) the critical state. Sites colored brighter were toppled more times during the avalanche.
Combinatorial stochastic processes
I am often drawn to questions about random combinatorial objects: markov chains, fragmentation/coagulation processes, interacting particle systems, renewal processes, random number theory. Proving limit theorems, describing stationary configurations or evolutions, or studying mean field behavior of these systems, to name a few objectives, often leads to cutting edge ideas and unsolved problems.

Continuously evolving draft: Combinatorial stochastic processes

Random 'sumset' problems for real numbers, related to the 'Bernoulli factory' problem: Sum constructions

Writeup of a coupon-collector type problem: Geometric coupons

Random walk collisions. A survey I wrote as a graduate student on collisions among random walkers. Contains some original hitting time calculations for random walk on a discrete cycle and on \(\mathbb{Z}\).

candidates
Sample \(n\) iid Uniform\((0,1)\) random variables, or candidates. Each candidate has a block of voters, which is their Voronoi cell in the unit interval. In each step, the candidate with the smallest cell is removed. Eventually, one candidate remains. This is a histogram of the distribution of the location of the election winner with \(n = 51\). One can easily show that the distribution is supported on \((\frac{1}{4}, \frac{3}{4})\). Proving that it is bimodal for any \(n\) is open. (I heard this question from Omer Angel.)
Geometry detection in random graphs
A recurring theme in the study of large finite graphs is to describe hidden geometric features. For a given measure on random graphs, i.e. a random matrix ensemble, we can say it has geometry if it can be roughly-isometrically embedded in a low-dimensional metric space. For Wishart matrices, aka random graphs embedded in a sphere, one can exactly characterize the phase transition, depending on the underlying dimension, between geometry and no geometry. Another combinatorial graph model, called the random intersection graph, shares the same phase transition, at essentially the same critical value. This suggests a universal phenomenon: is it possible to characterize a large class of random graphs that undergo exhibit the same phase transition?

A smooth transition from Wishart to GOE, with M. Racz. Published in JTP, 2018.

When random intersection graphs lose geometry, with S. Bubeck and M. Racz. In preperation.

Finding the source
Run a random growth model or diffusion on a graph up to a fixed time. Given only a snapshot of the set of sites visited by the process, what information can be determined about the process? Can you guess the starting point, the ending point, most visited point, or the distribution of the process? These kinds of questions arise in applied computer science, for example in infection spread through a social network. Giving a precise answer for simple random walk in \(\mathbb{Z}^d\) is already a difficult problem.
2dsrw
A map of the occupation times of a 2D simple random walk run for \(10^7\) steps. Points that were visited more often are colored darker.
Permuted Random Walk
A huge body of probability literature is devoted to studying random walks on graphs that are modified in some way, often with the goal of decreasing the mixing time. In this work we considered permuted random walk, a process alternates taking random walk steps, and moving according to a permutation on the vertices. On a regular tree, the permuted random walk always moves away from the origin faster than the usual random walk. In what generality does this hold -- for other random walks, or other graphs?

Random walks on regular trees can not be slowed down, with O. Angel, Y. Spinka and A. Yehudayoff. Published in EJP, 2024.

Counting clusters in 2D percolation
Consider independent site percolation on a finite graph, like a subset of \(\mathbb{Z}^d\) or the hexagonal lattice. How many open clusters are there? For example, what is the expected number of clusters? For lattices, it is easy to see that the number of clusters is on the same order as the volume of the set you percolate. Can one establish a law of large numbers for the cluster count, and compute the limiting constant? The A5 problem from the 2004 Putnam exam asked to prove a lower bound on this constant. But exactly determining it is a difficult problem.

Counting clusters on a finite grid. My undergraduate thesis (2014), advised by Pete Winkler.

Number of Pieces in Percolated Hexagonal Grids. REU project at the University of Washington (2015), advised by Me.

perc_loopex
A figure from my undergraduate thesis. To estimate the number of open connected components, the idea is to build a large configuration by appending a few columns at a time, and systematically counting how the number of clusters changes.
Stochastic geometry
Consider an iid sequence of random sets, or a Poisson point process of hyperplanes or sets in a metric space. One can ask many natural questions related to the coverage of such a family of sets: what is their union? What is the distribution of the random field counting how many times each point is covered by the union? If the sequence is confined to a compact set, what is the intersection of the sets? A canonical limit object, called the Crofton cell, often arises in connection with these questions, appearing (for example) as the scaling limit of the region containing the origin in a standard hyperplane tesselation of \(\mathbb{R}^d\).

Intersections of random sets, with A. Sarkar. Published in the Journal of Applied Probability, 2022.

continuous coupon collector
Sample iid arcs of length \(\theta\) at uniform random locations on the circle of circumference one, and stop when the union of the arcs covers the circle. This is a plot of the Lebesgue measure of the union of the arcs with \(\theta = 8\times 10^{-4}\), after centering (subtracting the mean) and scaling time by \(\theta^{-1} \log \theta^{-1}\).