Theory Seminar Series

Hosted by the UT Algorithms and Computational Theory Group

Spring 2025

Spring 2025 Theory Seminars will meet on Fridays from 11:00 am - 12:00 pm in POB 2.402. This schedule will be updated throughout the semester. Please note that this is a new location.

01/17/2025 - Arun Jambulapati, University of Texas-Austin: Sparsifying sums of functions

01/24/2025 - Swastik Kopparty, University of Toronto: Improved Private Information Retrieval Schemes from Matching Vectors and Derivatives

01/30/2025 - Eric Neyman, Alignment Research Center: Backdoors as an analogy for AI misalignment
NOTE SPECIAL LOCATION and DAY: Thursday, 11am-12 noon in room GDC 6.302

02/07/2025 - Arsen Vasilyan, University of Texas-Austin: Testing Assumptions of Learning Algorithms

02/14/2025 - Dean Doron, Ben-Gurion University: Online Condensing of Unpredictable Sources via Random Walks

02/21/2025 - Yang Liu, Carnegie Mellon University: CSP Inverse Theorems and Additive Combinatorics

02/28/2025 - Zeyu Guo, The Ohio State University: Combinatorial List Decodability of Random Reed–Solomon Codes and Beyond

03/07/2025 - Chinmay Nirkhe, University of Washington: Finding lessons in quantum oracles

03/14/2025 - Erik Waingarten, University of Pennsylvania: Data-Dependent Locality Sensitive Hashing for the Earth Mover's Distance

03/21/2025 SPRING BREAK

03/28/2025 NO MEETING

04/04/2025 SPECIAL LOCATION: WEL 3.310

04/11/2025 - Anindya De, University of Pennsylvania: Sparsifying the supremum of Gaussian processes

04/18/2025 - Sepehr Assadi, University of Waterloo: Vizing's Theorem in Near-Linear Time

04/25/2025 - Guy Blanc, Stanford University: The Sample Complexity of Smooth Boosting and the Tightness of the Hardcore Theorem

05/02/2025 - Nikhil Bansal, University of Michigan: Quasi-Monte Carlo Integration and Discrepancy

_____________________________________________________________________________________

01/17/2025
Arun Jambulapati, University of Texas Austin
Sparsifying sums of functions

Consider a function F on R^n that can be written as a sum of functions F = f_1 + f_2 + ... The question of approximating F by a (reweighted) sum of only a small number of summands appears throughout CS theory, with applications in mathematical analysis, data science, and numerical linear algebra.

In this talk, I will present several results showing this phenomenon is possible in many settings. For instance, if the functions {f_i} are arbitrary norms, symmetric submodular functions, or come from a general class of linear models, then one can sparsify down to a number of terms that is nearly linear in the dimension. As an application, we obtain algorithms with near-optimal running time for L_p regression with 1 < p < 2.

This is based on joint works with James R. Lee, Yang P. Liu, and Aaron Sidford.

01/24/2025
Swastik Kopparty, University of Toronto
Improved Private Information Retrieval Schemes from Matching Vectors and Derivatives

Private Information Retrieval (PIR) is a method for a user to interact with t non-colluding servers and read some entry of a database of size n without revealing to the servers anything about which entry of the database was read. After a long line of work, including breakthroughs by Yekhanin (2007), Efremenko (2009) and Dvir-Gopi (2014), we know that for any t >= 2, t-server PIR is possible with n^{o(1)} communication.

In this talk, I will describe some ideas that go into these PIR schemes, and give a new, simpler PIR scheme for t = 2 based on a slightly different viewpoint on them. This simpler scheme generalizes well, and gives superpolynomial improvements to the best known communication for all but finitely many t. Notably, we get a 3-server PIR scheme with communication exp((log n)^(1/3)), improving upon the previously best-known communication of exp((log n)^(1/2)) due to Efremenko.

Joint work with Fatemeh Ghasemi and Madhu Sudan.

01/30/2025

Eric Neyman, Alignment Research Center
Backdoors as an analogy for AI misalignment

In the dynamic backdoor detection game, an attacker modifies a function (such as a circuit or neural network) to behave differently on a small set of inputs (called backdoors). The defender receives the function as a white box, together with an input, and must determine whether or not the input is a backdoor. In this talk, I will draw an analogy between backdoors and inputs that elicit misaligned behavior from a neural network. I will discuss my colleagues' recent results on the dynamic backdoor detection game and their implications for AI safety.

02/07/2025
Arsen Vasilyan, University of Texas-Austin
Testing Assumptions of Learning Algorithms

Many algorithms in modern learning theory assume that data satisfies various distributional assumptions (such as Gaussianity or uniformity over {0,1}^d). To be confident that these algorithms give valid predictions, one needs to be confident that the assumptions these algorithms make are correct. This motivates the research direction of testing distributional assumptions of learning algorithms. This research direction studies algorithms that come together with a tester, which can alert a user if some assumptions are not satisfied. Conversely, if the tester finds a specific dataset to be satisfactory, then a user can be confident in the guarantee given by the learning algorithm.

In this talk, I will survey the current state of knowledge and known techniques. The talk will not assume any prior knowledge in learning theory, and will discuss connections with a wide array of other fields of theoretical computer science, including pseudorandomness, sum-of-squares relaxations and outlier removal.

Talk based on papers joint with Adam R. Klivans, Vasilis Kontonis, Ronitt Rubinfeld, Surbhi Goel, Aravind Gollakota, Abhishek Shetty and Konstantinos Stavropoulos.

02/14/2025
Dean Doron, Ben-Gurion University
Online Condensing of Unpredictable Sources via Random Walks

We consider the following adversarial, non-Markovian, random walk on “good enough” expanders: Starting from some fixed vertex, walk according to the instructions X = X1,…,Xt, such that for a typical x ~ X, for most i-s, the element x_i is rare, given its prefix. This notion extends other studied notions of weak randomness streams, most notably (almost) Chor-Goldreich sources.

We show that such walks (or certain variants of them) accumulate most of the entropy in X. In particular, we obtain seedless condensers with linear entropy gap, and seeded condensers with constant entropy gap. Importantly — our condensers are online, meaning that we don’t need to know t in advance, and we can output high quality randomness on the fly, without waiting for the stream to end.

We will also discuss more general questions about online condensing and extracting, and about the goal of seedless condensing of weak streams of randomness.

Joint work with Dana Moshkovitz, Justin Oh, and David Zuckerman

02/21/2025

Yang Liu, Carnegie Mellon University
CSP Inverse Theorems and Additive Combinatorics

A recent line of work initiated by Bhangale-Khot-Minzer to study the approximability of satisfiable CSPs sought to understand the following problem: if functions f_1, ..., f_k have nonnegligible correlation over a product distribution \mu^n, what structure can one deduce about the functions f_1, ..., f_k?

In this talk, we discuss a recent result which answers this question for k = 3 for a natural class of ``pairwise-connected'' distributions \mu, building upon previous results. Additionally, we discuss applications of this result to property testing and additive combinatorics. Of particular note is the first ``reasonable'' bound for the density Hales-Jewett problem for combinatorial lines of length three.

Based on joint work with Amey Bhangale, Subhash Khot, and Dor Minzer.

02/28/2025
Zeyu Guo, The Ohio State University
Combinatorial List Decodability of Random Reed–Solomon Codes and Beyond

A code C \subseteq A^n over an alphabet A is said to be (r, L)-combinatorially list decodable if any Hamming ball of radius r n in A^n contains at most L codewords from C.

In this talk, I will discuss a recent line of research that has demonstrated the optimal combinatorial list decodability of random Reed–Solomon codes. Additionally, I will briefly touch on related progress concerning rank-metric codes and folded Reed–Solomon codes.

03/07/2025

Chinmay Nirkhe, University of Washington

Finding lessons in quantum oracles

The study of quantum computation is mired in a love/hate relationship with quantum oracles. Complexity theorists love them; practitioners and skeptics hate them. Who is right? In this talk, I'll make the case that oracle separations should be looked at as lessons in the nature of quantum computation. I'll go through the lessons we can learn from well known oracle problems such as Grovers, Bernstein-Vazirani, BBBV, Random Oracle Model, and then talk about problems where the lesson is unclear. I'll motivate the quest to find a classical oracle separation for the problem of separating the classes QMA and QCMA and related problems as well as provide some opinions on what lessons we might learn from the oracle separation.

03/14/2025

Erik Waingarten, University of Pennsylvania

Data-Dependent Locality Sensitive Hashing for the Earth Mover's Distance

We give new data-dependent locality sensitive hashing schemes (LSH) for the Earth Mover's Distance (EMD), and as a result, improve the best approximation for nearest neighbor search under EMD by a quadratic factor. Here, the metric EMD_s(R^d,\ell_p) consists of sets of s vectors in R^d, and for any two sets x,y of s vectors the distance EMD(x,y) is the minimum cost of a perfect matching between x,y, where the cost of matching two vectors is their \ell_p distance. Previously, Andoni, Indyk, and Krauthgamer gave a (data-independent) locality-sensitive hashing scheme for EMD_s(R^d,\ell_p) when p in [1,2] with approximation O(log^2 s). By being data-dependent, we improve the approximation to \tilde{O}(log s).

04/11/2025

Anindya De, University of Pennsylvania

Sparsifying the supremum of Gaussian processes

In this talk, we will show that the supremum of any centered
Gaussian process can be approximated to any arbitrary accuracy by a
finite dimensional Gaussian process where the dimension of the
approximator is just dependent on the target error. As a corollary, we
show that for any norm \Phi defined over R^n and target error \eps,
there is a norm \Psi such that (i) \Psi is only dependent on t(\eps) =
\exp \exp (poly(1/\eps)) dimensions and (ii) \Psi(x)/\Phi(x) \in
[1-\eps, 1+ \eps] with probability 1-\eps (when x is sampled from the
Gaussian space). Our proof relies on Talagrand's majorizing measures
theorem.

Joint work with Shivam Nadimpalli, Ryan O'Donnell and Rocco Servedio.

04/18/2025

Sepehr Assadi, University of Waterloo

Vizing's Theorem in Near-Linear Time

Vizing's theorem states that any simple graph of maximum degree Delta can be edge colored with (Delta+1) different colors. In this talk, we present the first near-linear time randomized algorithm for computing a (Delta+1) edge coloring with high probability.

Based on joint work with Soheil Behnezhad, Sayan Bhattacharya, Martin Costa, Shay Solomon, and Tianyi Zhang.

04/25/2025

Guy Blanc, Stanford University

The Sample Complexity of Smooth Boosting and the Tightness of the Hardcore Theorem

This talk will be about two related results, one in complexity theory and one in learning.

On the learning side, we investigate the sample complexity of smooth boosters - These are boosting algorithms that do not place too much weight on any given example. They have a variety of applications including robustness, privacy, and reproducibility. We show that the sample complexity overhead of existing smooth boosters is optimal, separating smooth boosting from distribution-independent boosting.

On the complexity side, our work also sheds new light on Impagliazzo's hardcore theorem from complexity theory, all known proofs of which can be cast in the framework of smooth boosting. For a function f that is mildly hard against size-s circuits, the hardcore theorem provides a set of inputs on which f is extremely hard against size-s′ circuits. A downside of this important result is the loss in circuit size, i.e. that s′≪s. Answering a question of Trevisan, we show that this size loss is necessary and in fact, the parameters achieved by known proofs are the best possible.

05/02/2025

Nikhil Bansal, University of Michigan

Quasi-Monte Carlo Integration and Discrepancy

A classical approach to numerically integrating a function f is using Monte Carlo (MC) methods. Here, one evaluates f at random points and the estimation error scales as \sigma(f)/n^{1/2} with n samples, where \sigma(f) is the standard deviation of f. A different approach, widely used in practice, is using quasi-Monte Carlo (QMC) methods, where f is evaluated at carefully chosen deterministic points and the error scales roughly as 1/n. Both methods have distinctive advantages and shortcomings, and a key question has been to find a method that combines the advantages of both.

In this talk, I will introduce the fascinating area of QMC methods and their connections to various areas of mathematics and to geometric discrepancy. I will then show how recent developments in algorithmic discrepancy theory can be used to give a method that combines the benefits of MC and QMC methods, and even improves upon previous QMC approaches in various ways.

The talk will be completely self-contained and elementary, and no prior knowledge of either discrepancy or integration is required. Based on joint work with Haotian Jiang (U. Chicago).

Fall 2024

Fall 2024 Theory Seminars will meet on Fridays from 11:00 am - 12:00 pm in GDC 1.304. This schedule will be updated throughout the semester. Please note that this is a new location.

08/30/2024 - Dana Moshkovitz, University of Texas Austin: Coding Theory in Almost-Linear Time and Sub-Linear Space
09/06/2024 - Jeremy Fineman, Georgetown University: Beating Bellman-Ford: Faster Single-Source Shortest Paths with Negative Real Weights
09/13/2024 - Maryam Aliakbarpour, Rice University: Hypothesis selection with computational constraints
09/20/2024 - Barna Saha, UC San Diego: Faster All Pairs Shortest Paths
09/27/2024 - Noah Golowich, MIT: Computationally Efficient Reinforcement Learning with Linear Bellman Completeness
NOTE LOCATION: 12:15pm, GDC 6.302
10/04/2024 - Venkatesan Guruswami, UC Berkeley: The Parameterized Inapproximability Hypothesis
10/11/2024 - Kuikui Liu, MIT: Strong spatial mixing for colorings on trees and its algorithmic applications
10/18/2024 - Nick Hunter-Jones, University of Texas at Austin: Random unitaries in low depths
10/25/2024 - Sabee Grewal, University of Texas at Austin: Improved Stabilizer Estimation via Bell Difference Sampling & Siddhartha Jain, University of Texas at Austin: On Pigeonhole Principles and Ramsey in TFNP
11/01/2024 - Cristopher Moore, Santa Fe Institute: Tensor networks and phase transitions in machine learning
NOTE LOCATION: 12:15pm, GDC 6.302
11/08/2024 - Jesse Goodman, University of Texas at Austin: Improved condensers for Chor-Goldreich sources
11/15/2024 - Vasilis Kontonis, University of Texas at Austin: Smoothed Analysis for Learning Concepts with Low Intrinsic Dimension
11/22/2024 - Yael Tauman Kalai, MIT: Classical commitments to quantum states
11/29/2024 - NO MEETING, HOLIDAY
12/06/2024 - Roei Tell, University of Toronto: Deterministic reconstruction arguments and their applications
12/13/2024 - Ian Mertz, Charles University: Catalytic Computing: A Primer
NOTE LOCATION: POB 2.402

_____________________________________________________________________________________

08/30/2024
Dana Moshkovitz, University of Texas Austin
Coding Theory in Almost-Linear Time and Sub-Linear Space

Typical time-efficient encoding and decoding algorithms for error correcting codes use linear space. We construct asymptotically good codes that can be deterministically encoded in almost linear time and sub-linear space, as well as asymptotically good codes that can be deterministically decoded in this complexity. The encodable codes are based on condenser graphs. The decodable codes are based on locally correctable codes and a new efficient derandomization method. We believe that the new derandomization method is of independent interest.

The talk is based on joint works with Joshua Cook.

Jeremy Fineman, Georgetown University
Beating Bellman-Ford: Faster Single-Source Shortest Paths with Negative Real Weights
This talk presents a new randomized algorithm for the problem of single-source shortest path on directed graphs with real (both positive and negative) edge weights. Given an input graph with n vertices and m edges, the algorithm completes in $\tilde{O}(mn^{8/9})$ time with high probability. Whereas there has been significant progress for the case of integer-weighted graphs, this result constitutes the first asymptotic improvement over the

classic O(mn)-time algorithm for the case of real weights.

Maryam Aliakbarpour, Rice University
Hypothesis selection with computational constraints
With the ever-growing volume of data, understanding the computational aspects of statistical inference is increasingly critical. A key question arises: Can we develop algorithms that are both fast and memory-efficient to tackle these challenges? In this talk, we focus on the computational aspects of Hypothesis Selection, a fundamental problem in learning theory and statistics. The task is to select a distribution from a finite set of candidate distributions that best matches the underlying distribution of the given dataset. This talk will delve into the hypothesis selection problem under constraints of memory and time. We will explore how to achieve a nearly optimal tradeoff between memory usage and sample complexity, as well as methods to attain optimal accuracy using algorithms with near-optimal time complexity.

This talk is based on joint work with Mark Bun and Adam Smith.

Barna Saha, UC San Diego
Faster All Pairs Shortest Paths
We consider the age-old all pairs shortest path problem. For graphs with n vertices, the all-pairs shortest paths conjecture states that there is no algorithm that is truly subcubic. We show how approximations (both multiplicative and additive) allow us to go way below the cubic bound beating the previous results.

Joint work with Christopher Ye.

IFML at 12:15pm, GDC 6.302

Noah Golowich, MIT

Computationally Efficient Reinforcement Learning with Linear Bellman Completeness
One of the most natural approaches to reinforcement learning (RL) with function approximation is value iteration, which inductively generates approximations to the optimal value function by solving a sequence of regression problems. To ensure the success of value iteration, it is typically assumed that Bellman completeness holds, which ensures that these regression problems are well-specified. I will discuss new results on the problem of learning an optimal policy under Bellman completeness in the online and offline models of RL with linear function approximation.

In the online setting, while statistically efficient algorithms are known under Bellman completeness, these algorithms all relied on the principle of global optimism which requires solving a nonconvex optimization problem. We give the first polynomial-time algorithm when the number of actions is any constant. In the offline setting, we give the first algorithm which finds a near-optimal policy under a single-policy coverage condition. Our algorithm is also computationally efficient and generalizes to the setting with small inherent Bellman error, for which the suboptimality of our output policy scales with the square root of the Bellman error. We also prove a matching lower bound, showing that no algorithm can achieve better than square-root dependence on the Bellman error. This lower bound stands in contrast to many other settings in reinforcement learning with misspecification, where one can typically obtain performance that degrades linearly with the misspecification error.

https://www.ifml.institute/index.php/events/computationally-efficient-reinforcement-learning-linear-bellman-completeness

Venkatesan Guruswami, UC Berkeley
The Parameterized Inapproximability Hypothesis
The Parameterized Inapproximability Hypothesis (PIH) asserts that no fixed parameter tractable (FPT) algorithm can distinguish a satisfiable instance of a constraint satisfaction problem (CSP) parameterized by the number of variables, from one where every assignment fails to satisfy 1% of the constraints. PIH plays the role of the PCP theorem in parameterized complexity, with many downstream inapproximability consequences.

This talk will introduce the context and statement of the PIH, and then sketch the ideas behind a recent proof showing that the well-known Exponential Time Hypothesis (ETH) implies the PIH. The approach involves two broad steps. The first identifies a highly structured ETH-hard CSP whose variables take vector values, and whose constraints are either parallel or linear. Both kinds of constraints are then checked with constant soundness via a "parallel PCP of proximity" based on the Hadamard code. We will briefly mention follow-up improvements that employ Reed-Muller code based PCPs to even get near-optimal runtime lower bounds, eg. showing that constant factor approximations to k-clique on n-vertex graphs requires n^{k^{1-o(1)}} time under ETH.

Based on joint works with Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu.

Kuikui Liu, MIT
Strong spatial mixing for colorings on trees and its algorithmic applications
Correlation decay is a fundamental and important property of distributions arising in statistical physics and theoretical computer science. A longstanding conjecture is that the uniform distribution over proper q-colorings on any tree of maximum degree Δ exhibits a strong form of correlation decay whenever q≥Δ+1. It is surprising that such a basic question is still open, but then again it also highlights how much we still have to learn about random colorings. In this talk, I will discuss a near-resolution of this conjecture, as well as its algorithmic implications for sampling colorings on bounded-degree graphs via Glauber dynamics. Based on joint work with Zongchen Chen, Nitya Mani, and Ankur Moitra.

Nick Hunter-Jones, University of Texas at Austin
Random unitaries in low depths

We discuss recent progress in low-depth implementations of approximate unitary designs. A central question in quantum information theory is how efficiently can we construct unitaries which look indistinguishable from uniformly (Haar) random unitaries, mimicking moments of the Haar distribution. Through a combination of ideas from QI, TCS, and mathematical physics, it is now understood that such unitaries can be implemented in almost optimal depth. This has direct bearing on conjectures about circuit complexity growth and the incompressibility of quantum circuits.

Sabee Grewal, University of Texas at Austin
Improved Stabilizer Estimation via Bell Difference Sampling
Stabilizer states, which are the outputs of Clifford circuits, play a central role in quantum information. Aaronson and Gottesman (2008) and Montanaro (2017) showed that n-qubit stabilizer states can be learned in poly(n) time. In this talk, I'll discuss joint work with Vishnu Iyer, William Kretschmer, and Daniel Liang, where we establish new learnability and non-pseudorandomness results for outputs of Clifford+T circuits and generalizations thereof.

Siddhartha Jain, University of Texas at Austin
On Pigeonhole Principles and Ramsey in TFNP
A celebrated theorem of Frank Ramsey says that in any undirected, simple graph on 2^n vertices, there must exist a clique or independent set of size n/2. RAMSEY is the search problem of finding such a clique or independent set given a succinctly represented graph as input. Similarly, PIGEON is the search problem corresponding to the Pigeonhole Principle.

We show that RAMSEY is not black-box reducible to PIGEON, refuting a conjecture of Goldberg and Papadimitriou in the black-box setting. We prove this by giving reductions to RAMSEY from a new family of TFNP problems that correspond to generalized versions of the pigeonhole principle, and then proving that these generalized versions cannot be reduced to PIGEON. Formally, we define t-PPP as the class of total NP-search problems reducible to finding a t-collision in a mapping from (t-1)N+1 pigeons to N holes. These classes are closely related to multi-collision resistant hash functions in cryptography. We show that the generalized pigeonhole classes form a hierarchy as t increases, and also give a natural condition on the parameters t1, t2 that captures exactly when t1-PPP and t2-PPP collapse in the black-box setting. Finally, we prove other inclusion and separation results between these generalized PIGEON problems and other previously studied TFNP subclasses, such as PLS, PPA and PLC. Our separation results rely on new lower bounds in propositional proof complexity based on pseudoexpectation operators, which may be of independent interest.

Cristopher Moore, Santa Fe Institute
Tensor networks and phase transitions in machine learning

Suppose we observe a matrix of data with a low-rank “signal” obscured by noise. The standard way to find the signal, at least approximately, is PCA (principal component analysis): just look at the eigenvectors of the matrix. For Gaussian noise, random matrix theory tells us exactly how well this works: that is, the accuracy we can achieve as a function of the signal-to-noise ratio.

For tensors, such as three-index tables A_ijk, the situation is much more complex. Here there seems to be a “statistical-computational gap,” namely a regime where finding the signal is possible but exponentially hard. Physically, this corresponds to a “glass transition,” where the optimum becomes hidden behind an energy barrier. Mathematically, it means that we believe no polynomial-time algorithm exists, and that something like exhaustive search is necessary.

I’ll give evidence for this exponential hardness by showing that no algorithm remotely similar to PCA can work: more precisely, no low-degree polynomial of the input can. Along the way, I’ll give an introduction to tensor networks — a generalization of matrix products and traces that everyone, including network theorists, should know about. If we’re really lucky I’ll mention free probability and free cumulants...

This is joint work with Tim Kunisky (Yale) and Alex Wein (UC Davis) and appeared in FOCS 2024.

Jesse Goodman, University of Texas at Austin
Improved condensers for Chor-Goldreich sources

One of the earliest models of weak randomness is the Chor-Goldreich (CG) source. A CG source is a sequence of correlated random variables X = (X1, …, Xt), where each block Xi is guaranteed to have some entropy, even conditioned on the blocks that come before it.

Unfortunately, it is impossible to extract uniformly random bits from CG sources. Surprisingly, however, a recent work of Doron, Moshkovitz, Oh, and Zuckerman (STOC ’23) showed that it *is* possible to *condense* CG sources into a string with extremely high entropy rate — and this is good enough for many applications in algorithm design and cryptography.

In their work, [DMOZ23] showed how to condense CG sources where each block Xi has *constant length*. In this talk, I will discuss our recent work that handles CG sources where each block Xi can be *arbitrarily long*. In order to construct our condensers, we take a well-known paradigm for condensing independent sources, and bring it into the correlated world of CG sources. Our key ingredient is a powerful new type of object, which we call a non-malleable condenser for CG sources.

Joint work with Xin Li and David Zuckerman.

Vasilis Kontonis, University of Texas at Austin
Smoothed Analysis for Learning Concepts with Low Intrinsic Dimension

In traditional models of supervised learning, the goal of a learner– given examples from an arbitrary joint distribution on Rd × {±1}– is to output a hypothesis that is competitive (to within ǫ) of the best fitting concept from some class. In order to escape strong hardness results for learning even simple concept classes, we introduce a smoothed-analysis framework that requires a learner to compete only with the best classifier that is robust to small random Gaussian perturbations. This subtle change allows us to give a wide array of learning results for any concept that (1) depends on a low-dimensional subspace (aka multi-index model) and (2) has a bounded Gaussian surface area. This class includes functions of halfspaces and (low-dimensional) convex sets, cases that are only known to be learnable in non-smoothed settings with respect to highly structured distributions such as Gaussians.

Surprisingly, our analysis also yields new results for traditional non-smoothed frameworks such as learning with margin. In particular, we obtain the first algorithm for agnostically learning intersections of k-halfspaces with margin in time 2^poly(log k). Before our work, the best-known runtime was exponential in k.

Yael Tauman Kalai, MIT
Classical commitments to quantum states

We define the notion of a classical commitment scheme to quantum states, which allows a quantum prover to compute a classical commitment to a quantum state, and later open each qubit of the state in either the standard or the Hadamard basis. We construct such a commitment scheme from the post-quantum Learning With Errors (LWE) assumption. Our scheme is succinct in the sense that the running time of the verifier in the commitment phase depends only on the security parameter (independent of the size of the committed state), and its running time in the opening phase grows only with the number of qubits that are being opened (and the security parameter). As a corollary we obtain a classical succinct argument system for QMA under the post-quantum LWE assumption. Previously, this was known assuming post-quantum indistinguishability obfuscation.

This is based on joint work with Sam Gunn, Anand Natarajan and Agi Villanyi.

Roei Tell, University of Toronto

Deterministic reconstruction arguments and their applications

Constructions of pseudorandom generators (PRGs) are usually coupled with a reconstruction procedure, showing that if an adversary can break the PRG, it can also compute a hard function. Classical reconstruction procedures are computationally expensive, requiring non-uniformity or randomness, and this limits the reach of these arguments -- the security of the PRG is based on strong assumptions (e.g., on circuit lower bounds).

A recent research effort developed reconstruction procedures for various PRGs that are significantly more efficient -- namely, they are uniform and deterministic -- when considering limited types of adversaries. This led to applications, including new low-space algorithms, promising hardness vs randomness approaches to long-standing problems, win-win results for space-bounded computation, and better evidence for the possibility of derandomization with minimal memory footprint.

In this talk I will present some of the known results, as well as the prospects of pushing this approach further. A key technical part that I will describe is deterministic transformations of distinguishers to next-bit-predictors.

Ian Mertz, Charles University

Catalytic Computing: A Primer

Can memory be useful even when it's already full? In the catalytic computing model (Buhrman et al. 2014), we consider a space bounded Turing machine with additional access to a much larger hard drive, with the caveat that the initial contents of this extra space must be restored after any computation. Despite this restriction, catalytic computation gains surprising power over ordinary space-bounded computation, even above and beyond resources such as randomness and non-determinism.

In this talk we will survey the field of catalytic computation. We will cover the base catalytic model and where it fits into traditional complexity theory; variants of the model, such as lossy, non-uniform, and randomized catalytic computation; known techniques, such as register programs and compress-or-random approaches; applications of catalytic ideas to other computing models; and potential directions for the future of the field.

Spring 2024

Spring 2024 Theory Seminars will meet on Fridays from 11:00 am - 12:00 pm in GDC 4.304. This schedule will be updated throughout the semester. Please note that this is a new location.

01/19/2024 - Samson Zhou, Texas A&M University: Streaming Euclidean k-median and k-means with o(log n) Space
01/26/2024 - Thomas Rothvoss, University of Washington: The Subspace Flatness Conjecture and Faster Integer Programming
02/02/2024 - Brent Waters, University of Texas: Adaptively-Sound Succinct Arguments for NP from Indistinguishability Obfuscation
02/09/2024 - Gregory Kehne, University of Texas: Online Covering: Secretaries, Prophets, and Samples
02/16/2024 - NO MEETING THIS WEEK
02/23/2024 - Parikshit Gopalan, Apple: Omniprediction and multigroup fairness
03/01/2024 - Ainesh Bakshi, MIT: Learning quantum Hamiltonians at any temperature in polynomial time
03/08/2024 - Josh Alman, Columbia University: Faster Walsh-Hadamard Transform from Matrix Non-Rigidity
03/15/2024 SPRING BREAK
03/22/2024 - Ce Jin, MIT: 0-1 Knapsack in Nearly Quadratic Time
ROOM CHANGE: meeting in Wel 3.320
03/29/2024 - Ryan Williams, MIT: Beating Brute Force for Compression Problems
04/05/2024 - Moses Charikar, Stanford: Breaking the Metric Voting Distortion Barrier
04/12/2024 - Amit Sahai, UCLA: The Mathematics of Hiding Secrets in Software
MEETING TIME IS 2pm
04/19/2024 - Euiwoong Lee, University of Michigan: Recent Progresses on Correlation Clustering
04/26/2024 - Yufei Zhao, MIT: Equiangular lines and eigenvalue multiplicity
05/03/2024 - Mohsen Ghaffari, MIT: Parallel Derandomization for Chernoff-like Concentrations
ROOM CHANGE: meeting in POB 2.402

01/19/2024
Samson Zhou, Texas A&M University
Streaming Euclidean k-median and k-means with o(log n) Space

We consider the classic Euclidean k-median and k-means objective on insertion-only streams, where the goal is to maintain a (1+epsilon)-approximation to the k-median or k-means, while using as little memory as possible. Over the last 20 years, clustering in data streams has received a tremendous amount of attention and has been the test-bed for a large variety of new techniques, including coresets, the merge-and-reduce framework, bicriteria approximation, sensitivity sampling, and so on. Despite this intense effort to obtain smaller sketches for these problems, all known techniques require storing at least Omega(log(nDelta)) words of memory, where n is the size of the input and Delta is the aspect ratio. A natural question is if one can beat this logarithmic dependence on n and Delta. In this paper, we break this barrier by giving the first algorithm that achieves a (1+epsilon)-approximation to the more general (k,z)-clustering problem, using only ~O(dk/varepsilon^2)*(2^{z log z})*min(1/epsilon^z, k)*poly(log log(n Delta)) words of memory.

Joint work with Vincent Cohen-Addad and David P. Woodruff.

01/26/2024
Thomas Rothvoss, University of Washington
The Subspace Flatness Conjecture and Faster Integer Programming

In a seminal paper, Kannan and Lov\'asz (1988) considered a quantity $\mu_{KL}(\Lambda,K)$ which denotes the best volume-based lower bound on the \emph{covering radius} $\mu(\Lambda,K)$ of a convex body $K$ with respect to a lattice $\Lambda$. Kannan and Lov\'asz proved that $\mu(\Lambda,K) \leq n \cdot \mu_{KL}(\Lambda,K)$ and the Subspace Flatness Conjecture by Dadush (2012) claims a $O(\log n)$ factor suffices, which would match the lower bound from the work of Kannan and Lov\'asz. We settle this conjecture up to a constant in the exponent by proving
that $\mu(\Lambda,K) \leq O(\log^{3}(n)) \cdot \mu_{KL} (\Lambda,K)$. Our proof is based on the Reverse Minkowski Theorem due to Regev and
Stephens-Davidowitz (2017). Following the work of Dadush (2012, 2019), we obtain a $(\log n)^{O(n)}$-time randomized algorithm to
solve integer programs in $n$ variables. Another implication of our main result is a near-optimal \emph{flatness constant} of $O(n \log^{3}(n))$.

This is joint work with Victor Reis.

02/02/2024
Brent Waters, University of Texas
Adaptively-Sound Succinct Arguments for NP from Indistinguishability Obfuscation

A succinct non-interactive argument (SNARG) for NP allows a prover to convince a verifier that an NP statement x is true with a proof of size o(|x| + |w|) where w is the associated NP witness. A SNARG satisfies adaptive soundness if the malicious prover can choose the statement to prove after seeing the scheme parameters. In this work, we provide the first adaptively-sound SNARG for NP in the plain model assuming sub-exponentially-hard indistinguishability obfuscation, sub-exponentially-hard one-way functions and either the (polynomial) hardness of the discrete log assumption or the (polynomial) hardness of factoring. This gives the first adaptively-sound SNARG for NP from falsifiable} assumptions. All previous SNARGs for in the plain model either relied on non-falsifiable cryptographic assumptions or satisfied a weak notion of non-adaptive soundness (where the adversary has to choose the statement it proves before seeing the scheme parameters).

This is joint work with David Wu.

02/09/2024
Gregory Kehne, University of Texas
Online Covering: Secretaries, Prophets, and Samples

What makes set cover harder to solve online than off? By way of an answer, we give a polynomial-time algorithm for online covering integer programs (IPs) with a competitive ratio of O(\log mn) when the constraints are revealed in random order, essentially matching the best possible offline bound of \Omega(\log n) and circumventing the \Omega(\log m \log n) lower bound known in adversarial order. We then leverage this O(\log mn) competitive algorithm for the prophet version of online integer covering, where constraints are sampled from a sequence of known distributions. We present sample guarantees in the prophet setting, as well as in the setting where random samples from an adversarial instance are revealed at the outset.

This talk is based on joint work with Anupam Gupta and Roie Levin.

02/23/2024
Parikshit Gopalan, Apple
Omniprediction and multigroup fairness

Consider a scenario where we are learning a predictor, whose predictions will be evaluated by their expected loss. What if we do not know the precise loss at the time of learning, beyond some generic properties (like convexity)? What if the same predictor will be used in several applications in the future, each with their own loss function? Can we still learn predictors that have strong guarantees?
This motivates the notion of omnipredictors: predictors with strong loss minimization guarantees across a broad family of loss functions, relative to a benchmark hypothesis class. Omniprediction turns out to be intimately connected to multigroup fairness notions such as multicalibration, and also to other topics like boosting, swap regret minimization, and the approximate rank of matrices. This talk will present recent work in this area, emphasizing these connections.

03/01/2024
Ainesh Bakshi, MIT
Learning quantum Hamiltonians at any temperature in polynomial time

A quantum system of n interacting particles at thermal equilibrium can be described using a polynomial number of parameters, which correspond to the strengths of the forces between particles. A natural open question has been whether these parameters can be estimated efficiently by running experiments on the system. We resolve this question by giving the first polynomial-time algorithm for this task. This improves over prior work, which uses polynomially many samples from the system, but requires exponential time. In this talk, I will introduce the problem and describe some of the key ideas behind our algorithm. No background in quantum information is required.

Based on joint work with Allen Liu, Ankur Moitra and Ewin Tang.

03/08/2024
Josh Alman, Columbia University
Faster Walsh-Hadamard Transform from Matrix Non-Rigidity

The Walsh-Hadamard transform is a simple recursively-defined linear transformation with many applications throughout algorithm design and complexity theory. The folklore "fast Walsh-Hadamard transform" algorithm computes it using only N log N arithmetic operations, which is believed to be optimal up to constant factors. In this talk, I'll give two improved constructions:

- A new algorithm which uses (23/24) N log N + O(N) arithmetic operations. This is, to my knowledge, the first improvement of the leading constant. (Based on joint work with Kevin Rao.)

- A new depth-2 linear circuit of size O(N^1.45), improving on size O(N^1.5) which one gets from the fast Walsh-Hadamard transform approach. (Based on joint work with Ashwin Padaki and Yunfeng Guan.)

A key idea behind both constructions is to take advantage of the matrix non-rigidty of the Walsh-Hadamard transform. A matrix is called "rigid" if it's impossible to change a "small" number of its entries to make it have "low" rank. L. Valiant showed that if a matrix is rigid, then this implies lower bounds for computing its linear transformation. However, a recent line of work has shown that many matrices of interest, including the Walsh-Hadamard transform, are actually not rigid. Although a formal converse to Valiant's result is not known, we'll make use of non-rigidity in our new constructions.

03/22/2024
Ce Jin, MIT
0-1 Knapsack in Nearly Quadratic Time

We study pseudo-polynomial time algorithms for the fundamental 0-1 Knapsack problem. Recent research interest has focused on its fine-grained complexity with respect to the number of items n and the maximum item weight w_max. Under (min,+)-convolution hypothesis, 0-1 Knapsack does not have O((n+w_max)^{2-δ}) time algorithms (Cygan-Mucha-Węgrzycki-Włodarczyk 2017 and Künnemann-Paturi-Schneider 2017). On the upper bound side, currently the fastest algorithm runs in O~(n + w_max^{12/5}) time (Chen, Lian, Mao, and Zhang 2023), improving the earlier O(n + w_max^3)-time algorithm by Polak, Rohwedder, and Węgrzycki (2021).

We close this gap between the upper bound and the conditional lower bound (up to subpolynomial factors): The 0-1 Knapsack problem has a deterministic algorithm in O(n + w_max^2 * log^4 w_max) time.

03/29/2024
Ryan Williams, MIT
Beating Brute Force for Compression Problems

A compression problem is defined with respect to an efficient encoding function f: given a string x, the goal is to find the shortest y such that f(y)=x. The obvious brute-force algorithm for solving a compression problem on n-bit strings runs in time about 2^s, where s is the length of the shortest description y.

We show that every compression problem has a Boolean circuit family which finds short descriptions more efficiently than brute force. In particular, our circuits have size roughly 2^{4s/5}. Our construction builds on Fiat-Naor’s data structure for function inversion [SICOMP 1999]: we show how to carefully modify their data structure so that it can be nontrivially implemented using Boolean circuits, and we show how to utilize hashing so that the circuit size is only exponential in the description length.

One corollary is that the Minimum Circuit Size Problem for generic fan-in two circuits of size s(n) on truth tables of size 2^n can be solved by circuits of size 2^{4/5 w + o(w)}poly(2^n), where w=s(n)log_2(s(n)+n). This improves over the brute-force approach of trying all possible size-s(n) circuits for all s(n) >= n.

This talk is based on joint work with Shuichi Hirahara and Rahul Ilango, which will be merged with concurrent and independent work of Noam Mazor and Rafael Pass.

04/05/2024
Moses Charikar, Stanford
Breaking the Metric Voting Distortion Barrier

We consider the following well-studied problem of metric distortion in social choice. Suppose we have an election with voters and candidates who lie in a shared metric space. We would like to design a voting rule that chooses a candidate whose average distance to the voters is small. However, instead of having direct access to the distances in the metric space, each voter gives us a ranked list of the candidates in order of distance. Can we design a voting rule that, regardless of the election instance and underlying metric space, chooses a candidate whose cost differs from the true optimum by only a small factor (known as the distortion)?
A long line of work culminated in finding deterministic voting rules with metric distortion 3, which is the best possible for deterministic rules and many other classes of voting rules. However, without any restrictions, there is still a significant gap in our understanding. Even though the best lower bound is substantially lower at 2.112, finding a randomized rule that guarantees distortion $3 - \epsilon$ for some constant $\epsilon$ has been a major challenge in computational social choice.
In this work, we give a rule that guarantees distortion less than 2.753 by carefully randomizing between Maximal Lotteries, a natural game-theoretical voting rule dating back to the 60's, and new voting rules that we introduce.
Based on joint work with Prasanna Ramakrishnan, Kangning Wang, and Hongxun Wu.

04/12/2024
Amit Sahai
The Mathematics of Hiding Secrets in Software

At least since the initial public proposal of public-key cryptography based on computational hardness conjectures (Diffie and Hellman, 1976), cryptographers have contemplated the possibility of a “one-way compiler” that translates computer programs into “incomprehensible” but equivalent forms. And yet, the search for such a “one-way compiler” remained elusive for decades.
In this talk, we look back at our community’s attempts to formalize the notion of such a compiler, culminating in our 2001 work with Barak, Goldreich, Impagliazzo, Rudich, Vadhan, and Yang, which proposed the notion of indistinguishability obfuscation (iO). Roughly speaking, iO requires that the compiled versions of any two equivalent programs (with the same size and running time) be indistinguishable to any efficient adversary. The notion of punctured programming, introduced in our work with Waters in 2013, spawned an area of research dedicated to exploring the remarkable power of iO.
We’ll then discuss the intense effort that recently culminated in our 2021 work with Jain and Lin, finally showing how to construct iO in such a way that, for the first time, we can prove the security of our iO scheme based on well-studied computational hardness conjectures in cryptography.

04/19/2024
Euiwoong Lee, University of Michigan
Recent Progresses on Correlation Clustering

Correlation Clustering is one of the most well-studied graph clustering problems. The input is a complete graph where each edge is labeled either "+" or "-", and the goal is to partition the vertex set into (an arbitrary number of) clusters to minimize (the number of + edges between different clusters) + (the number of - edges within the same cluster). Until recently, the best polynomial-time approximation ratio was 2.06, nearly matching the integrality gap of 2 for the standard LP relaxation.

Since 2022, we have bypassed this barrier and progressively improved the approximation ratio, with the current ratio being 1.44. Based on a new relaxation inspired by the Sherali-Adams hierarchy, the algorithm introduces and combines several tools considered in different contexts, including "local to global correlation rounding" and "combinatorial preclusering". In this talk, I will describe the current algorithm as well as how it has been inspired by various viewpoints.

Joint work with Nairen Cao, Vincent Cohen-Addad, Shi Li, Alantha Newman, and Lukas Vogl.

04/26/2024
Yufei Zhao, MIT
Equiangular lines and eigenvalue multiplicity

I will discuss our solution to a longstanding discrete geometry problem where we determined the maximum number of lines in high dimension pairwise separated by a fixed angle. A key step is a new result in spectral graph theory showing that a connected bounded degree graph has sublinear second eigenvalue multiplicity.

These results prompted further investigations into spherical codes and spectral graph theory. I will highlight some ongoing work and open problems.

05/03/2024
Mohsen Ghaffari
Parallel Derandomization for Chernoff-like Concentrations

Randomized algorithms frequently use concentration results such as Chernoff and Hoeffding bounds. A longstanding challenge in parallel computing is to devise an efficient method to derandomize parallel algorithms that rely on such concentrations. Classic works of Motwani, Naor, and Naor [FOCS'89] and Berger and Rompel [FOCS'89] provide an elegant parallel derandomization method for these, via a binary search in a k-wise independent space, but with one major disadvantage: it blows up the computational work by a (large) polynomial. That is, the resulting deterministic parallel algorithm is far from work-efficiency and needs polynomial processors even to match the speed of single-processor computation. This talk overviews a duo of recent papers that provide the first nearly work-efficient parallel derandomization method for Chernoff-like concentrations.
Based on joint work with Christoph Grunau and Vaclav Rozhon, which can be accessed via https://arxiv.org/abs/2311.13764 and https://arxiv.org/abs/2311.13771.

Fall 2023

Fall 2023 Theory Seminars will meet on Fridays from 11:00 am - 12:00 pm in POB 2.402. This schedule will be updated throughout the semester.

08/15/2023 - Xin Li, Johns Hopkins University: Two Source Extractors for Asymptotically Optimal Entropy, and Improved Ramsey Graphs ***Location: GDC 4.304 Time: 11:00am-12:00pm***
08/25/2023 - Geoffrey Mon, UT Austin: Relaxed Local Correctability from Local Testing
09/01/2023 - Kevin Tian, UT Austin: Linear-Sized Sparsifiers via Near-Linear Time Discrepancy Theory
09/08/2023 - Raghu Meka, UCLA: Resolving Matrix Spencer Conjecture Up to poly-logarithmic Rank
09/15/2023 - Scott Aaronson, UT Austin/Open AI: Watermarking of Large Language Models
09/22/2023 - Peter Manohar, Carnegie Mellon University: Lower Bounds for Locally Decodable Codes from Semirandom CSP Refutation
09/29/2023 -
10/06/2023 - Ali Vakilian, TTIC: Algorithms for Socially Fair Clustering: Min-Max Fairness to Cascaded Norms
10/13/2023 - Keegan Ryan, UCSD: Fast Practical Lattice Reduction through Iterated Compression
10/20/2023 - Aaron Potechin, University of Chicago: Separating MAX 2-AND, MAX DI-CUT and MAX CUT
10/27/2023 - Lijie Chen, UC Berkeley: Polynomial -Time Pseudodeterministic Construction of Primes
11/03/2023 - Jesse Goodman, UT Austin: Exponentially Improved Extractors for Adversarial Sources
11/10/2023 - Greg Bodwin , University of Michigan: Recent Progress on Fault Tolerant Spanners
11/17/2023 - Thatchaphol Saranurak, University of Michigan: Using Isolating Mincuts for Fast Graph Algorithms: A Tutorial
11/24/2023 - THANKSGIVING HOLIDAY
12/01/2023 - Sivakanth Gopi, Microsoft Research: Higher Order MDS Codes and List Decoding
12/08/2023 - Jinyoung Park, Courant Institute of Mathematical Sciences NYU: Thresholds

08/15/2023
Xin Li, Johns Hopkins University
Two Source Extractors for Asymptotically Optimal Entropy, and Improved Ramsey Graphs

Randomness extractors are functions that extract almost uniform random bits from weak random sources with severe biases. One of the most important and well studied types of extractors is the so-called two source extractor, where the input consists of two independent weak sources. Assuming each source has n bits, one can show the existence of two source extractors for entropy k =log n+O(1). Such an extractor also gives a (bipartite) Ramsey graph with N=2^n vertices (on each side), such that there is no (bipartite) clique or independent set of size K=O(log N). Explicit constructions of two source extractors and Ramsey graphs are long standing open problems, dating back to the year of 1947 when Erd˝os invented the probabilistic method.

However, despite considerable effort and several recent breakthroughs, previously the best explicit two source extractor only works for entropy k=o(log n log log n). Correspondingly, the best explicit construction of Ramsey graphs only guarantees the non-existence of a clique or independent set of size K=(log N)^{o(log log log N)}.

In this talk, I will describe a new construction of explicit two source extractors for asymptotically optimal entropy of k=O(log n), which gives explicit Ramsey graphs on N vertices with on clique or independent set of size K=log^c N for some absolute constant c>1.

08/25/2023
Geoffrey Mon, UT Austin
Relaxed Local Correctability from Local Testing

A relaxed locally correctable code is equipped with an algorithm that, when given query access to a (potentially corrupted) codeword and an index i, either returns the ith bit of the nearest codeword or detects corruption. We build the first asymptotically good relaxed locally correctable codes with polylogarithmic query complexity, which finally closes the superpolynomial gap between query lower and upper bounds.

Our construction makes use of locally testable codes, which have algorithms that probabilistically detect the presence of corruption with few queries. By combining high-rate locally testable codes of various sizes, we can produce a code with local testers at every scale: we can gradually "zoom in" to any desired codeword index i, and a local tester at each step certifies that the next, smaller restriction of the input has low error. If no local tester detects corruption throughout this process, then the ith bit of the input is likely not corrupt and can be safely returned.

Based on joint work with Vinayak M. Kumar.

09/01/2023
Kevin Tian, UT Austin
Linear-Sized Sparsifiers via Near-Linear Time Discrepancy Theory

Discrepancy theory provides powerful tools for producing higher-quality objects which “beat the union bound” in fundamental settings throughout combinatorics and computer science. However, this quality has often come at the price of more expensive algorithms. We introduce a new framework for bridging this gap, by allowing for the efficient implementation of discrepancy-theoretic primitives. Our framework repeatedly solves regularized optimization problems to low accuracy to approximate the partial coloring method of [Rothvoss ’17], and simplifies and generalizes recent work of [Jain-Sah-Sawhney ’23] on fast algorithms for Spencer’s theorem. In particular, our framework only requires that the discrepancy body of interest has exponentially large Gaussian measure and is expressible as a sub level set of a symmetric, convex function. We combine this framework with new tools for proving Gaussian measure lower bounds to give improved algorithms for a variety of sparsification and coloring problems.

Joint work with Arun Jambulapati and Victor Reis.

09/08/2023
Raghu Meka, UCLA
Resolving Matrix Spencer Conjecture Up to poly-logarithmic Rank

The matrix Spencer conjecture is the following: given n symmetric matrices A1,..., An each of spectral norm at most 1, can we find signs for each of them such that their signed sum has spectral-norm at most O(sqrt{n}). If true, this would substantially generalize Spencer's 'Six standard deviations suffice' (a classical result in discrepancy theory). Spencer's result corresponds to the matrices Ai being diagonal.

In this talk, I will describe a simple proof of the matrix Spencer conjecture up to poly-logarithmic rank, i.e., each matrix Ai has rank at most n/(log n)^3. Previously, the result was known only for rank sqrt(n). Our result also has implications for quantum random access codes in the low-error regime: in particular, it implies a (1*log n - 3*(log log n)) lower bound for quantum random access codes with an advantage of 1/sqrt(n).

Based on joint work with Nikhil Bansal and Haotian Jiang.

09/15/2023
Scott Aaronson, UT Austin / Open AI
Watermarking of Large Language Models

I'll survey the research that's been done over the last year, by me and others, into the problem of how to insert statistical watermarks into the outputs of Large Language Models, in order (for example) to help detect GPT-enabled academic cheating, propaganda campaigns, and fraud. I'll explain how, by using pseudorandom functions, one can typically insert a watermark without degrading the quality of the LLM's output at all. I'll highlight the theoretical computer science aspects of the problem, as well as the challenge of coming up with a watermarking scheme that resists rephrasing attacks, or even of formalizing what one means by that. I'll also discuss some of the social and political issues around the deployment of LLM watermarking.

09/22/2023
Peter Manohar, Carnegie Mellon University
Lower Bounds for Locally Decodable Codes from Semirandom CSP Refutation

A code C is a q-locally decodable code (q-LDC) if one can recover any chosen bit b_i of the k-bit message b with good confidence by randomly querying the n-bit encoding x on at most q coordinates. Existing constructions of 2-LDCs achieve blocklength n = exp(O(k)), and lower bounds show that this is in fact tight. However, when q = 3, far less is known: the best constructions have n = subexp(k), while the best known lower bounds, that have stood for nearly two decades, only show a quadratic lower bound of n >= \Omega(k^2) on the blocklength.

In this talk, we will showcase a new approach to prove lower bounds for LDCs using recent advances in refuting semirandom instances of constraint satisfaction problems. These new tools yield, in the 3-query case, a near-cubic lower bound of n >= \tilde{\Omega}(k^3), improving on the prior best lower bound of n >= \tilde{\Omega(k^2)} of Kerendis and de Wolf from 2004 by a polynomial factor in k.

10/06/2023
Ali Vakilian, TTIC
Algorithms for Socially Fair Clustering: Min-Max Fairness to Cascaded Norms

In the center-based clustering methods (e.g., k-means), the assignment cost of a point represents the quality of the clustering for the point. This has motivated a notion of min-max fairness for the clustering problem, also known as socially fair clustering: Select k centers that minimize the maximum average clustering cost incurred by any of the L pre-specified groups of points in the input. This problem can be used to select a set of vaccination or polling sites to ensure different communities in the population are served fairly.

In this talk, I present our O(log L/ log log L)-approximation for the problem, which exponentially improves over the previously known O(L)-approximation and is optimal up to a constant factor. Then, I introduce a generalization of the problem called (p, q)-fair clustering, i.e., clustering with cascaded norms. This is a “relaxed” way of enforcing the fairness requirement, and its objective interpolates between the objective of classic k-clustering and that of socially fair clustering.

10/13/2023
Keegan Ryan, USCD
Fast Practical Lattice Reduction through Iterated Compression

We introduce a new lattice basis reduction algorithm with approximation guarantees analogous to the LLL algorithm and practical performance that far exceeds the current state of the art. We achieve these results by iteratively applying precision management techniques within a recursive algorithm structure and show the stability of this approach. We analyze the asymptotic behavior of our algorithm for arbitrary basis matrices, and we show that the heuristic running time is small and comparable to the cost of performing size reduction, matrix multiplication, and QR factorization on similarly sized matrices. The heuristic running time of our algorithm depends on the log of the condition number of the input basis, which is bounded by the bit length of basis entries in common applications. Our algorithm is fully practical, and we have published our implementation. We experimentally validate our heuristic, give extensive benchmarks against numerous classes of cryptographic lattices, and show that our algorithm significantly outperforms existing implementations.

10/20/23
Aaron Potechin, University of Chicago
Separating MAX 2-AND, MAX DI-CUT and MAX CUT

In 2008, Raghavendra's paper "Optimal algorithms and inapproximability results for every CSP?" showed that assuming the Unique Games Conjecture (or at least that unique games is hard), for any CSP on a finite set of predicates, the optimal worst-case polynomial time approximation algorithm is to use a standard semidefinite program and then apply a rounding algorithm. However, since the set of potential rounding functions is extremely large, Raghavendra's result does not tell us what the approximation ratio is for a given CSP. In fact, there are still several basic questions which are still open. For example, the question of whether there is a 7/8 approximation ratio for MAX SAT (where the clauses can have any length) is still open.

In this talk, I will review Raghavendra's result and why additional work is needed to analyze particular CSPs. I will then describe why MAX 2-AND, MAX DI-CUT and MAX CUT all have different approximation ratios and how we can prove this.

This is joint work with Joshua Brakensiek, Neng Huang, and Uri Zwick

10/27/23
Lijie Chen, UC Berkeley
Polynomial -Time Pseudodeterministic Construction of Primes

A randomized algorithm for a search problem is pseudodeterministic if it produces a fixed canonical solution to the search problem with high probability. In their seminal work on the topic, Gat and Goldwasser posed as their main open problem whether prime numbers can be pseudodeterministically constructed in polynomial time.

We provide a positive solution to this question in the infinitely-often regime. In more detail, we give an unconditional polynomial-time randomized algorithm B such that, for infinitely many values of n, B(1^n) outputs a canonical n-bit prime p_n with high probability. More generally, we prove that for every dense property Q of strings that can be decided in polynomial time, there is an infinitely-often pseudodeterministic polynomial-time construction of strings satisfying Q. This improves upon a subexponential-time construction of Oliveira and Santhanam.

Our construction uses several new ideas, including a novel bootstrapping technique for pseudodeterministic constructions, and a quantitative optimization of the uniform hardness-randomness framework of Chen and Tell, using a variant of the Shaltiel-Umans generator. This talk is based on joint work with Zhenjian Lu, Igor C. Oliveira, Hanlin Ren, and Rahul Santhanam.

11/03/2023
Jesse Goodman, UT Austin
Exponentially Improved Extractors for Adversarial Sources

Randomness is a powerful resource in computer science, with applications in cryptography, distributed computing, algorithm design, and more. Unfortunately, almost all of these applications require access to perfectly uniform bits, while the bits available in nature rarely look so pure. This motivates the study of randomness extractors, which are devices that can convert weak sources of randomness into perfectly uniform bits.

The *independent source model* is the oldest and most well-studied setting in which to study randomness extraction. In this model, the goal is to extract uniform bits from a sequence of N independent random variables X1, …, XN, each with min-entropy at least k. However, in real life, natural sources of randomness may sometimes produce samples with no entropy at all. This motivates the study of *adversarial sources*, which are identical to independent sources, except that only K of them are guaranteed to be “good” (i.e., contain any min-entropy, k).

Adversarial sources were first introduced by Chattopadhyay, Goodman, Goyal and Li (STOC ‘20), who showed how to extract from just K ≥ N^{0.5} good sources of polylogarithmic entropy. In a follow-up work, Chattopadhyay and Goodman (FOCS ‘21) improved the number of required good sources to K ≥ N^{0.01}. In this talk, I will describe a new construction that works for *exponentially less* good sources K ≥ polylog(N). In fact, for any constant N (e.g., N = 10^82), our extractor requires just K = 4 of the sources to be good. To construct our extractors, we exploit old and new tools from communication complexity, extractor theory, and extremal combinatorics.

Based on joint work with Eshan Chattopadhyay.

11/10/2023
Greg Bodwin, University of Michigan
Recent Progress on Fault Tolerant Spanners

Given a large input graph, a k-spanner is a much smaller subgraph that preserves shortest path distances up to an approximation factor of k. When this distance approximation is robust to a bounded set of failing nodes or edges, the spanner is called fault tolerant. Fault tolerant spanners and their relatives have applications in networking and distributed computing.

There has been a recent flurry of progress in our understanding of fault tolerant spanners, including faster construction algorithms and better tradeoffs between spanner size, error, and level of fault tolerance. We will survey this progress, spanning a sequence of 8 papers over the last 5 years. We will explain the new perspectives on the problem that have enabled progress, what has been solved, and what remains open.

11/17/2023
Thatchaphol Saranurak, University of Michigan
Using Isolating Mincuts for Fast Graph Algorithms: A Tutorial

The Isolating Mincuts algorithm is a new technique recently introduced by [Li and Panigrahi] and [Abboud Krauthgamer Trabelsi]. In the last three years, they found more than ten applications in fast graph algorithms.
I will give a gentle tutorial on this technique.

12/01/2023
Sivakanth Gopi, Microsoft Research
Higher Order MDS Codes and List Decoding

In this talk, we introduce 'higher order MDS codes', a fundamental generalization of MDS codes. An order-m MDS code, denoted by MDS(m), has the property that any m subspaces formed from columns of its generator matrix intersect as minimally as possible. Ordinary MDS codes correspond to m=2. We will then show that higher order MDS codes are equivalent to various seemingly unrelated concepts in coding theory: (1) optimally list-decodable codes (2) maximally recoverable tensor codes and (3) MDS codes with support constrained generator matrices (such as in GM-MDS theorem). This along with the GM-MDS theorem for Reed-Solomon codes implies that random Reed-Solomon codes over large enough alphabet are optimally list decodable.

This closes a long line of work on list-decodability of Reed-Solomon codes starting with the celebrated result of Guruswami and Sudan (1998) who showed that they are list-decodable up to the Johnson bound. Later works showed that "random" Reed-Solomon codes are list-decodable beyond the Johnson bound. We show that they in fact are optimally list-decodable! Quantitatively, we show that a random Reed-Solomon code of rate R over a large enough field is list decodable from radius 1-R-(1-R)/(L+1) with list size at most L, which proves a conjecture of Shangguan and Tamo (2020).

Subsequent work by Guo, Zhang (2023) and Alrabiah, Guruswami and Li (2023) showed that random RS codes over linear-sized fields achieve list-decoding capacity (which is weaker than optimal list-decoding). Building on this, we show that random AG codes over constant-sized fields achieve list-decoding capacity. For this, we prove a generalized GM-MDS theorem for algebraic codes.

Based on joint work with Joshua Brakensiek, Manik Dhar, Visu Makam and Zihan Zhang.

12/8/2023
Jinyoung Park, Courant Institute of Mathematical Sciences NYU
Thresholds

For a finite set X, a family F of subsets of X is said to be increasing if any set A that contains B in F is also in F. The p-biased product measure of F increases as p increases from 0 to 1, and often exhibits a drastic change around a specific value, which is called a "threshold." Thresholds of increasing families have been of great historical interest and a central focus of the study of random discrete structures (e.g. random graphs and hypergraphs), with estimation of thresholds for specific properties the subject of some of the most challenging work in the area. In 2006, Jeff Kahn and Gil Kalai conjectured that a natural (and often easy to calculate) lower bound q(F) (which we refer to as the “expectation-threshold”) for the threshold is in fact never far from its actual value. A positive answer to this conjecture enables one to narrow down the location of thresholds for any increasing properties in a tiny window. In particular, this easily implies several previously very difficult results in probabilistic combinatorics such as thresholds for perfect hypergraph matchings (Johansson–Kahn–Vu) and bounded-degree spanning trees (Montgomery). I will present recent progress on this topic. Based on joint work with Keith Frankston, Jeff Kahn, Bhargav Narayanan, and Huy Tuan Pham.

Spring 2023

Spring 2023 Theory Seminars will meet on Fridays from 2:00 - 3:00 pm in GDC 4.304. This schedule will be updated throughout the semester.

01/13/2023 - Scott Aaronson, UT: Discrete Bulk Reconstruction
01/20/2023 - Robert Andrews, University of Illinois Urbana-Champaign: On Matrix Multiplication and Polynomial Identity Testing
01/27/2023 - William Kretschmer, UT: Quantum Cryptography in Algorithmica
02/03/2023 - David Zuckerman, UT: Almost Chor-Goldreich Sources and Adversarial Random Walks
02/10/2023 - Xiaoyu He, Princeton University: Deletion Code Bounds via Pseudorandomness
02/17/2023 - Jack Zhou, Stanford University: Quantum Pseudoentanglement
02/24/2023 - GRADFEST
03/03/2023 - Robert Tarjan, Princeton University - James S. McDonnell Distinguished University Professor: Self-Adjusting Data Structures **GDC 2.216 Lecture Hall**
03/10/2023 - Daniel Liang, UT: Low-Stabilizer-Complexity Quantum States Are Not Pseudorandom
03/17/2023 - SPRING BREAK
03/24/2023 - Vinod Vaikuntanathan, MIT: Continuous Learning with Errors and Applications: Gaussian Mixtures and Undetectable Backdoors ***11:00am-12:00pm***
03/27/2023 - Jelani Nelson, UC Berkeley: New local differentially private protocols for frequency and mean estimation ***POB 2.302 (Avaya Auditotrium) 1:00-2:00pm***
03/29/2023 - Marijn Heule, Carnegie Mellon: Computer-aided Mathematics: Successes, Advances, and Trust ***2:30-3:30pm***
03/31/2023 - Gregory Valiant, Stanford University: Optimization with Limited Memory?
04/07/2023 - Ruizhe Zhang, UT Austin: Solving SDP Faster: A Robust IPM Framework and Efficient Implementation
04/14/2023 - Schuchi Chawla, UT Austin: Pandora's Box with Correlations: Learning and Approximation ***3:00-4:00pm***
04/21/2023 - ***CANCELLED***
04/28/2023 - FINALS
05/03/2023 - Yifeng Teng, Google Research: Prophet Secretary Against the Online Optimal
05/05/2023 - Raghu Meka, UCLA: Strong bounds for 3-progressions

01/13/2023
Scott Aaronson, UT
Discrete Bulk Reconstruction

According to the so-called "AdS/CFT correspondence" from quantum gravity, the geometries of certain spacetimes are fully determined by quantum states that live on their boundaries -- indeed, by the entropies of portions of those boundary states. This work investigates to what extent the geometries can be reconstructed from the entropies in polynomial time. Bouland, Fefferman, and Vazirani (2019) argued that the AdS/CFT map can be exponentially complex if one wants to reconstruct regions such as the interiors of black holes. Our main result provides a sort of converse: we show that, in the special case of a single 1D boundary, if the input data consists of a list of entropies of contiguous boundary regions, and if the entropies satisfy a single inequality called Strong Subadditivity, then we can construct a graph model for the bulk in linear time. Moreover, the bulk graph is planar, it has O(N^2) vertices (the information-theoretic minimum), and it's ``universal,'' with only the edge weights depending on the specific entropies in question. From a combinatorial perspective, our problem boils down to an ``inverse'' of the famous min-cut problem: rather than being given a graph and asked to find a min-cut, here we're given the values of min-cuts separating various sets of vertices, and we need to find a weighted undirected graph consistent with those values. We also make initial progress on the case of multiple 1D boundaries -- where the boundaries could be connected wormholes -- including an upper bound of O(N^4) vertices whenever a planar bulk graph exists, thus putting the problem into NP.

The talk is meant to be accessible to a CS theory audience, and assumes no particular knowledge of quantum gravity or even quantum mechanics.
Joint work with Jason Pollack (UT Austin)
https://arxiv.org/abs/2210.15601

01/20/2023
Robert Andrews, University of Illinois Urbana-Champaign
On Matrix Multiplication and Polynomial Identity Testing

Determining the complexity of matrix multiplication is a fundamental problem of theoretical computer science. It is popularly conjectured that ω, the matrix multiplication exponent, equals 2. If true, this conjecture would yield fast algorithms for a wide array of problems in linear algebra and beyond. If instead ω > 2, can the hardness of matrix multiplication be leveraged to design algorithms for other problems? In this talk, I will describe how lower bounds on ω can be used to make progress on derandomizing polynomial identity testing.

01/27/2023
William Kretschmer, UT
Quantum Cryptography in Algorithmica

In this talk, I will discuss the construction of a classical oracle relative to which P = NP yet single-copy secure pseudorandom quantum states exist. In the language of Impagliazzo's five worlds, this is a construction of pseudorandom states in "Algorithmica," and hence shows that in a black-box setting, quantum cryptography based on pseudorandom states is possible even if one-way functions do not exist. As a consequence, we demonstrate that there exists a property of a cryptographic hash function that simultaneously (1) suffices to construct pseudorandom quantum states, (2) holds for a random oracle, and thus plausibly holds for existing hash functions like SHA3, and (3) is independent of the P vs. NP question in the black box setting. This offers further evidence that one-way functions are not necessary for computationally-secure quantum cryptography. Our proof builds on recent work of Aaronson, Ingram, and Kretschmer (2022). Based on joint work with Luowen Qian, Makrand Sinha, and Avishay Tal.

02/03/2023
David Zuckerman, UT
Almost Chor-Goldreich Sources and Adversarial Random Walks

A Chor-Goldreich (CG) source is a sequence of random variables where each has min-entropy, even conditioned on the previous ones. We extend this notion in several ways, most notably allowing each random variable to have Shannon entropy conditioned on previous ones. We achieve pseudorandomness results for Shannon-CG sources that were not known to hold even for standard CG sources, and even for the weaker model of Santha-Vazirani sources.

Specifically, we construct a deterministic condenser that on input a Shannon-CG source, outputs a distribution that is close to having constant entropy gap, namely its min-entropy is only an additive constant less than its length. Therefore, we can simulate any randomized algorithm with small failure probability using almost CG sources with no multiplicative slowdown. This result extends to randomized protocols as well, and any setting in which we cannot simply cycle over all seeds, and a "one-shot" simulation is needed. Moreover, our construction works in an online manner, since it is based on random walks on expanders.

Our main technical contribution is a novel analysis of random walks, which should be of independent interest. We analyze walks with adversarially correlated steps, each step being entropy-deficient, on good enough lossless expanders. We prove that such walks (or certain interleaved walks on two expanders) accumulate entropy.

Joint work with Dean Doron, Dana Moshkovitz, and Justin Oh.

02/10/2023
Xiaoyu He, Princeton University
Deletion Code Bounds via Pseudorandomness

Deletion channels, introduced by Levenshtein in the 60's, are noisy channels that delete characters from the input. A (binary) k-deletion code of length n is a set C of binary strings of length n capable of correcting k such errors, i.e. satisfying the property that no pair of elements of C shares a common subsequence of length at least n-k. Let D(n, k) be the size of the largest k-deletion code of length n. Two central problems are to determine (a) the order of D(n, k) for constant k, and (b) the supremum of all 0<p<1 such that D(n, pn) grows exponentially in n (the so-called "zero-rate threshold"). In this talk, we establish the first nontrivial lower bound for problem (a) when k > 1, improving the greedy lower bound for D(n,k) by a logarithmic factor. We also prove the first nontrivial upper bound for problem (b), showing that D(n,pn) = 2^o(n) for p > 1/2 - 10^{-60}. The proofs use a variety of techniques from extremal and probabilistic combinatorics, especially pseudorandomness and a regularity lemma.

Based on joint works with Noga Alon, Gabriela Bourla, Ben Graham, Venkatesan Guruswami, Noah Kravitz, and Ray Li.

02/17/2023
Jack Zhou, Stanford University
Quantum Pseudoentanglement

Quantum pseudorandom states are efficiently constructable states which nevertheless masquerade as Haar-random states to poly-time observers. First defined by Ji, Liu and Song, such states have found a number of applications ranging from cryptography to the AdS/CFT correspondence. A fundamental question is exactly how much entanglement is required to create such states. Haar-random states, as well as t-designs for t≥2, exhibit near-maximal entanglement. Here we provide the first construction of pseudorandom states with only polylogarithmic entanglement entropy across an equipartition of the qubits, which is the minimum possible. Our construction can be based on any one-way function secure against quantum attack. We additionally show that the entanglement in our construction is fully "tunable", in the sense that one can have pseudorandom states with entanglement Θ(f(n)) for any desired function ω(logn)≤f(n)≤O(n).

More fundamentally, our work calls into question to what extent entanglement is a "feelable" quantity of quantum systems. Inspired by recent work of Gheorghiu and Hoban, we define a new notion which we call "pseudoentanglement", which are ensembles of efficiently constructable quantum states which hide their entanglement entropy. We show such states exist in the strongest form possible while simultaneously being pseudorandom states. We also describe diverse applications of our result from entanglement distillation to property testing to quantum gravity.

03/03/2023
Robert Tarjan, Princeton University - James S. McDonnell Distinguished University Professor
Self-Adjusting Data Structures

Data structures are everywhere in computer software. Classical data structures are specially designed to make each individual operation fast. A more flexible approach is to design the structure so that it adapts to its use. This idea has produced data structures that perform well in practice and have surprisingly good performance guarantees. In this talk I’ll review some recent work on such data structures, specifically self-adjusting search trees and self-adjusting heaps.

Robert Tarjan is the James S. McDonnell Distinguished University Professor of Computer Science at Princeton University. He has held academic positions at Cornell, Berkeley, Stanford, and NYU, and industrial research positions at Bell Labs, NEC, HP, Microsoft, and Intertrust Technologies. He has invented or co-invented many of the most efficient known data structures and graph algorithms. He was awarded the first Nevanlinna Prize from the International Mathematical Union in 1982 for “for outstanding contributions to mathematical aspects of information science,” the Turing Award in 1986 with John Hopcroft for “fundamental achievements in the design and analysis of algorithms and data structures,” and the Paris Kanellakis Award in Theory and Practice in 1999 with Daniel Sleator for the invention of splay trees. He is a member of the U.S. National Academy of Sciences, the U. S. National Academy of Engineering, the American Academy of Arts and Sciences, and the American Philosophical Society.

03/10/2023
Daniel Liang, UT
Low-Stabilizer-Complexity Quantum States Are Not Pseudorandom

Pseudorandom quantum states are a set of quantum states that are indistinguishable from a uniformly random (i.e. Haar random) quantum state. While they have been shown to be extremely useful for cryptographic purposes, a natural question is "what resources might be needed to construct them?" We attempt to answer the question relative to the Clifford + T gate set, a commonly used universal quantum gate. As the name implies, the T gate is the most expensive operation in this set and the number of T gates needed to create a state is often the most important measure. Our results are as follows: Given blackbox access to an n-qubit quantum state that is promised to be either (1) a state prepared with Clifford gates and at most t uses of the T gate or (2) a Haar random state, we give an algorithm that distinguishes between the two cases with probability at least 1-delta using at most O(exp(t) log(1/delta)) samples and O(n exp(t) log(1/delta)) time. As such, the number of T gates necessary for the construction of pseudorandom quantum states is strictly greater than O(log n). Our proofs expand on recent results by Gross, Nezami, and Walter (2021). Based on joint work with Sabee Grewal, Vishnu Iyer, and William Kretschmer.

03/24/2023
Vinod Vaikuntanathan, MIT
Continuous Learning with Errors and Applications: Gaussian Mixtures and Undetectable Backdoors

I will describe two results at the interface of statistics, machine learning and cryptography both of which build on the recently formulated continuous learning with errors (CLWE) problem.

First, I will show that CLWE is as hard as the widely studied (discrete) learning with errors (LWE) problem using techniques from leakage-resilient cryptography. In turn, I will use this to show the nearly optimal hardness of the long-studied Gaussian mixture learning problem.

Secondly, I will show an application of CLWE to machine learning. In the increasingly common setting where the training of models is outsourced, I will describe a method whereby a malicious trainer can use cryptography to insert an *undetectable* backdoor in a classifier. Using a secret key, the trainer can then slightly alter inputs to create large deviations in the model output. Without the secret key, the existence of the backdoor is hidden.

The talk is based on joint works with Shafi Goldwasser, Michael P. Kim and Or Zamir; and with Aparna Gupte and Neekon Vafa.

Vinod Vaikuntanathan is a professor of computer science at MIT, interested in cryptography and its applications across computer science. Vinod is the co-inventor of modern fully homomorphic encryption systems and many other lattice-based cryptographic primitives. He is a recipient of the Gödel Prize (2022), MIT Harold E. Edgerton Faculty Award (2018), DARPA Young Faculty Award (2018), the Sloan Faculty Fellowship (2013) and the Microsoft Faculty Fellowship (2014).

03/27/2023
Jelani Nelson, UC Berkeley
New local differentially private protocols for frequency and mean estimation

Consider the following examples of distributed applications: a texting app wants to train ML models for autocomplete based on text history residing on-device across millions of devices, or the developers of some other app want to understand common app settings by their users. In both cases, and many others, a third party wants to understand something in the aggregate about a large distributed database but under the constraint that each individual record requires some guarantee of privacy. Protocols satisfying so-called local differential privacy have become the gold standard for guaranteeing privacy in such situations, and in this talk I will discuss new such protocols for two of the most common problems that require solutions in this framework: frequency estimation, and mean estimation.

Based on joint works with subsets of Hilal Asi, Vitaly Feldman, Huy Le Nguyen, and Kunal Talwar.

Jelani Nelson is a Professor of Electrical Engineering and Computer Sciences at UC Berkeley, interested in randomized algorithms, sketching and streaming algorithms, dimensionality reduction, and differential privacy. He is a recipient of the Presidential Early Career Award for Scientist and Engineers (PECASE), and a Sloan Research Fellowship. He is also Founder and President of AddisCoder, Inc., a nonprofit that provides algorithms training to high school students in Ethiopia and Jamaica.

03/29/2023
Marijn Heule, Carnegie Mellon University
Computer-aided Mathematics: Successes, Advances, and Trust

Progress in satisfiability (SAT) solving has made it possible to determine the correctness of complex systems and answer long-standing open questions in mathematics. The SAT-solving approach is completely automatic and can produce clever though potentially gigantic proofs. We can have confidence in the correctness of the answers because highly trustworthy systems can validate the underlying proofs regardless of their size.

We demonstrate the effectiveness of the SAT approach by presenting some successes, including the solution of the Boolean Pythagorean Triples problem, computing the fifth Schur number, and resolving the remaining case of Keller’s conjecture. Moreover, we constructed and validated proofs for each of these results. The second part of the talk focuses on notorious math challenges for which automated reasoning may well be suitable. In particular, we discuss advances in applying SAT-solving techniques to the Hadwiger-Nelson problem
(chromatic number of the plane), optimal schemes for matrix multiplication, and the Collatz conjecture.

Marijn Heule is an Associate Professor at Carnegie Mellon University and received his PhD at Delft University of Technology (2008). His contributions to automated reasoning have enabled him and others to solve hard problems in formal verification and mathematics. He has developed award-winning SAT solvers and his preprocessing and proof-producing techniques are used in many state-of-the-art solvers. Marijn won multiple best paper awards at international conferences, including at SAT, CADE, IJCAR, TACAS, HVC, and IJCAI-JAIR. He is one of the editors of the Handbook of Satisfiability. This 900+ page handbook has become the reference for SAT research.

03/31/2023
Gregory Valiant, Stanford University
Optimization with Limited Memory?

In many high-dimensional optimization settings, there are significant gaps between the amount of data or number of iterations required by algorithms whose memory usage scales linearly with the dimension versus more complex and memory-intensive algorithms. Do some problems inherently require super-linear (or quadratic memory) to be solved efficiently? In this talk, we will survey the lay-of-the-land of fundamental tradeoffs between the amount of available memory, and the amount of data or queries to the function being optimized. This will include a discussion of our recent work showing that for optimizing a convex function, given the ability to query the function value/gradient at points, a (significantly) super-linear amount of memory is required to achieve the optimal rate of convergence obtained by algorithms using more than quadratic memory. Throughout, the emphasis will be on the many open problems in this area.

04/07/2023
Ruizhe Zhang, UT Austin
Solving SDP Faster: A Robust IPM Framework and Efficient Implementation

Semidefinite programming (SDP) is one of the most important tools in optimization theory. In this talk, I will introduce a new robust interior-point method analysis for SDP. Under this new framework, we can improve the running time of semidefinite programming (SDP) with variable size n×n and m constraints up to ϵ accuracy. For the case m=Ω(n^2), our algorithm can solve SDPs in m^ω time. This suggests solving SDP is nearly as fast as solving a linear system with equal number of variables and constraints. It is the first result that tall dense SDP can be solved in a nearly-optimal running time, and it also improves the state-of-the-art SDP solver [Jiang, Kathuria, Lee, Padmanabhan and Song, FOCS 2020]. In addition to our new IPM analysis, our algorithm also relies on a number of new techniques that might be of further interest, such as, maintaining the inverse of a Kronecker product using lazy updates, and a general amortization scheme for positive semidefinite matrices. Based on joint work with Baihe Huang, Shunhua Jiang, Zhao Song, and Runzhou Tao.

04/14/2023
Schuchi Chawla, UT Austin
Pandora's Box with Correlations: Learning and Approximation

In the Pandora's Box problem, the algorithm is provided with a number of boxes with unknown (stochastic) rewards contained inside them. The algorithm can open any box at some cost, discover the reward inside, and based on these observations can choose one box and keep the reward contained in it. Given the distributions from which the rewards are drawn, the algorithm must determine an order in which to open the boxes as well as when to stop and accept the best reward found so far. In general, an optimal algorithm may make both decisions adaptively based on instantiations observed previously. The Pandora's Box problem and its extensions capture many kinds of optimization problems with stochastic input where the algorithm can obtain instantiations of input random variables at some cost. Previous work on these problems assumes that the random variables are distributed independently. In this work, we consider Pandora's Box-type problems with correlations. Whereas the independent distributions setting is efficiently solvable, the correlated setting is hard and captures as special cases several algorithmic problems that have been studied previously -- e.g. min sum set cover and optimal decision tree. We provide the first approximation algorithms for the correlated setting under various distributional models. This talk is based on two papers with coauthors Evangelia Gergatsouli, Jeremy McMahan, Yifeng Teng, Christos Tzamos, and Ruimin Zhang. The first appeared at FOCS'20 and the second is under preparation.

05/03/2023
Yifeng Teng, Google Research
Prophet Secretary Against the Online Optimal

In the prophet secretary problem, a sequence of random variables with independent known distributions arrive in uniformly random order. Upon seeing a realized value at each step, an online decision-maker has to either select it and stop or irrevocably discard it. The goal is to maximize the expected value of the selected variable. Traditionally, the chosen benchmark is the expected reward of the prophet, who knows all the values in advance and can always select the maximum one.

In this work, we study the prophet secretary problem against a less pessimistic but equally well-motivated benchmark; the online optimal. Here, the main goal is to find polynomial-time algorithms that guarantee near-optimal expected reward. As a warm-up, we present a quasi-polynomial time approximation scheme (QPTAS) through careful discretization and non-trivial bundling processes. Using the toolbox developed for the QPTAS, coupled with a novel frontloading technique that enables us to reduce the number of decisions we need to make, we are able to remove the dependence on the number of items in the exponent of the running time, and obtain the first polynomial time approximation scheme (PTAS) for this problem.

05/05/2023
Raghu Meka, UCLA
Strong bounds for 3-progressions

Suppose you have a set S of integers from \{1,2,\ldots,N\} that contains at least N / C elements. Then for large enough N , must S contain three equally spaced numbers (i.e., a 3-term arithmetic progression)?

In 1953, Roth showed that this is indeed the case when C > \Omega(\log \log N), while Behrend in 1946 showed that C can be at most 2^{O(\sqrt{\log N})}. Since then, the problem has been a cornerstone of the area of additive combinatorics. Following a series of remarkable results, a celebrated paper from 2020 due to Bloom and Sisask improved the lower bound on C to C = (\log N)^{1+c}, for some constant c > 0.

This talk will describe a new work that C >2^{\Omega((\log N)^{0.09})}, thus getting closer to Behrend's construction.

Based on joint work with Zander Kelley.

Fall 2022

Fall 2022 Theory Seminars will meet on Fridays from 1:30 - 2:30pm in GDC 4.302. This schedule will be updated throughout the semester.

09/02/2022 - David Wu, UT: Batch Arguments for NP and More from Standard Bilinear Group Assumptions

09/09/2022 - Joe Neeman, UT

09/16/2022 - Chinmay Nirkhe, IBM Quantum

09/23/2022 -

10/07/2022 -

10/14/2022 -

10/21/2022 -

10/28/2022 - Nicole Wein, DIMACS

11/04/2022 -

11/11/2022 -

11/18/2022 - Huacheng Yu, Princeton

12/02/2022 -

12/09/2022 -

09/02/2022

David Wu, UT

Batch Arguments for NP and More from Standard Bilinear Group Assumptions

Non-interactive batch arguments for NP provide a way to amortize the cost of NP verification across multiple instances. They enable a computationally-bounded prover to convince a verifier of multiple NP statements with communication that is much smaller than the total witness length and with verification time that is smaller than individually checking each instance.

In this talk, I describe a new and direct approach for constructing batch arguments from standard cryptographic assumptions on groups with bilinear maps (e.g., the subgroup decision or the k-Lin assumption). Our approach avoids heavy tools like correlation-intractable hash functions or probabilistically-checkable proofs common to previous approaches. In turn, we also obtain the first construction of batch arguments for NP from standard bilinear map assumptions.

As corollaries to our main construction, we also obtain a delegation scheme for RAM programs (also known as a succinct non-interactive argument for P) as well as an aggregate signature scheme supporting bounded aggregation from standard bilinear map assumptions in the plain model.

Joint work with Brent Waters.

Spring 2022

02/18/2022 - Oliver Korten, Columbia: The Hardest Explicit Construction

03/11/2022 - Matthew Ferland, USC: Winning the War by (Strategically) Losing Battles: Settling the Complexity of Grundy-Values in Undirected Geography

03/25/2022 - Lisa Sauermann, MIT: List-decoding for Reed-Solomon codes

04/01/2022 - Kevin Tian, Stanford University: Semi-Random Sparse Recovery in Nearly-Linear Time

04/08/2022 - Debmalya Panigrahi, Duke University: Recent Results in Minimum Cut Algorithms

04/15/2022 - William Hoza, Simons Institute: Fooling Constant-Depth Threshold Circuits

04/22/2022 - Sushant Sachdeva, University of Toronto: Almost Linear Time Algorithms for Max-Flow and More

04/29/2022 - John Kallaugher, Sandia National Laboratories: A Quantum Advantage for a Natural Streaming Problem **HYBRID EVENT in GDC 4.304 - Event will begin at 2:00PM.**

05/06/2022 - Dean Doron, Ben-Gurion University: Approximating Large Powers of Stochastic Matrices in Small Space

02/18/2022

Oliver Korten, Columbia

The Hardest Explicit Construction

We investigate the complexity of explicit construction problems, where the goal is to produce a particular object of size n possessing some pseudorandom property in time polynomial in n. We give overwhelming evidence that APEPP, defined originally by Kleinberg et al., is the natural complexity class associated with explicit constructions of objects whose existence follows from the probabilistic method, by placing a variety of such construction problems in this class. We then demonstrate that a result of Jeřábek on provability in Bounded Arithmetic, when reinterpreted as a reduction between search problems, shows that constructing a truth table of high circuit complexity is complete for APEPP under PNP reductions. This illustrates that Shannon's classical proof of the existence of hard boolean functions is in fact a universal probabilistic existence argument: derandomizing his proof implies a generic derandomization of the probabilistic method. As a corollary, we prove that EXPNP contains a language of circuit complexity 2^n^Ω(1) if and only if it contains a language of circuit complexity 2^n/2n. Finally, for several of the problems shown to lie in APEPP, we demonstrate direct polynomial time reductions to the explicit construction of hard truth tables.

03/11/2022

Matthew Ferland, USC

Winning the War by (Strategically) Losing Battles: Settling the Complexity of Grundy-Values in Undirected Geography

We settle two long-standing complexity-theoretical questions-open since 1981 and 1993-in combinatorial game theory (CGT).

We prove that the Grundy value (a.k.a. nim-value, or nimber) of Undirected Geography is PSPACE-complete to compute. This exhibits a stark contrast with a result from 1993 that Undirected Geography is polynomial-time solvable. By distilling to a simple reduction, our proof further establishes a dichotomy theorem, providing a "phase transition to intractability" in Grundy-value computation, sharply characterized by a maximum degree of four: The Grundy value of Undirected Geography over any degree-three graph is polynomial-time computable, but over degree-four graphs-even when planar and bipartite-is PSPACE-hard. Additionally, we show, for the first time, how to construct Undirected Geography instances with Grundy value ∗n and size polynomial in n.

We strengthen a result from 1981 showing that sums of tractable partisan games are PSPACE-complete in two fundamental ways. First, since Undirected Geography is an impartial ruleset, we extend the hardness of sums to impartial games, a strict subset of partisan. Second, the 1981 construction is not built from a natural ruleset, instead using a long sum of tailored short-depth game positions. We use the sum of two Undirected Geography positions to create our hard instances. Our result also has computational implications to Sprague-Grundy Theory (1930s) which shows that the Grundy value of the disjunctive sum of any two impartial games can be computed-in polynomial time-from their Grundy values. In contrast, we prove that assuming PSPACE ≠ P, there is no general polynomial-time method to summarize two polynomial-time solvable impartial games to efficiently solve their disjunctive sum.

03/25/2022

Lisa Sauermann, MIT

List-decoding for Reed-Solomon codes

Reed-Solomon codes are an important and intensively studied class of error-correcting codes. After giving some background, this talk will discuss the so-called list-decoding problem for Reed-Solomon codes. More specifically, we prove that for any fixed list-decoding parameters, there exist Reed-Solomon codes with a certain rate, which is optimal up to a constant factor. This in particular answers a question of Guo, Li, Shangguan, Tamo, and Wootters about list-decodability of Reed-Solomon codes with radius close to 1. Joint work with Asaf Ferber and Matthew Kwan.

04/01/2022

Kevin Tian, Stanford University

Semi-Random Sparse Recovery in Nearly-Linear Time

Sparse recovery is one of the most fundamental and well-studied inverse problems. Standard statistical formulations of the problem are provably solved by general convex programming techniques and more practical, fast (nearly-linear time) iterative methods. However, these latter "fast algorithms" have previously been observed to be brittle in various real-world settings.

We investigate the brittleness of fast sparse recovery algorithms to generative model changes through the lens of studying their robustness to a "helpful" semi-random adversary, a framework which tests whether an algorithm overfits to input assumptions. We consider the following basic model: let A be an nxd measurement matrix which contains an unknown mxd subset of rows G which are bounded and satisfy the restricted isometry property (RIP), but is otherwise arbitrary. Letting x* in R^d be s-sparse, and given either exact measurements b = Ax* or noisy measurements b = Ax* + xi, we design algorithms recovering x* information-theoretically optimally in nearly-linear time. We extend our algorithm to hold for weaker generative models relaxing our planted RIP row subset assumption to a natural weighted variant, and show that our method's guarantees naturally interpolate the quality of the measurement matrix to, in some parameter regimes, run in sublinear time.

Our approach differs from that of prior fast iterative methods with provable guarantees under semi-random generative models (Cheng and Ge '18, Li et al. '20), which typically separate the problem of learning the planted instance from the estimation problem, i.e. they attempt to first learn the planted "good" instance (in our case, the matrix G). However, natural conditions on a submatrix which make sparse recovery tractable, such as RIP, are NP-hard to verify and hence first learning a sufficient row reweighting appears challenging. We eschew this approach and design a new iterative method, tailored to the geometry of sparse recovery, which is provably robust to our semi-random model. Our hope is that our approach opens the door to new robust, efficient algorithms for other natural statistical inverse problems.

Based on joint work with Jonathan Kelner, Jerry Li, Allen Liu, and Aaron Sidford

04/08/2022

Debmalya Panigrahi, Duke University

Recent Results in Minimum Cut Algorithms

Minimum cut problems are among the most basic questions in algorithm design. In the last few years, there has been a resurgence in new results in this domain, resulting in the first improvements in many decades for problems such as global min-cut, vertex min-cut, all-pairs min-cut, etc. In this talk, I will explore some of these results, focusing on the broad themes and techniques that have driven this progress.

This talk will be based on papers that have appeared in FOCS '20, STOC '21, and FOCS '21.

04/15/2022

William Hoza, Simons Institute

Fooling Constant-Depth Threshold Circuits

We present the first non-trivial pseudorandom generator (PRG) for linear threshold (LTF) circuits of arbitrary constant depth and super-linear size. This PRG fools circuits with depth d and n^{1 + delta} wires, where delta = exp(-O(d)), using seed length O(n^{1 - delta}) and with error exp(-n^{delta}). This tightly matches the best known lower bounds for this circuit class. As a consequence of our result, all the known hardness for LTF circuits has now effectively been translated into pseudorandomness. This brings the extensive effort in the last decade to construct PRGs and deterministic circuit-analysis algorithms for this class to the point where any subsequent improvement would yield breakthrough lower bounds.

A key ingredient in our construction is a pseudorandom restriction procedure that has tiny failure probability, but simplifies the function to a non-natural "hybrid computational model" that combines decision trees and LTFs. As part of our proof we also construct an "extremely low-error" PRG for the class of functions computable by an arbitrary function of s linear threshold functions that can handle even the extreme setting of parameters s = n/polylog(n) and epsilon = exp(-n/polylog(n)).

Joint work with Pooya Hatami, Avishay Tal, and Roei Tell.

04/22/2022

Sushant Sachdeva, University of Toronto

Almost Linear Time Algorithms for Max-Flow and More

***There will be an additional 15-minute optional portion of the talk immediately afterwards in order to fully convey the main ideas of the work.***

We give the first almost-linear time algorithm for computing exact maximum flows and minimum-cost flows on directed graphs. By well-known reductions, this implies almost-linear time algorithms for several problems including bipartite matching, optimal transport, and undirected vertex connectivity.

Our algorithm is designed using a new Interior Point Method (IPM) that builds the flow as a sequence of almost-linear number of approximate undirected minimum-ratio cycles, each of which is computed and processed very efficiently using a new dynamic data structure.

Our framework extends to give an almost-linear time algorithm for computing flows that minimize general edge-separable convex functions to high accuracy. This gives the first almost-linear time algorithm for several problems including entropy-regularized optimal transport, matrix scaling, p-norm flows, and Isotonic regression.

Joint work with Li Chen, Rasmus Kyng, Yang Liu, Richard Peng, and Maximilian Probst Gutenberg.

04/29/2022

John Kallaugher, Sandia National Laboratories

A Quantum Advantage for a Natural Streaming Problem

**HYBRID EVENT in GDC 4.304 - Event will begin at 2:00PM.**

Data streaming, in which a large dataset is received as a "stream" of updates, is an important model in the study of space-bounded computation. Starting with the work of Le Gall [SPAA `06], it has been known that quantum streaming algorithms can use asymptotically less space than their classical counterparts for certain problems. However, so far, all known examples of quantum advantages in streaming are for problems that are either specially constructed for that purpose, or require many streaming passes over the input.

We give a one-pass quantum streaming algorithm for one of the best studied problems in classical graph streaming - the triangle counting problem. Almost-tight parametrized upper and lower bounds are known for this problem in the classical setting; our algorithm uses polynomially less space in certain regions of the parameter space, resolving a question posed by Jain and Nayak in 2014 on achieving quantum advantages for natural streaming problems.

05/06/2022

Dean Doron, Ben-Gurion University

Approximating Large Powers of Stochastic Matrices in Small Space

We give a deterministic space-efficient algorithm for approximating powers of stochastic matrices. On input a w×w stochastic matrix A, our algorithm approximates An in space O(log n + √log n · log w) to within high accuracy. This improves upon the seminal work by Saks and Zhou, that requires O(log3/2n + √log n·log w) space, in the regime n ≫ w.

Fall 2021

08/27/2021 - Alexandros Hollender, University of Oxford: The Complexity of Gradient Descent: CLS = PPAD ∩ PLS

09/03/2021 - Nathan Klein, University of Washington: A (Slightly) Improved Approximation Algorithm for Metric TSP

09/10/2021 - Vishesh Jain, Stanford University, Stein Fellow: Towards the Sampling Lovász Local Lemma

09/17/2021 - Seth Pettie, University of Michigan: Information Theoretic Limits of Cardinality Estimation: Fisher Meets Shannon

10/01/2021 - Roei Tell, IAS+DIMACS: Hardness vs Randomness, Revised: Uniform, Non-Black-Box, and Instance-Wise

10/08/2021 - Eric Vigoda, UCSB: Computational Phase Transition for Approximate Counting

10/15/2021 - Srikanth Srinivasan, Aarhus University: Superpolynomial Lower Bounds Against Low-Depth Algebraic Circuits

10/22/2021 - Amnon Ta-Shma, Tel-Aviv University: Expander Random Walks: A Fourier-Analytic Approach

11/05/2021 - Joe Neeman, UT Austin: Robust Testing of Low-Dimensional Functions *Hybrid Event: To be hosted virtually and in person in GDC 4.304*

11/12/2021 - Virgina Williams, MIT: Hardness of Approximate Diameter: Now for Undirected Graphs

11/19/2021 - Minshen Zhu, Purdue University: Exponential Lower Bounds for Locally Decodable and Correctable Codes for Insertions and Deletions * Event Begins at 11:00 AM*

12/03/2021 - Shai Evra, Hebrew University of Jerusalem: Locally Testable Codes with Constant Rate, Distance, and Locality

08/27/2021

Alexandros Hollender, University of Oxford

The Complexity of Gradient Descent: CLS = PPAD ∩ PLS

We study search problems that can be solved by performing Gradient Descent on a bounded convex polytopal domain and show that this class is equal to the intersection of two well-known classes: PPAD and PLS. As our main underlying technical contribution, we show that computing a Karush-Kuhn-Tucker (KKT) point of a continuously differentiable function over the domain [0,1]2 is PPAD ∩ PLS-complete. This is the first natural problem to be shown complete for this class. Our results also imply that the class CLS (Continuous Local Search) - which was defined by Daskalakis and Papadimitriou as a more "natural" counterpart to PPAD ∩ PLS and contains many interesting problems - is itself equal to PPAD ∩ PLS.

09/03/2021

Nathan Klein, University of Washington

A (Slightly) Improved Approximation Algorithm for Metric TSP

I will describe work in which we obtain a 3/2-epsilon approximation algorithm for metric TSP, for some epsilon>10^{-36}. This slightly improves over the classical 3/2 approximation algorithm due to Christofides [1976] and Serdyukov [1978]. The talk will focus on giving an overview of the key ideas involved, such as properties of Strongly Rayleigh distributions and the structure of near minimum cuts. This is joint work with Anna Karlin and Shayan Oveis Gharan.

09/10/2021

Vishesh Jain, Stanford University, Stein Fellow

Towards the Sampling Lovász Local Lemma

For a constraint satisfaction problem which satisfies the condition of the Lovász local lemma (LLL), the celebrated algorithm of Moser and Tardos allows one to efficiently find a satisfying assignment. In the past few years, much work has gone into understanding whether one can efficiently sample from (approximately) the uniform distribution on satisfying assignments under LLL-like conditions. I will discuss recent progress on this problem, joint with Huy Tuan Pham (Stanford) and Thuy Duong Vuong (Stanford).

09/17/2021

Seth Pettie, University of Michigan

Information Theoretic Limits of Cardinality Estimation: Fisher Meets Shannon

Estimating the cardinality (number of distinct elements) of a large multiset is a classic problem in streaming and sketching, dating back to Flajolet and Martin's classic Probabilistic Counting (PCSA) algorithm from 1983. In this paper we study the intrinsic tradeoff between the space complexity of the sketch and its estimation error in the random oracle model. We define a new measure of efficiency for cardinality estimators called the Fisher-Shannon (Fish) number H/I. It captures the tension between the limiting Shannon entropy (H) of the sketch and its normalized Fisher information (I), which characterizes the variance of a statistically efficient, asymptotically unbiased estimator.

Our results are as follows:

We prove that all base-q variants of Flajolet and Martin's PCSA sketch have Fish-number H0/I0≈1.98016 and that every base-q variant of (Hyper)LogLog has Fish-number worse than H0/I0, but that they tend to H0/I0 in the limit as q→∞. Here H0,I0 are precisely defined constants.
We describe a sketch called Fishmonger that is based on a smoothed, entropy-compressed variant of PCSA with a different estimator function. It is proved that with high probability, Fishmonger processes a multiset of [U] such that at all times, its space is O(log2logU)+(1+o(1))(H0/I0)b≈1.98b bits and its standard error is 1/sqrt(b).

We give circumstantial evidence that H0/I0 is the optimum Fish-number of mergeable sketches for Cardinality Estimation. We define a class of linearizable sketches and prove that no member of this class can beat H0/I0. The popular mergeable sketches are, in fact, also linearizable.

10/01/2021

Roei Tell, IAS+DIMACS

Hardness vs Randomness, Revised: Uniform, Non-Black-Box, and Instance-Wise

Textbook hardness-to-randomness converts circuit lower bounds into PRGs. But is this black-box approach really necessary for derandomization? In this talk I'll show how to revamp the classical hardness-to-randomness framework, converting new types of *uniform lower bounds* into *non-black-box derandomization*. This yields conclusions such as promiseBPP = promiseP without PRGs, and reveals a close connection between worst-case derandomization and the new types of uniform lower bounds. Another instantiation of this new framework allows to deduce, under plausible hypotheses, that randomness can be eliminated at essentially no observable cost when solving decision problems.

Based on joint work with Lijie Chen.

10/08/2021

Eric Vigoda, UCSB

Computational Phase Transition for Approximate Counting

Spectral independence is a new technique, recently introduced by Anari et al (2020), for establishing rapid mixing results for Markov Chain Monte Carlo (MCMC) algorithms. We show that spectral independence implies optimal convergence rate for a variety of MCMC algorithms. This yields a dramatic computational phase transition for various approximate counting problems such as weighted independent sets.

This talk is based on joint works with Zongchen Chen (MIT) and Kuikui Liu (Washington).

10/15/2021

Srikanth Srinivasan, Aarhus University

Superpolynomial Lower Bounds Against Low-Depth Algebraic Circuits

Every multivariate polynomial P(x_1,...,x_n) can be written as a sum of monomials, i.e. a sum of products of variables and field constants. In general, the size of such an expression is the number of monomials that have a non-zero coefficient in P.

What happens if we add another layer of complexity, and consider sums of products of sums (of variables and field constants) expressions? Now, it becomes unclear how to prove that a given polynomial P(x_1,...,x_n) does not have small expressions. In this result, we solve exactly this problem.

More precisely, we prove that certain explicit polynomials have no polynomial-sized "Sigma-Pi-Sigma" (sums of products of sums) representations. We can also show similar results for Sigma-Pi-Sigma-Pi, Sigma-Pi-Sigma-Pi-Sigma and so on for all "constant-depth" expressions.

Based on joint work with Nutan Limaye (ITU Copenhagen + IIT Bombay) and Sébastien Tavenas (USMB + Univ. Grenoble, CNRS).

10/22/2021

Amnon Ta-Shma, Tel-Aviv University

Expander Random Walks: A Fourier-Analytic Approach

We consider the following question: Assume the vertices of an expander graph are labelled by {1,-1}. What test functions can or cannot distinguish t independent samples from those obtained by a random walk?

We prove all symmetric functions are fooled by random walks on expanders with constant spectral gap. We use Fourier analysis noticing that the bias of character $\chi_S$ is determined not only by the cardinality of S, but also by the distances between the elements in S.

The talk is based on two papers:

- Expander Random Walks: A Fourier-Analytic Approach with Gil Cohen and Noam Peri, and,

- Expander Random Walks: The General Case and Limitations with Gil Cohen, Dor Minzer, Shir Peleg and Aaron Potechin.

11/05/2021

Joe Neeman, UT Austin

Robust Testing of Low-Dimensional Functions *Hybrid Event: To be hosted virtually and in person in GDC 4.304*

A natural problem in high-dimensional inference is to decide if a classifier f:Rn→{−1,1} depends on a small number of linear directions of its input data. Call a function g:Rn→{−1,1}, a linear k-junta if it is completely determined by some k-dimensional subspace of the input space. A recent work of the authors showed that linear k-juntas are testable. Thus there exists an algorithm to distinguish between: 1. f:Rn→{−1,1} which is a linear k-junta with surface area s, 2. f is ϵ-far from any linear k-junta with surface area (1+ϵ)s, where the query complexity of the algorithm is independent of the ambient dimension n.

Following the surge of interest in noise-tolerant property testing, in this paper we prove a noise-tolerant (or robust) version of this result. Namely, we give an algorithm which given any c>0, ϵ>0, distinguishes between 1. f:Rn→{−1,1} has correlation at least c with some linear k-junta with surface area s. 2. f has correlation at most c−ϵ with any linear k-junta with surface area at most s. The query complexity of our tester is kpoly(s/ϵ).

Using our techniques, we also obtain a fully noise tolerant tester with the same query complexity for any class C of linear k-juntas with surface area bounded by s. As a consequence, we obtain a fully noise tolerant tester with query complexity kO(poly(logk/ϵ)) for the class of intersection of k-halfspaces (for constant k) over the Gaussian space. Our query complexity is independent of the ambient dimension n. Previously, no non-trivial noise tolerant testers were known even for a single halfspace.

11/12/2021

Virgina Williams, MIT

Hardness of Approximate Diameter: Now for Undirected Graphs

Approximating the graph diameter is a basic task of both theoretical and practical interest. A simple folklore algorithm can output a 2-approximation to the diameter in linear time by running BFS from an arbitrary vertex. It has been open whether a better approximation is possible in near-linear time.

A series of papers on fine-grained complexity have led to strong hardness results for diameter, culminating in our recent tradeoff curve in the upcoming FOCS'21 joint with Ray Li and Mina Dalirrooyfard, showing that under the Strong Exponential Time Hypothesis (SETH), for any integer k≥2 and δ>0, a 2−1/k−δ approximation for diameter in m-edge graphs requires mn^{1+1/(k−1)−o(1)} time. In particular, the simple linear time 2-approximation algorithm is optimal.

In this talk I will give an overview of the known algorithms and fine-grained lower bounds for diameter, including the SETH-based optimality of the 2-approximation algorithm.

11/19/2021

Minshen Zhu, Purdue University

Exponential Lower Bounds for Locally Decodable and Correctable Codes for Insertions and Deletions * Event Begins at 11:00 AM*

Locally Decodable Codes (LDCs) are error-correcting codes for which individual message symbols can be quickly recovered despite errors in the codeword. LDCs for Hamming errors have been studied extensively in the past few decades, where a major goal is to understand the amount of redundancy that is necessary and sufficient to decode from large amounts of error, with small query complexity. Despite exciting progress, we still don’t have satisfactory answers in several important parameter regimes. For example, in the case of 3-query LDCs, the gap between existing constructions and lower bounds is superpolynomial in the message length. In this work we study LDCs for insertion and deletion errors, called Insdel LDCs. Their study was initiated by Ostrovsky and Paskin-Cherniavsky (Information Theoretic Security, 2015), who gave a reduction from Hamming LDCs to Insdel LDCs with a small blowup in the code parameters. On the other hand, the only known lower bounds for Insdel LDCs come from those for Hamming LDCs, thus there is no separation between them. Here we prove new, strong lower bounds for the existence of Insdel LDCs. In particular, we show that 2-query linear Insdel LDCs do not exist, and give an exponential lower bound for the length of all q-query Insdel LDCs with constant q. For q ≥ 3 our bounds are exponential in the existing lower bounds for Hamming LDCs. Furthermore, our exponential lower bounds continue to hold for adaptive decoders, and even in private-key settings where the encoder and decoder share secret randomness. This exhibits a strict separation between Hamming LDCs and Insdel LDCs. Our strong lower bounds also hold for the related notion of Insdel LCCs (except in the private-key setting), due to an analogue to the Insdel notions of a reduction from Hamming LCCs to LDCs. Our techniques are based on a delicate design and analysis of hard distributions of insertion and deletion errors, which depart significantly from typical techniques used in analyzing Hamming LDCs .

12/03/2021

Shai Evra, Hebrew University of Jerusalem

Locally Testable Codes with Constant Rate, Distance, and Locality

A locally testable code (LTC) is an error correcting code that has a property-tester. The tester reads q bits that are randomly chosen, and rejects words with probability proportional to their distance from the code. The parameter q is called the locality of the tester.

LTCs were initially studied as important components of PCPs, and since then the topic has evolved on its own. High rate LTCs could be useful in practice: before attempting to decode a received word, one can save time by first quickly testing if it is close to the code.

An outstanding open question has been whether there exist "c3-LTCs", namely LTCs with *c*onstant rate, *c*onstant distance, and *c*onstant locality.

In this work we construct such codes based on a new two-dimensional complex which we call a left-right Cayley complex. This is essentially a graph which, in addition to vertices and edges, also has squares. Our codes can be viewed as a two-dimensional version of (the one-dimensional) expander codes, where the codewords are functions on the squares rather than on the edges.

Research Areas

What Starts Here

Academics

Student Support

Ph.D. Program

Master's Programs

Portfolio Program in Robotics

Admissions & Incoming Students

Current Students

Online Programs & Degrees

Master's Degrees

Student Experience

Apply

FAQ

Industry

Alumni

Outreach