Professor
My research mostly involves aspects of group theory and representation theory in various contexts. I work on mathematical aspects of AI, with focus on geometric deep learning. I also work on automorphic forms and their Fourier coefficients, with motivation from the connection to scattering amplitudes in QFT and string theory. Some of my current research projects include:
Below I list my publications and talks.
I have also written a book on automorphic forms.
In this paper, we develop a manifestly geometric framework for equivariant manifold neural ordinary differential equations (NODEs) and use it to analyse their modelling capabilities for symmetric data. First, we consider the action of a Lie group G on a smooth manifold M and establish the equivalence between equivariance of vector fields, symmetries of the corresponding Cauchy problems, and equivariance of the associated NODEs. We also propose a novel formulation, based on Lie theory for symmetries of differential equations, of the equivariant manifold NODEs in terms of the differential invariants of the action of G on M, which provides an efficient parameterisation of the space of equivariant vector fields in a way that is agnostic to both the manifold M and the symmetry group G. Second, we construct augmented manifold NODEs, through embeddings into flows on the tangent bundle TM, and show that they are universal approximators of diffeomorphisms on any connected M. Furthermore, we show that universality persists in the equivariant case and that the augmented equivariant manifold NODEs can be incorporated into the geometric framework using higher-order differential invariants. Finally, we consider the induced action of G on different fields on M and show how it can be used to generalise previous work, on, e.g., continuous normalizing flows, to equivariant models in any geometry.
Discrete symmetry groups arise in string theory in many contexts and several inter- esting physical quantities are defined on spaces that carry an action of these groups. Scattering amplitudes in particular have to be invariant under the group action and are often associated with automorphic forms. It has been noted first by Green, Miller and Vanhove as well as Pioline that these automorphic forms belong to very special automorphic representations, so-called small representations, leading to a rich interplay between mathematics and physics. This development and some open questions are sketched in this contribution.
We survey the mathematical foundations of geometric deep learning, focusing on group equivariant and gauge equivariant neural networks. We develop gauge equivariant convolutional neural networks on arbitrary manifolds \(\mathcal{M}\) using principal bundles with structure group K and equivariant maps between sections of associated vector bundles. We also discuss group equivariant neural networks for homogeneous spaces \(\mathcal {M}=G/K\), which are instead equivariant with respect to the global symmetry \(G\) on \(\mathcal {M}\). Group equivariant layers can be interpreted as intertwiners between induced representations of \(G\), and we show their relation to gauge equivariant convolutional layers. We analyze several applications of this formalism, including semantic segmentation and object detection networks. We also discuss the case of spherical networks in great detail, corresponding to the case \(\mathcal {M}=S^2=\textrm{SO}(3)/\textrm{SO}(2)\). Here we emphasize the use of Fourier analysis involving Wigner matrices, spherical harmonics and Clebsch-Gordan coefficients for \(G=\textrm{SO}(3)\), illustrating the power of representation theory for deep learning.
High-resolution wide-angle fisheye images are becoming more and more important for robotics applications such as autonomous driving. However, using ordinary convolutional neural networks or vision transformers on this data is problematic due to projection and distortion losses introduced when projecting to a rectangular grid on the plane. We introduce the HEAL-SWIN transformer, which combines the highly uniform Hierarchical Equal Area iso-Latitude Pixelation (HEALPix) grid used in astrophysics and cosmology with the Hierarchical Shifted-Window (SWIN) transformer to yield an efficient and flexible model capable of training on high-resolution, distortion-free spherical data. In HEAL-SWIN, the nested structure of the HEALPix grid is used to perform the patching and windowing operations of the SWIN transformer, resulting in a one-dimensional representation of the spherical data with minimal computational overhead. We demonstrate the superior performance of our model for semantic segmentation and depth regression tasks on both synthetic and real automotive datasets. Our code is available at https://github.com/JanEGerken/HEAL-SWIN.
We use Poincaré series for massive Maass-Jacobi forms to define a “massive theta lift”, and apply it to the examples of the constant function and the modular invariant j-function, with the Siegel-Narain theta function as integration kernel. These theta integrals are deformations of known one-loop string threshold corrections. Our massive theta lifts fall off exponentially, so some Rankin-Selberg integrals are finite without Zagier renormalization.
We analyze the role of rotational equivariance in convolutional neural networks (CNNs) applied to spherical images. We compare the performance of the group equivariant networks known as S2CNNs and standard non-equivariant CNNs trained with an increasing amount of data augmentation. The chosen architectures can be considered baseline references for the respective design paradigms. Our models are trained and evaluated on single or multiple items from the MNIST- or FashionMNIST dataset projected onto the sphere. For the task of image classification, which is inherently rotationally invariant, we find that by considerably increasing the amount of data augmentation and the size of the networks, it is possible for the standard CNNs to reach at least the same performance as the equivariant network. In contrast, for the inherently equivariant task of semantic segmentation, the non-equivariant networks are consistently outperformed by the equivariant networks with significantly fewer parameters. We also analyze and compare the inference latency and training times of the different networks, enabling detailed tradeoff considerations between equivariant architectures and data augmentation for practical problems.
We study some special features of F24, the holomorphic c = 12 superconformal field theory (SCFT) given by 24 chiral free fermions. We construct eight different Lie superalgebras of “physical” states of a chiral superstring compactified on F24, and we prove that they all have the structure of Borcherds-Kac-Moody superalgebras. This produces a family of new examples of such superalgebras. The models depend on the choice of an = 1 supercurrent on F24, with the admissible choices labeled by the semisimple Lie algebras of dimension 24. We also discuss how F24, with any such choice of supercurrent, can be obtained via orbifolding from another distinguished c = 12 holomorphic SCFT, the = 1 supersymmetric version of the chiral CFT based on the E8 lattice.
We discuss a set of heterotic and type II string theory compactifications to dimensions that are characterized by factorized internal worldsheet CFTs of the form , where are self-dual (super) vertex operator algebras. In the cases with spacetime supersymmetry, we show that the BPS states form a module for a Borcherds–Kac–Moody (BKM) (super)algebra, and we prove that for each model the BKM (super)algebra is a symmetry of genus zero BPS string amplitudes. We compute the supersymmetric indices of these models using both Hamiltonian and path integral formalisms. The path integrals are manifestly automorphic forms closely related to the Borcherds–Weyl–Kac denominator. Along the way, we comment on various subtleties inherent to these low-dimensional string compactifications.
We consider supergravity in five-dimensional Anti-De Sitter space AdS5 with minimal supersymmetry, encoded by a Sasaki-Einstein metric on a five-dimensional compact manifold M. Our main result reveals how the Sasaki-Einstein metric emerges from a canonical state in the dual CFT, defined by a superconformal gauge theory in four dimensional Minkowski space ℝ3,1in the t’Hooft limit where the rank N tends to infinity. We obtain explicit finite N−approximations to the Sasaki-Einstein metric, expressed in terms of a canonical (i.e. background free) BPS-state on the gauge theory side. We also provide a string theory interpretation of the BPS-state in question, which sheds new light on the previously noted intriguing duality of giant gravitons.
We consider a special class of unipotent periods for automorphic forms on a finite cover of a reductive adelic group , which we refer to as Fourier coefficients associated to the data of a ‘Whittaker pair’. We describe a quasi-order on Fourier coefficients, and an algorithm that gives an explicit formula for any coefficient in terms of integrals and sums involving higher coefficients. The maximal elements for the quasi-order are ‘Levi-distinguished’ Fourier coefficients, which correspond to taking the constant term along the unipotent radical of a parabolic subgroup, and then further taking a Fourier coefficient with respect to a -distinguished nilpotent orbit in the Levi quotient. Thus one can express any Fourier coefficient, including the form itself, in terms of higher Levi-distinguished coefficients. In companion papers we use this result to determine explicit Fourier expansions of minimal and next-to-minimal automorphic forms on split simply-laced reductive groups, and to obtain Euler product decompositions of certain Fourier coefficients.
In this paper we analyze Fourier coefficients of automorphic forms on a finite cover G of an adelic split simply-laced group. Let π be a minimal or next-to-minimal automorphic representation of G. We prove that any η∈π is completely determined by its Whittaker coefficients with respect to (possibly degenerate) characters of the unipotent radical of a fixed Borel subgroup, analogously to the Piatetski-Shapiro–Shalika formula for cusp forms on GLn. We also derive explicit formulas expressing the form, as well as all its maximal parabolic Fourier coefficient in terms of these Whittaker coefficients. A consequence of our results is the non-existence of cusp forms in the minimal and next-to-minimal automorphic spectrum. We provide detailed examples for G of type D5 and E8 with a view towards applications to scattering amplitudes in string theory.
We study the question of Eulerianity (factorizability) for Fourier coefficients of automorphic forms, and we prove a general transfer theorem that allows one to deduce the Eulerianity of certain coefficients from that of another coefficient. We also establish a ‘hidden’ invariance property of Fourier coefficients. We apply these results to minimal and next-to-minimal automorphic representations, and deduce Eulerianity for a large class of Fourier and Fourier–Jacobi coefficients. In particular, we prove Eulerianity for parabolic Fourier coefficients with characters of maximal rank for a class of Eisenstein series in minimal and next-to-minimal representations of groups of ADE-type that are of interest in string theory.
We show that Fourier coefficients of automorphic forms attached to minimal or next-to-minimal automorphic representations of are completely determined by certain highly degenerate Whittaker coefficients. We give an explicit formula for the Fourier expansion, analogously to the Piatetski-Shapiro–Shalika formula. In addition, we derive expressions for Fourier coefficients associated to all maximal parabolic subgroups. These results have potential applications for scattering amplitudes in string theory.
We provide an introduction to the theory of Eisenstein series and automorphic forms on real simple Lie groups G, emphasising the role of representation theory. It is useful to take a slightly wider view and define all objects over the (rational) adeles A, thereby also paving the way for connections to number theory, representation theory and the Langlands program. Most of the results we present are already scattered throughout the mathematics literature but our exposition collects them together and is driven by examples. Many interesting aspects of these functions are hidden in their Fourier coefficients with respect to unipotent subgroups and a large part of our focus is to explain and derive general theorems on these Fourier expansions. Specifically, we give complete proofs of the Langlands constant term formula for Eisenstein series on adelic groups G(A) as well as the Casselman–Shalika formula for the p-adic spherical Whittaker function associated to unramified automorphic representations of G(Q_p). In addition, we explain how the classical theory of Hecke operators fits into the modern theory of automorphic representations of adelic groups, thereby providing a connection with some key elements in the Langlands program, such as the Langlands dual group LG and automorphic L-functions. Somewhat surprisingly, all these results have natural interpretations as encoding physical effects in string theory. We therefore also introduce some basic concepts of string theory, aimed toward mathematicians, emphasising the role of automorphic forms. In particular, we provide a detailed treatment of supersymmetry constraints on string amplitudes which enforce differential equations of the same type that are satisfied by automorphic forms. Our treatise concludes with a detailed list of interesting open questions and pointers to additional topics which go beyond the scope of this book.
We define a very general class of CHL-models associated with any string theory (bosonic or supersymmetric) compactified on an internal CFT C x T^d. We take the orbifold by a pair (g,\delta), where g is a (possibly non-geometric) symmetry of C and \delta is a translation along T^d. We analyze the T-dualities of these models and show that in general they contain Atkin-Lehner type symmetries. This generalizes our previous work on N=4 CHL-models based on heterotic string theory on T^6 or type II on K3 x T^2, as well as the `monstrous’ CHL-models based on a compactification of heterotic string theory on the Frenkel-Lepowsky-Meurman CFT V^{\natural}.
In this note, we expand on some technical issues raised in \cite{PPV} by the authors, as well as providing a friendly introduction to and summary of our previous work. We construct a set of heterotic string compactifications to 0+1 dimensions intimately related to the Monstrous moonshine module of Frenkel, Lepowsky, and Meurman (and orbifolds thereof). Using this model, we review our physical interpretation of the genus zero property of Monstrous moonshine. Furthermore, we show that the space of (second-quantized) BPS-states forms a module over the Monstrous Lie algebras 𝔪g—some of the first and most prominent examples of Generalized Kac-Moody algebras—constructed by Borcherds and Carnahan. In particular, we clarify the structure of the module present in the second-quantized string theory. We also sketch a proof of our methods in the language of vertex operator algebras, for the interested mathematician.
We investigate Fourier coefficients of automorphic forms on split simply-laced Lie groups G. We show that for automorphic representations of small Gelfand-Kirillov dimension the Fourier coefficients are completely determined by certain degenerate Whittaker vectors on G. Although we expect our results to hold for arbitrary simply-laced groups, we give complete proofs only for G=SL(3) and G=SL(4). This is based on a method of Ginzburg that associates Fourier coefficients of automorphic forms with nilpotent orbits of G. Our results complement and extend recent results of Miller and Sahi. We also use our formalism to calculate various local (real and p-adic) spherical vectors of minimal representations of the exceptional groups E_6, E_7, E_8 using global (adelic) degenerate Whittaker vectors, correctly reproducing existing results for such spherical vectors obtained by very different methods.
We provide a physics derivation of Monstrous moonshine. We show that the McKay-Thompson series Tg, g∈𝕄, can be interpreted as supersymmetric indices counting spacetime BPS-states in certain heterotic string models. The invariance groups of these series arise naturally as spacetime T-duality groups and their genus zero property descends from the behaviour of these heterotic models in suitable decompactification limits. We also show that the space of BPS-states forms a module for the Monstrous Lie algebras 𝔪g, constructed by Borcherds and Carnahan. We argue that 𝔪g arise in the heterotic models as algebras of spontaneously broken gauge symmetries, whose generators are in exact correspondence with BPS-states. This gives 𝔪g an interpretation as a kind of BPS-algebra.
We consider four dimensional CHL models with sixteen spacetime supersymmetries obtained from orbifolds of type IIA superstring on K3 x T^2 by a Z_N symmetry acting (possibly) non-geometrically on K3. We show that most of these models (in particular, for geometric symmetries) are self-dual under a weak-strong duality acting on the heterotic axio-dilaton modulus S by a “Fricke involution” S –> -1/NS. This is a novel symmetry of CHL models that lies outside of the standard SL(2,Z)-symmetry of the parent theory, heterotic strings on T^6. For self-dual models this implies that the lattice of purely electric charges is N-modular, i.e. isometric to its dual up to a rescaling of its quadratic form by N. We verify this prediction by determining the lattices of electric and magnetic charges in all relevant examples. We also calculate certain BPS-saturated couplings and verify that they are invariant under the Fricke S-duality. For CHL models that are not self-dual, the strong coupling limit is dual to type IIA compactified on T^6/Z_N, for some Z_N-symmetry preserving half of the spacetime supersymmetries.
We study the second quantized version of the twisted twining genera of generalized Mathieu moonshine, and prove that they give rise to Siegel modular forms with infinite product representations. Most of these forms are expected to have an interpretation as twisted partition functions counting 1/4 BPS dyons in type II superstring theory on K3\times T^2 or in heterotic CHL-models. We show that all these Siegel modular forms, independently of their possible physical interpretation, satisfy an “S-duality” transformation and a “wall-crossing formula”. The latter reproduces all the eta-products of an older version of generalized Mathieu moonshine proposed by Mason in the ’90s. Surprisingly, some of the Siegel modular forms we find coincide with the multiplicative (Borcherds) lifts of Jacobi forms in umbral moonshine.
Motivated by string theory scattering amplitudes that are invariant under a discrete U-duality, we study Fourier coefficients of Eisenstein series on Kac-Moody groups. In particular, we analyse the Eisenstein series on E9(R), E10(R) and E11(R) corresponding to certain degenerate principal series at the values s=3/2 and s=5/2 that were studied in 1204.3043. We show that these Eisenstein series have very simple Fourier coefficients as expected for their role as supersymmetric contributions to the higher derivative couplings R4 and ∂4R4 coming from 1/2-BPS and 1/4-BPS instantons, respectively. This suggests that there exist minimal and next-to-minimal unipotent automorphic representations of the associated Kac-Moody groups to which these special Eisenstein series are attached. We also provide complete explicit expressions for degenerate Whittaker vectors of minimal Eisenstein series on E6(R), E7(R) and E8(R) that have not appeared in the literature before.
Generalised moonshine is reviewed from the point of view of holomorphic orbifolds, putting special emphasis on the role of the third cohomology group H^3(G, U(1)) in characterising consistent constructions. These ideas are then applied to the case of Mathieu moonshine, i.e. the recently discovered connection between the largest Mathieu group M_24 and the elliptic genus of K3. In particular, we find a complete list of twisted twining genera whose modular properties are controlled by a class in H^3(M_24, U(1)), as expected from general orbifold considerations.
The Mathieu twisted twining genera, i.e. the analogues of Norton’s generalised Moonshine functions, are constructed for the elliptic genus of K3. It is shown that they satisfy the expected consistency conditions, and that their behaviour under modular transformations is controlled by a 3-cocycle in H^3(M_24,U(1)), just as for the case of holomorphic orbifolds. This suggests that a holomorphic VOA may be underlying Mathieu Moonshine.
The hypermultiplet moduli space \(M_H\) in type II string theories compactified on a Calabi-Yau threefold \(X\) is largely constrained by supersymmetry (which demands quaternion-Kählerity), S-duality (which requires an isometric action of \(SL(2, Z)\)) and regularity. Mathematically, \(M_H\) ought to encode all generalized Donaldson-Thomas invariants on \(X\) consistently with wall-crossing, modularity and homological mirror symmetry. We review recent progress towards computing the exact metric on \(M_H\), or rather the exact complex contact structure on its twistor space.
When formulated in twistor space, the D-instanton corrected hypermultiplet moduli space in N=2 string vacua and the Coulomb branch of rigid N=2 gauge theories on R3×S1 are strikingly similar and, to a large extent, dictated by consistency with wall-crossing. We elucidate this similarity by showing that these two spaces are related under a general duality between, on one hand, quaternion-Kahler manifolds with a quaternionic isometry and, on the other hand, hyperkahler manifolds with a rotational isometry, further equipped with a hyperholomorphic circle bundle with a connection. We show that the transition functions of the hyperholomorphic circle bundle relevant for the hypermultiplet moduli space are given by the Rogers dilogarithm function, and that consistency across walls of marginal stability is ensured by the motivic wall-crossing formula of Kontsevich and Soibelman. We illustrate the construction on some simple examples of wall-crossing related to cluster algebras for rank 2 Dynkin quivers. In an appendix we also provide a detailed discussion on the general relation between wall-crossing and the theory of cluster algebras.
We continue our study of algebraic properties of N=4 topological amplitudes in heterotic string theory compactified on T^2, initiated in arXiv:1102.1821. In this work we evaluate a particular one-loop amplitude for any enhanced gauge group h \subset e_8 + e_8, i.e. for arbitrary choice of Wilson line moduli. We show that a certain analytic part of the result has an infinite product representation, where the product is taken over the positive roots of a Lorentzian Kac-Moody algebra g^{++}. The latter is obtained through double extension of the complement g= (e_8 + e_8)/h. The infinite product is automorphic with respect to a finite index subgroup of the full T-duality group SO(2,18;Z) and, through the philosophy of Borcherds-Gritsenko-Nikulin, this defines the denominator formula of a generalized Kac-Moody algebra G(g^{++}), which is an ‘automorphic correction’ of g^{++}. We explicitly give the root multiplicities of G(g^{++}) for a number of examples.
The perturbative spectrum of BPS-states in the E_8 x E_8 heterotic string theory compactified on T^2 is analysed. We show that the space of BPS-states forms a representation of a certain Borcherds algebra G which we construct explicitly using an auxiliary conformal field theory. The denominator formula of an extension G_{ext} \supset G of this algebra is then found to appear in a certain heterotic one-loop N=4 topological string amplitude. Our construction thus gives an N=4 realisation of the idea envisioned by Harvey and Moore, namely that the `algebra of BPS-states’ controls the threshold corrections in the heterotic string.
We survey recent results on quantum corrections to the hypermultiplet moduli space M in type IIA/B string theory on a compact Calabi-Yau threefold X, or, equivalently, the vector multiplet moduli space in type IIB/A on X x S^1. Our main focus lies on the problem of resumming the infinite series of D-brane and NS5-brane instantons, using the mathematical machinery of automorphic forms. We review the proposal that whenever the low-energy theory in D=3 exhibits an arithmetic “U-duality” symmetry G(Z) the total instanton partition function arises from a certain unitary automorphic representation of G, whose Fourier coefficients reproduce the BPS-degeneracies. For D=4, N=2 theories on R^3 x S^1 we argue that the relevant automorphic representation falls in the quaternionic discrete series of G, and that the partition function can be realized as a holomorphic section on the twistor space Z over M. We also offer some comments on the close relation with N=2 wall crossing formulae.