Share

Publications

Publications

CMAP Theses  are available by following this link:
Discover CMAP theses

Listed below, are sorted by year, the publications appearing in the HAL open archive.

2023

  • Is interpolation benign for random forest regression?
    • Arnould Ludovic
    • Boyer Claire
    • Scornet Erwan
    , 2023. Statistical wisdom suggests that very complex models, interpolating training data, will be poor at predicting unseen examples. Yet, this aphorism has been recently challenged by the identification of benign overfitting regimes, specially studied in the case of parametric models: generalization capabilities may be preserved despite model high complexity. While it is widely known that fully-grown decision trees interpolate and, in turn, have bad predictive performances, the same behavior is yet to be analyzed for Random Forests (RF). In this paper, we study the trade-off between interpolation and consistency for several types of RF algorithms. Theoretically, we prove that interpolation regimes and consistency cannot be achieved simultaneously for several non-adaptive RF. Since adaptivity seems to be the cornerstone to bring together interpolation and consistency, we study interpolating Median RF which are proved to be consistent in the interpolating regime. This is the first result conciliating interpolation and consistency for RF, highlighting that the averaging effect introduced by feature randomization is a key mechanism, sufficient to ensure the consistency in the interpolation regime and beyond. Numerical experiments show that Breiman's RF are consistent while exactly interpolating, when no bootstrap step is involved. We theoretically control the size of the interpolation area, which converges fast enough to zero, giving a necessary condition for exact interpolation and consistency to occur in conjunction.
  • Individual and population approaches for calibrating division rates in population dynamics: Application to the bacterial cell cycle
    • Doumic Marie
    • Hoffmann Marc
    , 2023, 40, pp.1-81. Modelling, analysing and inferring triggering mechanisms in population reproduction is fundamental in many biological applications. It is also an active and growing research domain in mathematical biology. In this chapter, we review the main results developed over the last decade for the estimation of the division rate in growing and dividing populations in a steady environment. These methods combine tools borrowed from PDE's and stochastic processes, with a certain view that emerges from mathematical statistics. A focus on the application to the bacterial cell division cycle provides a concrete presentation, and may help the reader to identify major new challenges in the field. (10.1142/9789811266140_0001)
    DOI : 10.1142/9789811266140_0001
  • Scaling limits of bisexual Galton-Watson processes
    • Bansaye Vincent
    • Caballero Maria-Emilia
    • Méléard Sylvie
    • San Martín Jaime
    Stochastics: An International Journal of Probability and Stochastic Processes, Taylor & Francis: STM, Behavioural Science and Public Health Titles, 2023, 95 (5), pp.749-784. Bisexual Galton-Watson processes are discrete Markov chains where reproduction events are due to mating of males and females. Owing to this interaction, the standard branching property of Galton-Watson processes is lost. We prove tightness for conveniently rescaled bisexual Galton-Watson processes, based on recent techniques developed by Bansaye, Caballero and Méléard. We also identify the possible limits of these rescaled processes as solutions of a stochastic system, coupling two equations through singular coefficients in Poisson terms added to square roots as coefficients of Brownian motions. Under some additional integrability assumptions, pathwise uniqueness of this limiting system of stochastic differential equations and convergence of the rescaled processes are obtained. Two examples corresponding to mutual fidelity are considered. (10.1080/17442508.2022.2123706)
    DOI : 10.1080/17442508.2022.2123706
  • Accuracy assessment of an internal resistance model of Li-ion batteries in immersion cooling configuration
    • Solai Elie
    • Beaugendre Heloise
    • Bieder Ulrich
    • Congedo Pietro Marco
    Applied Thermal Engineering, Elsevier, 2023, 220 (119656). Internal resistance is a critical parameter of the thermal behavior of Li-ion battery cells. This paper proposes an innovative way to deal with the uncertainties related to this physical parameter using experimental data and numerical simulation. First, a CFD model is validated against an experimental configuration representing the behavior of heated Li-ion battery cells under constant discharging current conditions. Secondly, an Uncertainty Quantification based methodology is proposed to represent the internal resistance and its inherent uncertainties. Thanks to an accurate and fast to compute surrogate model, the impact of those uncertainties on the temperature evolution of Li-ion cells is quantified. Finally, Bayesian inference of the internal resistance model parameters using experimental measurements is performed, reducing the prediction uncertainty by almost 95% for some temperatures of interest. Finally, an enhanced internal model is constructed by considering the state of charge and temperature dependency on internal resistance. This model is implemented in the CFD code and used to model a full discharge of the Li-ion batteries. The resulting temperature evolution computed with the two different resistance models is compared for the low state of charge situations. (10.1016/j.applthermaleng.2022.119656)
    DOI : 10.1016/j.applthermaleng.2022.119656
  • Goal-oriented sensitivity analysis of hyperparameters in deep learning
    • Novello Paul
    • Poëtte Gaël
    • Lugato David
    • Congedo Pietro Marco
    Journal of Scientific Computing, Springer Verlag, 2023, 94 (3), pp.45. Tackling new machine learning problems with neural networks always means optimizing numerous hyperparameters that define their structure and strongly impact their performances. In this work, we study the use of goal-oriented sensitivity analysis, based on the Hilbert-Schmidt Independence Criterion (HSIC), for hyperparameter analysis and optimization. Hyperparameters live in spaces that are often complex and awkward. They can be of different natures (categorical, discrete, boolean, continuous), interact, and have inter-dependencies. All this makes it non-trivial to perform classical sensitivity analysis. We alleviate these difficulties to obtain a robust analysis index that is able to quantify hyperparameters’ relative impact on a neural network’s final error. This valuable tool allows us to better understand hyperparameters and to make hyperparameter optimization more interpretable. We illustrate the benefits of this knowledge in the context of hyperparameter optimization and derive an HSIC-based optimization algorithm that we apply on MNIST and Cifar, classical machine learning data sets, but also on the approximation of Runge function and Bateman equations solution, of interest for scientific machine learning. This method yields competitive and cost-effective neural networks. (10.1007/s10915-022-02083-4)
    DOI : 10.1007/s10915-022-02083-4
  • Observation of light thermalization to negative-temperature Rayleigh-Jeans equilibrium states in multimode optical fibers
    • Baudin K.
    • Garnier J.
    • Fusaro A.
    • Berti N.
    • Michel C.
    • Krupa K.
    • Millot G.
    • Picozzi Antonio
    Physical Review Letters, American Physical Society, 2023, 130 (6), pp.063801. Although the temperature of a thermodynamic system is usually believed to be a positive quantity, under particular conditions, negative temperature equilibrium states are also possible. Negative temperature equilibriums have been observed with spin systems, cold atoms in optical lattices and two-dimensional quantum superfluids. Here we report the observation of Rayleigh-Jeans thermalization of light waves to negative temperature equilibrium states. The optical wave relaxes to the equilibrium state through its propagation in a multimode optical fiber, i.e., in a conservative Hamiltonian system. The bounded energy spectrum of the optical fiber enables negative temperature equilibriums with high energy levels (high order fiber modes) more populated than low energy levels (low order modes). Our experiments show that negative temperature speckle beams are featured, in average, by a non-monotonous radial intensity profile. The experimental results are in quantitative agreement with the Rayleigh-Jeans theory without free parameters. Bringing negative temperatures to the field of optics opens the door to the investigation of fundamental issues of negative temperature states in a flexible experimental environment. (10.1103/PhysRevLett.130.063801)
    DOI : 10.1103/PhysRevLett.130.063801
  • Numerical shape optimization among convex sets
    • Bogosel Beniamin
    Applied Mathematics and Optimization, Springer Verlag (Germany), 2023, 87 (1), pp.1. This article proposes a new discrete framework for approximating solutions to shape optimization problems under convexity constraints. The numerical method, based on the support function or the gauge function, is guaranteed to generate discrete convex shapes and is easily implementable using standard optimization software. The framework can handle various objective functions ranging from geometric quantities to functionals depending on partial differential equations. Width or diameter constraints are handled using the support function. Functionals depending on a convex body and its polar body can be handled using a unified framework. (10.1007/s00245-022-09920-w)
    DOI : 10.1007/s00245-022-09920-w
  • Unbalanced CO-Optimal Transport
    • Tran Quang Huy
    • Janati Hicham
    • Courty Nicolas
    • Flamary Rémi
    • Redko Ievgen
    • Demetci Pinar
    • Singh Ritambhara
    , 2023. Optimal transport (OT) compares probability distributions by computing a meaningful alignment between their samples. CO-optimal transport (COOT) takes this comparison further by inferring an alignment between features as well. While this approach leads to better alignments and generalizes both OT and Gromov-Wasserstein distances, we provide a theoretical result showing that it is sensitive to outliers that are omnipresent in real-world data. This prompts us to propose unbalanced COOT for which we provably show its robustness to noise in the compared datasets. To the best of our knowledge, this is the first such result for OT methods in incomparable spaces. With this result in hand, we provide empirical evidence of this robustness for the challenging tasks of heterogeneous domain adaptation with and without varying proportions of classes and simultaneous alignment of samples and features across single-cell measurements. (10.1609/aaai.v37i8.26193)
    DOI : 10.1609/aaai.v37i8.26193
  • Robust prediction interval estimation for Gaussian processes by cross-validation method
    • Acharki Naoufal
    • Bertoncello Antoine
    • Garnier Josselin
    Computational Statistics and Data Analysis, Elsevier, 2023, 178, pp.107597. Probabilistic regression models typically use the Maximum Likelihood Estimation or Cross-Validation to fit parameters. These methods can give an advantage to the solutions that fit observations on average, but they do not pay attention to the coverage and the width of Prediction Intervals. A robust two-step approach is used to address the problem of adjusting and calibrating Prediction Intervals for Gaussian Processes Regression. First, the covariance hyperparameters are determined by a standard Cross-Validation or Maximum Likelihood Estimation method. A Leave-One-Out Coverage Probability is introduced as a metric to adjust the covariance hyperparameters and assess the optimal type II Coverage Probability to a nominal level. Then a relaxation method is applied to choose the hyperparameters that minimize the Wasserstein distance between the Gaussian distribution with the initial hyperparameters (obtained by Cross-Validation or Maximum Likelihood Estimation) and the proposed Gaussian distribution with the hyperparameters that achieve the desired Coverage Probability. The method gives Prediction Intervals with appropriate coverage probabilities and small widths. (10.1016/j.csda.2022.107597)
    DOI : 10.1016/j.csda.2022.107597
  • Development and Theoretical Analysis of the Algorithms for Optimal Control and Reinforcement Learning
    • Kaledin Maksim
    , 2023. In this PhD dissertation, we address the problems of optimal stopping and learning in Markov decision processes used in reinforcement learning (RL). In the first direction, we derive complexity estimates for the algorithm called Weighted Stochastic Mesh (WSM) and give a new method for comparing the complexity of optimal stopping algorithms with the semi tractability index. We show that WSM is optimal with respect to this criterion when the commonly used regression methods are much mess effectiveFor reinforcement learning, we give a non-asymptotic convergence analysis of a stochastic approximation scheme with two time scales - gradient TD - under assumptions of "martingale increment" noise - buffer replay - and of "Markov noise" (when learning is done along a single run). We obtain upper bounds that are rate-optimal by constructing an error expansion method that provides accurate control of the remainders terms.We also present a new algorithm for variance reduction in policy gradient schemes. The proposed approach is based on minimising an estimator for the empirical variance of the weighted rewards. We establish theoretical and practical gains over the classical actor-critic (A2C) method.
  • Truncation errors and modified equations for the lattice Boltzmann method via the corresponding Finite Difference schemes
    • Bellotti Thomas
    ESAIM: Mathematical Modelling and Numerical Analysis, Société de Mathématiques Appliquées et Industrielles (SMAI) / EDP, 2023, 57, pp.1225–1255. Lattice Boltzmann schemes are efficient numerical methods to solve a broad range of problems under the form of conservation laws. However, they suffer from a chronic lack of clear theoretical foundations. In particular, the consistency analysis and the derivation of the modified equations are still open issues. This has prevented, until today, to have an analogous of the Lax equivalence theorem for Lattice Boltzmann schemes. We propose a rigorous consistency study and the derivation of the modified equations for any lattice Boltzmann scheme under acoustic and diffusive scalings. This is done by passing from a kinetic (lattice Boltzmann) to a macroscopic (Finite Difference) point of view at a fully discrete level in order to eliminate the non-conserved moments relaxing away from the equilibrium. We rewrite the lattice Boltzmann scheme as a multi-step Finite Difference scheme on the conserved variables, as introduced in our previous contribution. We then perform the usual analyses for Finite Difference by exploiting its precise characterization using matrices of Finite Difference operators. Though we present the derivation of the modified equations until second-order under acoustic scaling, we provide all the elements to extend it to higher orders, since the kinetic-macroscopic connection is conducted at the fully discrete level. Finally, we show that our strategy yields, in a more rigorous setting, the same results as previous works in the literature. (10.1051/m2an/2023008)
    DOI : 10.1051/m2an/2023008
  • STATISTICAL INFERENCE FOR ROUGH VOLATILITY: MINIMAX THEORY
    • Chong Carsten
    • Hoffmann Marc
    • Liu Yanghui
    • Szymanski Grégoire
    • Rosenbaum Mathieu
    , 2023. Rough volatility models have gained considerable interest in the quantitative finance community in recent years. In this paradigm, the volatility of the asset price is driven by a fractional Brownian motion with a small value for the Hurst parameter H. In this work, we provide a rigorous statistical analysis of these models. To do so, we establish minimax lower bounds for parameter estimation and design procedures based on wavelets attaining them. We notably obtain an optimal speed of convergence of n −1/(4H+2) for estimating H based on n sampled data, extending results known only for the easier case H > 1/2 so far. We therefore establish that the parameters of rough volatility models can be inferred with optimal accuracy in all regimes.
  • Automated Market Makers: Mean-Variance Analysis of LPs Payoffs and Design of Pricing Functions
    • Bergault Philippe
    • Bertucci Louis
    • Bouba David
    • Guéant Olivier
    , 2023. We analyze the performance of Liquidity Providers (LPs) providing liquidity to different types of Automated Market Makers (AMMs). This analysis is carried out using a mean / standard deviation viewpoint \`a la Markowitz, though based on the PnL of LPs compared to that of agents holding coins outside of AMMs. We show that LPs tend to perform poorly in a wide variety of CFMMs under realistic market conditions. We then explore an alternative AMM design in which an oracle feeds the current market exchange rate to the AMM which then quotes a bid/ask spread. This allows us to define an efficient frontier for the performance of LPs in an idealized world with perfect information and to show that the smart use of oracles greatly improves LPs' risk / return profile, even in the case of a lagged oracle.
  • Optimal incentives in a limit order book: a SPDE control approach
    • Baldacci Bastien
    • Bergault Philippe
    , 2023. With the fragmentation of electronic markets, exchanges are now competing in order to attract trading activity on their platform. Consequently, they developed several regulatory tools to control liquidity provision / consumption on their liquidity pool. In this paper, we study the problem of an exchange using incentives in order to increase market liquidity. We model the limit order book as the solution of a stochastic partial differential equation (SPDE) as in [12]. The incentives proposed to the market participants are functions of the time and the distance of their limit order to the mid-price. We formulate the control problem of the exchange who wishes to modify the shape of the order book by increasing the volume at specific limits. Due to the particular nature of the SPDE control problem, we are able to characterize the solution with a classic Feynman-Kac representation theorem. Moreover, when studying the asymptotic behavior of the solution, a specific penalty function enables the exchange to obtain closed-form incentives at each limit of the order book. We study numerically the form of the incentives and their impact on the shape of the order book, and analyze the sensitivity of the incentives to the market parameters.
  • A Formal Disproof of Hirsch Conjecture
    • Allamigeon Xavier
    • Canu Quentin
    • Strub Pierre-Yves
    , 2023, pp.17-29. The purpose of this paper is the formal verification of a counterexample of Santos et al. to the so-called Hirsch Conjecture on the diameter of polytopes (bounded convex polyhedra). In contrast with the pen-and-paper proof, our approach is entirely computational: we have implemented in Coq and proved correct an algorithm that explicitly computes, within the proof assistant, vertex-edge graphs of polytopes as well as their diameter. The originality of this certificate-based algorithm is to achieve a tradeoff between simplicity and efficiency. Simplicity is crucial in obtaining the proof of correctness of the algorithm. This proof splits into the correctness of an abstract algorithm stated over proof-oriented data types and the correspondence with a low-level implementation over computation-oriented data types. A special effort has been made to reduce the algorithm to a small sequence of elementary operations (e.g. matrix multiplications, basic routines on graphs), in order to make the derivation of the correctness of the low-level implementation more transparent. Efficiency allows us to scale up to polytopes with a challenging combinatorics. For instance, we formally check the two counterexamples to Hirsch conjecture due to Matschke, Santos and Weibel, respectively 20- and 23-dimensional polytopes with 36425 and 73224 vertices involving rational coefficients with up to 20 digits. We also illustrate the performance of the method by computing the list of vertices or the diameter of well-known classes of polytopes, such as (polars of) cyclic polytopes involved in McMullen's Upper Bound Theorem. (10.1145/3573105.3575678)
    DOI : 10.1145/3573105.3575678
  • State and parameter learning with PARIS particle Gibbs
    • Cardoso Gabriel Victorino
    • Janati Yazid
    • Le Corff Sylvain
    • Moulines Éric
    • Olsson Jimmy
    , 2023. Non-linear state-space models, also known as general hidden Markov models, are ubiquitous in statistical machine learning, being the most classical generative models for serial data and sequences in general. The particle-based, rapid incremental smoother (PARIS) is a sequential Monte Carlo (SMC) technique allowing for efficient online approximation of expectations of additive functionals under the smoothing distribution in these models. Such expectations appear naturally in several learning contexts, such as likelihood estimation (MLE) and Markov score climbing (MSC). PARIS has linear computational complexity, limited memory requirements and comes with non-asymptotic bounds, convergence results and stability guarantees. Still, being based on selfnormalised importance sampling, the PARIS estimator is biased. Our first contribution is to design a novel additive smoothing algorithm, the Parisian particle Gibbs (PPG) sampler, which can be viewed as a PARIS algorithm driven by conditional SMC moves, resulting in bias-reduced estimates of the targeted quantities. We substantiate the PPG algorithm with theoretical results, including new bounds on bias and variance as well as deviation inequalities. Our second contribution is to apply PPG in a learning framework, covering MLE and MSC as special examples. In this context, we establish, under standard assumptions, non-asymptotic bounds highlighting the value of bias reduction and the implicit Rao-Blackwellization of PPG. These are the first non-asymptotic results of this kind in this setting. We illustrate our theoretical results with numerical experiments supporting our claims. (10.48550/arXiv.2301.00900)
    DOI : 10.48550/arXiv.2301.00900
  • Non-asymptotic convergence bounds for Sinkhorn iterates and their gradients: a coupling approach
    • Greco Giacomo
    • Noble Maxence
    • Conforti Giovanni
    • Durmus Alain
    , 2023. Computational optimal transport (OT) has recently emerged as a powerful framework with applications in various fields. In this paper we focus on a relaxation of the original OT problem, the entropic OT problem, which allows to implement efficient and practical algorithmic solutions, even in high dimensional settings. This formulation, also known as the Schrödinger Bridge problem, notably connects with Stochastic Optimal Control (SOC) and can be solved with the popular Sinkhorn algorithm. In the case of discrete-state spaces, this algorithm is known to have exponential convergence; however, achieving a similar rate of convergence in a more general setting is still an active area of research. In this work, we analyze the convergence of the Sinkhorn algorithm for probability measures defined on the d-dimensional torus , that admit densities with respect to the Haar measure. In particular, we prove pointwise exponential convergence of Sinkhorn iterates and their gradient. Our proof relies on the connection between these iterates and the evolution along the Hamilton-Jacobi-Bellman equations of value functions obtained from SOC-problems. Our approach is novel in that it is purely probabilistic and relies on coupling by reflection techniques for controlled diffusions on the torus.
  • Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters
    • Noble Maxence
    • de Bortoli Valentin
    • Doucet Arnaud
    • Durmus Alain
    , 2023. Multi-marginal Optimal Transport (mOT), a generalization of OT, aims at minimizing the integral of a cost function with respect to a distribution with some prescribed marginals. In this paper, we consider an entropic version of mOT with a tree-structured quadratic cost, i.e., a function that can be written as a sum of pairwise cost functions between the nodes of a tree. To address this problem, we develop Tree-based Diffusion Schrödinger Bridge (TreeDSB), an extension of the Diffusion Schrödinger Bridge (DSB) algorithm. TreeDSB corresponds to a dynamic and continuous state-space counterpart of the multi-marginal Sinkhorn algorithm. A notable use case of our methodology is to compute Wasserstein barycenters which can be recast as the solution of a mOT problem on a star-shaped tree. We demonstrate that our methodology can be applied in high-dimensional settings such as image interpolation and Bayesian fusion.
  • An optimal control-based numerical method for scalar transmission problems with sign-changing coefficients
    • Ciarlet Patrick
    • Lassounon David
    • Rihani Mahran
    SIAM Journal on Numerical Analysis, Society for Industrial and Applied Mathematics, 2023, 61 (3), pp.1316-1339. In this work, we present a new numerical method for solving the scalar transmission problem with sign-changing coefficients. In electromagnetism, such a transmission problem can occur if the domain of interest is made of a classical dielectric material and a metal or a metamaterial, with for instance an electric permittivity that is strictly negative in the metal or metamaterial. The method is based on an optimal control reformulation of the problem. Contrary to other existing approaches, the convergence of this method is proved without any restrictive condition. In particular, no condition is imposed on the a priori regularity of the solution to the problem, and no condition is imposed on the meshes, other than that they fit with the interface between the two media. Our results are illustrated by some (2D) numerical experiments. (10.1137/22M1495998)
    DOI : 10.1137/22M1495998
  • Design patterns of hierarchies for order structures
    • Allamigeon Xavier
    • Canu Quentin
    • Cohen Cyril
    • Sakaguchi Kazuhiko
    • Strub Pierre-Yves
    , 2023. Using order structures in a proof assistant naturally raises the problem of working with multiple instances of a same structure over a common type of elements. This goes against the main design pattern of hierarchies used for instance in Coq's MathComp or Lean's mathlib libraries, where types are canonically associated to at most one instance and instances share a common overloaded syntax. We present new design patterns to leverage these issues, and apply them to the formalization of order structures in the MathComp library. A common idea in these patterns is underloading, i.e., a disambiguation of operators on a common type. In addition, our design patterns include a way to deal with duality in order structures in a convenient way. We hence formalize a large hierarchy which includes partial orders, semilattices, lattices as well as many variants. We finally pay a special attention to order substructures. We introduce a new kind of structure called prelattice. They are abstractions of semilattices, and allow us to deal with finite lattices and their sublattices within a common signature. As an application, we report on significant simplifications of the formalization of the face lattices of polyhedra in the Coq-Polyhedra library.
  • Error estimates for finite differences approximations of the total variation
    • Caillaud Corentin
    • Chambolle Antonin
    IMA Journal of Numerical Analysis, Oxford University Press (OUP), 2023, 43 (2), pp.692--736. We present a convergence rate analysis of the Rudin-Osher-Fatemi (ROF) denoising problem for two different discretizations of the total variation. The first discretization is the well-known isotropic total variation that suffers from a blurring effect in a special diagonal direction. We prove that in the setting corresponding to this direction, the discrete ROF energy converges to the continuous one in O(h^2/3). The second total variation is based on Raviart-Thomas fields and achieves a O(h) convergence rate for the same quantity under some standard hypotheses. (10.1093/imanum/drac001)
    DOI : 10.1093/imanum/drac001
  • The nonlocal isoperimetric problem for polygons: Hardy-Littlewood and Riesz inequalities
    • Bogosel Beniamin
    • Bucur Dorin
    • Fragalà Ilaria
    Mathematische Annalen, Springer Verlag, 2023. Given a non-increasing and radially symmetric kernel in $L ^ 1 _{\rm loc} (\Bbb{R} ^ 2 ; \Bbb{R} _+)$, we investigate counterparts of the classical Hardy-Littlewood and Riesz inequalities when the class of admissible domains is the family of polygons with given area and $N$ sides. The latter corresponds to study the polygonal isoperimetric problem in nonlocal version. We prove that, for every $N \geq 3$, the regular $N$-gon is optimal for Hardy-Littlewood inequality. Things go differently for Riesz inequality: while for $N = 3$ and $N = 4$ it is known that the regular triangle and the square are optimal, for $N\geq 5$ we prove that symmetry or symmetry breaking may occur (i.e.\ the regular $N$-gon may be optimal or not), depending on the value of $N$ and on the choice of the kernel. (10.1007/s00208-023-02683-x)
    DOI : 10.1007/s00208-023-02683-x
  • A Robust Low-Voltage-Ride-Through Strategy for Grid-Forming Converters Based on Reactive Power Synchronization
    • Deng Han
    • Qi Yang
    • Fang Jingyang
    • Tang Yi
    • Debusschere Vincent
    IEEE Transactions on Power Electronics, Institute of Electrical and Electronics Engineers, 2023, 38 (1), pp.346--357. (10.1109/TPEL.2022.3204912)
    DOI : 10.1109/TPEL.2022.3204912
  • Time reversal of spinal processes for linear and non-linear branching processes near stationarity
    • Henry Benoît
    • Méléard Sylvie
    • Chi Tran Viet
    Electronic Journal of Probability, Institute of Mathematical Statistics (IMS), 2023, 28 (none). We consider a stochastic individual-based population model with competition, trait-structure affecting reproduction and survival, and changing environment. The changes of traits are described by jump processes, and the dynamics can be approximated in large population by a non-linear PDE with a non-local mutation operator. Using the fact that this PDE admits a non-trivial stationary solution, we can approximate the non-linear stochastic population process by a linear birth-death process where the interactions are frozen, as long as the population remains close to this equilibrium. This allows us to derive, when the population is large, the equation satisfied by the ancestral lineage of an individual uniformly sampled at a fixed time $T$, which is the path constituted of the traits of the ancestors of this individual in past times $t\leq T$. This process is a time inhomogeneous Markov process, but we show that the time reversal of this process possesses a very simple structure (e.g. time-homogeneous and independent of $T$). This extends recent results where the authors studied a similar model with a Laplacian operator but where the methods essentially relied on the Gaussian nature of the mutations. (10.1214/23-EJP911)
    DOI : 10.1214/23-EJP911
  • Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning
    • Dieuleveut Aymeric
    • Fort Gersende
    • Moulines Eric
    • Wai Hoi-To
    IEEE Transactions on Signal Processing, Institute of Electrical and Electronics Engineers, 2023, 71, pp.3117-3148. Stochastic approximation (SA) is a classical algorithm that has had since the early days a huge impact on signal processing, and nowadays on machine learning, due to the necessity to deal with a large amount of data observed with uncertainties. An exemplar special case of SA pertains to the popular stochastic (sub)gradient algorithm which is the working horse behind many important applications. A lesser-known fact is that the SA scheme also extends to non-stochastic-gradient algorithms such as compressed stochastic gradient, stochastic expectation-maximization, and a number of reinforcement learning algorithms. The aim of this article is to overview and introduce the non-stochastic-gradient perspectives of SA to the signal processing and machine learning audiences through presenting a design guideline of SA algorithms backed by theories. Our central theme is to propose a general framework that unifies existing theories of SA, including its non-asymptotic and asymptotic convergence results, and demonstrate their applications on popular non-stochastic-gradient algorithms. We build our analysis framework based on classes of Lyapunov functions that satisfy a variety of mild conditions. We draw connections between non-stochastic-gradient algorithms and scenarios when the Lyapunov function is smooth, convex, or strongly convex. Using the said framework, we illustrate the convergence properties of the non-stochastic-gradient algorithms using concrete examples. Extensions to the emerging variance reduction techniques for improved sample complexity will also be discussed. (10.1109/TSP.2023.3301121)
    DOI : 10.1109/TSP.2023.3301121