Skip to main content
Cornell University

arXiv submission will be down for maintenance beginning 14:00 EDT Tuesday June 30th. The site should otherwise remain in operation.

Learn about arXiv becoming an independent nonprofit.
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > math.OC

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Optimization and Control

  • New submissions
  • Cross-lists
  • Replacements

See recent articles

Showing new listings for Monday, 29 June 2026

Total of 41 entries
Showing up to 2000 entries per page: fewer | more | all

New submissions (showing 13 of 13 entries)

[1] arXiv:2606.27563 [pdf, html, other]
Title: On strongly quasiconvex pseudomonotone equilibrium problems in Hadamard spaces
Luisa Marie Després, Nicholas Pischke
Comments: 33 pages
Subjects: Optimization and Control (math.OC)

We study two proximal point type methods for finding equilibrium points of pseudomonotone and strongly quasiconvex bifunctions. Extending results by A. Iusem and F. Lara, we prove the strong convergence of these methods over general complete geodesic metric spaces of nonpositive curvature, so-called Hadamard spaces. Our arguments are quite elementary and in particular effective, yielding sublinear non-asymptotic guarantees for the distance of the iterates towards the solution. These quantitative results are novel even in the context of Euclidean spaces, the original setting of the work by Iusem and Lara, and the simplicity of our arguments allows us to either weaken or even fully discharge various assumptions featuring in this previous work. We also provide an existence result for solutions of equilibrium problems generated by suitably semicontinuous, pseudomonotone and strongly quasiconvex bifunctions over general Hadamard spaces, and derive from this that every lower-semicontinuous strongly quasiconvex function over a Hadamard space has a minimizer, answering a question of the second author.

[2] arXiv:2606.27602 [pdf, html, other]
Title: Uncertainty-aware Power System Planning via Gradient Descent
Mehrnoush Ghazanfariharandi, Robert Mieth
Subjects: Optimization and Control (math.OC)

Power system planning models provide important guidance on long-term investment strategies with significant socio-economic impact. To remain computationally manageable, however, such planning models compromise on the level of complexity with which power system operations and physics are captured. A common approach in most planning models is to collapse multi-stage power system operational processes into a single stage and, as a result, give up on the ability to account for uncertainty in each operational stage. In light of newly emerging load patterns and the continuing adoption of weather-dependent stochastic renewable generation, this uncertainty, however, becomes increasingly impactful on operations, and ignoring it has been shown to cause underinvestment in transmission capacity and flexible resources. In this work, we present a computational approach for power system expansion planning that explicitly considers two-stage day-ahead (DA) and real-time (RT) operational decisions under uncertainty while retaining time-coupling constraints to allow modeling generator ramping and energy storage. To solve the resulting optimization problem efficiently, we employ a projected stochastic gradient descent algorithm combined with a primal-dual optimization framework and an exponential moving average smoothing strategy to improve convergence stability. We evaluate the resulting investment decisions within a two-stage DA and RT simulation framework and compare them with a classic expansion planning model that assumes perfect knowledge of renewable generation. Our experiments show that the proposed framework achieves lower total system costs while ensuring that the implemented technology portfolio achieves set renewable integration targets.

[3] arXiv:2606.27610 [pdf, html, other]
Title: Policy Gradient Learning for Distributionally Robust Markov Decision Processes under Wasserstein Ambiguity
Yadh Hafsi, Samy Mekkaoui, Huyên Pham, Kaixin Yan
Subjects: Optimization and Control (math.OC)

We study finite-horizon Markov Decision Processes (MDPs) under distributional uncertainty in the transition kernels and develop a policy-gradient framework for Wasserstein distributionally robust control. Ambiguity is modeled by state-action dependent Wasserstein balls around nominal transition kernels, leading to a max-min control problem over randomized policies and admissible transition laws. Since the worst-case transition law depends implicitly on the policy parameters, the usual policy-gradient argument does not apply. We address this difficulty by using a Wasserstein dual reformulation of the robust Bellman recursion and analyzing its directional differentiability. This yields an explicit recursive characterization of the robust policy gradient. Building on this characterization, we propose a robust actor-critic algorithm and illustrate its behavior on discrete and continuous benchmark examples.

[4] arXiv:2606.27640 [pdf, other]
Title: Multidimensional quadratic BSDEs with weak interactions and their applications in mean-field games of controls
Ulrich Horst, Emil Schmidek, Huilin Zhang
Subjects: Optimization and Control (math.OC); Probability (math.PR)

The well-posedness of multidimensional quadratic backward stochastic differential equations (qBSDEs) remains one of the central open problems in BSDE theory. Motivated by a mean-field utility maximization model with price impact, we introduce a new class of multidimensional qBSDEs that lies beyond the scope of existing well-posedness results (see (2.15) for the generator). In order to study the limit as the number of players tends to infinity, we establish existence and uniqueness for a large class of such qBSDEs under suitable smallness conditions imposed on each individual dynamics. A key feature of our approach is that the smallness condition is independent of the dimension of the BSDE system. In particular, the system itself is not confined to a small neighborhood, which allows us to analyze the mean-field limit of the underlying utility maximization problem. Such a condition is also natural in view of the well-known fact that general multidimensional qBSDEs may fail to admit solutions in the absence of suitable smallness assumptions. In addition, we derive a stability result for this class of equations based on the application of Picard iterations. Finally, using this stability result, we establish quantitative convergence rates toward the corresponding mean-field equilibria in two settings: Nash and Radner equilibria.

[5] arXiv:2606.27722 [pdf, html, other]
Title: A Backstepping Framework for Unconstrained Accelerated Optimization Algorithms
Song Chen, Jiaxu Liu, Chao Xu
Subjects: Optimization and Control (math.OC)

This paper introduces a control-theoretic perspective on unconstrained optimization algorithms using the backstepping methods. We model the optimization process as an augmented strict-feedback system given by $\dot{x}_1 = x_2$, $\dot{x}_2 = u$, and $\dot{z} = q(x_1,z)$, with a regulated output $y = \nabla f(x_1)$. This formulation recasts the development of unconstrained optimization algorithms as a feedback control problem, where the goal is to design the input $u$ to ensure $y(t) \to 0$. By employing backstepping, we recursively synthesize the actual feedback law $u$ after initially selecting a virtual control for $x_1$. For convex objective functions, we develop a general synthesis framework for augmented strict-feedback systems and specialize it to the standard strict-feedback case. This unified framework successfully recovers the constant-parameter Nesterov flow and the proportional-integral-derivative (PID) accelerated optimizer as direct corollaries. We further establish that, given a fixed virtual control, the universal second-step law is inverse optimal with respect to an induced outer-tracking problem. This reveals that the optimality of the control law is conditionally dependent on the target manifold prescribed by the virtual control, rather than holding globally across all possible backstepping designs. Finally, we formulate a formal optimal-backstepping theorem that elevates this optimality principle to the virtual-control stage by solving a reduced Hamilton--Jacobi--Bellman problem. These contributions collectively yield a robust and general backstepping-driven paradigm for the analysis and design of continuous-time unconstrained optimization algorithms.

[6] arXiv:2606.28010 [pdf, html, other]
Title: A primal-dual splitting algorithm for monotone inclusions with applications
Changchi Huang, Jigen Peng, Liqian Qin, Yuchao Tang
Comments: 43 pages
Subjects: Optimization and Control (math.OC)

In this paper, we study a broad class of structured monotone inclusion problems in real Hilbert spaces. We propose a novel primal-dual splitting algorithm for solving such inclusions, which accommodates multiple monotone operators and cocoercive terms, as well as a composite monotone operator involving the linear map. The algorithm combines forward evaluations for the cocoercive components with backward resolvent steps for the monotone operators and employs a dual update for the linear composition term. It generalizes and unifies several existing methods, while requiring only a single resolvent or operator evaluation per iteration. We prove weak convergence of the iterates under standard assumptions on monotonicity and cocoercivity. Furthermore, we establish strong convergence under a mild regularity condition, such as uniform monotonicity. Numerical experiments on image deblurring and denoising problems demonstrate the efficiency and flexibility of the proposed algorithm.

[7] arXiv:2606.28106 [pdf, other]
Title: Fractional quadratic obstructions to the local controllability of the Burgers equation
Thomas Perrin
Subjects: Optimization and Control (math.OC); Analysis of PDEs (math.AP)

We study the local controllability near zero of the Burgers equation with a scalar control and a fixed space-dependent source profile, in the case where the linearized system fails to be controllable and a second-order analysis is therefore required. We prove that quadratic obstructions to finite-time controllability can be quantified by Sobolev norms of the control with fractional negative exponents ranging over a full interval. To our knowledge, this is the first example, for a natural physical PDE, of a continuous scale of fractional quadratic obstructions, and the first such continuous scale for finite-time local controllability. Our explicit constructions shed light on the origin of fractional obstructions for partial differential equations, by relating the obstruction exponent to the regularity, and in some cases to the physical-space singularity, of the source profile.
We identify the natural structural conditions on the source profile leading to obstructions quantified by the $H^{-1}$ and $H^{-5/4}$ norms of the control, thereby providing a general framework in which the previously studied case of a constant source profile fits naturally. In this constant-profile case, we improve existing results by identifying the arithmetic condition on the Fourier mode which ensures that a small-time obstruction actually persists in finite time.
Finally, we derive sharp nonlinear remainder estimates adapted to the precise regularity of the source profile. These estimates make most of our obstruction results optimal with respect to the smallness assumption imposed on the control.

[8] arXiv:2606.28124 [pdf, html, other]
Title: Reservoir Zero-Coordinatewise Projected Subspace Search for Minimization Over Sparse Symmetric Sets in Machine Learning
Morteza Kimiaei, Shima Shabani, Michael Breuss
Comments: 42 Pages, 6 figures
Subjects: Optimization and Control (math.OC)

We study a class of nonconvex cardinality-constrained optimization problems arising in sparse learning. These problems are NP-hard due to the combinatorial nature of sparsity constraints. We introduce a Reservoir Zero-Coordinatewise Projected Subspace Search (RZCW-PSS) algorithm, a simplex-style method on sparse manifolds that integrates coordinatewise search, symmetry-aware swap-based support updates, randomized low-dimensional subspace exploration, and zero-coordinatewise reservoir injection. The proposed method augments classical coordinate and swap moves with sparse-compatible subspace searches constructed from a dynamically maintained reservoir of previously accepted feasible points. A key feature of the approach is a refined reservoir initialization strategy that embeds sparse projection directly into a uniform sampling procedure, preserving geometric diversity within the feasible set. The algorithm also includes an optional support-identification safeguard that enforces full-support stabilization under a fixed support-change decrease threshold. We establish that, under the stated regularity, sampling, and subproblem-accuracy assumptions, every full-support accumulation point of the RZCW-PSS iterates is Beck--Hallak zero-coordinatewise stationary almost surely; with the safeguard and full-support initialization, this conclusion applies to all accumulation points. We further prove a conditional local linear convergence rate after support stabilization and derive the corresponding logarithmic local iteration complexity. Numerical experiments on synthetic sparse learning problems demonstrate that RZCW-PSS improves robustness and solution quality while remaining computationally competitive with Partial Simplex Search, Basic Feasible Search, and Zero-Coordinatewise Search methods.

[9] arXiv:2606.28207 [pdf, html, other]
Title: Three-Body Earth-Moon Transfers with Different Departure/Arrival Orbital Altitudes: New Phenomenon and Diffusion Model-Augmented Construction
Shuyue Fu, Wenxuan Zhang, Di Wu, Shengping Gong, Peng Shi
Subjects: Optimization and Control (math.OC)

Construction of Earth-Moon transfers is the basis of missions to explore the Moon and cislunar space. The traditional grid search method suffers from a relatively low convergence rate and computational efficiency, mainly focusing on the distribution of transfer characteristic parameters. Moreover, when constructing transfers with different departure/arrival orbital altitudes, the process of grid search and trajectory correction should be repeated with a low convergence rate and computational efficiency. To address these limitations of the traditional grid search method, this paper is devoted to exploring an effective way to augment the grid search method. Bi-impulsive Earth-Moon transfers from a circular Earth parking orbit to a circular Moon target orbit in the Earth-Moon planar circular restricted three-body problem are considered in this paper. Firstly, the transfers are constructed, and the corresponding solution space is explored in terms of construction parameters, including departure phase angle at the Earth parking orbit, initial-to-circular velocity ratio, and time of flight. An interesting phenomenon about the discontinuous behavior of the time-of-flight distribution with respect to departure phase angle is identified. This phenomenon is further used to train a diffusion model, which aims to augment the traditional grid search method and generate high-quality initial guesses for transfers with different departure/arrival orbital altitudes. The construction results of the proposed method are presented and analyzed. The proposed diffusion model-augmented grid search method improves the convergence rate by 47.34-56.25% and saves the wall-clock time by 39.39-40.52% over the traditional grid search method relatively, while ensuring comparable transfer characteristics.

[10] arXiv:2606.28230 [pdf, html, other]
Title: A Fletcher's Augmented Lagrangian-Based Stochastic First-Order Method for Nonconvex Equality-Constrained Optimization
Yawen Cui, Qiankun Shi, Xiao Wang, Xiantao Xiao
Subjects: Optimization and Control (math.OC)

In this paper, we study nonconvex equality-constrained optimization problems in which only stochastic first-order approximations of the objective and constraint functions are available. Owing to the stochasticity in both objective and constraints, most existing stochastic first-order methods incur relatively high oracle complexity, particularly in terms of stochastic constraint function evaluations. To address this issue, we develop a stochastic first-order method based on a decomposed stochastic search direction, and employ Fletcher's augmented Lagrangian as a smooth merit function for step-size selection. To cope with the possible loss of uniform nondegeneracy of the stochastic Jacobian, we introduce an event decomposition based on the smallest singular value, which enables us to control perturbations in the stochastic search direction. Under an additional Lipschitz continuity assumption on the second-order derivatives of the objective and constraint functions, we show that the proposed algorithm attains a stochastic \(\epsilon\)-KKT point with an expected total oracle complexity of \(\mathcal O(\epsilon^{-3})\) in terms of stochastic gradient and stochastic constraint function evaluations. Finally, we present numerical experiments to demonstrate the performance of the proposed method.

[11] arXiv:2606.28254 [pdf, html, other]
Title: Graphon Mean Field Game of mutual holding
Daorong Cui, Shuoqing Deng, Yang Xiang
Subjects: Optimization and Control (math.OC)

This paper studies the mean field game of mutual holding proposed by Djete and Touzi(AAP, 2024), and consider the case where the interactions among agents are described by a graphon. We adopt the formulation on the enlarged space which is modeled using the joint law of the value process and the graphon label, as in Lacker and Soret(MOR, 2023). Under suitable conditions on the graphon function, we are able to provide the explicit characterization of the optimal strategy, prove the wellposedness of associated Mckean-Vlasov SDE and establish the convergence results of the Nash equilibria. The key technique consists in a detailed analysis of the continuity property under the $\mathcal{WOP}_2$ metric, and tailor-made arguments for different graphon equilibria under different regularities of the model.

[12] arXiv:2606.28278 [pdf, html, other]
Title: Sharp First-Order Lower Bounds under Sublevel $α$-Polyak-Lojasiewicz Conditions
Saeed Masiha, Negar Kiyavash, Patrick Thiran
Subjects: Optimization and Control (math.OC)

We study the optimal complexity of first-order methods under the $\alpha$-Polyak-Lojasiewicz condition with $\alpha\in[1,2)$. This condition bounds the suboptimality gap by a power $\alpha$ of the gradient norm; $\alpha=2$ recovers the classical Polyak-Lojasiewicz condition, $\alpha=1$ corresponds to a Holder error-bound regime, and intermediate exponents arise near degenerate minima in local Kurdyka-Lojasiewicz geometry. We first prove a structural obstruction: if global smoothness and a global $\alpha$-Polyak-Lojasiewicz inequality are imposed on $\mathbb{R}^d$, then every such function is constant for $\alpha<2$. This motivates the globally smooth, sublevel-$\alpha$-Polyak-Lojasiewicz class, where the inequality is required only on the initial sublevel set.
On this class, we prove sharp minimax lower bounds for first-order methods. In the deterministic oracle model, any first-order method requires $\Omega(L\tau^{2/\alpha}\epsilon^{-(2-\alpha)/\alpha})$ queries to reach accuracy $\epsilon$, matching gradient descent. In the bounded-variance stochastic-gradient oracle model, any stochastic first-order method requires $\Omega(L\sigma^2\tau^{4/\alpha}\epsilon^{-(4-\alpha)/\alpha})$ queries in the noise-dominated regime, matching known SGD upper rates under trajectory-containment assumptions.

[13] arXiv:2606.28307 [pdf, html, other]
Title: Second-Order KKT Guarantees for Bregman ADMM in Nonconvex and Non-Lipschitz Optimization
Shuang Li, Zhihui Zhu, Qiuwei Li
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)

We analyze Bregman ADMM for nonconvex linearly constrained problems under two-sided relative smoothness, a condition that replaces the standard Lipschitz gradient assumption with a Hessian comparison relative to a Bregman kernel. This setting covers polynomial objectives arising in matrix and tensor models for which a global Lipschitz-gradient constant need not exist. We show that on an invariant open state-space domain, one iteration of Bregman ADMM defines a smooth primal--dual fixed-point map whose strict-saddle KKT points are unstable fixed points; consequently, from random initialization the iterates converge to a strict saddle with probability zero. Combined with existing first-order convergence results, this yields almost-sure second-order stationarity of limiting KKT points. We extend the analysis to a multi-block star consensus formulation for distributed optimization. The technical novelty lies in a determinant reduction with a Bregman-specific symmetrization and scaling step in the two block spectral argument, together with a null space cancellation exploiting the star graph structure in the consensus case. Numerical experiments on distributed matrix factorization illustrate the theory, and a symmetric tensor factorization example demonstrates the broader Bregman proximal splitting idea beyond the separable consensus setting.

Cross submissions (showing 8 of 8 entries)

[14] arXiv:2606.27503 (cross-list from math.CO) [pdf, html, other]
Title: Tropical Fermat--Weber Problems over Non-Finite Data and their Inverse Formulations
John Sabol, Ruriko Yoshida
Subjects: Combinatorics (math.CO); Optimization and Control (math.OC)

The term tropical pseudonorm refers to a family of (not necessarily symmetric) gauge functions that arise in tropical or idempotent geometry. An important characteristic of these gauges is their invariance under translation by a constant vector, allowing them to descent naturally to tropical projective spaces. In this work, we explore the tropical one-infinity pseudonorm, a polyhedral hybrid gauge that allows for tunable asymmetry, in the context of a Fermat--Weber location problem. We extend previous formulations in considering non-finite data, and we investigate several variants of the inverse problem, providing linear programming formulations for their solution.

[15] arXiv:2606.27551 (cross-list from math.AG) [pdf, other]
Title: Any-dimensional Positivstellensätze for symmetric functions
Sebastian Debus, Robin Schabert
Comments: 26 pages
Subjects: Algebraic Geometry (math.AG); Functional Analysis (math.FA); Optimization and Control (math.OC)

Positivstellensätze provide certificates of positivity for polynomials. Extending these certificates to symmetric functions, uniformly across all dimensions, presents structural challenges. For instance, the underlying domain is not semialgebraic. In this paper, we prove two Positivstellensätze for symmetric functions that are uniformly bounded below by some $\varepsilon > 0$. These are infinite dimensional analogous of theorems of Pólya and Reznick. The proof relates evaluations of the (truncated) power sum map $(p_2,p_3,\dots)$ to moments of discrete probability measures on the compact interval $[-1,1]$. This yields a characterization of the orbit space of the infinite symmetric group. Finally, we provide an alternative proof of existing Positivstellensätze for normalized symmetric functions.

[16] arXiv:2606.27767 (cross-list from cs.LG) [pdf, html, other]
Title: Difference of Convex Programming in the Wasserstein Space with Applications to MMD Optimization
Clément Bonet, Pierre-Cyril Aubin-Frankowski, Youssef Mroueh
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)

Optimizing functionals over the space of probability measures is now ubiquitous in machine learning. A widely used approach is to perform the optimization directly over the Wasserstein space, but many objective functionals of practical interest are non-convex along Wasserstein geodesics, making the analysis of standard first-order methods challenging. In this work, we study a class of objectives over the Wasserstein space that admit a difference-of-convex (DC) decomposition and we lift the classical convex-concave procedure (CCCP) to this setting. Under smoothness and strong convexity assumptions on the convex components of the decomposition, we prove almost stationarity along the iterates of the resulting algorithm. Our main focus is on the Maximum Mean Discrepancy (MMD) and the Energy Distance (ED) functionals, for which we develop explicit Wasserstein DC decompositions, and establish local convergence of the scheme under mild assumptions. Empirically, we show that well-chosen DC decompositions yield faster and more stable convergence than Wasserstein gradient descent on these MMD objectives.

[17] arXiv:2606.27899 (cross-list from math.ST) [pdf, other]
Title: Optimal Estimators for Heavy-Tailed Mean Estimation via Convex Analysis
Bart P.G. van Parys, Bert Zwart
Subjects: Statistics Theory (math.ST); Optimization and Control (math.OC); Probability (math.PR)

We study optimal estimation of the location parameter of a distribution known only to lie in a symmetric moment class $\mathcal C_0$: the mean-zero distributions with bounded moment $\int\phi\, d\mathbb P\le B$ for a fixed even $\phi$. Our main result concerns the fixed-margin regime, where the error margin $\Delta$ is fixed as $n\to\infty$: we give an exact large-deviation characterization of the smallest worst-case probability $\beta_n(\Delta)$ of an error exceeding $\Delta$ that any measurable estimator can guarantee with $n$ observations. Its exponential rate is exactly a two-point Hellinger exponent over the class shifted to means $\pm\Delta$, $r(\Delta)=-\log\sup_{\mathbb P_{\pm\Delta}\in\mathcal C_{\pm\Delta}}\int\sqrt{d\mathbb P_{-\Delta}\, d\mathbb P_{\Delta}}$, achieved non-asymptotically, $\beta_n(\Delta)\le e^{-nr(\Delta)}$, by a monotone $M$-estimator synthesized from a two-parameter convex program. Lagrangian duality collapses the infinite-dimensional search over estimating functions to two multipliers, which determine a pair of envelopes characterizing the optimal estimating functions; the sandwich shape posited ad hoc in prior constructions emerges naturally. For bounded variance ($\phi(x)=x^2$, $B=\sigma^2$) the exponent is $r(\Delta)=\tfrac12\log(1+\Delta^2/\sigma^2)$. In the fixed-confidence regime, holding $\beta$ fixed and letting the optimal margin $\Delta_n(\beta)$ shrink with $n$, the same synthesis stays optimal to leading order for several concrete classes. As $\beta\downarrow0$ it attains the sharp constant $\sqrt2$ of Catoni for bounded variance and the constant $L(\alpha)$ of Lee and Bhatt et al. for bounded $\alpha$-moments, $\alpha\in(1,2)$, thereby shown tight; for slowly varying $\phi$ it is leading-order minimax at every fixed $\beta$. The least-favorable distributions are simple, supported on at most three atoms.

[18] arXiv:2606.28059 (cross-list from cs.IR) [pdf, html, other]
Title: Fast and Feasible: Permutation-based Constrained Reranking for Revenue Maximization
Svetlana Shirokovskikh, Anastasiia Soboleva, Ekaterina Solodneva, Aleksandr Katrutsa, Roman Loginov, Egor Samosvat
Subjects: Information Retrieval (cs.IR); Optimization and Control (math.OC)

Search and recommender systems have produced highly relevant search results. A natural next step in the development of such systems in e-commerce is to rerank these results to increase the platform's revenue from paid promotion products. However, maximizing revenue alone may degrade the user experience by reducing relevance or increasing fraud risk. To avoid this, we state the reranking problem as an integer linear program ($ILP$) that maximizes revenue subject to per-query constraints on other metrics, e.g., relevance. Since solving $ILP$ exactly for every query is slow for deployment to the online service, we propose a lightweight permutation-based reranking approximation algorithm PermR. At each step, the algorithm selects a pair of neighboring items and swaps them to either improve the objective or repair a violated constraint. We evaluate PermR across multiple categories of a large classified platform in offline and online settings. PermR achieves about 63\% of the ILP revenue improvement, within production latency limits, preserving all constraints. In a 14-day online A/B test over 56 million search queries, PermR increased revenue by $2$\%.

[19] arXiv:2606.28100 (cross-list from cs.GT) [pdf, html, other]
Title: Discrete Event Population Updates: finding game theoretic emergent behaviour in queueing systems with simulation
Vincent Knight, Geraint I. Palmer-Liyu, Thomas Hutton
Subjects: Computer Science and Game Theory (cs.GT); Optimization and Control (math.OC); Probability (math.PR); Populations and Evolution (q-bio.PE)

Strategic behaviour in queueing systems has been studied extensively in the behavioural queueing literature, but almost exclusively for systems that admit closed-form expressions for the cost or utility experienced by a strategic user. Evolutionary game theory offers a mature framework for analysing populations whose individual payoffs depend on the composition of the population itself, and would in principle apply to a much wider class of queueing systems; its application has, however, been constrained by the same closed-form requirement. We introduce Discrete Event Population Updates (DEPU), a general algorithmic framework that couples a single long run of a discrete event simulation (DES) directly to an evolutionary population update rule, removing that constraint. We present two implementations: Discrete Event Replicator Dynamics (DERD), which follows an Euler discretisation of the replicator dynamics equation, and Discrete Event Moran Replacement (DEMR), which maintains a finite population updated via Moran-style copying events. Both are applied to a multi-server jockeying model for which no closed-form fitness expressions are available. On the jockeying model considered, DEPU reaches comparable precision tens of times faster than the standard practice of nesting short simulations inside an outer evolutionary loop, and because each operating point then costs only a single simulation run it also makes systematic parameter sweeps tractable. This brings the toolkit of evolutionary dynamics within reach of any system a modeller can build in a discrete event simulator.

[20] arXiv:2606.28123 (cross-list from cs.LG) [pdf, html, other]
Title: Dangerous Liaisons of Convex Learning and Non-Affine Aggregation
Thomas Boudou, Batiste Le Bars, Nirupam Gupta, Aurélien Bellet
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)

Last-iterate convergence and generalization guarantees in first-order convex learning hinge on the monotonicity of the update operator. While linear averaging preserves the monotonicity of gradient updates, this property is often violated when gradients are aggregated non-affinely, as in modern pipelines enforcing constraints like adaptivity, privacy, robustness or fairness. Whether it is possible to design non-affine aggregation rules that maintain monotonicity has remained an open question. We answer this question negatively: we prove that the monotonicity of aggregated gradients is preserved if and only if the aggregation rule is positively affine. Consequently, non-affine aggregation prevents steady convergence and substantially degrade algorithmic stability. We quantify these drawbacks and propose a path forward by identifying sufficient conditions under which monotonicity can be restored. Our results provide a unified theoretical framework explaining the disparate failure modes observed in modern learning systems.

[21] arXiv:2606.28312 (cross-list from q-fin.MF) [pdf, html, other]
Title: Optimal Deployment of Electric Aircraft for Canadian Domestic Flights
Elham Soufiani, Mehrdad Pirnia
Comments: 6 pages, 4 images, Presented at the IEEE ITEC EATS 2026 (Transportation Electrification Conference and Expo or Electric Aircraft Technologies Symposium) took place from June 10-12, 2026, at the VIBE Credit Union Showplace in Novi, Michigan
Subjects: Mathematical Finance (q-fin.MF); Optimization and Control (math.OC)

This paper presents a multi-period mixed-integer linear programming (MILP) framework for planning the transition from conventional to electric aircraft in regional aviation. The model jointly optimizes fleet acquisition, infrastructure deployment, and service allocation over time, while accounting for policy constraints such as emissions reduction targets, electric service share, and budget limits. A real-world case study based on Helijet's short-haul network in British Columbia demonstrates the applicability of the model. The results show that electrification can reduce emissions by more than 70\% within five years while remaining economically viable. However, the transition is primarily limited by the capacity of the fleet and operational structure, rather than the charging infrastructure, leading to unmet demand under direct aircraft replacement. These findings emphasize the need for coordinated planning across fleet sizing, scheduling, and route prioritization to ensure a practical and efficient transition to electric aviation.

Replacement submissions (showing 20 of 20 entries)

[22] arXiv:2407.06408 (replaced) [pdf, html, other]
Title: Projection, Degeneracy, and Singularity Degree for Spectrahedra
Haesol Im, Woosuk L. Jung, David Torregrosa-Belén, Henry Wolkowicz
Subjects: Optimization and Control (math.OC)

Facial reduction, FR, is a regularization technique for convex programs where the strict feasibility constraint qualification, CQ, this http URL this CQ holds generically, failure is pervasive in applications such as semidefinite relaxations of hard discrete optimization problems. In this paper we relate FR to the analysis of the convergence behaviour of a semismooth Newton root finding method for the projection onto a spectrahedron, i.e., onto the intersection of a linear manifold and the semidefinite cone. We examine the effect of failure of strict feasibility on the projection problem. In the process, we derive an elegant formula for the projection onto a face of the semidefinite cone obtained via regularization and discuss pathologies that arise in the absence of strict feasibility. We show further that the ill-conditioning of the Jacobian of the Newton method near optimality characterizes the degeneracy of the nearest point in the spectrahedron. We apply the results, both theoretically and empirically, to the problem of finding nearest points to the sets of: (i) correlation matrices or the elliptope; and (ii) semidefinite relaxations of permutation matrices or the vontope, i.e., the feasible sets for the semidefinite relaxations of the max-cut and quadratic assignment problems, respectively.

[23] arXiv:2504.05609 (replaced) [pdf, html, other]
Title: Extended SQP Methods in Nonsmooth Difference Programming Applied to Problems with Variational Inequality Constraints
Boris S. Mordukhovich, Yixia Song, Shangzhi Zeng, Jin Zhang
Subjects: Optimization and Control (math.OC)

This paper explores a new class of constrained difference programming problems, where the objective and constraints are formulated as differences of functions, without requiring their convexity. To investigate such problems, novel variants of the extended sequential quadratic method are introduced. These algorithms iteratively solve strongly convex quadratic subproblems constructed via linear approximations of the given data by using their gradients and subgradients. The convergence of the proposed methods is rigorously analyzed by employing, in particular, the Polyak-Łojasiewicz-Kurdyka property that ensures global convergence for various classes of functions in the problem formulation, e.g., semialgebraic ones. The original framework is further extended to address difference programming problems with variational inequality (VI) constraints. By reformulating VI constraints via regularized gap functions, such problems are naturally embedded into constrained difference programming that leads us to direct applications of the proposed algorithms. Numerical experiments for the class of continuous network design problems demonstrate the efficiency of the new methods.

[24] arXiv:2505.00349 (replaced) [pdf, html, other]
Title: Burer-Monteiro factorizability of nuclear norm regularized optimization
Wenqing Ouyang, Ting Kei Pong, Man-Chung Yue
Subjects: Optimization and Control (math.OC)

This paper studies the relationship between the nuclear norm-regularized minimization problem, which minimizes the sum of a $C^2$ function $h$ and a positive multiple of the nuclear norm, denoted by $f$, and its factorized problem obtained by the Burer-Monteiro technique. We are interested in deriving conditions that ensure every second-order stationary point of the factorized problem corresponds to a global minimizer of $f$, a property we call the $r$-factorizability of $f$ in this paper. Under suitable restricted isometry property (RIP) type assumptions on $h$, we prove the $r$-factorizability of $f$. Moreover, the RIP constant in our paper is tight, in the sense that concrete non-$r$-factorizable $f$ can be constructed when the RIP-type assumption fails to hold. Our technique for constructing such examples is novel and may be of independent interest: specifically, we use a variant of the Von Neumann's trace inequality and relate the existence of such examples to the optimal value of a quadratic program involving the RIP constant, then we explicitly solve this optimization problem to identify the parameter regimes in which such worst-case counterexamples can be constructed.

[25] arXiv:2507.09165 (replaced) [pdf, html, other]
Title: Factorization-free Orthogonal Projection onto the Positive Semidefinite Cone with Composite Polynomial Filtering
Shucheng Kang, Haoyu Han, Antoine Groudiev, Heng Yang
Subjects: Optimization and Control (math.OC)

We propose a factorization-free method for orthogonal projection onto the positive semidefinite (PSD) cone, leveraging composite polynomial filtering. Inspired by recent advances in homomorphic encryption, our approach approximates the PSD cone projection operator using a carefully optimized composite polynomial evaluated exclusively via matrix-matrix multiplications. This approach enables efficient GPU implementations with low-precision arithmetic, significantly outperforming the classical PSD cone projection using state-of-the-art GPU-based eigenvalue decomposition solvers. Specifically, our method achieves a consistent relative error of $10^{-3}$ in half-precision arithmetic with only 22 matrix-matrix multiplications, providing roughly a $10\times$ speed-up over NVIDIA's cuSOLVER routines on various large-scale matrices. In single-precision arithmetic with emulation on B200 GPUs, our approach maintains competitive accuracy while achieving up to a $2\times$ speed-up. Consequently, for a $10,000 \times 10,000$ dense symmetric matrix, our method requires approximately $55$ ms in half-precision and $400$ ms in single-precision arithmetic on B200 GPUs. Integration into a first-order semidefinite programming solver confirms that our low-precision projections reliably yield solutions of moderate accuracy.

[26] arXiv:2510.04473 (replaced) [pdf, html, other]
Title: Introduction to Model-Based Derivative-Free Optimization
Lindon Roberts
Subjects: Optimization and Control (math.OC)

The field of derivative-free optimization (DFO) studies algorithms for nonlinear optimization that do not rely on the availability of gradient or Hessian information. It is primarily designed for settings when functions are black-box, expensive to evaluate and/or noisy. A widely used and studied class of DFO methods for local optimization is model-based DFO, where the general principles from derivative-based nonlinear optimization algorithms are followed, but local Taylor-type approximations are replaced with alternative local models constructed by interpolation. This document provides an overview of the basic algorithms and analysis for model-based DFO, covering worst-case complexity, approximation theory for polynomial interpolation models, and extensions to constrained and noisy problems.

[27] arXiv:2510.24550 (replaced) [pdf, html, other]
Title: Sum of Squares Submodularity
Anna Deza, Georgina Hall
Subjects: Optimization and Control (math.OC); Algebraic Geometry (math.AG); Functional Analysis (math.FA)

We introduce the notion of $t$-sum of squares (sos) submodularity, which is a hierarchy, indexed by $t$, of sufficient algebraic conditions for certifying submodularity of set functions. We show that, for fixed $t$, each level of the hierarchy can be verified via a semidefinite program of size polynomial in $n$, the size of the ground set of the set function. This is particularly relevant given existing hardness results around testing whether a set function is submodular (Crama, 1989). We derive several equivalent algebraic characterizations of $t$-sos submodularity and identify submodularity-preserving operations that also preserve $t$-sos submodularity. We further present a complete classification of the cases for which submodularity and $t$-sos submodularity coincide, as well as examples of $t$-sos-submodular functions. We demonstrate the usefulness of $t$-sos submodularity through three applications: (i) a new convex approach to submodular regression, involving minimal manual tuning; (ii) a systematic procedure to derive lower bounds on the submodularity ratio in approximate submodular maximization, and (iii) improved difference-of-submodular decompositions for difference-of-submodular optimization. Overall, our work builds a new bridge between discrete optimization and real algebraic geometry by connecting sum of squares-based algebraic certificates to a fundamental discrete structure, submodularity.

[28] arXiv:2602.10920 (replaced) [pdf, other]
Title: Data assimilation via model reference adaptation for linear and nonlinear dynamical systems
Benedikt Kaltenbach, Christian Aarset, Tram Thi Ngoc Nguyen
Subjects: Optimization and Control (math.OC); Analysis of PDEs (math.AP); Dynamical Systems (math.DS)

We address data assimilation for linear and nonlinear dynamical systems via the so-called model reference adaptive system. Continuing our theoretical developments, we deliver the first practical implementation of this approach for online parameter identification with time series data. Our semi-implicit scheme couples a modified state equation with a parameter evolution law that is driven by model-data residuals. We demonstrate four benchmark problems of increasing complexity: the Darcy flow, the Fisher-KPP equation, a nonlinear potential equation and finally, an Allen-Cahn type equation. Across all cases, explicit model reference adaptive system construction, verified assumptions and numerically stable reconstructions underline our proposed method as a reliable, versatile tool for data assimilation and real-time inversion.

[29] arXiv:2602.16862 (replaced) [pdf, html, other]
Title: Action-Space Entropy Regularization in Bayesian Markowitz
Andy Au
Comments: 23 pages, 1 figure
Subjects: Optimization and Control (math.OC); Portfolio Management (q-fin.PM)

We solve the entropy-regularized mean--variance portfolio problem under Bayesian drift uncertainty. We combine continuous-time Bayesian filtering with stochastic policy optimization; the main finding is negative: the two mechanisms are orthogonal. Posterior dynamics are policy-independent, so entropy regularization cannot accelerate learning about the unknown drift. The mean control is identical to the deterministic Bayesian Markowitz feedback, and entropy enters only through policy variance. On the technical side, the optimal policy is Gaussian, the value function is quadratic in wealth, and the leading belief-dependent coefficient closes in exponential form. The framework recovers both parent models as limiting cases.

[30] arXiv:2603.15606 (replaced) [pdf, html, other]
Title: Saddle Point Evasion via Curvature-Regularized Gradient Dynamics
Liraz Mudrik, Isaac Kaminer, Sean Kragelund, Abram H. Clark
Comments: Published in IEEE Control Systems Letters. 6 pages, 3 figures
Journal-ref: IEEE Control Systems Letters, vol. 10, pp. 625-630, 2026
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)

Nonconvex optimization underlies many modern machine learning and control tasks, where saddle points pose the dominant obstacle to reliable convergence in high-dimensional settings. Escaping these saddle points deterministically using continuous-time optimization remains an open challenge: gradient descent is blind to curvature, stochastic perturbation methods lack deterministic guarantees, and Newton-type approaches suffer from Hessian singularity. Adopting the perspective of viewing optimization algorithms as dynamical systems, we present Curvature-Regularized Gradient Dynamics (CRGD), which augments the objective with a smooth penalty on the negative Hessian eigenvalues, yielding an augmented cost that serves as an optimization Lyapunov function with user-selectable convergence rates to second-order stationary points. Numerical experiments confirm that CRGD converges to second-order stationary points, even in regimes where gradient descent fails.

[31] arXiv:2603.17256 (replaced) [pdf, html, other]
Title: On the sensitivity of the subspace predictor to behavioral perturbations
Dian Jin, Jeremy Coulson
Comments: 6 pages, 2 figures
Subjects: Optimization and Control (math.OC)

Behavioral systems define discrete-time LTI systems in terms of a set of trajectories, which forms a linear subspace. This subspace underlies the subspace predictor used in data-driven prediction and control. In practice, such subspaces are typically represented through data matrices. For robustness certification and uncertainty quantification, however, these matrix representations are coordinate-dependent and therefore do not provide a coordinate-free way to quantify uncertainty. In this work, we derive an explicit prediction error bound in terms of behavioral distance between the true subspace and an estimate, showing that the predictor is locally Lipschitz with respect to behavioral perturbations. We also present a one-step prediction error bound that is relevant for receding-horizon implementations, which becomes computable when combined with existing behavioral-distance certificates. Numerical studies show that our bound is tighter than an existing data-matrix perturbation bound, and remains computable, though more conservative, when combined with an existing behavioral distance certificate.

[32] arXiv:2604.21132 (replaced) [pdf, html, other]
Title: The Method of Ellipcenters for Strongly Convex Functions
Yunier Bello-Cruz
Comments: 15 pages
Subjects: Optimization and Control (math.OC)

The Method of Ellipcenters (ME), introduced in~\cite{ME2025} for strongly convex quadratic minimization, uses two gradient evaluations per iteration: one at the current iterate and one at a companion point on the same level set. We extend ME to the broader class of strongly convex functions with Lipschitz continuous gradient. We prove that ME contracts unconditionally at the linear rate $1-\mu^2/L^2$, and that at every step where the two gradient directions are linearly independent, which, in dimension at least two, is every step generically, it matches the rate of gradient descent with exact line search. In that linearly independent case, a midpoint argument exploiting the level-set symmetry yields a further per-step improvement, which is global when the angle between the two gradients is uniformly bounded away from zero. The same symmetry forces this angle to be obtuse, so the improvement is strictly active at every such step. ME also converges in at most two steps in dimension two. Numerical experiments on regularized logistic regression confirm the theoretical predictions.

[33] arXiv:2605.03400 (replaced) [pdf, html, other]
Title: A Quadratic-Approximation-Based Stochastic Approximation Method for Weakly Convex Stochastic Programming
Yule Zhang, Benqi Liu, Xiantao Xiao, Liwei Zhang
Comments: 40 pages, 8 figures, 1 table
Subjects: Optimization and Control (math.OC)

We propose a novel stochastic approximation algorithm, termed PMQSopt, for solving weakly convex stochastic optimization problems involving expectation-valued functions. The algorithm is constructed by integrating the proximal method of multipliers with quadratic approximations of the original stochastic problem. We analyze the sample complexity of PMQSopt in terms of the total number of stochastic gradient evaluations required. The convergence of the algorithm is characterized by three metrics associated with the $\epsilon$-KKT conditions: the average squared norm of the gradient of the Moreau envelope of the Lagrangian, the average constraint violation, and the average complementarity violation. For each of these metrics, we establish an expected convergence rate of $\mathcal{O}(T^{-1/4})$ after $T$ iterations. Furthermore, we show that with probability at least $1-1/T^{2/3}$, the gradient of the Lagrangian satisfies an $\mathcal{O}(T^{-1/8})$ bound; with probability at least $1-2/T^{2/3}$, the constraint violation achieves an $\mathcal{O}(T^{-1/4})$ bound; and with probability at least $1-3/T^{2/3}$, the complementarity violation attains an $\mathcal{O}(T^{-1/4})$ bound. All results are established under two mild conditions: (i) weak convexity of all problem functions, and (ii) the existence of a strictly feasible point. The proposed PMQSopt algorithm is a sequentially strongly convex programming method that is readily implementable. Numerical experiments illustrate its practical performance.

[34] arXiv:2606.08783 (replaced) [pdf, other]
Title: OptMuon: Closed-Loop Orthogonalized Momentum Methods for Stochastic Optimization with Zero-Noise Optimality
Ganzhao Yuan
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Numerical Analysis (math.NA)

Orthogonalized momentum updates, as used in Muon-style optimizers, have recently shown strong empirical stability in large-scale deep learning. However, most current orthogonalized methods are still paired with fixed, externally scheduled, or otherwise open-loop magnitude rules, so their scale is not directly calibrated from the realized optimization trajectory. Motivated by the closed-loop perspective behind Lipschitz-free and noise-adaptive methods, we propose OptMuon, a family of adaptive momentum orthogonalization methods for stochastic nonconvex optimization. OptMuon combines Muon-style polar-factor directions with a trajectory-dependent AdaGrad-Norm-type coefficient schedule, so that the update magnitude is determined by the observed gradient and momentum history rather than by a prescribed Lipschitz-dependent rule. The schedule does not use the smoothness constant, the variance level, or the bounded-gradient constant in parameter selection, and its running-maximum correction prevents isolated gradient spikes from causing excessive coefficient collapse. Under lower-boundedness, unbiased stochastic gradients with bounded variance, smoothness, and an almost-sure bounded stochastic-gradient condition, we prove two complementary expected-stationarity guarantees. OptMuon-A achieves the noise-adaptive rate \(\tilde{\mathcal O}(T^{-1/2}+\sigma^{1/2}T^{-1/4})\) under average smoothness, while OptMuon-I achieves \(\tilde{\mathcal O}(T^{-1/2}+\sigma^{1/3}T^{-1/3})\) under individual smoothness. In the zero-noise regime, both bounds automatically reduce to a nearly optimal deterministic first-order rate \(\tilde{\mathcal O}(T^{-1/2})\) without manual hyperparameter retuning. These results show that closed-loop scalar adaptation can be combined with Muon-style momentum orthogonalization while retaining noise adaptivity and zero-noise optimality up to logarithmic factors.

[35] arXiv:2606.13987 (replaced) [pdf, html, other]
Title: Proof and More Variations of Bellman's Lost-in-a-forest Problem
Zhipeng Deng
Comments: V2 corrects typos, modifies some wordings in proof, adds literature and more results
Subjects: Optimization and Control (math.OC)

In this paper, based on our previous general formulation and computational solution to Bellman's Lost-in-a-forest Problem, we provide the proof of general solution and obtained more variations and results related to this problem. This paper provides generalized formalized method connecting curve covering, lost-in-the-forest problem, and traveling salesman problem with neighborhoods. We prove the equivalence and convergence. We also provide more results of searching for two lines, connection to Wetzel's unit arc covering problem, variations with closed path, variations in three dimensions, etc. The results include general calculation equations, partial analytical results, and numerical results.

[36] arXiv:2606.23250 (replaced) [pdf, html, other]
Title: Sparse Feedback Implementation for Sender-Receiver Transportation Linear-Quadratic Control
Anders Hansson
Subjects: Optimization and Control (math.OC)

We study a sparse linear-quadratic problem for transportation dynamics. The goal is to compute the optimal control signal without applying the usually dense optimal feedback gain directly. We show that the optimal feedback gain can be factorized as the product of a sparse matrix and the inverse of another sparse matrix from the right. This factorization enables the control signal to be computed with much less computational effort than direct multiplication by the dense gain. The factorization also enables a distributed implementation. The main message is that linear quadratic control need not appear dense when expressed in graph-adapted coordinates, and that intrinsic sparsity can be revealed under the proposed formulation.

[37] arXiv:2501.07400 (replaced) [pdf, html, other]
Title: Derivation of effective gradient flow equations and dynamical truncation of training data in Deep Learning
Thomas Chen
Comments: AMS Latex, 36 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Analysis of PDEs (math.AP); Optimization and Control (math.OC); Machine Learning (stat.ML)

We derive explicit equations governing the cumulative biases and weights in Deep Learning with ReLU activation function, based on gradient descent for the Euclidean loss in the input layer, and under the assumption that the weights are, in a precise sense, adapted to the coordinate system distinguished by the activations. We show that gradient descent corresponds to a dynamical process in the input layer, whereby clusters of data are progressively reduced in complexity ("truncated") at an exponential rate that increases with the number of data points that have already been truncated. We provide a detailed discussion of several types of solutions to the gradient flow equations. A main motivation for this work is to shed light on the interpretability question in supervised learning.

[38] arXiv:2510.21613 (replaced) [pdf, other]
Title: Beyond Smoothed Analysis: Analyzing the Simplex Method by the Book
Eleon Bach, Alexander E. Black, Sophie Huiberts, Sean Kafer
Subjects: Data Structures and Algorithms (cs.DS); Optimization and Control (math.OC)

Narrowing the gap between theory and practice is a longstanding goal of the algorithm analysis community. To further progress our understanding of how algorithms work in practice, we propose a new algorithm analysis framework that we call by the book analysis. In contrast to earlier frameworks, by the book analysis not only models an algorithm's input data, but also the algorithm itself. Results from by the book analysis are meant to correspond well with established knowledge of an algorithm's practical behavior, as they are meant to be grounded in observations from implementations, input modeling best practices, and measurements on practical benchmark instances. We apply our framework to the simplex method, an algorithm which is beloved for its excellent performance in practice and notorious for its high running time under worst-case analysis. The simplex method similarly showcased the state of the art framework smoothed analysis (Spielman and Teng, STOC'01). We explain how our framework overcomes several weaknesses of smoothed analysis and we prove that under input scaling assumptions, feasibility tolerances and other design principles used by simplex method implementations, the simplex method indeed attains a polynomial running time.

[39] arXiv:2602.00334 (replaced) [pdf, html, other]
Title: Adaptive Momentum and Nonlinear Damping for Neural Network Training
Aikaterini Karoni, Rajit Rajpal, Benedict Leimkuhler, Gabriel Stoltz
Comments: 31 pages, 13 figures. Accepted at ICML 2026
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)

Momentum Stochastic Gradient Descent (mSGD) relies on a fixed momentum coefficient shared across all parameters, failing to account for the heterogeneous structure of modern loss landscapes. In this work, we adopt a continuous-time formulation to introduce individual, adaptive momentum coefficients regulated by the kinetic energy of each model parameter. This mechanism automatically adjusts to evolving training dynamics to maintain stability without sacrificing convergence speed. We demonstrate that this adaptive friction is inextricably linked to cubic damping, a suppression mechanism from structural dynamics. We additionally introduce two optimization schemes by augmenting the continuous dynamics of mSGD and Adam with a cubic damping term. Empirically, our methods demonstrate robustness and match or outperform Adam on training ViT, BERT, and GPT2 tasks where mSGD typically struggles. We further provide theoretical results establishing the exponential convergence of the proposed schemes.

[40] arXiv:2606.11347 (replaced) [pdf, html, other]
Title: Annealed Entropic Allocation for Ranking and Selection
Xin Fei, Juergen Branke
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)

We propose annealed entropic allocation, an adaptive sampling policy based on an annealed, weighted soft-min formulation of static budget allocation. We replace the maximin large-deviation rate objective with a weighted log-sum-exp surrogate that blends challenger-specific pairwise scores through soft-min weights, avoiding hard switching when several challengers are nearly active. To capture tail behavior beyond the leading exponent, the surrogate incorporates saddlepoint prefactors from refined pairwise tail asymptotics. Because these corrections are subexponential, decreasing the annealing temperature with the budget preserves the same first-order target allocation. For the static problem, we prove uniform convergence to the hard minimum, concentration of soft-min weights on active challengers, and continuity of the induced target-allocation map under fixed weights. Experiments show that the proposed methods are consistently competitive: the no-saddlepoint ablation performs best in symmetric Gaussian and exponential slippage settings, while saddlepoint weighting can help in heterogeneous or asymmetric cases.

[41] arXiv:2606.15105 (replaced) [pdf, html, other]
Title: Optimal Ground-to-Air Interception with Time-Varying Acceleration Bounds
Or Nahum, Vitaly Shaferman
Comments: This work has been submitted for journal publication. 37 Pages, 10 figures
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)

This paper proposes novel optimal-control-based guidance laws for ground-to-air missiles with time-varying acceleration bounds. In such engagements, as the missile climbs in altitude, its acceleration bound decreases, which may lead to acceleration saturation and significant miss distances if not explicitly accounted for. The proposed guidance laws incorporate hard acceleration command constraints directly into a linear-quadratic optimal-control framework, in contrast to conventional unbounded or softly constrained approaches. Analytically based guidance laws are developed for linear zero-order and first-order strictly proper missile dynamics with arbitrary-order linear target dynamics. Unlike the constant hard-bound case with minimum-phase missile dynamics, time-varying acceleration command bounds permit an initial unsaturated interval in which the proposed guidance laws can anticipate future saturation and reshape the acceleration profile accordingly. This enables earlier maneuvers when the missile possesses greater low-altitude maneuverability, fundamentally altering the structure of the optimal solution. The proposed approach is evaluated in nonlinear simulations and compared with equivalent unbounded and softly constrained optimal guidance laws. The results demonstrate substantially improved interception performance under saturation, reduced tuning requirements compared to softly constrained guidance laws, and enhanced capability in challenging engagement scenarios.

Total of 41 entries
Showing up to 2000 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status