Modified Monotone Policy Iteration for Interpretable Policies in Markov Decision Processes and the Impact of State Ordering Rules

Optimizing interpretable policies for Markov Decision Processes (MDPs) can be computationally intractable for large-scale MDPs, e.g., for monotone policies, the optimal interpretable policy depends on the initial state distribution, precluding standard dynamic programming techniques. Previous work has proposed Monotone Policy Iteration (MPI) to produce a feasible solution for warm starting a Mixed Integer Linear Program … Read more

A more efficient reformulation of complex SDP as real SDP

This note proposes a novel reformulation of complex semidefinite programs (SDPs) as real SDPs. As an application, we present an economical reformulation of complex SDP relaxations of complex polynomial optimization problems as real SDPs and derive some further reductions by exploiting structure of the complex SDP relaxations. Various numerical examples demonstrate that our new reformulation … Read more

Exact convergence rate of the last iterate in subgradient methods

\(\) We study the convergence of the last iterate in subgradient methods applied to the minimization of a nonsmooth convex function with bounded subgradients. We first introduce a proof technique that generalizes the standard analysis of subgradient methods. It is based on tracking the distance between the current iterate and a different reference point at … Read more

Essays on Average Incremental Cost Pricing for Independent System Operators

In a series of essays, we introduce average incremental cost (AIC) pricing and present examples to help understand its advantages. In non-convex markets, AIC pricing is the rough equivalent to marginal cost pricing in convex markets. We present a qualitative comparison of current ISO pricing methods and the AIC approach. We argue that AIC better … Read more

One-Pass Average Incremental Cost Pricing

Since the inception of ISOs, Locational Marginal Prices (LMPs) alone were not incentive compatible because an auction winner who offered its avoidable costs could lose money at the LMP. ISOs used make-whole payments to ensure market participants did not lose money. Make-whole payments were not public creating transparency issues. Over time, the ISOs employed methods … Read more

Inexact Direct-Search Methods for Bilevel Optimization Problems

In this work, we introduce new direct search schemes for the solution of bilevel optimization (BO) problems. Our methods rely on a fixed accuracy black box oracle for the lower-level problem, and deal both with smooth and potentially nonsmooth true objectives. We thus analyze for the first time in the literature direct search schemes in … Read more

Planning a Zero-Emission Mixed-Fleet Public Bus System with Minimal Life Cycle Cost

The variety of available technology options for the operation of zero-emission bus systems gives rise to the problem of finding an optimal technology decision for bus operators. Among others, overnight charging, opportunity charging and hydrogen-based technology options are frequently pursued technological solutions. As their operating conditions are strongly influenced by the urban context, an optimal … Read more

A Short Proof of Tight Bounds on the Smallest Support Size of Integer Solutions to Linear Equations

\(\) In this note, we study the size of the support of solutions to linear Diophantine equations $Ax=b, ~x\in\Z^n$ where $A\in\Z^{m\times n}, b\in\Z^n$. We give an asymptotically tight upper bound on the smallest support size, parameterized by $\|A\|_\infty$ and $m$, and taken as a worst case over all $b$ such that the above system has … Read more

Global convergence of a BFGS-type algorithm for nonconvex multiobjective optimization problems

We propose a modified BFGS algorithm for multiobjective optimization problems with global convergence, even in the absence of convexity assumptions on the objective functions. Furthermore, we establish the superlinear convergence of the method under usual conditions. Our approach employs Wolfe step sizes and ensures that the Hessian approximations are updated and corrected at each iteration … Read more

On the Computation of Restricted Normal Cones

Restricted normal cones are of interest, for instance, in the theory of local error bounds, where they have recently been used to characterize the exis- tence of a constrained Lipschitzian error bound. In this paper, we establish rela- tions between two concepts for restricted normals. The first of these concepts was introduced in the late … Read more