Mengdi Wang – Optimization Online

A Single Time-Scale Stochastic Approximation Method for Nested Stochastic Optimization

Published: 2018/12/27

We study constrained nested stochastic optimization problems in which the objective function is a composition of two smooth functions whose exact values and derivatives are not available. We propose a single time-scale stochastic approximation algorithm, which we call the Nested Averaged Stochastic Approximation (NASA), to find an approximate stationary point of the problem. The algorithm … Read more

Primal-Dual π Learning: Sample Complexity and Sublinear Run Time for Ergodic Markov Decision Problems

Published: 2017/10/17

Mengdi Wang

Dynamic Programming

Consider the problem of approximating the optimal policy of a Markov decision process (MDP) by sampling state transitions. In contrast to existing reinforcement learning methods that are based on successive approximations to the nonlinear Bellman equation, we propose a Primal-Dual π Learning method in light of the linear duality between the value and policy. The … Read more

Lower Bound On the Computational Complexity of Discounted Markov Decision Problems

Published: 2017/05/20

Yichen Chen

Mengdi Wang

Dynamic Programming, Linear Programming complexity, markov decision processes

We study the computational complexity of the infinite-horizon discounted-reward Markov Decision Problem (MDP) with a finite state space $\cS$ and a finite action space $\cA$. We show that any randomized algorithm needs a running time at least $\Omega(\carS^2\carA)$ to compute an $\epsilon$-optimal policy with high probability. We consider two variants of the MDP where the … Read more

Randomized Linear Programming Solves the Discounted Markov Decision Problem In Nearly-Linear (Sometimes Sublinear) Running Time

Published: 2017/04/05, Updated: 2017/09/13

Mengdi Wang

Dynamic Programming, Linear Programming duality, linear programming, markov decision processes, primal-dual algorithms, randomized algorithm, running-time complexity, stochastic approximation

We propose a randomized linear programming algorithm for approximating the optimal policy of the discounted Markov decision problem. By leveraging the value-policy duality, the algorithm adaptively samples state transitions and makes exponentiated primal-dual updates. We show that it finds an ε-optimal policy using nearly-linear running time in the worst case. For Markov decision processes that … Read more

Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning

Published: 2016/12/07

Yichen Chen

Mengdi Wang

Dynamic Programming reinforcement learning, stochastic primal-dual methods

We study the online estimation of the optimal policy of a Markov decision process (MDP). We propose a class of Stochastic Primal-Dual (SPD) methods which exploit the inherent minimax duality of Bellman equations. The SPD methods update a few coordinates of the value and policy estimates as a new state transition is observed. These methods … Read more

Worst-Case Hardness of Approximation for Sparse Optimization with L0 Norm

Published: 2016/02/22, Updated: 2016/03/29

Yichen Chen

Mengdi Wang

Combinatorial Optimization, Nonlinear Optimization approximation hard, complexity, l0 norm, np-hard, sparsity

In this paper, we consider sparse optimization problems with L0 norm penalty or constraint. We prove that it is strongly NP-hard to find an approximate optimal solution within certain error bound, unless P = NP. This provides a lower bound for the approximation error of any deterministic polynomial-time algorithm. Applying the complexity result to sparse … Read more

Blessing of Massive Scale: Spatial Graphical Model Estimation with a Total Cardinality Constraint

Published: 2015/11/18

Ethan Fang

Mengdi Wang

Han Liu

Combinatorial Optimization, Statistics complexity, graphical models, high-dimensional data, l0-constraint method, nonconvex optimization

We consider the problem of estimating high dimensional spatial graphical models with a total cardinality constraint (i.e., the l0-constraint). Though this problem is highly nonconvex, we show that its primal-dual gap diminishes linearly with the dimensionality and provide a convex geometry justification of this ‘blessing of massive scale’ phenomenon. Motivated by this result, we propose … Read more

Random Multi-Constraint Projection: Stochastic Gradient Methods for Convex Optimization with Many Constraints

Published: 2015/11/10, Updated: 2017/04/13

Convex and Nonsmooth Optimization convex optimization, random projection, stochastic algorithms, stochastic gradient

Consider convex optimization problems subject to a large number of constraints. We focus on stochastic problems in which the objective takes the form of expected values and the feasible set is the intersection of a large number of convex sets. We propose a class of algorithms that perform both stochastic gradient descent and random feasibility … Read more

Vanishing Price of Anarchy in Large Coordinative Nonconvex Optimization

Published: 2015/07/22, Updated: 2015/07/28

Mengdi Wang

Convex Optimization, Global Optimization Theory cooperative optimization, cutting planes, duality, nonconvex optimization, price of anarchy

We focus on a class of nonconvex cooperative optimization problems that involve multiple participants. We study the duality framework and provide geometric and analytic character- izations of the duality gap. The dual problem is related to a market setting in which each participant pursuits self interests at a given price of common goods. The duality … Read more

Stochastic Compositional Gradient Descent: Algorithms for Minimizing Compositions of Expected-Value Functions

Published: 2014/11/13, Updated: 2015/09/01

Ethan Fang

Mengdi Wang

Han Liu

Convex Optimization, Statistics, Stochastic Programming composition, sample complexity, statistical learning, stochastic gradient, stochastic optimization

Classical stochastic gradient methods are well suited for minimizing expected-value objective functions. However, they do not apply to the minimization of a nonlinear function involving expected values or a composition of two expected-value functions, i.e., problems of the form $\min_x \E_v\[f_v\big(\E_w [g_w(x)]\big) \]$. In order to solve this stochastic composition problem, we propose a class … Read more