Stefania Bellavia – Optimization Online

Alternate Training of Shared and Task-Specific Parameters for Multi-Task Neural Networks

Published: 2024/01/08

Nonlinear Optimization, Stochastic Programming multi-task learning, neural networks, stochastic gradient

This paper introduces novel alternate training procedures for hard-parameter sharing Multi-Task Neural Networks (MTNNs). Traditional MTNN training faces challenges in managing conflicting loss gradients, often yielding sub-optimal performance. The proposed alternate training method updates shared and task-specific weights alternately, exploiting the multi-head architecture of the model. This approach reduces computational costs, enhances training regularization, and … Read more

An optimally fast objective-function-free minimization algorithm using random subspaces

Published: 2023/10/25

Nonlinear Optimization, Stochastic Programming, Unconstrained Optimization evaluation complexity, nonlinear optimization, objective-function-free optimization (OFFO), sketching, stochastic adaptive regularization methods

Article Download View An optimally fast objective-function-free minimization algorithm using random subspaces

Inexact Newton methods with matrix approximation by sampling for nonlinear least-squares and systems

Published: 2023/08/31

Stefania Bellavia

Greta Malaspina

Benedetta Morini

Nonlinear Optimization, Nonlinear Systems and Least-Squares, Stochastic Programming

We develop and analyze stochastic inexact Gauss-Newton methods for nonlinear least-squares problems and inexact Newton methods for nonlinear systems of equations. Random models are formed using suitable sampling strategies for the matrices involved in the deterministic models. The analysis of the expected number of iterations needed in the worst case to achieve a desired level … Read more

Regularized methods via cubic subspace minimization for nonconvex optimization

Published: 2023/06/26

Nonlinear Optimization, Unconstrained Optimization

\(\) The main computational cost per iteration of adaptive cubic regularization methods for solving large-scale nonconvex problems is the computation of the step \(s_k\), which requires an approximate minimizer of the cubic model. We propose a new approach in which this minimizer is sought in a low dimensional subspace that, in contrast to classical approaches, … Read more

SLiSeS: Subsampled Line Search Spectral Gradient Method for Finite Sums

Published: 2023/06/01, Updated: 2024/10/09

Stefania Bellavia

Nataša Krejić

Nataša Krklec Jerinkić

Marcos Raydan

Data Science Algorithms, Nonlinear Optimization, Optimization in Data Science finite sum minimization, line-search, spectral gradient methods, subsampling

Citation SLiSes Article Download View SLiSeS: Subsampled Line Search Spectral Gradient Method for Finite Sums

Trust-region algorithms: probabilistic complexity and intrinsic noise with applications to subsampling techniques

Published: 2021/12/12

Nonlinear Systems and Least-Squares, Unconstrained Optimization evaluation complexity, finite-sum optimization, inexact functions and derivatives, probabilistic analysis, subsampling methods, trust-region methods

A trust-region algorithm is presented for finding approximate minimizers of smooth unconstrained functions whose values and derivatives are subject to random noise. It is shown that, under suitable probabilistic assumptions, the new method finds (in expectation) an epsilon-approximate minimizer of arbitrary order q > 0 in at most O(epsilon^{-(q+1)}) inexact evaluations of the function and … Read more

A stochastic first-order trust-region method with inexact restoration for finite-sum minimization

Published: 2021/06/22

Nonlinear Optimization finite sum minimization, inexact restoration, sub-sampling, trust-region methods, worst-case evaluation complexity

We propose a stochastic first-order trust-region method with inexact function and gradient evaluations for solving finite-sum minimization problems. At each iteration, the function and the gradient are approximated by sampling. The sample size in gradient approximations is smaller than the sample size in function approximations and the latter is determined using a deterministic rule inspired … Read more

The Impact of Noise on Evaluation Complexity: The Deterministic Trust-Region Case

Published: 2021/04/06, Updated: 2023/02/20

Unconstrained Optimization evaluation complexity, inexact functions and derivatives, noise, trust-region methods

Intrinsic noise in objective function and derivatives evaluations may cause premature termination of optimization algorithms. Evaluation complexity bounds taking this situation into account are presented in the framework of a deterministic trust-region method. The results show that the presence of intrinsic noise may dominate these bounds, in contrast with what is known for methods in … Read more

High-order Evaluation Complexity of a Stochastic Adaptive Regularization Algorithm for Nonconvex Optimization Using Inexact Function Evaluations and Randomly Perturbed Derivatives

Published: 2020/05/12

Bound-constrained Optimization, Constrained Nonlinear Optimization, Unconstrained Optimization evaluation complexity, inexact functions and derivatives, regularization, stochastic analysis

A stochastic adaptive regularization algorithm allowing random noise in derivatives and inexact function values is proposed for computing strong approximate minimizers of any order for inexpensively constrained smooth optimization problems. For an objective function with Lipschitz continuous p-th derivative in a convex neighbourhood of the feasible set and given an arbitrary optimality order q, it … Read more

A relaxed interior point method for low-rank semidefinite programming problems with applications to matrix completion

Published: 2019/09/16, Updated: 2021/03/26

Stefania Bellavia

Jacek Gondzio

Margherita Porcelli

Nonlinear Systems and Least-Squares, Semi-definite Programming interior point methods, low rank, matrix-completion problems, semidefinite programming

A new relaxed variant of interior point method for low-rank semidefinite programming problems is proposed in this paper. The method is a step outside of the usual interior point framework. In anticipation to converging to a low-rank primal solution, a special nearly low-rank form of all primal iterates is imposed. To accommodate such a (restrictive) … Read more