Holonomic extended least angle regression

PDF / 711,505 Bytes
33 Pages / 439.37 x 666.142 pts Page_size
42 Downloads / 193 Views

Holonomic extended least angle regression Marc Härkönen1

· Tomonari Sei2 · Yoshihiro Hirose3

Received: 30 August 2018 / Revised: 19 March 2020 / Accepted: 15 September 2020 © The Author(s) 2020

Abstract One of the main problems studied in statistics is the fitting of models. Ideally, we would like to explain a large dataset with as few parameters as possible. There have been numerous attempts at automatizing this process. Most notably, the Least Angle Regression algorithm, or LARS, is a computationally efficient algorithm that ranks the covariates of a linear model. The algorithm is further extended to a class of distributions in the generalized linear model by using properties of the manifold of exponential families as dually flat manifolds. However this extension assumes that the normalizing constant of the joint distribution of observations is easy to compute. This is often not the case, for example the normalizing constant may contain a complicated integral. We circumvent this issue if the normalizing constant satisfies a holonomic system, a system of linear partial differential equations with a finite-dimensional space of solutions. In this paper we present a modification of the holonomic gradient method and add it to the extended LARS algorithm. We call this the holonomic extended least angle regression algorithm, or HELARS. The algorithm was implemented using the statistical software R, and was tested with real and simulated datasets. Keywords Generalized linear model · Holonomic gradient method · Least angle regression

This work was partly supported by JSPS KAKENHI Grant Numbers JP17K00044, JP18K18008, JST CREST Grant Number JPMJCR1763, and the Vilho, Yrjö and Kalle Väisälä Foundation.

B

Marc Härkönen [email protected]

1

School of Mathematics, Georgia Institute of Technology, 686 Cherry St NW, Atlanta, GA 30332, USA

2

Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan

3

Faculty of Information Science and Technology and Global Station for Big Data and Cybersecurity, Hokkaido University, Kita 14, Nishi 9, Kita-ku, Sapporo, Hokkaido 060-0814, Japan

123

Information Geometry

1 Introduction In model selection, one would ideally want to choose a statistical model that fits the data well, while still being simple enough to allow meaningful interpretations and explanatory power. In this paper we consider model simplification of linear and generalized linear models, where we want to choose a subset of covariates to include in the model. In two decades, there have been many advances in sparse modeling. One of the most famous methods is L1-regularization: Least Absolute Shrinkage and Selection Operator (LASSO [23]). LASSO is defined only for the normal linear regression problem. However, the idea of LASSO has been applied to many other problems. For example, Park and Hastie [19] considered the generalized linear models and Yuan and Lin [25] treated Gaussian graphical models. Least Angle Regression (LARS, [6]) is an efficient al

Data Loading...

Holonomic extended least angle regression

Recommend Documents

Method of Least Squares (Regression Analysis)

A Maximum Likelihood Approach to Least Absolute Deviation Regression

Least Squares Approach for Multivariate Split Selection in Regression Trees

Least squares support vector regression for solving Volterra integral equations

Bootstrapping U -statistics: applications in least squares and robust regression

Least-Squares Regression with Unitary Constraints for Network Behaviour Classification

CPT-SPT Correlation Analysis Based on BP Artificial Neural Network Associated with Partial Least Square Regression

Interpretation of Statistical Errors for Precise GPS Receiver Positioning Using Recursive Least Squares and Extended Kal

Application of the principal component analysis, cluster analysis, and partial least square regression on crossbreed Ang

Spatial Assessment of Soil Organic Carbon Using Bayesian Maximum Entropy and Partial Least Square Regression Model

Least Squares

Functional Linear Regression Analysis Based on Partial Least Squares and Its Application