Statistics Seminar: Amichai Painsky

תאריך:

ב', 15/01/201815:30-16:30

מיקום:

Hevra 4412

מרצה:

Amichai Painsky, MIT and Hebrew University

Title: Universal Loss and Gaussian Learning Bounds

:Abstract

In this talk I address two fundamental predictive modeling problems: choosing a universal loss function, and approaching non-linear learning problems with linear means.

A loss function quantifies the difference between the true values and the estimated fits, for a given instance of data. Different loss functions correspond to a variety of merits, and the choice of a "correct" loss may sometimes be questionable. Here, I show that for binary classification problems, the Bernoulli log-likelihood loss (log-loss) is universal with respect to practical alternatives. In other words, I show that by minimizing the log-loss we minimize an upper bound to any smooth, convex and unbiased binary loss function. This property justifies the broad use of log-loss in regression, in decision trees, as an InfoMax criterion (cross-entropy minimization) and in many other applications.

I then address a Gaussian representation problem which utilizes the log-loss. In this problem we look for an embedding of an arbitrary data which maximizes its "Gaussian part" while preserving the original dependence between the variables and the target. This embedding provides an efficient (and practical) representation as it allows us to consider the favorable properties of a Gaussian distribution. I introduce different methods and show that the optimal Gaussian embedding is governed by the non-linear canonical correlations of the data. This result provides a primary limit for our ability to Gaussianize arbitrary data-sets and solve complex problems by linear means.

סמינר מחלקתי

המחלקה לסטטיסטיקה ומדע הנתונים

הפקולטה למדעי החברה

Statistics Seminar: Amichai Painsky

6642ec5335333038c821d5ec7e8fd28a

58f346316074ab88159e26929565864f

80ad7660b11647bd22186ae09834057d

006f8629049e0c903fe53db16416907e

8e34947e82957b542b31f1086819e7da