Marc Potters, Estimating Large Noisy Matrices: Random Matrix Theory Meets Cross-Validation

Event
Seminar

Date: Thursday, 5 December 2019

Time: 18.00 - 19.00 GMT
Location: City and Guilds building - CAGB LT 200

Audience: Open to all
Cost: Free

Add event to calendar

For further details:

Contact: Rula Murtada

BIO: As Chief Investment Officer, Marc oversees the investment process of all CFM funds and serves on the Board of Directors of CFM S.A. In addition, together with Jean-Philippe Bouchaud, he supervises the research team with particular focus on developing concrete applications in financial forecasting, portfolio construction, risk control and execution. Marc maintains strong links with academia and as an expert in Random Matrix Theory has taught at UCLA and Sorbonne University. Marc obtained his PhD in physics from Princeton University and joined CFM in October 1995 as a researcher in quantitative finance. Marc continues to publish papers in statistical finance with his research team and co-authored ‘Theory of Financial Risk and Derivative Pricing’ with Jean-Philippe.

ABSTRACT: The estimation of large covariance matrices is a recurring problem in data analysis, in particular in finance where it is a pre-requisite of Markowitz portfolio optimization. The sample covariance matrices (SCM) is not the best estimate of true covariance, especially when the matrix is large. Using tools from random matrix theory (RMT) and free probability one can compute the eigenvalue spectrum of the SCM from independent or even auto-correlated samples. One can also compute the optimal estimator of true covariance from sample data under a natural hypothesis of absence of prior knowledge about eigenvectors. This estimator can be expressed in RMT language but is easier to understand in the optimization/validation (O/V) framework used in machine learning. I will present a recent leave-one-out algorithm that combines simplicity and good results.

Functions of the true covariance matrix (such as its inverse) can also be estimated using the same schemes. I speculate that the link between the RMT and O/V frameworks can help us distinguish between signal and noise in very complex noisy data sets such as neural recording data.

Add event to calendar

See all events