TY - JOUR AB - Representations of probability measures in reproducing kernel Hilbert spacesprovide a flexible framework for fully nonparametric hypothesis tests ofindependence, which can capture any type of departure from independence,including nonlinear associations and multivariate interactions. However, theseapproaches come with an at least quadratic computational cost in the number ofobservations, which can be prohibitive in many applications. Arguably, it isexactly in such large-scale datasets that capturing any type of dependence isof interest, so striking a favourable tradeoff between computational efficiencyand test performance for kernel independence tests would have a direct impacton their applicability in practice. In this contribution, we provide anextensive study of the use of large-scale kernel approximations in the contextof independence testing, contrasting block-based, Nystrom and random Fourierfeature approaches. Through a variety of synthetic data experiments, it isdemonstrated that our novel large scale methods give comparable performancewith existing methods whilst using significantly less computation time andmemory. AU - Zhang,Q AU - Filippi,S AU - Gretton,A AU - Sejdinovic,D DO - 10.1007/s11222-016-9721-7 EP - 130 PY - 2017/// SN - 1573-1375 SP - 113 TI - Large-Scale Kernel Methods for Independence Testing T2 - Statistics and Computing UR - http://dx.doi.org/10.1007/s11222-016-9721-7 UR - http://hdl.handle.net/10044/1/56039 VL - 28 ER -