Repository logo
 

Fast Nonparametric Clustering of Structured Time-Series.

Accepted version
Peer-reviewed

Type

Article

Change log

Authors

Hensman, James 
Rattray, Magnus 
Lawrence, Neil David  ORCID logo  https://orcid.org/0000-0001-9258-1030

Abstract

In this publication, we combine two Bayesian nonparametric models: the Gaussian Process (GP) and the Dirichlet Process (DP). Our innovation in the GP model is to introduce a variation on the GP prior which enables us to model structured time-series data, i.e., data containing groups where we wish to model inter- and intra-group variability. Our innovation in the DP model is an implementation of a new fast collapsed variational inference procedure which enables us to optimize our variational approximation significantly faster than standard VB approaches. In a biological time series application we show how our model better captures salient features of the data, leading to better consistency with existing biological classifications, while the associated inference algorithm provides a significant speed-up over EM-based variational inference.

Description

Keywords

Cluster Analysis, Computational Biology, Computer Simulation, Gene Expression Profiling, Normal Distribution, Statistics, Nonparametric

Journal Title

IEEE Trans Pattern Anal Mach Intell

Conference Name

Journal ISSN

0162-8828
1939-3539

Volume Title

37

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Rights

All rights reserved