Loading...
Thumbnail Image
Publication

Learning from Irregularly-Sampled Time Series

Citations
Altmetric:
Abstract
Irregularly-sampled time series are characterized by non-uniform time intervals between successive measurements. Such time series naturally occur in application areas including climate science, ecology, biology, and medicine. Irregular sampling poses a great challenge for modeling this type of data as there can be substantial uncertainty about the values of the underlying temporal processes. Moreover, different time series are not necessarily synchronized or of the same length, which makes it difficult to deal with using standard machine learning methods that assume fixed-dimensional data spaces. The goal of this thesis is to develop scalable probabilistic tools for modeling a large collection of irregularly-sampled time series defined over a common time interval. We first introduce an uncertainty-aware kernel framework based on a Gaussian process (GP) representation of the time series and then demonstrate how to significantly scale up the model by linearizing the kernel with various acceleration techniques. To further reduce the computational overhead of the GP representation and improve the expressiveness of the model, we propose a generalized uncertainty-aware framework that integrates a posterior GP sampler with arbitrary black-box models including neural networks. We propose a linear time and linear space sampling algorithm and show how to efficiently train the entire framework end-to-end. To better model the uncertainty by utilizing the information from an entire dataset collectively, we reframe our task as a missing data problem that aims at learning the distribution of the latent temporal process. We first study the missing data problem under a simplified setting where the data are defined on a finite-dimensional space and introduce a model based on generative adversarial networks for learning from incomplete data. To relax the finite-dimensional constraint, we propose a unified encoder-decoder framework that can be trained as a density model or an implicit generative model. We finally introduce a specific architecture within this framework to efficiently represent and learn from irregularly-sampled continuous time series.
Type
dissertation
Date
2020-05
Publisher
License
License
http://creativecommons.org/licenses/by/4.0/