Semi-parametric Gaussian copula models for machine learning

Kaufmann, Dinu. Semi-parametric Gaussian copula models for machine learning. 2017, Doctoral Thesis, University of Basel, Faculty of Science.

Available under License CC BY-NC-ND (Attribution-NonCommercial-NoDerivatives).


Official URL: http://edoc.unibas.ch/diss/DissB_12475

Downloads: Statistics Overview


The aim of machine learning and statistics is to learn and predict from data. With the introduction of copulas, probabilistic models and algorithms can benefit from the separation of dependency and marginals. The additional flexibility allows to generalise better and increase the prediction accuracy. Based on this observation, this work enlightens different models within the framework of a semi-parametric Gaussian copula model.
The first model we consider is archetypal analysis. We show that the Gaussian copula approximates the dependency structure of the generative model we consider. With copula archetypal analysis, we present a new model, which extends the applicability of the original model. Our second contribution refers to the semi-parametric Gaussian copula extension of principal component analysis. We consider the model in the context of parametric appearance models for facial appearance. We show, that the copula relaxation leads ultimately to a higher specificity and provide a unifying way of combining different data. The third contribution is Bayesian sub-network estimation within the framework of Gaussian graphical models. We show that the Markov blanket of a set of query variables has analytical form and can be efficiently estimated. Our last contribution is the motivation of time-resolved information flows in the context of directed information and Pearlian graphs. We show, how to discover information flows in non-stationary time series and give a convenient estimator.
At the core of these models lies the semi-parametric Gaussian copula model. In this work we show how it allows to relax certain assumptions in the aforementioned models. Ultimately, this leads to non-Gaussian and latent linear models, which better apply to real-world data sets.
Advisors:Roth, Volker and Gutmann, Michael
Faculties and Departments:05 Faculty of Science > Departement Mathematik und Informatik > Informatik > Biomedical Data Analysis (Roth)
UniBasel Contributors:Kaufmann, Dinu and Roth, Volker
Item Type:Thesis
Thesis Subtype:Doctoral Thesis
Thesis no:12475
Thesis status:Complete
Number of Pages:1 Online-Ressource (XII, 130 Seiten)
Identification Number:
edoc DOI:
Last Modified:05 Apr 2018 17:36
Deposited On:01 Mar 2018 10:28

Repository Staff Only: item control page