Geometric Methods in Probabilistic Modelling

Research output: Book/ReportPh.D. thesisResearch

Standard

Geometric Methods in Probabilistic Modelling. / Mallasto, Anton.

Department of Computer Science, Faculty of Science, University of Copenhagen, 2019.

Research output: Book/ReportPh.D. thesisResearch

Harvard

Mallasto, A 2019, Geometric Methods in Probabilistic Modelling. Department of Computer Science, Faculty of Science, University of Copenhagen. <https://soeg.kb.dk/permalink/45KBDK_KGL/1pioq0f/alma99123532514605763>

APA

Mallasto, A. (2019). Geometric Methods in Probabilistic Modelling. Department of Computer Science, Faculty of Science, University of Copenhagen. https://soeg.kb.dk/permalink/45KBDK_KGL/1pioq0f/alma99123532514605763

Vancouver

Mallasto A. Geometric Methods in Probabilistic Modelling. Department of Computer Science, Faculty of Science, University of Copenhagen, 2019.

Author

Mallasto, Anton. / Geometric Methods in Probabilistic Modelling. Department of Computer Science, Faculty of Science, University of Copenhagen, 2019.

Bibtex

@phdthesis{4f841d30d7fa40d8a29abdab8fec5776,
title = "Geometric Methods in Probabilistic Modelling",
abstract = "During the past decade, machine learning has established itself as the foundation of rtificial intelligence, viewing learning as a statistical task that can be well quantified.An increasingly popular toolkit in machine learning is provided by geometry, which we consider in this thesis in two main categories: (i) the study of the geometry induced by machine learning models; (ii) the design of machine learning models that respect specified geometric properties of data.First, most machine learning approaches quantify learning objectives by minimizing a loss function between a model and given data. The model can be deterministic or probabilistic, outputting point predictions or stochastic predictions, respectively. In contrast to deterministic models, probabilistic modelling allows us to carry out uncertainty quantification, which helps assess whether a prediction is trustworthy or not. The loss function often describes some kind of geometric similarity between the model and the data, and thus in the case of probabilistic modelling requires studying the geometry of probability measures, which in this thesis is taken to be the optimal transport geometry.Second, all data has structure. Sometimes, the data structure is not very restrictive, e.g., when the data lives in a vector space, in which case any appropriate vector could a priori be a data point. However, depending on the application, the data structure could be more restrictive. For example, if we are interested in location data on earth, then all possible data points have to lie approximately on a sphere. More generally, the data might naturally live on some low-dimensional surface. This restricts us to models that take the known geometry into account, which can be enforced through the machinery of Riemannian geometry.",
author = "Anton Mallasto",
year = "2019",
language = "English",
publisher = "Department of Computer Science, Faculty of Science, University of Copenhagen",

}

RIS

TY - BOOK

T1 - Geometric Methods in Probabilistic Modelling

AU - Mallasto, Anton

PY - 2019

Y1 - 2019

N2 - During the past decade, machine learning has established itself as the foundation of rtificial intelligence, viewing learning as a statistical task that can be well quantified.An increasingly popular toolkit in machine learning is provided by geometry, which we consider in this thesis in two main categories: (i) the study of the geometry induced by machine learning models; (ii) the design of machine learning models that respect specified geometric properties of data.First, most machine learning approaches quantify learning objectives by minimizing a loss function between a model and given data. The model can be deterministic or probabilistic, outputting point predictions or stochastic predictions, respectively. In contrast to deterministic models, probabilistic modelling allows us to carry out uncertainty quantification, which helps assess whether a prediction is trustworthy or not. The loss function often describes some kind of geometric similarity between the model and the data, and thus in the case of probabilistic modelling requires studying the geometry of probability measures, which in this thesis is taken to be the optimal transport geometry.Second, all data has structure. Sometimes, the data structure is not very restrictive, e.g., when the data lives in a vector space, in which case any appropriate vector could a priori be a data point. However, depending on the application, the data structure could be more restrictive. For example, if we are interested in location data on earth, then all possible data points have to lie approximately on a sphere. More generally, the data might naturally live on some low-dimensional surface. This restricts us to models that take the known geometry into account, which can be enforced through the machinery of Riemannian geometry.

AB - During the past decade, machine learning has established itself as the foundation of rtificial intelligence, viewing learning as a statistical task that can be well quantified.An increasingly popular toolkit in machine learning is provided by geometry, which we consider in this thesis in two main categories: (i) the study of the geometry induced by machine learning models; (ii) the design of machine learning models that respect specified geometric properties of data.First, most machine learning approaches quantify learning objectives by minimizing a loss function between a model and given data. The model can be deterministic or probabilistic, outputting point predictions or stochastic predictions, respectively. In contrast to deterministic models, probabilistic modelling allows us to carry out uncertainty quantification, which helps assess whether a prediction is trustworthy or not. The loss function often describes some kind of geometric similarity between the model and the data, and thus in the case of probabilistic modelling requires studying the geometry of probability measures, which in this thesis is taken to be the optimal transport geometry.Second, all data has structure. Sometimes, the data structure is not very restrictive, e.g., when the data lives in a vector space, in which case any appropriate vector could a priori be a data point. However, depending on the application, the data structure could be more restrictive. For example, if we are interested in location data on earth, then all possible data points have to lie approximately on a sphere. More generally, the data might naturally live on some low-dimensional surface. This restricts us to models that take the known geometry into account, which can be enforced through the machinery of Riemannian geometry.

UR - https://soeg.kb.dk/permalink/45KBDK_KGL/1pioq0f/alma99123532514605763

M3 - Ph.D. thesis

BT - Geometric Methods in Probabilistic Modelling

PB - Department of Computer Science, Faculty of Science, University of Copenhagen

ER -

ID: 234995464