Selective Imputation for Multivariate Time Series Datasets with Missing Values

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Standard

Selective Imputation for Multivariate Time Series Datasets with Missing Values. / Blazquez-Garcia, Ane; Wickstrom, Kristoffer; Yu, Shujian; Mikalsen, Karl Oyvind; Boubekki, Ahcene; Conde, Angel; Mori, Usue; Jenssen, Robert; Lozano, Jose A.

I: IEEE Transactions on Knowledge and Data Engineering, Bind 35, Nr. 9, 2023, s. 9490-9501.

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Harvard

Blazquez-Garcia, A, Wickstrom, K, Yu, S, Mikalsen, KO, Boubekki, A, Conde, A, Mori, U, Jenssen, R & Lozano, JA 2023, 'Selective Imputation for Multivariate Time Series Datasets with Missing Values', IEEE Transactions on Knowledge and Data Engineering, bind 35, nr. 9, s. 9490-9501. https://doi.org/10.1109/TKDE.2023.3240858

APA

Blazquez-Garcia, A., Wickstrom, K., Yu, S., Mikalsen, K. O., Boubekki, A., Conde, A., Mori, U., Jenssen, R., & Lozano, J. A. (2023). Selective Imputation for Multivariate Time Series Datasets with Missing Values. IEEE Transactions on Knowledge and Data Engineering, 35(9), 9490-9501. https://doi.org/10.1109/TKDE.2023.3240858

Vancouver

Blazquez-Garcia A, Wickstrom K, Yu S, Mikalsen KO, Boubekki A, Conde A o.a. Selective Imputation for Multivariate Time Series Datasets with Missing Values. IEEE Transactions on Knowledge and Data Engineering. 2023;35(9):9490-9501. https://doi.org/10.1109/TKDE.2023.3240858

Author

Blazquez-Garcia, Ane ; Wickstrom, Kristoffer ; Yu, Shujian ; Mikalsen, Karl Oyvind ; Boubekki, Ahcene ; Conde, Angel ; Mori, Usue ; Jenssen, Robert ; Lozano, Jose A. / Selective Imputation for Multivariate Time Series Datasets with Missing Values. I: IEEE Transactions on Knowledge and Data Engineering. 2023 ; Bind 35, Nr. 9. s. 9490-9501.

Bibtex

@article{6038054b6e084fbb906579d6161edd29,

title = "Selective Imputation for Multivariate Time Series Datasets with Missing Values",

abstract = "Multivariate time series often contain missing values for reasons such as failures in data collection mechanisms. Since these missing values can complicate the analysis of time series data, imputation techniques are typically used to deal with this issue. However, the quality of the imputation directly affects the performance of downstream tasks. In this paper, we propose a selective imputation method that identifies a subset of timesteps with missing values to impute in a multivariate time series dataset. This selection, which will result in shorter and simpler time series, is based on both reducing the uncertainty of the imputations and representing the original time series as good as possible. In particular, the method uses multi-objective optimization techniques to select the optimal set of points, and in this selection process, we leverage the beneficial properties of the Multi-task Gaussian Process (MGP). The method is applied to different datasets to analyze the quality of the imputations and the performance obtained in downstream tasks, such as classification or anomaly detection. The results show that much shorter and simpler time series are able to maintain or even improve both the quality of the imputations and the performance of the downstream tasks. ",

keywords = "imputation, irregular sampling, missing data, Multivariate time series",

author = "Ane Blazquez-Garcia and Kristoffer Wickstrom and Shujian Yu and Mikalsen, {Karl Oyvind} and Ahcene Boubekki and Angel Conde and Usue Mori and Robert Jenssen and Lozano, {Jose A.}",

note = "Publisher Copyright: {\textcopyright} 1989-2012 IEEE.",

year = "2023",

doi = "10.1109/TKDE.2023.3240858",

language = "English",

volume = "35",

pages = "9490--9501",

journal = "IEEE Transactions on Knowledge and Data Engineering",

issn = "1041-4347",

publisher = "IEEE Computer Society Press",

number = "9",

}

RIS

TY - JOUR

T1 - Selective Imputation for Multivariate Time Series Datasets with Missing Values

AU - Blazquez-Garcia, Ane

AU - Wickstrom, Kristoffer

AU - Yu, Shujian

AU - Mikalsen, Karl Oyvind

AU - Boubekki, Ahcene

AU - Conde, Angel

AU - Mori, Usue

AU - Jenssen, Robert

AU - Lozano, Jose A.

PY - 2023

Y1 - 2023

N2 - Multivariate time series often contain missing values for reasons such as failures in data collection mechanisms. Since these missing values can complicate the analysis of time series data, imputation techniques are typically used to deal with this issue. However, the quality of the imputation directly affects the performance of downstream tasks. In this paper, we propose a selective imputation method that identifies a subset of timesteps with missing values to impute in a multivariate time series dataset. This selection, which will result in shorter and simpler time series, is based on both reducing the uncertainty of the imputations and representing the original time series as good as possible. In particular, the method uses multi-objective optimization techniques to select the optimal set of points, and in this selection process, we leverage the beneficial properties of the Multi-task Gaussian Process (MGP). The method is applied to different datasets to analyze the quality of the imputations and the performance obtained in downstream tasks, such as classification or anomaly detection. The results show that much shorter and simpler time series are able to maintain or even improve both the quality of the imputations and the performance of the downstream tasks.

AB - Multivariate time series often contain missing values for reasons such as failures in data collection mechanisms. Since these missing values can complicate the analysis of time series data, imputation techniques are typically used to deal with this issue. However, the quality of the imputation directly affects the performance of downstream tasks. In this paper, we propose a selective imputation method that identifies a subset of timesteps with missing values to impute in a multivariate time series dataset. This selection, which will result in shorter and simpler time series, is based on both reducing the uncertainty of the imputations and representing the original time series as good as possible. In particular, the method uses multi-objective optimization techniques to select the optimal set of points, and in this selection process, we leverage the beneficial properties of the Multi-task Gaussian Process (MGP). The method is applied to different datasets to analyze the quality of the imputations and the performance obtained in downstream tasks, such as classification or anomaly detection. The results show that much shorter and simpler time series are able to maintain or even improve both the quality of the imputations and the performance of the downstream tasks.

KW - imputation

KW - irregular sampling

KW - missing data

KW - Multivariate time series

UR - http://www.scopus.com/inward/record.url?scp=85148442415&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2023.3240858

DO - 10.1109/TKDE.2023.3240858

M3 - Journal article

AN - SCOPUS:85148442415

VL - 35

SP - 9490

EP - 9501

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 9

ER -

ID: 364497888

Datalogisk Institut