Systematic Analysis of Distribution Shifts in Cross-Subject Glucose Prediction Using Wearable Physiological Data

Andrew Beten; Luna Lococo; Ayaan Baig; Thilini Karunarathna

doi:10.3390/ECSA-12-26583

Previous Article in event

Design and Implementation of an IoT-Based Respiratory Motion Sensor

Next Article in event

Ensemble Learning-assisted Spectroelectrochemical Sensing Platform for detection of fluoride in water

Systematic Analysis of Distribution Shifts in Cross-Subject Glucose Prediction Using Wearable Physiological Data

^{*

1},

¹,

¹,

^{*

2}

¹ College of Arts and Sciences, Ohio State University, Columbus, OH 43210, United States
² Department of Mechanical and Electrical Systems Engineering, Kyoto University of Advanced Science, Kyoto 615-0096, Japan

Academic Editor: Stefano Mariani

Published: 07 November 2025 by MDPI in The 12th International Electronic Conference on Sensors and Applications session Student Session

https://doi.org/10.3390/ECSA-12-26583

Abstract:

Wearable sensors offer a promising platform for non-invasive glucose monitoring by indirectly predicting glucose levels from physiological signals. However, machine learning models trained on such data often suffer degraded performance when applied to new individuals due to distribution shifts in physiological patterns. This study investigates how the inter-subject distribution shift impacts the performance of glucose prediction models trained on wearable data. We utilize the BIGIDEAs dataset, which includes simultaneous recordings of glucose levels and multimodal physiological signals. Personalized XGBoost regression models were trained on data from 10 subjects and evaluated on 5 held-out subjects to assess cross-subject generalization. Distribution shifts in glucose profiles between training and test subjects were quantified using the Anderson-Darling (AD) statistic. Results show that models trained on one individual performed poorly when tested on others. Repeated measures correlation analysis revealed significant positive correlations between the AD statistic and model performance metrics, including RMSE, NRMSE, and MARD. Our findings highlight the challenge of inter-individual generalization and the need for distribution-aware models. We propose personalized calibration and subject phenotyping as future directions to enhance model generalizability.

Keywords: wearable physiological sensing; predictive modelling; continuous glucose monitoring; distribution shift; XGBoost

View paper

19 Reads
0 Recommendations