// Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation
All of Us Imputation
Wearable sensors enable health researchers to continuously collect data pertaining to the physiological state of individuals in real-world settings. However, such data can be subject to extensive missingness due to a complex combination of factors. In this work, we study the problem of imputation of missing step count data, one of the most ubiquitous forms of wearable sensor data. We construct a novel and large scale data set consisting of a training set with over 3 million hourly step count observations and a test set with over 2.5 million hourly step count observations. We propose a domain knowledge-informed sparse self-attention model for this task that captures the temporal multi-scale nature of step-count data. We assess the performance of the model relative to baselines and conduct ablation studies to verify our specific model designs. The repository contains code for our paper Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation (Wei et al.), which was accepted at Conference on Health, Inference, and Learning (CHIL) 2024.
This project aims to address the issue of missing data in hourly step count data collected from smart watches and activity trackers (e.g., Apple Watch, Fitbit devices). To address this problem, we proposed a sparse self-attention model which captures the multi-timescale physical activity patterns and also designed a input feature representation to combine the time encoding (hour of day, day of week) with a temporally local activity pattern representation. Also, we curated a large-scale cohort from the Fitbit database of the All of Us research program to serve as the train and test dataset. The results show that our model can outperform all the considered baselines on the overall test dataset.

Details & Specifications
Published:
Category:
Technologies, Datasets
Tags:
Missingness,
Imputation,
Physiological,
Physical Activity
Imputation,
Physiological,
Physical Activity
License:
- Dr. Benjamin M. Marlin (UMass)
- Dr. James Rehg (UIUC)
- Dr. Santosh Kumar (Memphis)
- Hui Wei (UMass)
- Colin Samplawski (UMass)
- Maxwell Xu (Georgia Tech)

