Publications
2024 |
Iman Deznabi, Yunfei Lou, Abhinav Shaw, Natcha Simsiri, Tauhidur Rahman, Madalina Fiterau Dynamic Clustering via Branched Deep Learning Enhances Personalization of Stress Prediction from Mobile Sensor Data Journal Article In: Nature Scientific Reports, vol. 14, no. 6631, 2024. @article{nokey, |
Iman Deznabi, Peeyush Kumar, Madalina Fiterau Zero-shot micro-climate prediction with deep learning Workshop 2024. @workshop{Deznabi2024, Weather station data is a valuable resource for climate prediction, however, its reliability can be limited in remote locations. To compound the issue, making local predictions often relies on sensor data that may not be accessible for a new, previously unmonitored location. In response to these challenges, we propose a novel zero-shot learning approach designed to forecast various climate measurements at new and unmonitored locations. Our method surpasses conventional weather forecasting techniques in predicting microclimate variables by leveraging knowledge extracted from other geographic locations. |
2023 |
Iman Deznabi, Madalina Fiterau Conference on Health, Inference, and Learning (CHIL 2023), vol. 209, 2023. @conference{Deznabi2023, The analysis of multivariate time series data is challenging due to the various frequencies of signal changes that can occur over both short and long terms. Furthermore, standard deep learning models are often unsuitable for such datasets, as signals are typically sampled at different rates. To address these issues, we introduce MultiWave, a novel framework that enhances deep learning time series models by incorporating components that operate at the intrinsic frequencies of signals. MultiWave uses wavelets to decompose each signal into subsignals of varying frequencies and groups them into frequency bands. Each frequency band is handled by a different component of our model. A gating mechanism combines the output of the components to produce sparse models that use only specific signals at specific frequencies. Our experiments demonstrate that MultiWave accurately identifies informative frequency bands and improves the performance of various deep learning models, including LSTM, Transformer, and CNN-based models, for a wide range of applications. It attains top performance in stress and affect detection from wearables. It also increases the AUC of the best-performing model by 5% for in-hospital COVID-19 mortality prediction from patient blood samples and for human activity recognition from accelerometer and gyroscope data. We show that MultiWave consistently identifies critical features and their frequency components, thus providing valuable insights into the applications studied. |
2022 |
Edmond Cunningham, Adam D Cobb, Susmit Jha Principal Component Flows Conference 2022. @conference{nokey, Normalizing flows map an independent set of latent variables to their samples using a bijective transformation. Despite the exact correspondence between samples and latent variables, their high level relationship is not well understood. In this paper we characterize the geometric structure of flows using principal manifolds and understand the relationship between latent variables and samples using contours. We introduce a novel class of normalizing flows, called principal component flows (PCF), whose contours are its principal manifolds, and a variant for injective flows (iPCF) that is more efficient to train than regular injective flows. PCFs can be constructed using any flow architecture, are trained with a regularized maximum likelihood objective and can perform density estimation on all of their principal manifolds. In our experiments we show that PCFs and iPCFs are able to learn the principal manifolds over a variety of datasets. Additionally, we show that PCFs can perform density estimation on data that lie on a manifold with variable dimensionality, which is not possible with existing normalizing flows. |
Christen H Fleming, Iman Deznabi, Shauhin Alavi, Margaret C Crofoot, Ben T Hirsch, E Patricia Medici, Michael J Noonan, Roland Kays, William F Fagan, Daniel Sheldon, Justin M Calabrese Population-level inference for home-range areas Journal Article In: Methods in Ecology and Evolution journal (2022), vol. 13, iss. 5, pp. 1027-1041, 2022. @article{nokey, 1. Home-range estimates are a common product of animal tracking data, as each range represents the area needed by a given individual. Population-level inference of home-range areas—where multiple individual home ranges are considered to be sampled from a population—is also important to evaluate changes over time, space or covariates such as habitat quality or fragmentation, and for comparative analyses of species averages. Population-level home-range parameters have traditionally been estimated by first assuming that the input tracking data were sampled independently when calculating home ranges via conventional kernel density estimation (KDE) or minimal convex polygon (MCP) methods, and then assuming that those individual home ranges were measured exactly when calculating the population-level estimates. This conventional approach does not account for the temporal autocorrelation that is inherent in modern tracking data, nor for the uncertainties of each individual home-range estimate, which are often large and heterogeneous. 2. Here, we introduce a statistically and computationally efficient framework for the population-level analysis of home-range areas, based on autocorrelated kernel density estimation (AKDE), that can account for variable temporal autocorrelation and estimation uncertainty. 3. We apply our method to empirical examples on lowland tapir Tapirus terrestris, kinkajou Potos flavus, white-nosed coati Nasua narica, white-faced capuchin monkey Cebus capucinus and spider monkey Ateles geoffroyi, and quantify differences between species, environments and sexes. 4. Our approach allows researchers to more accurately compare different populations with different movement behaviours or sampling schedules while retaining statistical precision and power when individual home-range uncertainties vary. Finally, we emphasize the estimation of effect sizes when comparing populations, rather than mere significance tests. |
2021 |
Iman Deznabi, Mohit Iyyer, Madalina Fiterau Predicting in-hospital mortality by combining clinical notes with time-series data Conference Association for Computational Linguistics (ACL-IJCNLP 2021) Findings, 2021. @conference{Deznabi2021b, In intensive care units (ICUs), patient health is monitored through (1) continuous vital signals from various medical devices, and (2) clinical notes consisting of opinions and summaries from doctors which are recorded in electronic health records (EHR). It is difficult to jointly model these two sources of information because clinical notes, unlike vital signals, are collected at irregular intervals and their contents are relatively unstructured. In this paper, we present a model that combines both sources of information about ICU patients to make accurate in-hospital mortality predictions. We apply a fine-tuned BERT model to each of the patient's clinical notes. The resulting embeddings are then combined to obtain the overall embedding for the entire text part of the data. This is then combined with the output of an LSTM model that encodes patients' vital signals. Our model improves upon the state of the art for mortality prediction, attaining an AUC score of 0.9, compared to the previous 0.87, setting a new standard for mortality prediction on the MIMIC III benchmark. |
Edmond Cunningham, Madalina Fiterau A Change of Variables Method For Rectangular Matrix-Vector Products Conference 2021. @conference{nokey, Rectangular matrix-vector products (MVPs) are used extensively throughout machine learning and are fundamental to neural networks such as multi-layer perceptrons. However, the use of rectangular MVPs in successive normalizing flow transformations is notably missing. This paper identifies this methodological gap and plugs it with a tall and wide MVP change of variables formula. Our theory builds up to a practical algorithm that envelops existing dimensionality increasing flow methods such as augmented flows. We show that tall MVPs are closely related to the stochastic inverse of wide MVPs and empirically demonstrate that they improve density estimation over existing dimension changing methods. |
Iman Deznabi, Tamanna Motahar, Ali Sarvghad, Madalina Fiterau, Narges Mahyar Impact of the COVID-19 Pandemic on the Academic Community Results from a survey conducted at University of Massachusetts Amherst Journal Article In: ACM, Digital Government: Research and Practive, COVID-19 Commentary, vol. 2, iss. 2, no. 22, pp. 1-12, 2021. @article{Deznabi2021, The COVID-19 pandemic has significantly impacted academic life in the United States and beyond. To gain a better understanding of its impact on the academic community, we conducted a large-scale survey at the University of Massachusetts Amherst. We collected multifaceted data from students, staff, and faculty on several aspects of their lives, such as mental and physical health, productivity, and finances. All our respondents expressed mental and physical issues and concerns, such as increased stress and depression levels. Financial difficulties seem to have the most considerable toll on staff and undergraduate students, while productivity challenges were mostly expressed by faculty and graduate students. As universities face many important decisions with respect to mitigating the effects of this pandemic, we present our findings with the intent of shedding light on the challenges faced by various academic groups in the face of the pandemic, calling attention to the differences between groups. We also contribute a discussion highlighting how the results translate to policies for the effective and timely support of the categories of respondents who need them most. Finally, the survey itself, which includes conditional logic allowing for personalized questions, serves as a template for further data collection, facilitating a comparison of the impact on campuses across the United States. |
2020 |
Edmond Cunningham; Renos Zabounidis; Abhinav Agrawal; Madalina Fiterau; Daniel Sheldon Normalizing Flows Across Dimensions Workshop International Conference on Machine Learning (ICML) Workshops 2020, ICML 2020 Inductive biases, invariances and generalization in RL (BIG) workshop, 2020. @workshop{DBLP:journals/corr/abs-2006-13070, |
Purva Pruthi; Javier González; Xiaoyu Lu; Madalina Fiterau Structure Mapping for Transferability of Causal Models Workshop International Conference on Machine Learning (ICML) Workshops 2020, ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models, 2020. @workshop{DBLP:journals/corr/abs-2007-09445, |
2019 |
Brett Beaulieu-Jones; Samuel G Finlayson; Corey Chivers; Irene Chen; Matthew McDermott; Jaz Kandola; Adrian V Dalca; Andrew Beam; Madalina Fiterau; Tristan Naumann Trends and Focus of Machine Learning Applications for Health Research Journal Article In: JAMA network open, vol. 2, no. 10, pp. e1914051–e1914051, 2019. @article{beaulieu2019trends, |
Yi Ren Fung; Ziqiang Guan; Ritesh Kumar; Yeahuay Joie Wu; Madalina Fiterau Alzheimer's Disease Brain MRI Classification: Challenges and Insights Workshop IJCAI ARIAL Workshop, 2019. @workshop{DBLP:journals/corr/abs-1906-04231, |
Surya Teja Devarakonda; Yeahuay Joie Wu; Yi Ren Fung; Madalina Fiterau FLARe: Forecasting by Learning Anticipated Representations Proceedings Article In: Proceedings of the Machine Learning for Healthcare Conference, MLHC 2019, 9-10 August 2019, Ann Arbor, Michigan, USA, pp. 53–65, Proceedings of Machine Learning Research, 2019. @inproceedings{DBLP:conf/mlhc/DevarakondaWFF19, |
Jason A Fries; Paroma Varma; Vincent S Chen; Ke Xiao; Heliodoro Tejeda; Priyanka Saha; Jared Dunnmon; Henry Chubb; Shiraz Maskatia; Madalina Fiterau; Scott Delp; Euan Ashley; Christopher Ré; James R Priest Weakly Supervised Classification of Aortic Valve Malformations using Unlabeled Cardiac MRI Sequences Journal Article In: Nature communications, vol. 10, no. 1, pp. 1–10, 2019. @article{fries2019weakly, |
Debasmita Ghose; Shasvat Mukeshkumar Desai; Sneha Bhattacharya; Deep Chakraborty; Madalina Fiterau; Tauhidur Rahman Pedestrian Detection in Thermal Images Using Saliency Maps Workshop Conference on Computer Vision and Pattern Recognition (CVPR) Workshops 2019, Computer Vision Foundation, 2019. @workshop{DBLP:conf/cvpr/GhoseDBCFR19, |
Abhishek Sharma; Aritra Ghosh; Madalina Fiterau Generative Sequential Stochastic Model for Marked Point Processes Workshop International Conference on Machine Learning (ICML) Workshops 2019, ICML Time Series Workshop, 2019. @workshop{ICMLTS19:gssmmpp, |
Rheeya Uppaal; Bryon Kucharski; Bhanu Pratap; Iman Deznabi; Madalina Fiterau Multi-resolution Attention with Signal Splitting for Multivariate Time Series Classification Workshop International Conference on Machine Learning (ICML) Workshops 2019, ICML Time Series Workshop, 2019. @workshop{ICMLTS19:mass, Real world multivariate time series pose three significant challenges: irregularity in sampling, missing values, and varying sampling frequencies among signals. Recent work for inference on such data aims at solving one of these issues, however a unified model is still lacking. We present a unified method which handles all three: Multi-resolution Attention with Signal Splitting (MASS). Our method is model-agnostic and can be applied to any existing model, significantly boosting predictive performance. MASS uses parallel multi-resolution blocks to model different resolution data streams, in addition to splitting signals into components of specific resolutions, to provide approximately a 3% improvement on the Physionet Challenge 2012 Dataset. We also compare to the state of the art TBM and GRU-D models, showcasing promising results against them. |
Abhinav Shaw; Natcha Simsiri; Iman Deznaby; Madalina Fiterau; Tauhidur Rahman Personalized Student Stress Prediction with Deep Multitask Network Workshop International Conference on Machine Learning (ICML) Workshops 2019, ICML Adaptive and Multitask Learning Workshop, 2019. @workshop{DBLP:journals/corr/abs-1906-11356, With the growing popularity of wearable devices, the ability to utilize physiological data collected from these devices to predict the wearer's mental state such as mood and stress suggests great clinical applications, yet such a task is extremely challenging. In this paper, we present a general platform for personalized predictive modeling of behavioural states like students' level of stress. Through the use of Auto-encoders and Multitask learning we extend the prediction of stress to both sequences of passive sensor data and high-level covariates. Our model outperforms the state-of-the-art in the prediction of stress level from mobile sensor data, obtaining a 45.6 % improvement in F1 score on the StudentLife dataset. |
Bhanu Pratap Singh; Iman Deznabi; Bharath Narasimhan; Bryon Kucharski; Rheeya Uppaal; Akhila Josyula; Madalina Fiterau Multi-resolution Networks For Flexible Irregular Time Series Modeling (Multi-FIT) Technical Report 2019. @techreport{DBLP:journals/corr/abs-1905-00125, |
Ziqiang Guan; Ritesh Kumar; Yi Ren Fung; Yeahuay Wu; Madalina Fiterau A Comprehensive Study of Alzheimer's Disease Classification Using Convolutional Neural Networks Technical Report 2019. @techreport{DBLP:journals/corr/abs-1904-07950, |