Publications

Ricards Marcinkevics^, Patricia Reis Wolfertstetter^, Ugne Klimiene^, Kieran Chin-Cheong, Alyssia Paschke, Julia Zerres, Markus Denzinger, David Niederberger, Sven Wellmann, Ece Özkan Elsen^†, Christian Knorr^†, Julia E. Vogt^†
^ denotes shared first authorship, ^† denotes shared last authorshipInterpretable and intervenable ultrasonography-based machine learning models for pediatric appendicitisMedical Image Analysis

Abstract

Appendicitis is among the most frequent reasons for pediatric abdominal surgeries. Previous decision support systems for appendicitis have focused on clinical, laboratory, scoring, and computed tomography data and have ignored abdominal ultrasound, despite its noninvasive nature and widespread availability. In this work, we present interpretable machine learning models for predicting the diagnosis, management and severity of suspected appendicitis using ultrasound images. Our approach utilizes concept bottleneck models (CBM) that facilitate interpretation and interaction with high-level concepts understandable to clinicians. Furthermore, we extend CBMs to prediction problems with multiple views and incomplete concept sets. Our models were trained on a dataset comprising 579 pediatric patients with 1709 ultrasound images accompanied by clinical and laboratory data. Results show that our proposed method enables clinicians to utilize a human-understandable and intervenable predictive model without compromising performance or requiring time-consuming image annotation when deployed. For predicting the diagnosis, the extended multiview CBM attained an AUROC of 0.80 and an AUPR of 0.92, performing comparably to similar black-box neural networks trained and tested on the same dataset.

Authors

Ricards Marcinkevics^*, Patricia Reis Wolfertstetter^*, Ugne Klimiene^*, Kieran Chin-Cheong, Alyssia Paschke, Julia Zerres, Markus Denzinger, David Niederberger, Sven Wellmann, Ece Özkan Elsen^†, Christian Knorr^†, Julia E. Vogt^†
^* denotes shared first authorship, ^† denotes shared last authorship

Submitted

Medical Image Analysis

Date

01.01.2024

Link DOI Code

MZH Kolk, S Ruipérez-Campillo, L Alvarez-Florez, B Deb, EJ Bekkers, CP Allaart, ALCJ van der Lingen, P Clopton, I Isgum, AAM Wilde, RE Knops, SM Narayan, FVY TjongDynamic prediction of malignant ventricular arrhythmias using neural networks in patients with an implantable cardioverter-defibrillatorLancet eBiomedicine

Abstract

Background Risk stratification for ventricular arrhythmias currently relies on static measurements that fail to adequately capture dynamic interactions between arrhythmic substrate and triggers over time. We trained and internally validated a dynamic machine learning (ML) model and neural network that extracted features from longitudinally collected electrocardiograms (ECG), and used these to predict the risk of malignant ventricular arrhythmias. Methods A multicentre study in patients implanted with an implantable cardioverter-defibrillator (ICD) between 2007 and 2021 in two academic hospitals was performed. Variational autoencoders (VAEs), which combine neural networks with variational inference principles, and can learn patterns and structure in data without explicit labelling, were trained to encode the mean ECG waveforms from the limb leads into 16 variables. Supervised dynamic ML models using these latent ECG representations and clinical baseline information were trained to predict malignant ventricular arrhythmias treated by the ICD. Model performance was evaluated on a hold-out set, using time-dependent receiver operating characteristic (ROC) and calibration curves. Findings 2942 patients (61.7 ± 13.9 years, 25.5% female) were included, with a total of 32,129 ECG recordings during a mean follow-up of 43.9 ± 35.9 months. The mean time-varying area under the ROC curve for the dynamic model was 0.738 ± 0.07, compared to 0.639 ± 0.03 for a static (i.e. baseline-only model). Feature analyses indicated dynamic changes in latent ECG representations, particularly those affecting the T-wave morphology, were of highest importance for model predictions. Interpretation Dynamic ML models and neural networks effectively leverage routinely collected longitudinal ECG recordings for personalised and updated predictions of malignant ventricular arrhythmias, outperforming static models.

Authors

MZH Kolk, S Ruipérez-Campillo, L Alvarez-Florez, B Deb, EJ Bekkers, CP Allaart, ALCJ van der Lingen, P Clopton, I Isgum, AAM Wilde, RE Knops, SM Narayan, FVY Tjong

Submitted

Lancet eBiomedicine

Date

01.01.2024

Laura Manduchi^, Moritz Vandenhirtz^, Alain Ryser, Julia E. Vogt
^* denotes shared first authorshipTree Variational AutoencodersSpotlight at Neural Information Processing Systems, NeurIPS 2023

Abstract

We propose Tree Variational Autoencoder (TreeVAE), a new generative hierarchical clustering model that learns a flexible tree-based posterior distribution over latent variables. TreeVAE hierarchically divides samples according to their intrinsic characteristics, shedding light on hidden structures in the data. It adapts its architecture to discover the optimal tree for encoding dependencies between latent variables. The proposed tree-based generative architecture enables lightweight conditional inference and improves generative performance by utilizing specialized leaf decoders. We show that TreeVAE uncovers underlying clusters in the data and finds meaningful hierarchical relations between the different groups on a variety of datasets, including real-world imaging data. We present empirically that TreeVAE provides a more competitive log-likelihood lower bound than the sequential counterparts. Finally, due to its generative nature, TreeVAE is able to generate new samples from the discovered clusters via conditional sampling.

Authors

Laura Manduchi^*, Moritz Vandenhirtz^*, Alain Ryser, Julia E. Vogt
^* denotes shared first authorship

Submitted

Spotlight at Neural Information Processing Systems, NeurIPS 2023

Date

20.12.2023

Sonia Laguna, Julian Heidenreich, Jiugeng Sun, Nil\"ufer Cetin, Ibrahim Al Hazwani, Udo Schlegel, Furui Cheng, Mennatallah El-AssadyExpLIMEable: An exploratory framework for LIMENeurIPS 2023, XAI in Action: Past, Present, and Future Applications

Abstract

ExpLIMEable is a tool to enhance the comprehension of Local Interpretable Model-Agnostic Explanations (LIME), particularly within the realm of medical image analysis. LIME explanations often lack robustness due to variances in perturbation techniques and interpretable function choices. Powered by a convolutional neural network for brain MRI tumor classification, ExpLIMEable seeks to mitigate these issues. This explainability tool allows users to tailor and explore the explanation space generated post hoc by different LIME parameters to gain deeper insights into the model’s decision-making process, its sensitivity, and limitations. We introduce a novel dimension reduction step on the perturbations seeking to find more informative neighborhood spaces and extensive provenance tracking to support the user. This contribution ultimately aims to enhance the robustness of explanations, key in high-risk domains like healthcare

Authors

Sonia Laguna, Julian Heidenreich, Jiugeng Sun, Nil\"ufer Cetin, Ibrahim Al Hazwani, Udo Schlegel, Furui Cheng, Mennatallah El-Assady

Submitted

NeurIPS 2023, XAI in Action: Past, Present, and Future Applications

Date

16.12.2023

Ricards Marcinkevics^, Sonia Laguna^, Moritz Vandenhirtz, Julia E. Vogt
^* denotes shared first authorshipBeyond Concept Bottleneck Models: How to Make Black Boxes Intervenable?XAI in Action: Past, Present, and Future Applications, NeurIPS 2023

Abstract

Recently, interpretable machine learning has re-explored concept bottleneck models (CBM), comprising step-by-step prediction of the high-level concepts from the raw features and the target variable from the predicted concepts. A compelling advantage of this model class is the user's ability to intervene on the predicted concept values, consequently affecting the model's downstream output. In this work, we introduce a method to perform such concept-based interventions on already-trained neural networks, which are not interpretable by design. Furthermore, we formalise the model's intervenability as a measure of the effectiveness of concept-based interventions and leverage this definition to fine-tune black-box models. Empirically, we explore the intervenability of black-box classifiers on synthetic tabular and natural image benchmarks. We demonstrate that fine-tuning improves intervention effectiveness and often yields better-calibrated predictions. To showcase the practical utility of the proposed techniques, we apply them to chest X-ray classifiers and show that fine-tuned black boxes can be as intervenable and more performant than CBMs.

Authors

Ricards Marcinkevics^*, Sonia Laguna^*, Moritz Vandenhirtz, Julia E. Vogt
^* denotes shared first authorship

Submitted

XAI in Action: Past, Present, and Future Applications, NeurIPS 2023

Date

16.12.2023

Alexander Marx, Francesco Di Stefano, Heike Leutheuser, Kieran Chin-Cheong, Marc Pfister, Marie-Anne Burckhardt, Sara Bachmann^†, Julia E. Vogt^†
^† denotes shared last authorshipBlood glucose forecasting from temporal and static information in children with T1DFrontiers in Pediatrics

Abstract

Background: The overarching goal of blood glucose forecasting is to assist individuals with type 1 diabetes (T1D) in avoiding hyper- or hypoglycemic conditions. While deep learning approaches have shown promising results for blood glucose forecasting in adults with T1D, it is not known if these results generalize to children. Possible reasons are physical activity (PA), which is often unplanned in children, as well as age and development of a child, which both have an effect on the blood glucose level. Materials and Methods: In this study, we collected time series measurements of glucose levels, carbohydrate intake, insulin-dosing and physical activity from children with T1D for one week in an ethics approved prospective observational study, which included daily physical activities. We investigate the performance of state-of-the-art deep learning methods for adult data—(dilated) recurrent neural networks and a transformer—on our dataset for short-term (30 min) and long-term (2 h) prediction. We propose to integrate static patient characteristics, such as age, gender, BMI, and percentage of basal insulin, to account for the heterogeneity of our study group. Results: Integrating static patient characteristics (SPC) proves beneficial, especially for short-term prediction. LSTMs and GRUs with SPC perform best for a prediction horizon of 30 min (RMSE of 1.66 mmol/l), a vanilla RNN with SPC performs best across different prediction horizons, while the performance significantly decays for long-term prediction. For prediction during the night, the best method improves to an RMSE of 1.50 mmol/l. Overall, the results for our baselines and RNN models indicate that blood glucose forecasting for children conducting regular physical activity is more challenging than for previously studied adult data. Conclusion: We find that integrating static data improves the performance of deep-learning architectures for blood glucose forecasting of children with T1D and achieves promising results for short-term prediction. Despite these improvements, additional clinical studies are warranted to extend forecasting to longer-term prediction horizons.

Authors

Alexander Marx, Francesco Di Stefano, Heike Leutheuser, Kieran Chin-Cheong, Marc Pfister, Marie-Anne Burckhardt, Sara Bachmann^†, Julia E. Vogt^†
^† denotes shared last authorship

Submitted

Frontiers in Pediatrics

Date

14.12.2023

Thomas M. Sutter^, Alain Ryser^, Joram Liebeskind, Julia E. Vogt
^* denotes shared first authorshipDifferentiable Random Partition ModelsNeurips 2023

Abstract

Partitioning a set of elements into an unknown number of mutually exclusive subsets is essential in many machine learning problems. However, assigning elements, such as samples in a dataset or neurons in a network layer, to an unknown and discrete number of subsets is inherently non-differentiable, prohibiting end-to-end gradient-based optimization of parameters. We overcome this limitation by proposing a novel two-step method for inferring partitions, which allows its usage in variational inference tasks. This new approach enables reparameterized gradients with respect to the parameters of the new random partition model. Our method works by inferring the number of elements per subset and, second, by filling these subsets in a learned order. We highlight the versatility of our general-purpose approach on three different challenging experiments: variational clustering, inference of shared and independent generative factors under weak supervision, and multitask learning.

Authors

Thomas M. Sutter^*, Alain Ryser^*, Joram Liebeskind, Julia E. Vogt
^* denotes shared first authorship

Submitted

Neurips 2023

Date

12.12.2023

Alice Bizeul, Carl AllenSimVAE: Narrowing the gap between Discriminative & Generative Representation LearningNeurIPS 2023 Workshop on Mathematics of Modern Machine Learning

Abstract

Self-supervised representation learning is a powerful paradigm that leverages the relationship between semantically similar data, such as augmentations, extracts of an image or sound clip, or multiple views/modalities. Recent methods, e.g. SimCLR, CLIP and DINO, have made significant strides, yielding representations that achieve state-of-the-art results on multiple downstream tasks. A number of self-supervised discriminative approaches have been proposed, e.g. instance discrimination, latent clustering and contrastive methods. Though often intuitive, a comprehensive theoretical understanding of their underlying mechanisms or *what* they learn eludes. Meanwhile, generative approaches, such as variational autoencoders (VAEs), fit a specific latent variable model and have principled appeal, but lag significantly in terms of performance. We present a theoretical analysis of self-supervised discriminative methods and a graphical model that reflects the assumptions they implicitly make and unifies these methods. We show that fitting this model under an ELBO objective improves representations over previous VAE methods on several common benchmarks, narrowing the gap to discriminative methods, and can also preserve information lost by discriminative approaches. This work brings new theoretical insight to modern machine learning practice.

Authors

Alice Bizeul, Carl Allen

Submitted

NeurIPS 2023 Workshop on Mathematics of Modern Machine Learning

Date

07.11.2023

Claudio Fanconi^, Moritz Vandenhirtz^, Severin Husmann, Julia E. Vogt
^* denotes shared first authorshipThis Reads Like That: Deep Learning for Interpretable Natural Language ProcessingConference on Empirical Methods in Natural Language Processing, EMNLP 2023

Abstract

Prototype learning, a popular machine learning method designed for inherently interpretable decisions, leverages similarities to learned prototypes for classifying new data. While it is mainly applied in computer vision, in this work, we build upon prior research and further explore the extension of prototypical networks to natural language processing. We introduce a learned weighted similarity measure that enhances the similarity computation by focusing on informative dimensions of pre-trained sentence embeddings. Additionally, we propose a post-hoc explainability mechanism that extracts prediction-relevant words from both the prototype and input sentences. Finally, we empirically demonstrate that our proposed method not only improves predictive performance on the AG News and RT Polarity datasets over a previous prototype-based approach, but also improves the faithfulness of explanations compared to rationale-based recurrent convolutions.

Authors

Claudio Fanconi^*, Moritz Vandenhirtz^*, Severin Husmann, Julia E. Vogt
^* denotes shared first authorship

Submitted

Conference on Empirical Methods in Natural Language Processing, EMNLP 2023

Date

25.10.2023

Link DOI Code

Imant Daunhawer, Kai Schumacher, Anna Badura, Julia E. Vogt, Holger Michel, Sven WellmannValidating the early phototherapy prediction tool across cohortsFrontiers in Pediatrics, 2023

Abstract

Background: Hyperbilirubinemia of the newborn infant is a common disease worldwide. However, recognized early and treated appropriately, it typically remains innocuous. We recently developed an early phototherapy prediction tool (EPPT) by means of machine learning (ML) utilizing just one bilirubin measurement and few clinical variables. The aim of this study is to test applicability and performance of the EPPT on a new patient cohort from a different population. Materials and methods: This work is a retrospective study of prospectively recorded neonatal data from infants born in 2018 in an academic hospital, Regensburg, Germany, meeting the following inclusion criteria: born with 34 completed weeks of gestation or more, at least two total serum bilirubin (TSB) measurement prior to phototherapy. First, the original EPPT—an ensemble of a logistic regression and a random forest—was used in its freely accessible version and evaluated in terms of the area under the receiver operating characteristic curve (AUROC). Second, a new version of the EPPT model was re-trained on the data from the new cohort. Third, the predictive performance, variable importance, sensitivity and specificity were analyzed and compared across the original and re-trained models. Results: In total, 1,109 neonates were included with a median (IQR) gestational age of 38.4 (36.6–39.9) and a total of 3,940 bilirubin measurements prior to any phototherapy treatment, which was required in 154 neonates (13.9%). For the phototherapy treatment prediction, the original EPPT achieved a predictive performance of 84.6% AUROC on the new cohort. After re-training the model on a subset of the new dataset, 88.8% AUROC was achieved as evaluated by cross validation. The same five variables as for the original model were found to be most important for the prediction on the new cohort, namely gestational age at birth, birth weight, bilirubin to weight ratio, hours since birth, bilirubin value. Discussion: The individual risk for treatment requirement in neonatal hyperbilirubinemia is robustly predictable in different patient cohorts with a previously developed ML tool (EPPT) demanding just one TSB value and only four clinical parameters. Further prospective validation studies are needed to develop an effective and safe clinical decision support system.

Authors

Imant Daunhawer, Kai Schumacher, Anna Badura, Julia E. Vogt, Holger Michel, Sven Wellmann

Submitted

Frontiers in Pediatrics, 2023

Date

09.10.2023

Zixuan Xiao, Michal Muszynski, Ricards Marcinkevics, Lukas Zimmerli, Adam D. Ivankay, Dario Kohlbrenner, Manuel Kuhn, Yves Nordmann, Ulrich Muehlner, Christian Clarenbach, Julia E. Vogt, Thomas BrunschwilerBreathing New Life into COPD Assessment: Multisensory Home-monitoring for Predicting Severity25th ACM International Conference on Multimodal Interaction, ICMI'23

Abstract

Chronic obstructive pulmonary disease (COPD) is a significant public health issue, affecting more than 100 million people worldwide. Remote patient monitoring has shown great promise in the efficient management of patients with chronic diseases. This work presents the analysis of the data from a monitoring system developed to track COPD symptoms alongside patients’ self-reports. In particular, we investigate the assessment of COPD severity using multisensory home-monitoring device data acquired from 30 patients over a period of three months. We describe a comprehensive data pre-processing and feature engineering pipeline for multimodal data from the remote home-monitoring of COPD patients. We develop and validate predictive models forecasting i) the absolute and ii) differenced COPD Assessment Test (CAT) scores based on the multisensory data. The best obtained models achieve Pearson’s correlation coefficient of 0.93 and 0.37 for absolute and differenced CAT scores. In addition, we investigate the importance of individual sensor modalities for predicting CAT scores using group sparse regularization techniques. Our results suggest that feature groups indicative of the patient’s general condition, such as static medical and physiological information, date, spirometer, and air quality, are crucial for predicting the absolute CAT score. For predicting changes in CAT scores, sleep and physical activity features are most important, alongside the previous CAT score value. Our analysis demonstrates the potential of remote patient monitoring for COPD management and investigates which sensor modalities are most indicative of COPD severity as assessed by the CAT score. Our findings contribute to the development of effective and data-driven COPD management strategies.

Authors

Zixuan Xiao, Michal Muszynski, Ricards Marcinkevics, Lukas Zimmerli, Adam D. Ivankay, Dario Kohlbrenner, Manuel Kuhn, Yves Nordmann, Ulrich Muehlner, Christian Clarenbach, Julia E. Vogt, Thomas Brunschwiler

Submitted

25th ACM International Conference on Multimodal Interaction, ICMI'23

Date

09.10.2023

KO Kristinsdottir, S Ruipérez-Campillo, T HelgasonQuantifying Spasticity: A ReviewChapter of the Book "Stroke-Management Pearls"

Abstract

A precise method to measure spasticity is fundamental in improving the quality of life of spastic patients. The measurement methods that exist for spasticity have long been considered scarce and inadequate, which can partly be explained by a lack of consensus in the definition of spasticity. Spasticity quantification methods can be roughly classified according to whether they are based on neurophysiological or biomechanical mechanisms, clinical scales, or imaging techniques. This article reviews methods from all classes and further discusses instrumentation, dimensionality, and EMG onset detection methods. The objective of this article is to provide a review on spasticity measurement methods used to this day in an effort to contribute to the advancement of both the quantification and treatment of spasticity.

Authors

KO Kristinsdottir, S Ruipérez-Campillo, T Helgason

Submitted

Chapter of the Book "Stroke-Management Pearls"

Date

04.10.2023

Ruibin Feng, Brototo Deb, Prasanth Ganesan, Fleur VY Tjong, Albert J Rogers, Samuel Ruipérez-Campillo, Sulaiman Somani, Paul Clopton, Tina Baykaner, Miguel Rodrigo, James Zou, Fracois Haddad, Matei Zahari, Sanjiv M. NarayanSegmenting computed tomograms for cardiac ablation using machine learning leveraged by domain knowledge encodingFrontiers in cardiovascular medicine

Abstract

Background Segmentation of computed tomography (CT) is important for many clinical procedures including personalized cardiac ablation for the management of cardiac arrhythmias. While segmentation can be automated by machine learning (ML), it is limited by the need for large, labeled training data that may be difficult to obtain. We set out to combine ML of cardiac CT with domain knowledge, which reduces the need for large training datasets by encoding cardiac geometry, which we then tested in independent datasets and in a prospective study of atrial fibrillation (AF) ablation. Methods We mathematically represented atrial anatomy with simple geometric shapes and derived a model to parse cardiac structures in a small set of N = 6 digital hearts. The model, termed “virtual dissection,” was used to train ML to segment cardiac CT in N = 20 patients, then tested in independent datasets and in a prospective study. Results In independent test cohorts (N = 160) from 2 Institutions with different CT scanners, atrial structures were accurately segmented with Dice scores of 96.7% in internal (IQR: 95.3%–97.7%) and 93.5% in external (IQR: 91.9%–94.7%) test data, with good agreement with experts (r = 0.99; p < 0.0001). In a prospective study of 42 patients at ablation, this approach reduced segmentation time by 85% (2.3 ± 0.8 vs. 15.0 ± 6.9 min, p < 0.0001), yet provided similar Dice scores to experts (93.9% (IQR: 93.0%–94.6%) vs. 94.4% (IQR: 92.8%–95.7%), p = NS). Conclusions Encoding cardiac geometry using mathematical models greatly accelerated training of ML to segment CT, reducing the need for large training sets while retaining accuracy in independent test data. Combining ML with domain knowledge may have broad applications.

Authors

Ruibin Feng, Brototo Deb, Prasanth Ganesan, Fleur VY Tjong, Albert J Rogers, Samuel Ruipérez-Campillo, Sulaiman Somani, Paul Clopton, Tina Baykaner, Miguel Rodrigo, James Zou, Fracois Haddad, Matei Zahari, Sanjiv M. Narayan

Submitted

Frontiers in cardiovascular medicine

Date

02.10.2023

L Pancorbo^, S Ruipérez-Campillo^, F Castells, J Millet
^* denotes shared first authorshipHeterogeneity Quantification of Electrophysiological Signal Propagation in High-Density Multielectrode RecordingsIEEE Computing in Cardiology (50th CinC, 2023)

Abstract

This study presents a novel metric to evaluate the heterogeneity of cardiac substrate by using vector maps derived from omnipolar electrograms. This metric determines the level of disorganisation of electrical propagation having the potential to classify cardiac tissue under the catheter. We tested the methodology on propagation maps obtained from experimental recordings with and without electrical stimulation, under the assumption that the former exhibit greater heterogeneity. Results show the discriminatory behaviour of the parameter (p < 0.001), assigning higher values to non-stimulated maps and lower values in cases with stimulation. The clinical relevance of this paper lies in the introduction of a new metric defined on omnipolarderived vector maps, capable of identifying and quantifying areas of disorganised electrical propagation within the heart. This parameter has the potential to make orientation-independent catheterisation procedures more efficient providing electrophysiologists with valuable information for the management of arrhythmias.

Authors

L Pancorbo^*, S Ruipérez-Campillo^*, F Castells, J Millet
^* denotes shared first authorship

Submitted

IEEE Computing in Cardiology (50th CinC, 2023)

Date

01.10.2023

RT Ors-Quixal, E Ramírez-Candela, S Ruipérez-Campillo, F Castells, J Millet3D CNN as an Approach to Predict the Cerebral Performance of Comatose PatientsIEEE Computing in Cardiology (50th CinC, 2023)

Abstract

Many patients remain in a comatose state after initially surviving a resuscitation following a cardiac arrest. The prognosis in this state carries the decision of life support withdrawal, thus needing an objective and deterministic guideline. The objective of this study, is to assist this decision by providing a model able to predict the cerebral performance category (CPC) of comatose patients following cardiac arrest from their electroencephalographic (EEG) signal. To achieve this, binary classifiers built with 3D Convolutional Neural Networks (CNNs) followed by Dense Neural Networks (DNN) are used in combination with a “divide and conquer” strategy, thus enabling the automatic extraction of features from the tensors of EEG signals, taking into consideration the spatial relation of the signals according to the electrodes’ distribution on the scalp. This work was submitted under the team name “BioITACA UPV” to “Predicting Neurological Recovery from Coma After Cardiac Arrest: The George B. Moody PhysioNet Challenge 2023”, and while the team did not score in the official phase, results obtained from a held-out subset of the training set demonstrate the capability of the model to classify by CPC from short segments of 5 seconds to long recordings of EEG data. Results show an average accuracy of 0.76 between the CPC classifiers and capability to discern between a good or bad outcome prognosis.

Authors

RT Ors-Quixal, E Ramírez-Candela, S Ruipérez-Campillo, F Castells, J Millet

Submitted

IEEE Computing in Cardiology (50th CinC, 2023)

Date

01.10.2023

M Pedron, P Ganesan, R Feng, B Deb, H Chang, S Ruipérez-Campillo, S Somani, Y Desai, AJ Rogers, P Clopton, SM NarayanDefining the Predictive Ceiling of Electrogram Features Alone for Predicting Outcomes From Atrial Fibrillation AblationIEEE Computing in Cardiology (50th CinC, 2023)

Abstract

The aim of this study is to improve the prediction of long-term outcomes in patients with atrial fibrillation solely using electrogram (EGM) features. We developed three distinct models based on data from a cohort of N=561 patients, each targeting different aspects of EGM analysis: Principal Component Analysis (PCA): We applied PCA to analyze the variances of eigenvectors projecting more than a fixed threshold of the overall variance (15%). To identify common projection axes among these eigenvectors, we employed the k-means algorithm for clustering. Auto Regressive: This technique involves applying a bijective transformation to the coefficients, which are subsequently used as input for various machine learning classifiers, including Random Forest or Support Vector Classifier. Feature Engineering: We performed feature engineering by extracting voltage, rate, and shape similarity metrics from raw EGM (Electrogram) data.

Authors

M Pedron, P Ganesan, R Feng, B Deb, H Chang, S Ruipérez-Campillo, S Somani, Y Desai, AJ Rogers, P Clopton, SM Narayan

Submitted

IEEE Computing in Cardiology (50th CinC, 2023)

Date

01.10.2023

E Ramítez, S Ruipérez-Campillo, F Castells, R Casado-Arroyo, J MilletSynchronization of Conventional Electrocardiogram Recordings for Accurate Vectorcardiography ReconstructionIEEE Computing in Cardiology (50th CinC, 2023)

Abstract

The vectorcardiogram (VCG) provides a comprehensive representation of the heart's electrical activity in 3D aiding in the diagnosis and treatment of cardiovascular diseases. The conventional electrocardiogram (ECG) records twelve leads intermittently at intervals of 2.5 seconds, with lead II typically recorded continuously, which poses a challenge for reconstructing the VCG, as each lead's beats belong to different time instances. The purpose of this research is to propose and validate a methodology for accurately synchronizing the recording beats to reconstruct the VCG. To achieve this goal, a phantom was created to mimic the standard 12-lead ECG setup. The temporal offset of each beat from the first is calculated using cross-correlation utilizing the continuous lead and the same offset is applied to all leads, and finally reconstructing the VCG. The results demonstrate precise synchronization, as evidenced by Pearson correlation values of 0.9959±0.0034 , an MAE of 0.0077±0.0024 mV , and an RMSE of 0.0119±0.0038 mV in the VCG reconstruction. This technique is essential for the accurate diagnosis and treatment of cardiovascular diseases and can be applied to conventional ECG recordings taken on paper to obtain VCG.

Authors

E Ramítez, S Ruipérez-Campillo, F Castells, R Casado-Arroyo, J Millet

Submitted

IEEE Computing in Cardiology (50th CinC, 2023)

Date

01.10.2023

MZH Kolk, S Ruipérez-Campillo, B Deb, E Bekkers, CP Allaart, AJ Rogers, ACJ Van Der Lingen, I Isgum, B De Vos, P Clopton, othersOptimising Patient Selection for Primary Prevention ICD Implantation: Utilising Multimodal Machine Learning to Assess Risk of ICD Non-Benefit.Europace

Abstract

Aims Left ventricular ejection fraction (LVEF) is suboptimal as a sole marker for predicting sudden cardiac death (SCD). Machine learning (ML) provides new opportunities for personalized predictions using complex, multimodal data. This study aimed to determine if risk stratification for implantable cardioverter-defibrillator (ICD) implantation can be improved by ML models that combine clinical variables with 12-lead electrocardiograms (ECG) time-series features. Methods and results A multicentre study of 1010 patients (64.9 ± 10.8 years, 26.8% female) with ischaemic, dilated, or non-ischaemic cardiomyopathy, and LVEF ≤ 35% implanted with an ICD between 2007 and 2021 for primary prevention of SCD in two academic hospitals was performed. For each patient, a raw 12-lead, 10-s ECG was obtained within 90 days before ICD implantation, and clinical details were collected. Supervised ML models were trained and validated on a development cohort (n = 550) from Hospital A to predict ICD non-arrhythmic mortality at three-year follow-up (i.e. mortality without prior appropriate ICD-therapy). Model performance was evaluated on an external patient cohort from Hospital B (n = 460). At three-year follow-up, 16.0% of patients had died, with 72.8% meeting criteria for non-arrhythmic mortality. Extreme gradient boosting models identified patients with non-arrhythmic mortality with an area under the receiver operating characteristic curve (AUROC) of 0.90 [95% confidence intervals (CI) 0.80-1.00] during internal validation. In the external cohort, the AUROC was 0.79 (95% CI 0.75-0.84). Conclusions ML models combining ECG time-series features and clinical variables were able to predict non-arrhythmic mortality within three years after device implantation in a primary prevention population, with robust performance in an independent cohort.

Authors

MZH Kolk, S Ruipérez-Campillo, B Deb, E Bekkers, CP Allaart, AJ Rogers, ACJ Van Der Lingen, I Isgum, B De Vos, P Clopton, others

Submitted

Europace

Date

15.09.2023

R Cervigón, S Ruipérez-Campillo, J Millet, F CastellsLoneliness and Heart Rate in Older AdultsIEEE Mediterranean Conference on Medical and Biological Engineering and Computing (2023)

Abstract

The purpose of the study is to better understand the complex nature of loneliness in older adults and the potential contributing factors that may impact their sense of connection and well-being. The study utilized a mixed-methods approach, combining quantitative measures such as heart rate monitoring with qualitative data collected through interviews and surveys. The findings suggest that loneliness in older adults may be influenced by multiple factors, including their level of education, resilience, and empathy and incidence in spontaneous heart rate variations. Results highlight the importance of empathy in promoting social connectedness and reducing feelings of loneliness in older adults, may have implications for developing targeted interventions aimed at reducing loneliness and improving the well-being of older adults.

Authors

R Cervigón, S Ruipérez-Campillo, J Millet, F Castells

Submitted

IEEE Mediterranean Conference on Medical and Biological Engineering and Computing (2023)

Date

14.09.2023

Ece Özkan Elsen^, Thomas M. Sutter^, Yurong Hu, Sebastian Balzer, Julia E. Vogt
^* denotes shared first authorshipM(otion)-mode Based Prediction of Ejection Fraction using EchocardiogramsGCPR 2023

Abstract

Early detection of cardiac dysfunction through routine screening is vital for diagnosing cardiovascular diseases. An important metric of cardiac function is the left ventricular ejection fraction (EF), where lower EF is associated with cardiomyopathy. Echocardiography is a popular diagnostic tool in cardiology, with ultrasound being a low-cost, real-time, and non-ionizing technology. However, human assessment of echocardiograms for calculating EF is time-consuming and expertise-demanding, raising the need for an automated approach. In this work, we propose using the M(otion)-mode of echocardiograms for estimating the EF and classifying cardiomyopathy. We generate multiple artificial M-mode images from a single echocardiogram and combine them using off-the-shelf model architectures. Additionally, we extend contrastive learning (CL) to cardiac imaging to learn meaningful representations from exploiting structures in unlabeled data allowing the model to achieve high accuracy, even with limited annotations. Our experiments show that the supervised setting converges with only ten modes and is comparable to the baseline method while bypassing its cumbersome training process and being computationally much more efficient. Furthermore, CL using M-mode images is helpful for limited data scenarios, such as having labels for only 200 patients, which is common in medical applications.

Authors

Ece Özkan Elsen^*, Thomas M. Sutter^*, Yurong Hu, Sebastian Balzer, Julia E. Vogt
^* denotes shared first authorship

Submitted

GCPR 2023

Date

01.09.2023

Authors

Alexander Immer, Christoph Schultheiss, Julia E. Vogt, Bernhard Schölkopf, Peter Bühlmann, Alexander Marx

Submitted

Proceedings of the 40th International Conference on Machine Learning, ICML 2023

Date

04.07.2023

Laura Manduchi^, Moritz Vandenhirtz^, Alain Ryser, Julia E. Vogt
^* denotes shared first authorshipTree Variational AutoencodersICML 2023 Workshop on Structured Probabilistic Inference & Generative Modeling

Abstract

We propose a new generative hierarchical clustering model that learns a flexible tree-based posterior distribution over latent variables. The proposed Tree Variational Autoencoder (TreeVAE) hierarchically divides samples according to their intrinsic characteristics, shedding light on hidden structures in the data. It adapts its architecture to discover the optimal tree for encoding dependencies between latent variables, improving generative performance. We show that TreeVAE uncovers underlying clusters in the data and finds meaningful hierarchical relations between the different groups on several datasets. Due to its generative nature, TreeVAE can generate new samples from the discovered clusters via conditional sampling.

Authors

Laura Manduchi^*, Moritz Vandenhirtz^*, Alain Ryser, Julia E. Vogt
^* denotes shared first authorship

Submitted

ICML 2023 Workshop on Structured Probabilistic Inference & Generative Modeling

Date

30.06.2023

Authors

Paweł Czyż, Frederic Grabowski, Julia E. Vogt, Niko Beerenwinkel, Alexander Marx

Submitted

Arxiv

Date

19.06.2023

Ricards Marcinkevics^, Pamuditha N. Silva^, Anna-Katharina Hankele^, Charlyn Dörnte, Sarah Kadelka, Katharina Csik, Svenja Godbersen, Algera Goga, Lynn Hasenöhrl, Pascale Hirschi, Hasan Kabakci, Mary P. LaPierre, Johanna Mayrhofer, Alexandra C. Title, Xuan Shu, Nouell Baiioud, Sandra Bernal, Laura Dassisti, Mara D. Saenz-de-Juano, Meret Schmidhauser, Giulia Silvestrelli, Simon Z. Ulbrich, Thea J. Ulbrich, Tamara Wyss, Daniel J. Stekhoven, Faisal S. Al-Quaddoomi, Shuqing Yu, Mascha Binder, Christoph Schultheiβ, Claudia Zindel, Christoph Kolling, Jörg Goldhahn, Bahram Kasmapour Seighalani, Polina Zjablovskaja, Frank Hardung, Marc Schuster, Anne Richter, Yi-Ju Huang, Gereon Lauer, Herrad Baurmann, Jun Siong Low, Daniela Vaqueirinho, Sandra Jovic, Luca Piccoli, Sandra Ciesek, Julia E. Vogt, Federica Sallusto, Markus Stoffel^†, Susanne E. Ulbrich^†
^ denotes shared first authorship, ^† denotes shared last authorshipMachine learning analysis of humoral and cellular responses to SARS-CoV-2 infection in young adultsFrontiers in Immunology

Abstract

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) induces B and T cell responses, contributing to virus neutralization. In a cohort of 2,911 young adults, we identified 65 individuals who had an asymptomatic or mildly symptomatic SARS-CoV-2 infection and characterized their humoral and T cell responses to the Spike (S), Nucleocapsid (N) and Membrane (M) proteins. We found that previous infection induced CD4 T cells that vigorously responded to pools of peptides derived from the S and N proteins. By using statistical and machine learning models, we observed that the T cell response highly correlated with a compound titer of antibodies against the Receptor Binding Domain (RBD), S and N. However, while serum antibodies decayed over time, the cellular phenotype of these individuals remained stable over four months. Our computational analysis demonstrates that in young adults, asymptomatic and paucisymptomatic SARS-CoV-2 infections can induce robust and long-lasting CD4 T cell responses that exhibit slower decays than antibody titers. These observations imply that next-generation COVID-19 vaccines should be designed to induce stronger cellular responses to sustain the generation of potent neutralizing antibodies.

Authors

Ricards Marcinkevics^*, Pamuditha N. Silva^*, Anna-Katharina Hankele^*, Charlyn Dörnte, Sarah Kadelka, Katharina Csik, Svenja Godbersen, Algera Goga, Lynn Hasenöhrl, Pascale Hirschi, Hasan Kabakci, Mary P. LaPierre, Johanna Mayrhofer, Alexandra C. Title, Xuan Shu, Nouell Baiioud, Sandra Bernal, Laura Dassisti, Mara D. Saenz-de-Juano, Meret Schmidhauser, Giulia Silvestrelli, Simon Z. Ulbrich, Thea J. Ulbrich, Tamara Wyss, Daniel J. Stekhoven, Faisal S. Al-Quaddoomi, Shuqing Yu, Mascha Binder, Christoph Schultheiβ, Claudia Zindel, Christoph Kolling, Jörg Goldhahn, Bahram Kasmapour Seighalani, Polina Zjablovskaja, Frank Hardung, Marc Schuster, Anne Richter, Yi-Ju Huang, Gereon Lauer, Herrad Baurmann, Jun Siong Low, Daniela Vaqueirinho, Sandra Jovic, Luca Piccoli, Sandra Ciesek, Julia E. Vogt, Federica Sallusto, Markus Stoffel^†, Susanne E. Ulbrich^†
^* denotes shared first authorship, ^† denotes shared last authorship

Submitted

Frontiers in Immunology

Date

29.05.2023

Link DOI Code

Moritz Vandenhirtz, Laura Manduchi, Ricards Marcinkevics, Julia E. VogtSignal Is Harder To Learn Than Bias: Debiasing with Focal LossDomain Generalization Workshop, ICLR 2023

Abstract

Spurious correlations are everywhere. While humans often do not perceive them, neural networks are notorious for learning unwanted associations, also known as biases, instead of the underlying decision rule. As a result, practitioners are often unaware of the biased decision-making of their classifiers. Such a biased model based on spurious correlations might not generalize to unobserved data, leading to unintended, adverse consequences. We propose Signal is Harder (SiH), a variational-autoencoder-based method that simultaneously trains a biased and unbiased classifier using a novel, disentangling reweighting scheme inspired by the focal loss. Using the unbiased classifier, SiH matches or improves upon the performance of state-of-the-art debiasing methods. To improve the interpretability of our technique, we propose a perturbation scheme in the latent space for visualizing the bias that helps practitioners become aware of the sources of spurious correlations.

Authors

Moritz Vandenhirtz, Laura Manduchi, Ricards Marcinkevics, Julia E. Vogt

Submitted

Domain Generalization Workshop, ICLR 2023

Date

04.05.2023

Thomas M. Sutter, Laura Manduchi, Alain Ryser, Julia E. VogtLearning Group Importance using the Differentiable Hypergeometric DistributionICLR 2023

Abstract

Partitioning a set of elements into subsets of a priori unknown sizes is essential in many applications. These subset sizes are rarely explicitly learned - be it the cluster sizes in clustering applications or the number of shared versus independent generative latent factors in weakly-supervised learning. Probability distributions over correct combinations of subset sizes are non-differentiable due to hard constraints, which prohibit gradient-based optimization. In this work, we propose the differentiable hypergeometric distribution. The hypergeometric distribution models the probability of different group sizes based on their relative importance. We introduce reparameterizable gradients to learn the importance between groups and highlight the advantage of explicitly learning the size of subsets in two typical applications: weakly-supervised learning and clustering. In both applications, we outperform previous approaches, which rely on suboptimal heuristics to model the unknown size of groups.

Authors

Thomas M. Sutter, Laura Manduchi, Alain Ryser, Julia E. Vogt

Submitted

ICLR 2023

Date

01.05.2023

Abstract

Humans naturally integrate various senses to understand our surroundings, enabling us to compensate for partially missing sensory input.On the contrary, machine learning models excel at harnessing extensive datasets but face challenges in handling missing data effectively. While utilizing multiple data types provides a more comprehensive perspective, it also raises the likelihood of encountering missing values, underscoring the significance of proper missing data management in machine learning techniques. In this thesis, we advocate for developing machine learning models that emulate the human approach of merging diverse sensory inputs into a unified representation, demonstrating resilience in the face of missing input sources. Generating labels for multiple data types is laborious and often costly, resulting in a scarcity of fully annotated multimodal datasets. On the other hand, multimodal data naturally possesses a form of weak supervision. We understand that these samples describe the same event and assume that certain underlying generative factors are shared among the group members, providing a form of weak guidance. Our thesis focuses on learning from data characterized by weak supervision, delving into the interrelationships among group members. We start by exploring novel techniques for machine learning models capable of processing multimodal inputs while effectively handling missing data. Our emphasis is on variational autoencoders (VAE) for learning from weakly supervised data. We introduce a generalized formulation of probabilistic aggregation functions, designed to overcome the limitations of previous …

Authors

Thomas M. Sutter

Date

30.09.2022

Johannes Pohl, Alain Ryser, Janne Marieke Veerbeek, Geert Verheyden, Julia Elisabeth Vogt, Andreas Rüdiger Luft, Chris Awai EasthopeClassification of functional and non-functional arm use by inertial measurement units in individuals with upper limb impairment after strokeFrontiers in Physiology

Abstract

Background: Arm use metrics derived from wrist-mounted movement sensors are widely used to quantify the upper limb performance in real-life conditions of individuals with stroke throughout motor recovery. The calculation of real-world use metrics, such as arm use duration and laterality preferences, relies on accurately identifying functional movements. Hence, classifying upper limb activity into functional and non-functional classes is paramount. Acceleration thresholds are conventionally used to distinguish these classes. However, these methods are challenged by the high inter and intra-individual variability of movement patterns. In this study, we developed and validated a machine learning classifier for this task and compared it to methods using conventional and optimal thresholds.Methods: Individuals after stroke were video-recorded in their home environment performing semi-naturalistic daily tasks while wearing wrist-mounted inertial measurement units. Data were labeled frame-by-frame following the Taxonomy of Functional Upper Limb Motion definitions, excluding whole-body movements, and sequenced into 1-s epochs. Actigraph counts were computed, and an optimal threshold for functional movement was determined by receiver operating characteristic curve analyses on group and individual levels. A logistic regression classifier was trained on the same labels using time and frequency domain features. Performance measures were compared between all classification methods.Results: Video data (6.5 h) of 14 individuals with mild-to-severe upper limb impairment were labeled. Optimal activity count thresholds were ≥20.1 for the affected side and ≥38.6 for the unaffected side and showed high predictive power with an area under the curve (95% CI) of 0.88 (0.87,0.89) and 0.86 (0.85, 0.87), respectively. A classification accuracy of around 80% was equivalent to the optimal threshold and machine learning methods and outperformed the conventional threshold by ∼10%. Optimal thresholds and machine learning methods showed superior specificity (75–82%) to conventional thresholds (58–66%) across unilateral and bilateral activities.Conclusion: This work compares the validity of methods classifying stroke survivors’ real-life arm activities measured by wrist-worn sensors excluding whole-body movements. The determined optimal thresholds and machine learning classifiers achieved an equivalent accuracy and higher specificity than conventional thresholds. Our open-sourced classifier or optimal thresholds should be used to specify the intensity and duration of arm use.

Authors

Johannes Pohl, Alain Ryser, Janne Marieke Veerbeek, Geert Verheyden, Julia Elisabeth Vogt, Andreas Rüdiger Luft, Chris Awai Easthope

Submitted

Frontiers in Physiology

Date

28.09.2022

Johannes Pohl, Alain Ryser, Janne Marieke Veerbeek, Geert Verheyden, Julia Elisabeth Vogt, Andreas Rüdiger Luft, Chris Awai EasthopeAccuracy of gait and posture classification using movement sensors in individuals with mobility impairment after strokeFrontiers in Physiology

Abstract

Background: Stroke leads to motor impairment which reduces physical activity, negatively affects social participation, and increases the risk of secondary cardiovascular events. Continuous monitoring of physical activity with motion sensors is promising to allow the prescription of tailored treatments in a timely manner. Accurate classification of gait activities and body posture is necessary to extract actionable information for outcome measures from unstructured motion data. We here develop and validate a solution for various sensor configurations specifically for a stroke population.Methods: Video and movement sensor data (locations: wrists, ankles, and chest) were collected from fourteen stroke survivors with motor impairment who performed real-life activities in their home environment. Video data were labeled for five classes of gait and body postures and three classes of transitions that served as ground truth. We trained support vector machine (SVM), logistic regression (LR), and k-nearest neighbor (kNN) models to identify gait bouts only or gait and posture. Model performance was assessed by the nested leave-one-subject-out protocol and compared across five different sensor placement configurations.Results: Our method achieved very good performance when predicting real-life gait versus non-gait (Gait classification) with an accuracy between 85% and 93% across sensor configurations, using SVM and LR modeling. On the much more challenging task of discriminating between the body postures lying, sitting, and standing as well as walking, and stair ascent/descent (Gait and postures classification), our method achieves accuracies between 80% and 86% with at least one ankle and wrist sensor attached unilaterally. The Gait and postures classification performance between SVM and LR was equivalent but superior to kNN.Conclusion: This work presents a comparison of performance when classifying Gait and body postures in post-stroke individuals with different sensor configurations, which provide options for subsequent outcome evaluation. We achieved accurate classification of gait and postures performed in a real-life setting by individuals with a wide range of motor impairments due to stroke. This validated classifier will hopefully prove a useful resource to researchers and clinicians in the increasingly important field of digital health in the form of remote movement monitoring using motion sensors.

Authors

Johannes Pohl, Alain Ryser, Janne Marieke Veerbeek, Geert Verheyden, Julia Elisabeth Vogt, Andreas Rüdiger Luft, Chris Awai Easthope

Submitted

Frontiers in Physiology

Date

26.09.2022

Hanna Ragnarsdottir^, Laura Manduchi^, Holger Michel, Fabian Laumer, Sven Wellmann, Ece Özkan Elsen^†, Julia E. Vogt^†
^* denotes shared first authorship, ^† denotes shared last authorshipInterpretable Prediction of Pulmonary Hypertension in Newborns Using EchocardiogramsDAGM German Conference on Pattern Recognition

Abstract

Pulmonary hypertension (PH) in newborns and infants is a complex condition associated with several pulmonary, cardiac, and systemic diseases contributing to morbidity and mortality. Therefore, accurate and early detection of PH is crucial for successful management. Using echocardiography, the primary diagnostic tool in pediatrics, human assessment is both time-consuming and expertise-demanding, raising the need for an automated approach. In this work, we present an interpretable multi-view video-based deep learning approach to predict PH for a cohort of 194 newborns using echocardiograms. We use spatio-temporal convolutional architectures for the prediction of PH from each view, and aggregate the predictions of the different views using majority voting. To the best of our knowledge, this is the first work for an automated assessment of PH in newborns using echocardiograms. Our results show a mean F1-score of 0.84 for severity prediction and 0.92 for binary detection using 10-fold cross-validation. We complement our predictions with saliency maps and show that the learned model focuses on clinically relevant cardiac structures, motivating its usage in clinical practice.

Authors

Hanna Ragnarsdottir^*, Laura Manduchi^*, Holger Michel, Fabian Laumer, Sven Wellmann, Ece Özkan Elsen^†, Julia E. Vogt^†
^* denotes shared first authorship, ^† denotes shared last authorship

Submitted

DAGM German Conference on Pattern Recognition

Date

20.09.2022

Thomas M. Sutter^, Alain Ryser^, Joram Liebeskind, Julia E Vogt
^* denotes shared first authorshipDifferentiable Set PartitioningICML 2023 Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators

Abstract

Partitioning a set of elements into an unknown number of mutually exclusive subsets is essential in many machine learning problems. However, assigning elements, such as samples in a dataset or neurons in a network layer, to an unknown and discrete number of subsets is inherently non-differentiable, prohibiting end-to-end gradient-based optimization of parameters. We overcome this limitation by proposing a novel two-step method for inferring partitions, which allows its usage in variational inference tasks. This new approach enables reparameterized gradients with respect to the parameters of the new random partition model. Our method works by inferring the number of elements per subset and, second, by filling these subsets in a learned order. We highlight the versatility of our general-purpose approach on two different challenging experiments: multitask learning and inference of shared and independent generative factors under weak supervision.

Authors

Thomas M. Sutter^*, Alain Ryser^*, Joram Liebeskind, Julia E Vogt
^* denotes shared first authorship

Submitted

ICML 2023 Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators

Date

17.09.2022

Z Yuan, MJ O’Melia, K Li, J Lyu, F Zhou, P Jothikumar, NA Rohner, MP Manspeaker, DM Francis, K Bai, C Ge, MN Rushdi, L Chingozha, S Ruipérez-Campillo, H Lu, SN Thomas, C ZhuTumor microenvironments impair T cell receptor affinity and functionbioRxiv

Abstract

CD8+ T cells underpin effective anti-tumor immune responses in melanoma; however, their functions are attenuated due to various immunosuppressive factors in the tumor microenvironment (TME), resulting in disease progression. T cell function is elicited by the T cell receptor (TCR), which recognizes antigen peptide-major histocompatibility complex (pMHC) expressed on tumor cells via direct physical contact, i.e., two-dimensional (2D) interaction. TCR–pMHC 2D affinity plays a central role in antigen recognition and discrimination, and is sensitive to both the conditions of the T cell and the microenvironment in which it resides. Herein, we demonstrate that CD8+ T cells residing in TME have lower 2D TCR–pMHC bimolecular affinity and TCR–pMHC–CD8 trimolecular avidity, pull fewer TCR–pMHC bonds by endogenous forces, flux lower level of intracellular calcium in response to antigen stimulation, exhibit impaired in vivo activation, and show diminished anti-tumor effector function. These detrimental effects are localized in the tumor and tumor draining lymph node (TdLN), and affect both antigen-inexperienced and antigen-experienced CD8+ T cells irrespective of their TCR specificities. These findings implicate impaired antigen recognition as a mechanism of T cell dysfunction in the TME.

Authors

Z Yuan, MJ O’Melia, K Li, J Lyu, F Zhou, P Jothikumar, NA Rohner, MP Manspeaker, DM Francis, K Bai, C Ge, MN Rushdi, L Chingozha, S Ruipérez-Campillo, H Lu, SN Thomas, C Zhu

Submitted

bioRxiv

Date

13.09.2022

ISegarra, S Ruipérez-Campillo, FCastells, J MilletNovel Method for Orientation-Independent Analysis in Equi-Spaced Multi-Electrode ArraysIEEE Computing in Cardiology (49th CinC, 2022)

Abstract

The diagnosis and treatment of cardiac arrhythmias relies on catheter recordings, that may be inefficient because of the continued use of the bipolar processing and analysis techniques of traditional catheters, missing the potential of the novel matrix catheters. This results in the need of more processing of the signals and longer cardiac scans to obtain accurate information about the state of the tissue being analysed. This study proposes a new clique configuration to compute omnipolar EGM (oEGM) in multi-electrode array catheters to obtain parameters of interest in a more robust and accurate manner. Numerous simulations with varying input parameters are designed to emulate the propagation of electrical activity on the cardiac tissue surface captured by the catheter and characterise the differences between the classic method of omnipolar analysis (triangular clique) and our proposed new method (cross clique). The results show that the cross clique is more robust to variations in the direction of wave propagation, and more accurate in the estimation of the local activation time (LAT).

Authors

ISegarra, S Ruipérez-Campillo, FCastells, J Millet

Submitted

IEEE Computing in Cardiology (49th CinC, 2022)

Date

04.09.2022

RC Klein, RE van Lieshout, MZH Kolk, K Geijtenbeek, R Vos, S Ruipérez-Campillo, R Feng, B Deb, P Ganesan, RE Knops, I Isgum, SM Narayan, E Bekkers, B Vos FVY TongWeakly-Supervised Deep Learning for Left Ventricle Fibrosis Segmentation in Cardiac MRI Using Image-Level LabelsIEEE Computing in Cardiology (49th CinC, 2022)

Abstract

Automated segmentation of myocardial fibrosis in late gadolinium enhancement (LGE) cardiac MRI (CMR) has the potential to improve efficiency and precision of diagnosis and treatment of cardiomyopathies. However, state-of-the-art Deep Learning approaches require manual pixel-level annotations. Using weaker labels can greatly reduce manual annotation time and expedite dataset curation, which is why we propose fibrosis segmentation methods using either slice-level or stack-level fibrosis labels. 5759 short-axis LGE CMR image slices were retrospectively obtained from 482 patients. U-Nets with slice-level and stack-level supervision are trained with 446 weakly-labeled patients by making use of a myocardium segmentation U-Net and fibrosis classification Dilated Residual Networks (DRN). For comparison, a U-Net is trained with pixel-level supervision using a training set of 81 patients. On the proprietary test set of 24 patients, pixel-level, slice-level and stack-level supervision reach Dice scores of 0.74, 0.70 and 0.70, while on the external Emidec dataset of 100 patients Dice scores of 0.55, 0.61 and 0.52 were obtained. Results indicate that using larger weakly-annotated datasets can approach the performance of methods using pixel-level annotated datasets and potentially improve generalization to external datasets.

Authors

RC Klein, RE van Lieshout, MZH Kolk, K Geijtenbeek, R Vos, S Ruipérez-Campillo, R Feng, B Deb, P Ganesan, RE Knops, I Isgum, SM Narayan, E Bekkers, B Vos FVY Tong

Submitted

IEEE Computing in Cardiology (49th CinC, 2022)

Date

04.09.2022

FE van Lieshout, RC Klein, MZH Kolk, K van Geijtenbeek, R Vos, S Ruipérez-Campillo, R Feng, B Deb, P Ganesan, RE Knops, I Isgum, SM Narayan, E Bekkers, B Vos, FVY TjongDeep Learning for Ventricular Arrhythmia Prediction Using Fibrosis Segmentations on Cardiac MRI DataIEEE Computing in Cardiology (49th CinC, 2022)

Abstract

Many patients at high risk of life-threatening ventricular arrhythmias (VA) and sudden cardiac death (SCD) who received an implantable cardioverter defibrillator (ICD), never receive appropriate device therapy. The presence of fibrosis on LGE CMR imaging is shown to be associated with increased risk of VA. Therefore, there is a strong need for both automatic segmentation and quantification of cardiac fibrosis as well as better risk stratification for SCD. This study first presents a novel two-stage deep learning network for the segmentation of left ventricle myocardium and fibrosis on LGE CMR images. Secondly it aims to effectively predict device therapy in ICD patients by using a graph neural network approach which incorporates both myocardium and fibrosis features as well as the left ventricle geometry. Our segmentation network outperforms previous state-of-the-art methods on 2D CMR data, reaching a Dice score of 0.82 and 0.77 on myocardium and fibrosis segmentation, respectively. The ICD therapy prediction network reaches an AUC of 0.60 while using only CMR data and outperforms baseline methods based on current guideline markers for ICD implantation. This work lays a strong basis for future research on improved risk stratification for VA and SCD.

Authors

FE van Lieshout, RC Klein, MZH Kolk, K van Geijtenbeek, R Vos, S Ruipérez-Campillo, R Feng, B Deb, P Ganesan, RE Knops, I Isgum, SM Narayan, E Bekkers, B Vos, FVY Tjong

Submitted

IEEE Computing in Cardiology (49th CinC, 2022)

Date

04.09.2022

S Ruipérez-Campillo, J Millet, FCastellsClassification of Atrial Tachycardia Types Using Dimensional Transforms of ECG Signals and Machine LearningIEEE Computing in Cardiology (49th CinC, 2022)

Abstract

Accurate non-invasive diagnoses in the context of cardiac diseases are problems that hitherto remain unresolved. We propose an unsupervised classification of atrial flutter (AFL) using dimensional transforms of ECG signals in high dimensional vector spaces. A mathematical model is used to generate synthetic signals based on clinical AFL signals, and hierarchical clustering analysis and novel machine learning (ML) methods are designed for the un-supervised classification. Metrics and accuracy parameters are created to assess the performance of the model, proving the power of this novel approach for the diagnosis of AFL from ECG using innovative AI algorithms.

Authors

S Ruipérez-Campillo, J Millet, FCastells

Submitted

IEEE Computing in Cardiology (49th CinC, 2022)

Date

04.09.2022

R Cervigón, E Franco, S Ruipérez-Campillo, C Lozano, F Castells, J MorenoAutocorrelation Function for Predicting Arrhythmic Recurrences in Patients Undergoing Persistent Atrial Fibrillation AblationIEEE Computing in Cardiology (49th CinC, 2022)

Abstract

Persistent atrial fibrillation ablation has a high recurrence rate. In this work, we performed an analysis of bipolar intracavitary signals obtained with a conventional 24-pole diagnostic catheter (Woven Orbiter) placed in the right atrium and coronary sinus in a cohort of patients with persistent atrial fibrillation undergoing ablation to detect features predictive of acute procedural success (conversion to sinus rhythm during ablation) and the occurrence of recurrences. The goal is to arrive at a quantitative description of the degree of randomness of the atrial response in atrial fibrillation and to demonstrate the presence of hidden periodic components. This was done by the determination of the autocorrelation function. Results showed that higher correlation in relative maximum peaks, and a lower dominant atrial frequency (greater distance between relative amplitude maxima) may be associated with a greater likelihood of achieving reversion to sinus rhythm and lower probability of recurrences. A larger study is needed to draw conclusions.

Authors

R Cervigón, E Franco, S Ruipérez-Campillo, C Lozano, F Castells, J Moreno

Submitted

IEEE Computing in Cardiology (49th CinC, 2022)

Date

04.09.2022

ML Cardo, A Chulián, S Ruipérez-Campillo, J Millet, F Castells, R CervigónECG Analysis to Study Social Connections in Older Cardiac PatientsIEEE Computing in Cardiology (49th CinC, 2022)

Abstract

Loneliness in older adults is associated with functional decline, depression and even death. Given the prevalence of loneliness, the aim of this study was to examine the association between loneliness and cardiac biomarkers in older people that attend to cardiology consultation. The results showed that loneliness was more prevalent in women than in men, and it was associated with marital status too. ECG recording were analyzed and QT interval and T-wave length showed higher values in people suffering from loneliness, as well as higher cardiac frequency, where the presence of meaning in life be a protective factor. Studies with a larger sample size are needed, but these results appear to show a relationship between biomarkers and mental state.

Authors

ML Cardo, A Chulián, S Ruipérez-Campillo, J Millet, F Castells, R Cervigón

Submitted

IEEE Computing in Cardiology (49th CinC, 2022)

Date

04.09.2022

Alain Ryser, Laura Manduchi, Fabian Laumer, Holger Michel, Sven Wellmann, Julia E. VogtAnomaly Detection in Echocardiograms with Dynamic Variational Trajectory ModelsThe Seventh Machine Learning for Healthcare Conference, MLHC 2022

Abstract

We propose a novel anomaly detection method for echocardiogram videos. The introduced method takes advantage of the periodic nature of the heart cycle to learn three variants of a variational latent trajectory model (TVAE). While the first two variants (TVAE-C and TVAE-R) model strict periodic movements of the heart, the third (TVAE-S) is more general and allows shifts in the spatial representation throughout the video. All models are trained on the healthy samples of a novel in-house dataset of infant echocardiogram videos consisting of multiple chamber views to learn a normative prior of the healthy population. During inference, maximum a posteriori (MAP) based anomaly detection is performed to detect out-of-distribution samples in our dataset. The proposed method reliably identifies severe congenital heart defects, such as Ebstein’s Anomaly or Shone-complex. Moreover, it achieves superior performance over MAP-based anomaly detection with standard variational autoencoders when detecting pulmonary hypertension and right ventricular dilation. Finally, we demonstrate that the proposed method enables interpretable explanations of its output through heatmaps highlighting the regions corresponding to anomalous heart structures.

Authors

Alain Ryser, Laura Manduchi, Fabian Laumer, Holger Michel, Sven Wellmann, Julia E. Vogt

Submitted

The Seventh Machine Learning for Healthcare Conference, MLHC 2022

Date

05.08.2022

Ricards Marcinkevics, Ece Özkan Elsen, Julia E. VogtDebiasing Deep Chest X-Ray Classifiers using Intra- and Post-processing MethodsThe Seventh Machine Learning for Healthcare Conference, MLHC 2022

Abstract

Deep neural networks for image-based screening and computer-aided diagnosis have achieved expert-level performance on various medical imaging modalities, including chest radiographs. Recently, several works have indicated that these state-of-the-art classifiers can be biased with respect to sensitive patient attributes, such as race or gender, leading to growing concerns about demographic disparities and discrimination resulting from algorithmic and model-based decision-making in healthcare. Fair machine learning has focused on mitigating such biases against disadvantaged or marginalised groups, mainly concentrating on tabular data or natural images. This work presents two novel intra-processing techniques based on fine-tuning and pruning an already-trained neural network. These methods are simple yet effective and can be readily applied post hoc in a setting where the protected attribute is unknown during the model development and test time. In addition, we compare several intra- and post-processing approaches applied to debiasing deep chest X-ray classifiers. To the best of our knowledge, this is one of the first efforts studying debiasing methods on chest radiographs. Our results suggest that the considered approaches successfully mitigate biases in fully connected and convolutional neural networks offering stable performance under various settings. The discussed methods can help achieve group fairness of deep medical image classifiers when deploying them in domains with different fairness considerations and constraints.

Authors

Ricards Marcinkevics, Ece Özkan Elsen, Julia E. Vogt

Submitted

The Seventh Machine Learning for Healthcare Conference, MLHC 2022

Date

05.08.2022

Ugne Klimiene^, Ricards Marcinkevics^, Patricia Reis Wolfertstetter, Ece Özkan Elsen, Alyssia Paschke, David Niederberger, Sven Wellmann, Christian Knorr, Julia E Vogt
^* denotes shared first authorshipMultiview Concept Bottleneck Models Applied to Diagnosing Pediatric AppendicitisOral spotlight at the 2nd Workshop on Interpretable Machine Learning in Healthcare (IMLH), ICML 2022

Abstract

Arguably, interpretability is one of the guiding principles behind the development of machine-learning-based healthcare decision support tools and computer-aided diagnosis systems. There has been a renewed interest in interpretable classification based on high-level concepts, including, among other model classes, the re-exploration of concept bottleneck models. By their nature, medical diagnosis, patient management, and monitoring require the assessment of multiple views and modalities to form a holistic representation of the patient's state. For instance, in ultrasound imaging, a region of interest might be registered from multiple views that are informative about different sets of clinically relevant features. Motivated by this, we extend the classical concept bottleneck model to the multiview classification setting by representation fusion across the views. We apply our multiview concept bottleneck model to the dataset of ultrasound images acquired from a cohort of pediatric patients with suspected appendicitis to predict the disease. The results suggest that auxiliary supervision from the concepts and aggregation across multiple views help develop more accurate and interpretable classifiers.

Authors

Ugne Klimiene^*, Ricards Marcinkevics^*, Patricia Reis Wolfertstetter, Ece Özkan Elsen, Alyssia Paschke, David Niederberger, Sven Wellmann, Christian Knorr, Julia E Vogt
^* denotes shared first authorship

Submitted

Oral spotlight at the 2nd Workshop on Interpretable Machine Learning in Healthcare (IMLH), ICML 2022

Date

23.07.2022

Alain Ryser, Laura Manduchi, Fabian Laumer, Holger Michel, Sven Wellmann, Julia E. VogtInterpretable Anomaly Detection in Echocardiograms with Dynamic Variational Trajectory ModelsPoster at the 2nd Workshop on Interpretable Machine Learning in Healthcare (IMLH), ICML 2022

Abstract

We propose a novel anomaly detection method for echocardiogram videos. The introduced method takes advantage of the periodic nature of the heart cycle to learn different variants of a variational latent trajectory model (TVAE). The models are trained on the healthy samples of an in-house dataset of infant echocardiogram videos consisting of multiple chamber views to learn a normative prior of the healthy population. During inference, maximum a posteriori (MAP) based anomaly detection is performed to detect out-ofdistribution samples in our dataset. The proposed method reliably identifies severe congenital heart defects, such as Ebstein’s Anomaly or Shonecomplex. Moreover, it achieves superior performance over MAP-based anomaly detection with standard variational autoencoders on the task of detecting pulmonary hypertension and right ventricular dilation. Finally, we demonstrate that the proposed method provides interpretable explanations of its output through heatmaps which highlight the regions corresponding to anomalous heart structures.

Authors

Alain Ryser, Laura Manduchi, Fabian Laumer, Holger Michel, Sven Wellmann, Julia E. Vogt

Submitted

Poster at the 2nd Workshop on Interpretable Machine Learning in Healthcare (IMLH), ICML 2022

Date

23.07.2022

Abstract

The algorithmic independence of conditionals, which postulates that the causal mechanism is algorithmically independent of the cause, has recently inspired many highly successful approaches to distinguish cause from effect given only observational data. Most popular among these is the idea to approximate algorithmic independence via two-part Minimum Description Length (MDL). Although intuitively sensible, the link between the original postulate and practical two-part MDL encodings is left vague. In this work, we close this gap by deriving a two-part formulation of this postulate, in terms of Kolmogorov complexity, which directly links to practical MDL encodings. To close the cycle, we prove that this formulation leads on expectation to the same inference result as the original postulate.

Authors

Alexander Marx, Jilles Vreeken

Submitted

AAAI'22 Workshop on Information-Theoretic Methods for Causal Inference and Discovery (ITCI’22)

Date

05.05.2022

Abstract

The recent introduction of portable, low-field MRI (LF-MRI) into the clinical setting has the potential to transform neuroimaging. However, LF-MRI is limited by lower resolution and signal-to-noise ratio, leading to incomplete characterization of brain regions. To address this challenge, recent advances in machine learning facilitate the synthesis of higher resolution images derived from one or multiple lower resolution scans. Here, we report the extension of a machine learning super-resolution (SR) algorithm to synthesize 1 mm isotropic MPRAGElike scans from LF-MRI T1-weighted and T2-weighted sequences. Our initial results on a paired dataset of LF and high-field (HF, 1.5T-3T) clinical scans show that: (i) application of available automated segmentation tools directly to LF-MRI images falters; but (ii) segmentation tools succeed when applied to SR images with high correlation to gold standard measurements from HF-MRI (e.g., r = 0.85 for hippocampal volume, r = 0.84 for the thalamus, r = 0.92 for the whole cerebrum). This work demonstrates proof-of-principle postprocessing image enhancement from lower resolution LF-MRI sequences. These results lay the foundation for future work to enhance the detection of normal and abnormal image findings at LF and ultimately improve the diagnostic performance of LF-MRI. Our tools are publicly available on FreeSurfer (surfer.nmr.mgh.harvard.edu/)

Authors

Juan Eugenio Iglesias, Riana Schleicher, Sonia Laguna, Benjamin Billot, Pamela Schaefer, Brenna McKaig, Joshua N Goldstein, Kevin N Sheth, Matthew S Rosen, W Taylor Kimberly

Submitted

arXiv preprint arXiv:2202.03564

Date

Abstract

Authors

Ricards Marcinkevics, Julia E. Vogt

Submitted

Interpretable Inductive Biases and Physically Structured Learning Workshop, NeurIPS 2020

Date

01.11.2020

Thomas M. Sutter, Imant Daunhawer, Julia E. VogtMultimodal Generative Learning Utilizing Jensen-Shannon-DivergenceNeurIPS 2020

Abstract

Learning from different data types is a long-standing goal in machine learning research, as multiple information sources co-occur when describing natural phenomena. However, existing generative models that approximate a multimodal ELBO rely on difficult or inefficient training schemes to learn a joint distribution and the dependencies between modalities. In this work, we propose a novel, efficient objective function that utilizes the Jensen-Shannon divergence for multiple distributions. It simultaneously approximates the unimodal and joint multimodal posteriors directly via a dynamic prior. In addition, we theoretically prove that the new multimodal JS-divergence (mmJSD) objective optimizes an ELBO. In extensive experiments, we demonstrate the advantage of the proposed mmJSD model compared to previous work in unsupervised, generative learning tasks.

Authors

Thomas M. Sutter, Imant Daunhawer, Julia E. Vogt

Submitted

NeurIPS 2020

Date

22.10.2020

Varaha Karthik Pattisapu, Imant Daunhawer, Thomas Weikert, Alexander Sauter, Bram Stieltjes, Julia E. VogtPET-guided Attention Network for Segmentation of Lung Tumors from PET/CT imagesGCPR 2020

Abstract

PET/CT imaging is the gold standard for the diagnosis and staging of lung cancer. However, especially in healthcare systems with limited resources, costly PET/CT images are often not readily available. Conventional machine learning models either process CT or PET/CT images but not both. Models designed for PET/CT images are hence restricted by the number of PET images, such that they are unable to additionally leverage CT-only data. In this work, we apply the concept of visual soft attention to efficiently learn a model for lung cancer segmentation from only a small fraction of PET/CT scans and a larger pool of CT-only scans. We show that our model is capable of jointly processing PET/CT as well as CT-only images, which performs on par with the respective baselines whether or not PET images are available at test time. We then demonstrate that the model learns efficiently from only a few PET/CT scans in a setting where mostly CT-only data is available, unlike conventional models.

Authors

Varaha Karthik Pattisapu, Imant Daunhawer, Thomas Weikert, Alexander Sauter, Bram Stieltjes, Julia E. Vogt

Submitted

GCPR 2020

Date

12.10.2020

S Ruipérez-Campillo, S Castrejón, M Martínez, R Cervigón, O Meste, JL Merino, J Millet, F CastellsSlow conduction regions as a valuable vectorcardiographic parameter for the non-invasive identification of atrial flutter typesIEEE Computing in Cardiology (47th CinC, 2020)

Abstract

The objective of this study is to non-invasively characterise a variety of atrial flutter (AFL) types, defined by a maroreentrant circuit. A vectorcardiographic approach is proposed to compare atrial macroreentrant circuits. Vectorcardiogram (VCG) arechetypes are computed so that parameters such as similarity among loops can be calculated. The methodology was employed in a set of artificial VCGs created from a computational simulation based on a mathematical model and in signals from real patients. Adenosine was used to block the ventricular contribution to the ECG signal, later transformed to a VCG analysed from different perspectives. Results demonstrate a high similarity for cases belonging to a group with its archetype in synthetic and real cases. Slow conduction velocity regions were found to be very well represented in VCGs, in accordance with AFL mechanisms and its importance when characterising atrial macroreentries. The conclusion is that our methodology allows differentiation between the most recurrent types of AFL through the analysis of its VCG representation, predicting the presence of slow conduction regions along the macroreentry. This can be very useful when planning in advance the ablation procedure.

Authors

S Ruipérez-Campillo, S Castrejón, M Martínez, R Cervigón, O Meste, JL Merino, J Millet, F Castells

Submitted

IEEE Computing in Cardiology (47th CinC, 2020)

Date

13.09.2020

Imant Daunhawer, Thomas M. Sutter, Ricards Marcinkevics, Julia E. VogtSelf-supervised Disentanglement of Modality-specific and Shared Factors Improves Multimodal Generative ModelsGCPR 2020

Abstract

Multimodal generative models learn a joint distribution over multiple modalities and thus have the potential to learn richer representations than unimodal models. However, current approaches are either inefficient in dealing with more than two modalities or fail to capture both modality-specific and shared variations. We introduce a new multimodal generative model that integrates both modality-specific and shared factors and aggregates shared information across any subset of modalities efficiently. Our method partitions the latent space into disjoint subspaces for modality-specific and shared factors and learns to disentangle these in a purely self-supervised manner. In extensive experiments, we show improvements in representation learning and generative performance compared to previous methods and showcase the disentanglement capabilities.

Authors

Imant Daunhawer, Thomas M. Sutter, Ricards Marcinkevics, Julia E. Vogt

Submitted

GCPR 2020

Date

10.09.2020

Abstract

Self-organizing maps (SOMs) have been widely used as a means to visualize latent structure in large amounts of heterogeneous data, in particular as a clustering method. Relatively little work, however, has focused on combining SOMs with deep generative networks for modeling health states, which arise for example in the intensive care unit (ICU). We present Temporal PSOM, a novel neural network architecture that jointly trains a Variational Autoencoder for feature extraction and a probabilistic version of SOM to achieve an interpretable discrete representation of patient health states in the ICU. Experiments on the publicly available eICU data set show significant improvements over state-of-the-art methods in terms of cluster enrichment for current APACHE physiology scores as well as prediction of future physiology states.

Authors

Laura Manduchi, Matthias Hueser, Gunnar Raetsch, Vincent Fortuin

Submitted

ML4H Workshop, NeurIPS 2019

Date

15.12.2019

Thomas Sutter, Imant Daunhawer, Julia E. VogtMultimodal Generative Learning Utilizing Jensen-Shannon-DivergenceVisually Grounded Interaction and Language Workshop, NeurIPS 2019

Abstract

Learning from different data types is a long standing goal in machine learning research, as multiple information sources co-occur when describing natural phenomena. Existing generative models that try to approximate a multimodal ELBO rely on difficult training schemes to handle the intermodality dependencies, as well as the approximation of the joint representation in case of missing data. In this work, we propose an ELBO for multimodal data which learns the unimodal and joint multimodal posterior approximation functions directly via a dynamic prior. We show that this ELBO is directly derived from a variational inference setting for multiple data types, resulting in a divergence term which is the Jensen-Shannon divergence for multiple distributions. We compare the proposed multimodal JS-divergence (mmJSD) model to state-of-the-art methods and show promising results using our model in unsupervised, generative learning using a multimodal VAE on two different datasets.

Authors

Thomas Sutter, Imant Daunhawer, Julia E. Vogt

Submitted

Visually Grounded Interaction and Language Workshop, NeurIPS 2019

Date

12.12.2019

Imant Daunhawer, Thomas Sutter, Julia E. VogtImproving Multimodal Generative Models with Disentangled Latent PartitionsBayesian Deep Learning Workshop, NeurIPS 2019

Abstract

Multimodal generative models learn a joint distribution of data from different modalities---a task which arguably benefits from the disentanglement of modality-specific and modality-invariant information. We propose a factorized latent variable model that learns named disentanglement on multimodal data without additional supervision. We demonstrate the disentanglement capabilities on simulated data, and show that disentangled representations can improve the conditional generation of missing modalities without sacrificing unconditional generation.

Authors

Imant Daunhawer, Thomas Sutter, Julia E. Vogt

Submitted

Bayesian Deep Learning Workshop, NeurIPS 2019

Date

Authors

Lisa Ruby, Sergio J. Sanabria, Katharina Martini, Konstantin J. Dedes, Denise Vorburger, Ece Özkan Elsen, Thomas Frauenfelder, Orcun Goksel, Marga B. Rominger

Submitted

Investigative Radiology

Date

30.06.2019

Sandhya Prabhakaran, Julia E. VogtBayesian Clustering For HIV1 Protease Inhibitor Contact MapsArtificial Intelligence in Medicine (AIME), Springer Lecture Notes in Artificial Intelligence, 2019

Abstract

We present a probabilistic model for clustering which enables the modeling of overlapping clusters where objects are only available as pairwise distances. Examples of such distance data are genomic string alignments, or protein contact maps. In our clustering model, an object has the freedom to belong to one or more clusters at the same time. By using an IBP process prior, there is no need to explicitly fix the number of clusters, as well as the number of overlapping clusters, in advance. In this paper, we demonstrate the utility of our model using distance data obtained from HIV1 protease inhibitor contact maps.

Authors

Sandhya Prabhakaran, Julia E. Vogt

Submitted

Artificial Intelligence in Medicine (AIME), Springer Lecture Notes in Artificial Intelligence, 2019

Date

29.05.2019

Stefan G. Stark, Stephanie L. Hyland, Melanie F. Pradier, Kjong Lehmann, Andreas Wicki, Fernando Perez Cruz, Julia E. Vogt, Gunnar RätschUnsupervised Extraction of Phenotypes from Cancer Clinical Notes for Association StudiesArxiv preprint

Abstract

The recent adoption of Electronic Health Records (EHRs) by health care providers has introduced an important source of data that provides detailed and highly specific insights into patient phenotypes over large cohorts. These datasets, in combination with machine learning and statistical approaches, generate new opportunities for research and clinical care. However, many methods require the patient representations to be in structured formats, while the information in the EHR is often locked in unstructured texts designed for human readability. In this work, we develop the methodology to automatically extract clinical features from clinical narratives from large EHR corpora without the need for prior knowledge. We consider medical terms and sentences appearing in clinical narratives as atomic information units. We propose an efficient clustering strategy suitable for the analysis of large text corpora and to utilize the clusters to represent information about the patient compactly. To demonstrate the utility of our approach, we perform an association study of clinical features with somatic mutation profiles from 4,007 cancer patients and their tumors. We apply the proposed algorithm to a dataset consisting of about 65 thousand documents with a total of about 3.2 million sentences. We identify 341 significant statistical associations between the presence of somatic mutations and clinical features. We annotated these associations according to their novelty, and report several known associations. We also propose 32 testable hypotheses where the underlying biological mechanism does not appear to be known but plausible. These results illustrate that the automated discovery of clinical features is possible and the joint analysis of clinical and genetic datasets can generate appealing new hypotheses.

Authors

Stefan G. Stark, Stephanie L. Hyland, Melanie F. Pradier, Kjong Lehmann, Andreas Wicki, Fernando Perez Cruz, Julia E. Vogt, Gunnar Rätsch

Submitted

Arxiv preprint

Date

02.05.2019

Melanie F. Pradier, Stephanie L. Hyland, Stefan G. Stark, Kjong Lehmann, Julia E. Vogt, Fernando Perez-Cruz, Gunnar RätschA Bayesian Nonparametric Approach to Discover Clinico-Genetic Associations across Cancer TypesBiorxiv preprint

Abstract

Motivation: Personalized medicine aims at combining genetic, clinical, and environmental data to improve medical diagnosis and disease treatment, tailored to each patient. This paper presents a Bayesian nonparametric (BNP) approach to identify genetic associations with clinical/environmental features in cancer. We propose an unsupervised approach to generate data-driven hypotheses and bring potentially novel insights about cancer biology. Our model combines somatic mutation information at gene-level with features extracted from the Electronic Health Record. We propose a hierarchical approach, the hierarchical Poisson factor analysis (H-PFA) model, to share information across patients having different types of cancer. To discover statistically significant associations, we combine Bayesian modeling with bootstrapping techniques and correct for multiple hypothesis testing. Results: Using our approach, we empirically demonstrate that we can recover well-known associations in cancer literature. We compare the results of H-PFA with two other classical methods in the field: case-control (CC) setups, and linear mixed models (LMMs).

Authors

Melanie F. Pradier, Stephanie L. Hyland, Stefan G. Stark, Kjong Lehmann, Julia E. Vogt, Fernando Perez-Cruz, Gunnar Rätsch

Submitted

Biorxiv preprint

Date

29.04.2019

Alexander Marx, Jilles VreekenTesting Conditional Independence on Discrete Data using Stochastic ComplexityProceedings of the International Conference on Artificial Intelligence and Statistics, AISTATS 2019

Abstract

Testing for conditional independence is a core aspect of constraint-based causal discovery. Although commonly used tests are perfect in theory, they often fail to reject independence in practice--especially when conditioning on multiple variables. We focus on discrete data and propose a new test based on the notion of algorithmic independence that we instantiate using stochastic complexity. Amongst others, we show that our proposed test, SCI, is an asymptotically unbiased as well as L2 consistent estimator for conditional mutual information (CMI). Further, we show that SCI can be reformulated to find a sensible threshold for CMI that works well on limited samples. Empirical evaluation shows that SCI has a lower type II error than commonly used tests. As a result, we obtain a higher recall when we use SCI in causal discovery algorithms, without compromising the precision.

Authors

Alexander Marx, Jilles Vreeken

Submitted

Proceedings of the International Conference on Artificial Intelligence and Statistics, AISTATS 2019

Date

01.04.2019

Imant Daunhawer, Severin Kasser, Gilbert Koch, Lea Sieber, Hatice Cakal, Janina Tütsch, Marc Pfister, Sven Wellman, Julia E. VogtEnhanced early prediction of clinically relevant neonatal hyperbilirubinemia with machine learningPediatric Research, 2019

Abstract

Background Machine learning models may enhance the early detection of clinically relevant hyperbilirubinemia based on patient information available in every hospital. Methods We conducted a longitudinal study on preterm and term born neonates with serial measurements of total serum bilirubin in the first two weeks of life. An ensemble, that combines a logistic regression with a random forest classifier, was trained to discriminate between the two classes phototherapy treatment vs. no treatment. Results Of 362 neonates included in this study, 98 had a phototherapy treatment, which our model was able to predict up to 48â€‰h in advance with an area under the ROC-curve of 95.20%. From a set of 44 variables, including potential laboratory and clinical confounders, a subset of just four (bilirubin, weight, gestational age, hours since birth) suffices for a strong predictive performance. The resulting early phototherapy prediction tool (EPPT) is provided as an open web application. Conclusion Early detection of clinically relevant hyperbilirubinemia can be enhanced by the application of machine learning. Existing guidelines can be further improved to optimize timing of bilirubin measurements to avoid toxic hyperbilirubinemia in high-risk patients while minimizing unneeded measurements in neonates who are at low risk.

Authors

Imant Daunhawer, Severin Kasser, Gilbert Koch, Lea Sieber, Hatice Cakal, Janina Tütsch, Marc Pfister, Sven Wellman, Julia E. Vogt

Submitted

Pediatric Research, 2019

Date

30.03.2019

Alvaro Gomariz, Weiye Li, Ece Özkan Elsen, Christine Tanner, Orcun GokselSiamese Networks With Location Prior for Landmark Tracking in Liver Ultrasound SequencesInternational Symposium on Biomedical Imaging (ISBI)

Authors

Alvaro Gomariz, Weiye Li, Ece Özkan Elsen, Christine Tanner, Orcun Goksel

Submitted

International Symposium on Biomedical Imaging (ISBI)

Date

06.02.2019

Ricards Marcinkevics, Steven Kelk, Carlo Galuzzi, Berthold StegemannDiscovery of Important Subsequences in Electrocardiogram Beats Using the Nearest Neighbour AlgorithmArxiv

Abstract

The classification of time series data is a well-studied problem with numerous practical applications, such as medical diagnosis and speech recognition. A popular and effective approach is to classify new time series in the same way as their nearest neighbours, whereby proximity is defined using Dynamic Time Warping (DTW) distance, a measure analogous to sequence alignment in bioinformatics. However, practitioners are not only interested in accurate classification, they are also interested in why a time series is classified a certain way. To this end, we introduce here the problem of finding a minimum length subsequence of a time series, the removal of which changes the outcome of the classification under the nearest neighbour algorithm with DTW distance. Informally, such a subsequence is expected to be relevant for the classification and can be helpful for practitioners in interpreting the outcome. We describe a simple but optimized implementation for detecting these subsequences and define an accompanying measure to quantify the relevance of every time point in the time series for the classification. In tests on electrocardiogram data we show that the algorithm allows discovery of important subsequences and can be helpful in detecting abnormalities in cardiac rhythms distinguishing sick from healthy patients.

Authors

Ricards Marcinkevics, Steven Kelk, Carlo Galuzzi, Berthold Stegemann

Submitted

Arxiv

Date

26.01.2019

Stefanie Ehrbar, Alexander Jöhl, Michael Kühni, Mirko Meboldt, Ece Özkan Elsen, Christine Tanner, Orcun Goksel, Stephan Klöck, Jan Unkelbach, Matthias Guckenberger, Stephanie Tanadini-LangELPHA: Dynamically deformable liver phantom for real-time motion-adaptive radiotherapy treatmentsMedical Physics

Authors

Stefanie Ehrbar, Alexander Jöhl, Michael Kühni, Mirko Meboldt, Ece Özkan Elsen, Christine Tanner, Orcun Goksel, Stephan Klöck, Jan Unkelbach, Matthias Guckenberger, Stephanie Tanadini-Lang

Submitted

Medical Physics

Date

03.01.2019

Sandhya Prabhakaran and Julia E. VogtA Bayesian Model for Overlapping Clusters from Distance DataAll of Bayesian Nonparametrics Workshop in Neural Information Processing Systems Conference 2018

Authors

Sandhya Prabhakaran and Julia E. Vogt

Submitted

All of Bayesian Nonparametrics Workshop in Neural Information Processing Systems Conference 2018

Date

02.12.2018

Jan A. Roth, Manuel Battegay, Fabrice Juchler, Julia E. Vogt, Andreas F. WidmerIntroduction to Machine Learning in Digital Healthcare EpidemiologyInfection Control & Hospital Epidemiology, 2018

Abstract

To exploit the full potential of big routine data in healthcare and to efficiently communicate and collaborate with information technology specialists and data analysts, healthcare epidemiologists should have some knowledge of large-scale analysis techniques, particularly about machine learning. This review focuses on the broad area of machine learning and its first applications in the emerging field of digital healthcare epidemiology.

Authors

Jan A. Roth, Manuel Battegay, Fabrice Juchler, Julia E. Vogt, Andreas F. Widmer

Submitted

Infection Control & Hospital Epidemiology, 2018

Date

04.11.2018

Sergio J Sanabria, Ece Özkan Elsen, Marga Rominger, Orcun GokselSpatial domain reconstruction for imaging speed-of-sound with pulse-echo ultrasound: simulation and in vivo studyPhysics in Medicine and Biology

Authors

Sergio J Sanabria, Ece Özkan Elsen, Marga Rominger, Orcun Goksel

Submitted

Physics in Medicine and Biology

Date

26.10.2018

Alexander Marx, Jilles VreekenCausal Inference on Multivariate and Mixed-Type DataProceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Data, ECMLPKDD 2018

Abstract

How can we discover whether X causes Y, or vice versa, that Y causes X, when we are only given a sample over their joint distribution? How can we do this such that X and Y can be univariate, multivariate, or of different cardinalities? And, how can we do so regardless of whether X and Y are of the same, or of different data type, be it discrete, numeric, or mixed? These are exactly the questions we answer. We take an information theoretic approach, based on the Minimum Description Length principle, from which it follows that first describing the data over cause and then that of effect given cause is shorter than the reverse direction. Simply put, if Y can be explained more succinctly by a set of classification or regression trees conditioned on X, than in the opposite direction, we conclude that X causes Y. Empirical evaluation on a wide range of data shows that our method, Crack, infers the correct causal direction reliably and with high accuracy on a wide range of settings, outperforming the state of the art by a wide margin. Code related to this paper is available at: http://eda.mmci.uni-saarland.de/crack.

Authors

Alexander Marx, Jilles Vreeken

Submitted

Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Data, ECMLPKDD 2018

Date

13.08.2018

DOI Code

Ece Özkan Elsen, Valery Vishnevsky, Orcun GokselInverse Problem of Ultrasound Beamforming with Sparsity Constraints and RegularizationIEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control

Authors

Ece Özkan Elsen, Valery Vishnevsky, Orcun Goksel

Submitted

IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control

Date

03.03.2018

Authors

Ece Özkan Elsen, Christine Tanner, Matej Kastelic, Oliver Mattausch, Maxim Makhinya, Orcun Goksel

Submitted

International Journal of Computer Assisted Radiology and Surgery

Date

22.03.2017

Leutheuser, H. and Lang, N. and Gradl, S. and Struck, M. and Tobola, A. and Hofmann, C. and Anneken, L. and Eskofier, B. M.Textile Integrated Wearable Technologies for Sports and Medical ApplicationsSmart Textiles: Fundamentals, Design, and Interaction

Abstract

Innovative and pervasive monitoring possibilities are given using textile integration of wearable computing components. We present the FitnessSHIRT (Fraunhofer IIS, Erlangen, Germany) as one example of a textile integrated wearable computing device. Using the FitnessSHIRT, the electric activity of the human heart and breathing characteristics can be determined. Within this chapter, we give an overview of the market situation, current application scenarios, and related work. We describe the technology and algorithms behind the wearable FitnessSHIRT as well as current application areas in sports and medicine. Challenges using textile integrated wearable devices are stated and addressed in experiments or in explicit recommendations. The applicability of the FitnessSHIRT is shown in user studies in sports and medicine. This chapter is concluded with perspectives for textile integrated wearable devices.

Authors

Leutheuser, H. and Lang, N. and Gradl, S. and Struck, M. and Tobola, A. and Hofmann, C. and Anneken, L. and Eskofier, B. M.

Submitted

Smart Textiles: Fundamentals, Design, and Interaction

Date

01.02.2017

Tobola, A. and Leutheuser, H. and Schmitz, B. and Hofmann, C. and Struck, M. and Weigand, C. and Eskofier, B. M. and Fischer, G.Battery runtime optimization toolbox for wearable biomedical sensorsIn Proc: IEEE-EMBS 13th International Conference on Wearable and Implantable Body Sensor Networks (BSN)

Abstract

Battery runtime is a critical concern for practical usage of wearable biomedical sensor systems. A long runtime requires an interdisciplinary low-power knowledge and appropriate design tools. We addressed this issue designing a toolbox in three parts: (1) Modular evaluation kit for development of wearable ultra-low-power biomedical sensors; (2) Miniaturized, wearable, and code compatible sensor system with the same properties as the development kit; (3) Web-based battery runtime calculator for our sensor systems. The purpose of the development kit is optimization of the power consumption. Once optimization is finished, the same embedded software can be transferred to the miniaturized body worn sensor. The web-based application supports development quantifying the effects of use case and design decisions on battery runtime. A sensor developer can select sensor modules, configure sensor parameters, enter use case specific requirements, and select a battery to predict the battery runtime for a specific application. Our concept adds value to development of ultra-low-power biomedical wearable sensors. The concept is effective for professional work and educational purposes.

Authors

Tobola, A. and Leutheuser, H. and Schmitz, B. and Hofmann, C. and Struck, M. and Weigand, C. and Eskofier, B. M. and Fischer, G.

Submitted

In Proc: IEEE-EMBS 13th International Conference on Wearable and Implantable Body Sensor Networks (BSN)

Date

14.06.2016

Leutheuser, H. and Gradl, S. and Anneken, L. and Arnold, M. and Lang, N. and Achenbach, S. and Eskofier, B. M.Instantaneous P- and T-wave detection: Assessment of three ECG fiducial points detection algorithmsIn Proc: IEEE-EMBS 13th International Conference on Wearable and Implantable Body Sensor Networks (BSN)

Abstract

Arrhythmia detection algorithms require the exact and instantaneous detection of fiducial points in the ECG signal. These fiducial points (QRS-complex, P- and T-wave) correspond to distinct cardiac contraction phases. The performance evaluation of different fiducial points detection algorithms require the existence of large databases (DBs) encompassing reference annotations. Up to last year, P- and T-wave annotations were only available for the QT DB. This was addressed by Elgendi et al. who provided P- and T-wave annotations to the MIT-BIH arrhythmia DB. A variety of ECG fiducial points detection algorithms exists in literature, whereas, to the best knowledge of the authors, we could not identify any single-lead algorithm ready for instantaneous P- and T-wave detection. In this work, we present three P- and T-wave detection algorithms: a revised version for QRS detection using line fitting capable to detect P- and T-wave, an expeditious version of a wavelet based ECG delineation algorithm, and a fast naive fiducial points detection algorithm. The fast naive fiducial points detection algorithm performed best on both DBs with sensitivities ranging from 73.0% (P-wave detection, error interval of ± 40 ms) to 89.4% (T-wave detection, error interval of ± 80 ms). As this algorithm detects a wave event in every search window, it has to be investigated how this affects arrhythmia detection algorithms. The reference Matlab implementations are available for download to encourage the development of high-accurate and automated ECG processing algorithms for the integration in daily life using mobile computers.

Authors

Leutheuser, H. and Gradl, S. and Anneken, L. and Arnold, M. and Lang, N. and Achenbach, S. and Eskofier, B. M.

Submitted

In Proc: IEEE-EMBS 13th International Conference on Wearable and Implantable Body Sensor Networks (BSN)

Date

14.06.2016

Ece Özkan Elsen, Gemma Roig, Orcun Goksel, Xavier BoixHerding Generalizes Diverse M -Best SolutionsarXiv

Authors

Ece Özkan Elsen, Gemma Roig, Orcun Goksel, Xavier Boix

Submitted

arXiv

Date

27.05.2016

Firat Ozdemir, Ece Özkan Elsen, Orcun GokselGraphical Modeling of Ultrasound Propagation in Tissue for Automatic Bone SegmentationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI)

Authors

Firat Ozdemir, Ece Özkan Elsen, Orcun Goksel

Submitted

International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI)

Date

27.05.2016

Authors

Melanie F. Pradier, Theofanis Karaletsos, Stefan Stark, Julia E. Vogt, Gunnar Rätsch, and Fernando Perez-Cruz

Submitted

Accepted Abstract at Machine Learning for Healthcare Workshop in Neural Information Processing Systems Conference 2015

Date

06.12.2015

Mullan, P. J. and Kanzler, C. M. and Lorch, B. and Schröder, L. and Winkler, L. and Laich, L. H. and Riedel, F. and Richer, R. and Luckner, C. and Leutheuser, H. and Eskofier, B. M. and Pasluosta, C. F.Unobtrusive Heart Rate Estimation During Physical Exercise using Photoplethysmographic and Acceleration DataIn Proc: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Abstract

Photoplethysmography (PPG) is a non-invasive, inexpensive and unobtrusive method to achieve heart rate monitoring during physical exercises. Motion artifacts during exercise challenge the heart rate estimation from wrist-type PPG signals. This paper presents a methodology to overcome these limitation by incorporating acceleration information. The proposed algorithm consisted of four stages: (1) A wavelet based denoising, (2) an acceleration based denoising, (3) a frequency based approach to estimate the heart rate followed by (4) a postprocessing step. Experiments with different movement types such as running and rehabilitation exercises were used for algorithm design and development. Evaluation of our heart rate estimation showed that a mean absolute error 1.96 bpm (beats per minute) with standard deviation of 2.86 bpm and a correlation of 0.98 was achieved with our method. These findings suggest that the proposed methodology is robust to motion artifacts and is therefore applicable for heart rate monitoring during sports and rehabilitation.

Authors

Mullan, P. J. and Kanzler, C. M. and Lorch, B. and Schröder, L. and Winkler, L. and Laich, L. H. and Riedel, F. and Richer, R. and Luckner, C. and Leutheuser, H. and Eskofier, B. M. and Pasluosta, C. F.

Submitted

In Proc: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Date

25.08.2015

Gradl, S. and Leutheuser, H. and Elgendi, M. and Lang, N. and Eskofier, B. M.Temporal correction of detected R-peaks in the ECG signals: A crucial step to improve QRS detection algorithmsIn Proc: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Abstract

In the last decade the interest for heart rate variability analysis has increased tremendously. Related algorithms depend on accurate temporal localization of the heartbeat, e.g. the R-peak in electrocardiogram signals, especially in the presence of arrhythmia. This localization can be delivered by numerous solutions found in the literature which all lack an exact specification of their temporal precision. We implemented three different state-of-the-art algorithms and evaluated the precision of their R-peak localization. We suggest a method to estimate the overall R-peak temporal inaccuracy-dubbed beat slackness-of QRS detectors with respect to normal and abnormal beats. We also propose a simple algorithm that can complement existing detectors to reduce this slackness. Furthermore we define improvements to one of the three detectors allowing it to be used in real-time on mobile devices or embedded hardware. Across the entire MIT-BIH Arrhythmia Database, the average slackness of all the tested algorithms was 9 ms for normal beats and 13 ms for abnormal beats. Using our complementing algorithm this could be reduced to 4 ms for normal beats and to 7 ms for abnormal beats. The presented methods can be used to significantly improve the precision of R-peak detection and provide an additional measurement for QRS detector performance.

Authors

Gradl, S. and Leutheuser, H. and Elgendi, M. and Lang, N. and Eskofier, B. M.

Submitted

In Proc: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Date

25.08.2015

B. E. Heldberg, T. Kautz, H. Leutheuser, R. Hopfeng\"artner, B. Kasper, B. M. EskofierUsing Wearable sensors for semiology-independent seizure detection - towards ambulatory monitoring of epilepsyIn Proc: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Abstract

Epilepsy is a disease of the central nervous system. Nearly 70% of people with epilepsy respond to a proper treatment, but for a successful therapy of epilepsy, physicians need to know if and when seizures occur. The gold standard diagnosis tool video-electroencephalography (vEEG) requires patients to stay at hospital for several days. A wearable sensor system, e.g. a wristband, serving as diagnostic tool or event monitor, would allow unobtrusive ambulatory long-term monitoring while reducing costs. Previous studies showed that seizures with motor symptoms such as generalized tonic-clonic seizures can be detected by measuring the electrodermal activity (EDA) and motion measuring acceleration (ACC). In this study, EDA and ACC from 8 patients were analyzed. In extension to previous studies, different types of seizures, including seizures without motor activity, were taken into account. A hierarchical classification approach was implemented in order to detect different types of epileptic seizures using data from wearable sensors. Using a k-nearest neighbor (kNN) classifier an overall sensitivity of 89.1% and an overall specificity of 93.1% were achieved, for seizures without motor activity the sensitivity was 97.1% and the specificity was 92.9%. The presented method is a first step towards a reliable ambulatory monitoring system for epileptic seizures with and without motor activity.

Authors

B. E. Heldberg, T. Kautz, H. Leutheuser, R. Hopfeng\"artner, B. Kasper, B. M. Eskofier

Submitted

In Proc: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Date

25.08.2015

T. Cibis, B. Groh, H. Leutheuser, B. M. EskofierWearable Real-time ECG monitoring with emergency alert system for scuba divingIn Proc: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Abstract

Medical diagnosis is the first level for recognition and treatment of diseases. To realize fast diagnosis, we propose a concept of a basic framework for the underwater monitoring of a diver’s ECG signal, including an alert system that warns the diver of predefined medical emergency situations. The framework contains QRS detection, heart rate calculation and an alert system. After performing a predefined study protocol, the algorithm’s accuracy was evaluated with 10 subjects in a dry environment and with 5 subjects in an underwater environment. The results showed that, in 3 out of 5 dives as well as in dry environment, data transmission remained stable. In these cases, the subjects were able to trigger the alert system. The evaluated data showed a clear ECG signal with a QRS detection accuracy of 90%. Thus, the proposed framework has the potential to detect and to warn of health risks. Further developments of this sample concept can imply an extension for monitoring different biomedical parameters.

Authors

T. Cibis, B. Groh, H. Leutheuser, B. M. Eskofier

Submitted

In Proc: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Date

25.08.2015

U. Jensen, H. Leutheuser, S. Hofmann, B. Schuepferling, G. Suttner, K. Seiler, J. Kornhuber, B. M EskofierA Wearable Real-Time Activity TrackerBiomed Eng Lett.

Abstract

Purpose Exercise and physical activity is a driving force for mental health. Major challenges in the treatment of psychological diseases are accurate activity profiles and the adherence to exercise intervention programs. We present the development and validation of CHRONACT, a wearable realtime activity tracker based on inertial sensor data to support mental health. Methods CHRONACT comprised a Human Activity Recognition (HAR) algorithm that determined activity levels based on their Metabolic Equivalent of Task (MET) with sensors on ankle and wrist. Special emphasis was put on wearability, real-time data analysis and runtime to be able to use the system as augmented feedback device. For the development, data of 47 healthy subjects performing clinical intervention program activities were collected to train different classification models. The most suitable model according to the accuracy and processing power tradeoff was selected for an embedded implementation on CHRONACT. Results A validation trial (six subjects, 6 h of data) showed the accuracy of the system with a classification rate of 85.6%. The main source of error was identified in acyclic activities that contained activity bouts of neighboring classes. The runtime of the system was more than 7 days and continuous result logging was available for 39 h. Conclusions In future applications, the CHRONACT system can be used to create accurate and unobtrusive patient activity profiles. Furthermore, the system is ready to assess the effects of individual augmented feedback for exercise adherence.

Authors

U. Jensen, H. Leutheuser, S. Hofmann, B. Schuepferling, G. Suttner, K. Seiler, J. Kornhuber, B. M Eskofier

Submitted

Biomed Eng Lett.

Date

18.07.2015

Julia E. Vogt, Marius Kloft, Stefan Stark, Sandhya Prabhakaran, Sudhir Raman, Volker Roth and Gunnar RätschProbabilistic Clustering of Time-Evolving Distance DataMachine Learning Journal, 2015

Abstract

We present a novel probabilistic clustering model for objects that are represented via pairwise distances and observed at different time points. The proposed method utilizes the information given by adjacent time points to find the underlying cluster structure and obtain a smooth cluster evolution. This approach allows the number of objects and clusters to differ at every time point, and no identification on the identities of the objects is needed. Further, the model does not require the number of clusters being specified in advance—they are instead determined automatically using a Dirichlet process prior. We validate our model on synthetic data showing that the proposed method is more accurate than state-of-the-art clustering methods. Finally, we use our dynamic clustering model to analyze and illustrate the evolution of brain cancer patients over time.

Authors

Julia E. Vogt, Marius Kloft, Stefan Stark, Sandhya Prabhakaran, Sudhir Raman, Volker Roth and Gunnar Rätsch

Submitted

Machine Learning Journal, 2015

Date

16.07.2015

Authors

Ece Özkan Elsen, Orcun Goksel

Submitted

International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Date

27.05.2015

Tobola, A. and Espig, C. and Streit, F. J. and Korpok, O. and Leutheuser, H. and Schmitz, B. and Hofmann, C. and Struck, M. and Weigand, C. and Eskofier, B. M. and Fischer, G.Scalable {ECG} Hardware and Algorithms for Extended Runtime of Wearable SensorsIn Proc: 10th Annual IEEE International Symposium on Medical Measurements and Applications (MeMeA)

Abstract

Everything in nature tries to reach the lowest possible energy level. Therefore any natural or artificial system must have the ability to adjust itself to the changing requirements of its surrounding environment. In this paper we address this issue by an ECG sensor designed to be adjustable during runtime, having the ability to reduce the power consumption at cost of the informational content. Accessible for everyone, standard ECG hardware and open source software has been used to realize an ECG processing system for wearable applications. The average power consumption has been measured for each mode of operation. Finally we take conclusion to conciser context-aware scaling as key feature to address the energy issue of wearable sensor systems.

Authors

Tobola, A. and Espig, C. and Streit, F. J. and Korpok, O. and Leutheuser, H. and Schmitz, B. and Hofmann, C. and Struck, M. and Weigand, C. and Eskofier, B. M. and Fischer, G.

Submitted

In Proc: 10th Annual IEEE International Symposium on Medical Measurements and Applications (MeMeA)

Date

07.05.2015

Julia E. VogtUnsupervised Structure Detection in Biomedical DataIEEE/ACM Transactions on Computational Biology and Bioinformatics (Volume: 12 , Issue: 4 , July-Aug. 1 2015)

Abstract

A major challenge in computational biology is to find simple representations of high-dimensional data that best reveal the underlying structure. In this work, we present an intuitive and easy-to-implement method based on ranked neighborhood comparisons that detects structure in unsupervised data. The method is based on ordering objects in terms of similarity and on the mutual overlap of nearest neighbors. This basic framework was originally introduced in the field of social network analysis to detect actor communities. We demonstrate that the same ideas can successfully be applied to biomedical data sets in order to reveal complex underlying structure. The algorithm is very efficient and works on distance data directly without requiring a vectorial embedding of data. Comprehensive experiments demonstrate the validity of this approach. Comparisons with state-of-the-art clustering methods show that the presented method outperforms hierarchical methods as well as density based clustering methods and model-based clustering. A further advantage of the method is that it simultaneously provides a visualization of the data. Especially in biomedical applications, the visualization of data can be used as a first pre-processing step when analyzing real world data sets to get an intuition of the underlying data structure. We apply this model to synthetic data as well as to various biomedical data sets which demonstrate the high quality and usefulness of the inferred structure.

Authors

Julia E. Vogt

Submitted

IEEE/ACM Transactions on Computational Biology and Bioinformatics (Volume: 12 , Issue: 4 , July-Aug. 1 2015)

Date

26.01.2015

Abstract

The normal oscillation of the heart rate is called Heart Rate Variability (HRV). HRV parameters change under different conditions like rest, physical exercise, mental stress, and body posture changes. However, results how HRV parameters adapt to physical exercise have been inconsistent. This study investigated how different HRV parameters changed during one hour of running. We used datasets of 295 athletes where each dataset had a total length of about 65 minutes. Data was divided in segments of five minutes and three HRV parameters and one kinematic parameter were calculated for each segment. We applied two different analysis of variance (ANOVA) models to analyze the differences in the means of each segment for every parameter. The two ANOVA models were univariate ANOVA with repeated measures and multivariate ANOVA with repeated measures. The obligatory post-hoc procedure consisted of multiple dependent t tests with Bonferroni correction. We investigated the last three segments of the parameters in more detail and detected a delay of one minute between varying running speed and measured heart rate. Hence, the circulatory system of our population needed one minute to adapt to a change in running speed. The method we provided can be used to further investigate more HRV parameters.

Authors

H. Leutheuser, B. M. Eskofier

Submitted

Int J Comp Sci Sport

Date

01.01.2013

Volker Roth, Thomas J. Fuchs, Julia E. Vogt, Sandhya Prabhakaran, Joachim M. BuhmannStructure Preserving Embedding of Dissimilarity DataSimilarity-Based Pattern Analysis and Recognition, 157-177

Abstract

Partitioning methods for observations represented by pairwise dissimilarities are studied. Particular emphasis is put on their properties when applied to dissimilarity matrices that do not admit a loss-free embedding into a vector space. Specifically, the Pairwise Clustering cost function is shown to exhibit a shift invariance property which basically means that any symmetric dissimilarity matrix can be modified to allow a vector-space representation without distorting the optimal group structure. In an approximate sense, the same holds true for a probabilistic generalization of Pairwise Clustering, the so-called Wishart–Dirichlet Cluster Process. This shift-invariance property essentially means that these clustering methods are “blind” against Euclidean or metric violations. From the application side, such blindness against metric violations might be seen as a highly desired feature, since it broadens the applicability of certain algorithms. From the viewpoint of theory building, however, the same property might be viewed as a “negative” result, since studying these algorithms will not lead to any new insights on the role of metricity in clustering problems.

Authors

Volker Roth, Thomas J. Fuchs, Julia E. Vogt, Sandhya Prabhakaran, Joachim M. Buhmann

Submitted

Similarity-Based Pattern Analysis and Recognition, 157-177

Date

31.12.2012

Z. Makowska, M. T. Dill, Julia E. Vogt, Magdalena Filipowicz Sinnreich, L. Terraciano, Volker Roth, M. H. HeimP139: Continuous exposure to PEG-IFN-Alpha only transiently activates JAK-stat signalling in human liverCytokine 59(3):563–564, 2012

Abstract

Introduction IFN-\alpha signals through the Jak-STAT pathway to induce expression of IFN-stimulated genes (ISGs) with antiviral functions. USP18 is an IFN-inducible negative regulator of the Jak-STAT pathway. Upregulation of USP18 results in a long-lasting desensitization of IFN-\alpha signalling. As a result of this IFN-induced refractoriness, ISG levels decrease back to baseline despite continuous presence of the cytokine. Pegylated forms of IFN-\alpha (pegIFN-\alpha) are currently in clinical use for treatment of chronic hepatitis C virus infection. PegIFN-\alphas show increased anti-hepatitis C virus efficacy compared to nonpegylated IFN-\alpha. This has been attributed to the significantly longer plasma half-life of the pegylated form. However, the underlying assumption that persistently high plasma levels obtained with pegIFN-\alpha therapy result in ongoing stimulation of ISGs in the liver has never been tested. In the present study we therefore investigated the kinetics of Jak-STAT pathway activation and ISG induction in the human liver at several time points during the first week of pegIFN-\alpha therapy. Methods 18 patients with chronic hepatitis C underwent a liver biopsy 4 h (n = 6), 16 h, 48 h, 96 h or 144 h (all n = 3) after the first injection of pegIFN-\alpha-2b. Additional 3 patients received pegIFN-\alpha-2a and were biopsied at 144 h. The activation of Jak-STAT signalling and USP18 upregulation were assessed by immunohistochemistry and Western blot. Gene expression analysis was performed using Human Genome U133 Plus 2.0 arrays and Bioconductor packages of R statistical environment. Results A single dose of pegIFN-\alpha-2b resulted in elevated IFN-\alpha plasma levels throughout the one-week dosing interval. Despite the continuous IFN-\alpha exposure, strong activation of the Jak-STAT pathway was only observed at early time points after administration. Almost 500 genes were significantly upregulated in the liver samples following pegIFN-\alpha stimulation. The breadth of transcriptional response to pegIFN-\alpha was maximal 16 h post-injection and decreased gradually, with only few genes significantly upregulated after 144 h of treatment. Bayesian clustering of the gene expression data revealed 4 distinct groups of the ISGs based on the temporal patterns of regulation. Of 494 upregulated ISGs, the expression of 474 peaked 4 h or 16 h after pegIFN-\alpha administration, followed by a steady decline of mRNA levels through the remaining 128 h of treatment. This transient activation of the Jak-STAT pathway coincided with elevated expression of USP18 on the protein level, which was first detectable 16 post-injection. Conclusion PegIFN-\alpha induces a transient activation of Jak-STAT signalling and ISG upregulation in human liver, in spite of persistent high serum concentrations. The short-lived STAT1 phosphorylation and gene induction can be explained by upregulation of USP18 and establishment of refractory state. The superior efficacy of pegIFN-\alpha compared to conventional IFN-\alpha for chronic hepatitis C therapy cannot be explained by persistent signalling and ISG induction during the one-week dosing interval.

Authors

Z. Makowska, M. T. Dill, Julia E. Vogt, Magdalena Filipowicz Sinnreich, L. Terraciano, Volker Roth, M. H. Heim

Submitted

Cytokine 59(3):563–564, 2012

Date

11.08.2012

Sandhya Prabhakaran, Sudhir Raman, Julia E. Vogt, Volker RothAutomatic Model Selection in Archetype AnalysisPattern Recognition: Joint 34th DAGM and 36th OAGM Symposium, Lecture Notes in Computer Science, 2012

Abstract

Archetype analysis involves the identification of representative objects from amongst a set of multivariate data such that the data can be expressed as a convex combination of these representative objects. Existing methods for archetype analysis assume a fixed number of archetypes a priori. Multiple runs of these methods for different choices of archetypes are required for model selection. Not only is this computationally infeasible for larger datasets, in heavy-noise settings model selection becomes cumbersome. In this paper, we present a novel extension to these existing methods with the specific focus of relaxing the need to provide a fixed number of archetypes beforehand. Our fast iterative optimization algorithm is devised to automatically select the right model using BIC scores and can easily be scaled to noisy, large datasets. These benefits are achieved by introducing a Group-Lasso component popular for sparse linear regression. The usefulness of the approach is demonstrated through simulations and on a real world application of document analysis for identifying topics.

Authors

Sandhya Prabhakaran, Sudhir Raman, Julia E. Vogt, Volker Roth

Submitted

Pattern Recognition: Joint 34th DAGM and 36th OAGM Symposium, Lecture Notes in Computer Science, 2012

Date

31.07.2012

Julia E. Vogt, Volker RothA Complete Analysis of the l_{1,p} Group-LassoICML 2012: Proceedings of the 29th international conference on Machine Learning

Abstract

The Group-Lasso is a well-known tool for joint regularization in machine learning methods. While the l_{1,2} and the l_{1,\infty} version have been studied in detail and efficient algorithms exist, there are still open questions regarding other l_{1,p} variants. We characterize conditions for solutions of the l_{1,p} Group-Lasso for all p-norms with 1 <= p <= \infty, and we present a unified active set algorithm. For all p-norms, a highly efficient projected gradient algorithm is presented. This new algorithm enables us to compare the prediction performance of many variants of the Group-Lasso in a multi-task learning setting, where the aim is to solve many learning problems in parallel which are coupled via the Group-Lasso constraint. We conduct large-scale experiments on synthetic data and on two real-world data sets. In accordance with theoretical characterizations of the different norms we observe that the weak-coupling norms with p between 1.5 and 2 consistently outperform the strong-coupling norms with p >> 2.

Authors

Julia E. Vogt, Volker Roth

Submitted

ICML 2012: Proceedings of the 29th international conference on Machine Learning

Date

17.06.2012

Michael T. Dill, Francois H.T. Duong, Julia E. Vogt, Stephanie Bibert, Pierre-Yves Bochud, Luigi Terracciano, Andreas Papassotiropoulos, Volker Roth and Markus H. HeimInterferon-Induced Gene Expression is a Stronger Predictor of Treatment Response Than IL28B Genotype in Patients With Hepatitis CGastroenterology, 2011 Mar;140(3):1021-1031.e10

Abstract

BACKGROUND & AIMS: The host immune response during the chronic phase of hepatitis C virus infection varies among individuals; some patients have a no interferon (IFN) response in the liver, whereas others have full activation of IFN-stimulated genes (ISGs). Preactivation of this endogenous IFN system is associated with nonresponse to pegylated IFN-\alpha (pegIFN-\alpha) and ribavirin. Genome-wide association studies have associated allelic variants near the IL28B (IFN\lambda3) gene with treatment response. We investigated whether IL28B genotype determines the constitutive expression of ISGs in the liver and compared the abilities of ISG levels and IL28B genotype to predict treatment outcome. METHODS: We genotyped 109 patients with chronic hepatitis C for IL28B allelic variants and quantified the hepatic expression of ISGs and of IL28B. Decision tree ensembles, in the form of a random forest classifier, were used to calculate the relative predictive power of these different variables in a multivariate analysis. RESULTS: The minor IL28B allele was significantly associated with increased expression of ISG. However, stratification of the patients according to treatment response revealed increased ISG expression in nonresponders, irrespective of IL28B genotype. Multivariate analysis of ISG expression, IL28B genotype, and several other factors associated with response to therapy identified ISG expression as the best predictor of treatment response. CONCLUSIONS: IL28B genotype and hepatic expression of ISGs are independent predictors of response to treatment with pegIFN-\alpha and ribavirin in patients with chronic hepatitis C. The most accurate prediction of response was obtained with a 4-gene classifier comprising IFI27, ISG15, RSAD2, and HTATIP2.

Authors

Michael T. Dill, Francois H.T. Duong, Julia E. Vogt, Stephanie Bibert, Pierre-Yves Bochud, Luigi Terracciano, Andreas Papassotiropoulos, Volker Roth and Markus H. Heim

Submitted

Gastroenterology, 2011 Mar;140(3):1021-1031.e10

Date

28.02.2011

Julia E. Vogt, Volker RothThe Group Lasso: l_{1,\infty} Regularization versus l_{1,2} RegularizationPattern Recognition: 32-nd DAGM Symposium, Lecture Notes in Computer Science, 2010

Abstract

The l_{1,\infty} norm and the l_{1,2} norm are well known tools for joint regularization in Group-Lasso methods. While the l_{1,2} version has been studied in detail, there are still open questions regarding the uniqueness of solutions and the efficiency of algorithms for the l_{1,\infty} variant. For the latter, we characterize the conditions for uniqueness of solutions, we present a simple test for uniqueness, and we derive a highly efficient active set algorithm that can deal with input dimensions in the millions. We compare both variants of the Group-Lasso for the two most common application scenarios of the Group-Lasso, one is to obtain sparsity on the level of groups in “standard” prediction problems, the second one is multi-task learning where the aim is to solve many learning problems in parallel which are coupled via the Group-Lasso constraint. We show that both version perform quite similar in “standard” applications. However, a very clear distinction between the variants occurs in multi-task settings where the l_{1,2} version consistently outperforms the l_{1,\infty} counterpart in terms of prediction accuracy.

Authors

Julia E. Vogt, Volker Roth

Submitted

Pattern Recognition: 32-nd DAGM Symposium, Lecture Notes in Computer Science, 2010

Date

31.07.2010

Julia E. Vogt, Sandhya Prabhakaran, Thomas J. Fuchs, Volker RothThe Translation-invariant Wishart-Dirichlet Process for Clustering Distance DataICML 2010: Proceedings of the 27th international conference on Machine Learning

Abstract

We present a probabilistic model for clustering of objects represented via pairwise dissimilarities. We propose that even if an underlying vectorial representation exists, it is better to work directly with the dissimilarity matrix hence avoiding unnecessary bias and variance caused by embeddings. By using a Dirichlet process prior we are not obliged to fix the number of clusters in advance. Furthermore, our clustering model is permutation-, scale- and translation-invariant, and it is called the Translation-invariant Wishart Dirichlet (TIWD) process. A highly efficient MCMC sampling algorithm is presented. Experiments show that the TIWD process exhibits several advantages over competing approaches.

Authors

Julia E. Vogt, Sandhya Prabhakaran, Thomas J. Fuchs, Volker Roth

Submitted

ICML 2010: Proceedings of the 27th international conference on Machine Learning

Date

20.06.2010