publications | Euxhen Hasanaj

2026

EMBO J
SenSet defines cell-type specific senescence signatures in the aged human lung

Euxhen Hasanaj, Delphine Beaulieu, Cankun Wang, and 26 more authors

EMBO J., Apr 2026

Abs Bib HTML

Cellular senescence is defined as an irreversible growth arrest observed when cells are exposed to a variety of stressors, including DNA damage, oxidative stress, or nutrient deprivation. Although senescence is a well-established driver of aging and age-related diseases, it is a highly heterogeneous process with significant variations across organisms, tissues, and cell types. The relatively low abundance of senescent cells in healthy aged tissues poses a major challenge to the longitudinal study of senescence in specific organs, including the human lung. To overcome this limitation, we developed a positive-unlabeled learning framework to generate a comprehensive list of senescence marker genes in human lungs (termed SenSet) using the largest publicly available single-cell lung dataset, the Human Lung Cell Atlas (HLCA). We validated SenSet in a highly complex ex vivo human 3D lung tissue culture model subjected to the senescence inducers bleomycin, doxorubicin, or irradiation, and established its sensitivity and accuracy in characterizing senescence. Using SenSet, we identified and validated cell-type-specific senescence signatures in distinct lung cell populations upon aging and environmental exposure. Our study provides a comprehensive analysis of senescent cells in the healthy aging lung, presenting fundamental implications for our understanding of major lung diseases, including cancer, fibrosis, chronic obstructive pulmonary disease, or asthma.
@article{Hasanaj2026-sq, title = {{SenSet} defines cell-type specific senescence signatures in the aged human lung}, author = {Hasanaj, Euxhen and Beaulieu, Delphine and Wang, Cankun and Hu, Qianjiang and Rosas, Lorena and Bueno, Marta and Sembrat, John C and Pineda, Ricardo H and Melo-Narvaez, Maria Camila and Cardenes, Nayra and Yanwu, Zhao and Yingze, Zhang and Lafyatis, Robert and Morris, Alison and Mora, Ana and Rojas, Mauricio and Li, Dongmei and Rahman, Irfan and Pryhuber, Gloria S and Lehmann, Mareike and Alder, Jonathan and Gurkar, Aditi and Finkel, Toren and Ma, Qin and Lugo-Martinez, Jose and Póczos, Barnabás and Bar-Joseph, Ziv and Eickelberg, Oliver and Königshoff, Melanie}, journal = {EMBO J.}, publisher = {Springer Science and Business Media LLC}, pages = {1--50}, month = apr, year = {2026}, language = {en} }
Preprint
Foundation models improve perturbation response prediction

Elijah Cole, Geert-Jan Huizing, Sohan Addagudi, and 17 more authors

bioRxiv, Feb 2026

Abs Bib HTML

Predicting cellular responses to genetic or chemical perturbations has been a long-standing goal in biology. Recent applications of foundation models to this task have yielded contradictory results regarding their superiority over simple baselines. We conducted an extensive analysis of over 600 different models across various prediction tasks and evaluation metrics, demonstrating that while some foundation models fail to outperform simple baselines, others significantly improve predictions for both genetic and chemical perturbations. Furthermore, we developed and evaluated methods for integrating multiple foundation mod- els for perturbation prediction. Our results show that with sufficient data, these models approach fundamental performance limits, confirming that foundation models can improve cellular response simulations.
@article{Cole2026-zt, title = {Foundation models improve perturbation response prediction}, author = {Cole, Elijah and Huizing, Geert-Jan and Addagudi, Sohan and Ho, Nicholas and Hasanaj, Euxhen and Kuijs, Merel and Johnstone, Toby and Carilli, Maria and Davi, Alec and Ellington, Caleb and Feinauer, Christoph and Li, Pan and Menegaux, Romain and Mohammadi, Shahin and Shao, Yanjun and Zhang, Josiah and Lundberg, Emma and Song, Le and Bar-Joseph, Ziv and Xing, Eric P}, journal = {bioRxiv}, institution = {bioRxiv}, pages = {2026.02.18.706454}, month = feb, year = {2026}, language = {en}, }

2025

ICML
Multimodal benchmarking of foundation model representations for cellular perturbation response prediction

Euxhen Hasanaj, Elijah Cole, Shahin Mohammadi, and 4 more authors

Proceedings of the 42nd International Conference on Machine Learning Workshops, Jun 2025

Abs Bib HTML PDF

The decreasing cost of single-cell RNA sequencing (scRNA-seq) has enabled the collection of massive scRNA-seq datasets, which are now being used to train transformer-based cell foundation models (FMs). One of the most promising applications of these FMs is perturbation response modeling. This task aims to forecast how cells will respond to drugs or genetic interventions. Accurate perturbation response models could drastically accelerate drug discovery by reducing the space of interventions that need to be tested in the wet lab. However, recent studies have shown that FM-based models often struggle to outperform simpler baselines for perturbation response prediction. A key obstacle is the lack of understanding of the components driving performance in FM-based perturbation response models. In this work, we conduct the first systematic pan-modal study of perturbation embeddings, with an emphasis on those derived from biological FMs. We benchmark their predictive accuracy, analyze patterns in their predictions, and identify the most successful representation learning strategies. Our findings offer insights into what FMs are learning and provide practical guidance for improving perturbation response modeling.
@article{Hasanaj2025-lm, title = {Multimodal benchmarking of foundation model representations for cellular perturbation response prediction}, author = {Hasanaj, Euxhen and Cole, Elijah and Mohammadi, Shahin and Addagudi, Sohan and Zhang, Xingyi and Song, Le and Xing, Eric P}, journal = {Proceedings of the 42nd International Conference on Machine Learning Workshops}, month = jun, year = {2025}, language = {en}, }
ISMB 2025
Recovering time-varying networks from single-cell data

Euxhen Hasanaj, Barnabás Póczos, and Ziv Bar-Joseph

ISMB, Jul 2025

Abs Bib HTML PDF

Gene regulation is a dynamic process that underlies all aspects of human development, disease response, and other key biological processes. The reconstruction of temporal gene regulatory networks has conventionally relied on regression analysis, graphical models, or other types of relevance networks. With the large increase in time series single-cell data, new approaches are needed to address the unique scale and nature of this data for reconstructing such networks. Here, we develop a deep neural network, Marlene, to infer dynamic graphs from time series single-cell gene expression data. Marlene constructs directed gene networks using a self-attention mechanism where the weights evolve over time using recurrent units. By employing meta learning, the model is able to recover accurate temporal networks even for rare cell types. In addition, Marlene can identify gene interactions relevant to specific biological responses, including COVID-19 immune response, fibrosis, and aging.
@article{Hasanaj2025-gh, title = {Recovering time-varying networks from single-cell data}, author = {Hasanaj, Euxhen and Póczos, Barnabás and Bar-Joseph, Ziv}, journal = {ISMB}, month = jul, year = {2025}, archiveprefix = {arXiv}, primaryclass = {q-bio.QM} }

2024

ISMB 2024
Integrating patients in time series clinical transcriptomics data

Euxhen Hasanaj, Sachin Mathur, and Ziv Bar-Joseph

Bioinformatics (ISMB Proceedings), Jun 2024

Abs Bib HTML PDF Code

Analysis of time series transcriptomics data from clinical trials is challenging. Such studies usually profile very few time points from several individuals with varying response patterns and dynamics. Current methods for these datasets are mainly based on linear, global orderings using visit times which do not account for the varying response rates and subgroups within a patient cohort. We developed a new method that utilizes multi-commodity flow algorithms for trajectory inference in large scale clinical studies. Recovered trajectories satisfy individual-based timing restrictions while integrating data from multiple patients. Testing the method on multiple drug datasets demonstrated an improved performance compared to prior approaches suggested for this task, while identifying novel disease subtypes that correspond to heterogeneous patient response patterns.
@article{Hasanaj2024-jb, title = {Integrating patients in time series clinical transcriptomics data}, author = {Hasanaj, Euxhen and Mathur, Sachin and Bar-Joseph, Ziv}, journal = {Bioinformatics (ISMB Proceedings)}, publisher = {Oxford University Press (OUP)}, volume = {40}, number = {ISMB Proceedings}, pages = {i151--i159}, month = jun, year = {2024}, language = {en}, }
Genome Biol.

scDOT: optimal transport for mapping senescent cells in spatial transcriptomics

Nam D Nguyen, Lorena Rosas, Timur Khaliullin, and 15 more authors

Genome Biol., Nov 2024

Abs HTML PDF

The low resolution of spatial transcriptomics data necessitates additional information for optimal use. We developed scDOT, which combines spatial transcriptomics and single cell RNA sequencing to improve the ability to reconstruct single cell resolved spatial maps and identify senescent cells. scDOT integrates optimal transport and expression deconvolution to learn non-linear couplings between cells and spots and to infer cell placements. Application of scDOT to lung spatial transcriptomics data improves on prior methods and allows the identification of the spatial organization of senescent cells, their neighboring cells and novel genes involved in cell-cell interactions that may be driving senescence.

2023

PMLR

AutoML Decathlon: Diverse Tasks, Modern Methods, and Efficiency at Scale

Nicholas Roberts, Samuel Guo, Cong Xu, and 24 more authors

In NeurIPS 2022 Competition Track, Aug 2023

HTML PDF

2022

NatureAging
NIH SenNet Consortium to Map Senescent Cells throughout the Human Lifespan to Understand Physiological Health

SenNet Consortium

Nature Aging, Dec 2022

Abs Bib HTML PDF

Cells respond to many stressors by senescing, acquiring stable growth arrest, morphologic and metabolic changes, and a proinflammatory senescence-associated secretory phenotype. The heterogeneity of senescent cells (SnCs) and senescence-associated secretory phenotype are vast, yet ill characterized. SnCs have diverse roles in health and disease and are therapeutically targetable, making characterization of SnCs and their detection a priority. The Cellular Senescence Network (SenNet), a National Institutes of Health Common Fund initiative, was established to address this need. The goal of SenNet is to map SnCs across the human lifespan to advance diagnostic and therapeutic approaches to improve human health. State-of-the-art methods will be applied to identify, define and map SnCs in 18 human tissues. A common coordinate framework will integrate data to create four-dimensional SnC atlases. Other key SenNet deliverables include innovative tools and technologies to detect SnCs, new SnC biomarkers and extensive public multi-omics datasets. This Perspective lays out the impetus, goals, approaches and products of SenNet.
@article{leeNIHSenNetConsortium2022b, title = {{{NIH SenNet Consortium}} to Map Senescent Cells throughout the Human Lifespan to Understand Physiological Health}, author = {Consortium, SenNet}, year = {2022}, month = dec, journal = {Nature Aging}, pages = {1--11}, publisher = {{Nature Publishing Group}}, copyright = {2022 Springer Nature America, Inc.}, langid = {english}, }
Cell R. M.
Multiset multicover methods for discriminative marker selection

Euxhen Hasanaj, Amir Alavi, Anupam Gupta, and 2 more authors

Cell Reports Methods, Oct 2022

Abs Bib HTML PDF Code

Markers are increasingly being used for several high throughput data analysis and experimental design tasks. Examples include the use of markers for assigning cell types in scRNA-seq studies, for deconvolving bulk gene expression data, and for selecting marker proteins in single cell spatial proteomics studies. Most marker selection methods focus on differential expression (DE) analysis. While such methods work well for data with a few non-overlapping marker sets, they are not appro- priate for large atlas-size datasets where several cell types and tissues are considered. To address this, we define the phenotype cover (PC) problem for marker selection and present algorithms that can improve the discriminative power of marker sets. Analysis of these sets on several marker se- lection tasks suggests that these methods can lead to solutions that accurately distinguish different phenotypes in the data.
@article{HasanajPC2022, title = {Multiset multicover methods for discriminative marker selection}, author = {Hasanaj, Euxhen and Alavi, Amir and Gupta, Anupam and Poczos, Barnabas and Bar-Joseph, Ziv}, journal = {Cell Reports Methods}, month = oct, year = {2022}, }
NatureComm
Interactive single-cell data analysis using Cellar

Euxhen Hasanaj, Jingtao Wang, Arjun Sarathi, and 2 more authors

Nature Communications 13:1, Apr 2022

Abs Bib HTML PDF Code Website

Cell type assignment is a major challenge for all types of high throughput single cell data. In many cases such assignment requires the repeated manual use of external and complementary data sources. To improve the ability to uniformly assign cell types across large consortia, platforms and modalities we developed Cellar, a software tool that provides interactive support to all the different steps involved in the assignment and dataset comparison process. We discuss the different methods implemented by Cellar, how these can be used with different data types, how to combine complementary data types and how to analyze and visualize spatial data. We demonstrate the advantages of Cellar by using it to annotate several HuBMAP datasets from multi-omics single-cell sequencing and spatial proteomics studies. Cellar is open-source and includes several annotated HuBMAP datasets. Availability https://cellar.cmu.hubmapconsortium.org/app/cellar
@article{Hasanaj2022, issn = {2041-1723}, issue = {1}, title = {Interactive single-cell data analysis using Cellar}, author = {Hasanaj, Euxhen and Wang, Jingtao and Sarathi, Arjun and Ding, Jun and Bar-Joseph, Ziv}, journal = {Nature Communications 13:1}, month = apr, year = {2022}, pages = {1-6}, volume = {13}, publisher = {Nature Publishing Group}, }