Quantitative Biology
See recent articles
Showing new listings for Friday, 2 May 2025
- [1] arXiv:2505.00043 [pdf, other]
-
Title: Enhancing Echocardiogram Video Quality via Latent Space EditingSubjects: Quantitative Methods (q-bio.QM)
Echocardiography (echo), or cardiac ultrasound, is the most widely used imaging modality for cardiac form and function due to its relatively low cost, rapid acquisition, and non-invasive nature. However, ultrasound acquisitions are often limited by artifacts, noise, and low-quality acquisitions that hinder diagnostic interpretation. Existing techniques for enhancing echos consist of traditional filter-based algorithms, deep-learning approaches developed on radiofrequency (RF) data, or approaches that have strong priors such as manual segmentation labels which limits both clinical applicability and scalability. To address these limitations, we introduce a data-driven approach for enhancing echo videos using a generative model trained on historical images. We learn a latent space representation for echo images using a generative model and use self-supervision from a synthetic dataset of high quality (HQ) and simulated low quality (LQ) image pairs to estimate a direction vector in the latent space from the LQ to HQ domain. In both held-out internal and external test sets, our approach resulted in echo videos with higher gCNR (0.60-0.62 vs. 0.48-0.53) and quality score (0.99-0.99 vs. 0.92-0.96) compared to original LQ videos. Furthermore, we leverage previously developed models for echo to show preservation of key clinical characteristics such as LVEF (MAE 4.74-6.82) and left ventricle segmentation (Dice 0.92-0.93), suggesting potential for future clinical use to improve the quality of echo videos.
- [2] arXiv:2505.00128 [pdf, html, other]
-
Title: Routing functions for parameter space decomposition to describe stability landscapes of ecological modelsComments: 20 pages, 4 figures, 2 tables, 1 algorithmSubjects: Populations and Evolution (q-bio.PE); Algebraic Geometry (math.AG)
Changes in environmental or system parameters often drive major biological transitions, including ecosystem collapse, disease outbreaks, and tumor development. Analyzing the stability of steady states in dynamical systems provides critical insight into these transitions. This paper introduces an algebraic framework for analyzing the stability landscapes of ecological models defined by systems of first-order autonomous ordinary differential equations with polynomial or rational rate functions. Using tools from real algebraic geometry, we characterize parameter regions associated with steady-state feasibility and stability via three key boundaries: singular, stability (Routh-Hurwitz), and coordinate boundaries. With these boundaries in mind, we employ routing functions to compute the connected components of parameter space in which the number and type of stable steady states remain constant, revealing the stability landscape of these ecological models. As case studies, we revisit the classical Levins-Culver competition-colonization model and a recent model of coral-bacteria symbioses. In the latter, our method uncovers complex stability regimes, including regions supporting limit cycles, that are inaccessible via traditional techniques. These results demonstrate the potential of our approach to inform ecological theory and intervention strategies in systems with nonlinear interactions and multiple stable states.
- [3] arXiv:2505.00219 [pdf, html, other]
-
Title: Real-Time Brain-Computer Interface Control of Walking Exoskeleton with Bilateral Sensory FeedbackJeffrey Lim, Po T. Wang, Won Joon Sohn, Derrick Lin, Shravan Thaploo, Luke Bashford, David Bjanes, Angelica Nguyen, Hui Gong, Michelle Armacost, Susan J. Shaw, Spencer Kellis, Brian Lee, Darrin Lee, Payam Heydari, Richard A. Andersen, Zoran Nenadic, Charles Y. Liu, An H. DoComments: Main text of pre-print and supplementary information includedSubjects: Neurons and Cognition (q-bio.NC); Human-Computer Interaction (cs.HC)
Invasive brain-computer interface (BCI) technology has demonstrated the possibility of restoring brain-controlled walking in paraplegic spinal cord injury patients. However, current implementations of BCI-controlled walking still have significant drawbacks. In particular, prior systems are unidirectional and lack sensory feedback for insensate patients, have suboptimal reliance on brain signals from the bilateral arm areas of the motor cortex, and depend on external systems for signal processing. Motivated by these shortcomings, this study is the first time a bidirectional brain-computer interface (BDBCI) has demonstrated the restoration of both brain-controlled walking and leg sensory feedback while utilizing the bilateral leg motor and sensory cortices. Here, a subject undergoing subdural electrocorticogram electrode implantation for epilepsy surgery evaluation leveraged the leg representation areas of the bilateral interhemispheric primary motor and sensory cortices to operate a BDBCI with high performance. Although electrode implantation in the interhemispheric region is uncommon, electrodes can be safely implanted in this region to access rich leg motor information and deliver bilateral leg sensory feedback. Finally, we demonstrated that all BDBCI operations can be executed on a dedicated, portable embedded system. These results indicate that BDBCIs can potentially provide brain-controlled ambulation and artificial leg sensation to people with paraplegia after spinal cord injury in a manner that emulates full-implantability and is untethered from any external systems.
- [4] arXiv:2505.00572 [pdf, other]
-
Title: A Bioinformatic Study of Genetics Involved in Determining Mild Traumatic Brain Injury Severity and RecoveryComments: This paper consists of 43 pages, including references. It contains seven figures and four tables. Details regarding coding and analysis are available upon requestSubjects: Genomics (q-bio.GN); Neurons and Cognition (q-bio.NC)
Aim: This in silico study sought to identify specific biomarkers for mild traumatic brain injury (mTBI) through the analysis of publicly available gene and miRNA databases, hypothesizing their influence on neuronal structure, axonal integrity, and regeneration. Methods: This study implemented a three-step process: (1) Data searching for mTBI-related genes in Gene and MalaCard databases and literature review ; (2) Data analysis involved performing functional annotation through GO and KEGG, identifying hub genes using Cytoscape, mapping protein-protein interactions via DAVID and STRING, and predicting miRNA targets using miRSystem, miRWalk2.0, and mirDIP (3) RNA-sequencing analysis applied to the mTBI dataset GSE123336. Results: Eleven candidate hub genes associated with mTBI outcome were identified: APOE, S100B, GFAP, BDNF, AQP4, COMT, MBP, UCHL1, DRD2, ASIC1, and CACNA1A. Enrichment analysis linked these genes to neuron projection regeneration and synaptic plasticity. miRNAs linked to the mTBI candidate genes were hsa-miR-9-5p, hsa-miR-204-5p, hsa-miR-1908-5p, hsa-miR-16-5p, hsa-miR-10a-5p, has-miR-218-5p, has-miR-34a-5p, and has-miR-199b-5p. The RNA sequencing revealed 2664 differentially expressed miRNAs post-mTBI, with 17 showing significant changes at the time of injury and 48 hours post-injury. Two miRNAs were positively correlated with direct head hits. Conclusion: Our study indicates that specific genes and miRNAs, particularly hsa-miR-10a-5p, may influence mTBI outcomes. Our research may guide future mTBI diagnostics, emphasizing the need to measure and track these specific genes and miRNAs in diverse cohorts.
- [5] arXiv:2505.00600 [pdf, other]
-
Title: Frustration, dynamics and catalysisSubjects: Biomolecules (q-bio.BM); Soft Condensed Matter (cond-mat.soft); Biological Physics (physics.bio-ph)
The controlled dissipation of chemical potentials is the fundamental way cells make a living. Enzyme-mediated catalysis allows the various transformations to proceed at biologically relevant rates with remarkable precision and efficiency. Theory, experiments and computational studies coincide to show that local frustration is a useful concept to relate protein dynamics with catalytic power. Local frustration gives rise to the asperities of the energy landscapes that can harness the thermal fluctuations to guide the functional protein motions. We review here recent advances into these relationships from various fields of protein science. The biologically relevant dynamics is tuned by the evolution of protein sequences that modulate the local frustration patterns to near optimal values.
- [6] arXiv:2505.00644 [pdf, html, other]
-
Title: Predatory dynamics in susceptible and resistant $\textit{Eriopis connexa}$ populationsAnna Mara Ferreira Maciel, Gabriel Rodrigues Palma, Lucas dos Anjos, Lucas Santos Canuto, Wesley Augusto Conde Godoy, Rafael de Andrade MoralComments: 50 pagesSubjects: Populations and Evolution (q-bio.PE); Quantitative Methods (q-bio.QM)
The ladybird $\textit{Eriopis connexa}$ (Germar, 1824), a voracious aphid predator, faces challenges from insecticide applications, compromising biological control. As a result, there has been an increase in the number of studies analysing the resistance and susceptibility of ladybirds. Some studies have found that resistant populations exhibit distinct predation and foraging behaviour compared to susceptible ones. This study models the population dynamics of resistant and susceptible $\textit{E. connexa}$ preying on $\textit{Aphis gossypii}$ Glover, 1877 and $\textit{Myzus persicae}$ (Sulzer, 1776). We constructed a logistic model with density dependence and type-II functional response to analyse predation dynamics, incorporating bifurcation analysis on predation parameters (attack rate and handling time) and the mortality rate of susceptible ladybirds. We simulated scenarios with/without insecticide application and with/without aphid resistance. To simulate the effects of insecticide applications, the parameters related to aphids' intrinsic growth rate ($r_1$ and $r_2$) change to reflect the responses of susceptible and resistant populations. The same approach is used concerning the mortality rate of ladybirds ($d_2$ and $d_3$). Our results demonstrate that mortality, attack rate, and handling time are critical in shaping predator-prey interactions. Temporal simulations revealed fluctuating abundances, highlighting the fragility of these interactions under insecticide stress. Therefore, this study contributes to understanding the ecological implications of insecticides, which disrupt natural predation dynamics, and shows how variations in behavioural rates can impact prey control. This research demonstrated the importance of integrated strategies that balance insecticide applications with preserving natural enemies and promoting sustainable agricultural practices.
New submissions (showing 6 of 6 entries)
- [7] arXiv:2505.00037 (cross-list from quant-ph) [pdf, other]
-
Title: Can a Quantum Support Vector Machine algorithm be utilized to identify Key Biomarkers from Multi-Omics data of COVID19 patients?Junggu Choi, Chansu Yu, Kyle L. Jung, Suan-Sin Foo, Weiqiang Chen, Suzy AA Comhair, Serpil C. Erzurum, Lara Jehi, Jae U. JungComments: 70 pages, 6 figuresSubjects: Quantum Physics (quant-ph); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
Identifying key biomarkers for COVID-19 from high-dimensional multi-omics data is critical for advancing both diagnostic and pathogenesis research. In this study, we evaluated the applicability of the Quantum Support Vector Machine (QSVM) algorithm for biomarker-based classification of COVID-19. Proteomic and metabolomic biomarkers from two independent datasets were ranked by importance using ridge regression and grouped accordingly. The top- and bottom-ranked biomarker sets were then used to train and evaluate both classical SVM (CSVM) and QSVM models, serving as predictive and negative control inputs, respectively. The QSVM was implemented with multiple quantum kernels, including amplitude encoding, angle encoding, the ZZ feature map, and the projected quantum kernel. Across various experimental settings, QSVM consistently achieved classification performance that was comparable to or exceeded that of CSVM, while reflecting the importance rankings by ridge regression. Although the experiments were conducted in numerical simulation, our findings highlight the potential of QSVM as a promising approach for multi-omics data analysis in biomedical research.
- [8] arXiv:2505.00196 (cross-list from cs.LG) [pdf, html, other]
-
Title: Mapping minds not averages: a scalable subject-specific manifold learning framework for neuroimaging dataComments: 20 pages, 6 figuresSubjects: Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
Mental and cognitive representations are believed to reside on low-dimensional, non-linear manifolds embedded within high-dimensional brain activity. Uncovering these manifolds is key to understanding individual differences in brain function, yet most existing machine learning methods either rely on population-level spatial alignment or assume data that is temporally structured, either because data is aligned among subjects or because event timings are known. We introduce a manifold learning framework that can capture subject-specific spatial variations across both structured and temporally unstructured neuroimaging data. On simulated data and two naturalistic fMRI datasets (Sherlock and Forrest Gump), our framework outperforms group-based baselines by recovering more accurate and individualized representations. We further show that the framework scales efficiently to large datasets and generalizes well to new subjects. To test this, we apply the framework to temporally unstructured resting-state fMRI data from individuals with schizophrenia and healthy controls. We further apply our method to a large resting-state fMRI dataset comprising individuals with schizophrenia and controls. In this setting, we demonstrate that the framework scales efficiently to large populations and generalizes robustly to unseen subjects. The learned subject-specific spatial maps our model finds reveal clinically relevant patterns, including increased activation in the basal ganglia, visual, auditory, and somatosensory regions, and decreased activation in the insula, inferior frontal gyrus, and angular gyrus. These findings suggest that our framework can uncover clinically relevant subject-specific brain activity patterns. Our approach thus provides a scalable and individualized framework for modeling brain activity, with applications in computational neuroscience and clinical research.
- [9] arXiv:2505.00316 (cross-list from cs.LG) [pdf, html, other]
-
Title: Surrogate modeling of Cellular-Potts Agent-Based Models as a segmentation task using the U-Net neural network architectureTien Comlekoglu, J. Quetzalcóatl Toledo-Marín, Tina Comlekoglu, Douglas W. DeSimone, Shayn M. Peirce, Geoffrey Fox, James A. GlazierSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
The Cellular-Potts model is a powerful and ubiquitous framework for developing computational models for simulating complex multicellular biological systems. Cellular-Potts models (CPMs) are often computationally expensive due to the explicit modeling of interactions among large numbers of individual model agents and diffusive fields described by partial differential equations (PDEs). In this work, we develop a convolutional neural network (CNN) surrogate model using a U-Net architecture that accounts for periodic boundary conditions. We use this model to accelerate the evaluation of a mechanistic CPM previously used to investigate \textit{in vitro} vasculogenesis. The surrogate model was trained to predict 100 computational steps ahead (Monte-Carlo steps, MCS), accelerating simulation evaluations by a factor of 590 times compared to CPM code execution. Over multiple recursive evaluations, our model effectively captures the emergent behaviors demonstrated by the original Cellular-Potts model of such as vessel sprouting, extension and anastomosis, and contraction of vascular lacunae. This approach demonstrates the potential for deep learning to serve as efficient surrogate models for CPM simulations, enabling faster evaluation of computationally expensive CPM of biological processes at greater spatial and temporal scales.
- [10] arXiv:2505.00518 (cross-list from physics.chem-ph) [pdf, html, other]
-
Title: An evaluation of unconditional 3D molecular generation methodsComments: Published at the GEM workshop, ICLR 2025Subjects: Chemical Physics (physics.chem-ph); Quantitative Methods (q-bio.QM)
Unconditional molecular generation is a stepping stone for conditional molecular generation, which is important in \emph{de novo} drug design. Recent unconditional 3D molecular generation methods report saturated benchmarks, suggesting it is time to re-evaluate our benchmarks and compare the latest models. We assess five recent high-performing 3D molecular generation methods (EQGAT-diff, FlowMol, GCDM, GeoLDM, and SemlaFlow), in terms of both standard benchmarks and chemical and physical validity. Overall, the best method, SemlaFlow, has a success rate of 87% in generating valid, unique, and novel molecules without post-processing and 92.4% with post-processing.
- [11] arXiv:2505.00530 (cross-list from cs.LG) [pdf, html, other]
-
Title: Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning FrameworksComments: 17 pages, 5 main figures, 2 appendix figures. Submitted to ICML 2025Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Biomolecules (q-bio.BM)
SMILES-based molecule generation has emerged as a powerful approach in drug discovery. Deep reinforcement learning (RL) using large language model (LLM) has been incorporated into the molecule generation process to achieve high matching score in term of likelihood of desired molecule candidates. However, a critical challenge in this approach is catastrophic forgetting during the RL phase, where knowledge such as molecule validity, which often exceeds 99\% during pretraining, significantly deteriorates. Current RL algorithms applied in drug discovery, such as REINVENT, use prior models as anchors to retian pretraining knowledge, but these methods lack robust exploration mechanisms. To address these issues, we propose Partial SMILES Validation-PPO (PSV-PPO), a novel RL algorithm that incorporates real-time partial SMILES validation to prevent catastrophic forgetting while encouraging exploration. Unlike traditional RL approaches that validate molecule structures only after generating entire sequences, PSV-PPO performs stepwise validation at each auto-regressive step, evaluating not only the selected token candidate but also all potential branches stemming from the prior partial sequence. This enables early detection of invalid partial SMILES across all potential paths. As a result, PSV-PPO maintains high validity rates even during aggressive exploration of the vast chemical space. Our experiments on the PMO and GuacaMol benchmark datasets demonstrate that PSV-PPO significantly reduces the number of invalid generated structures while maintaining competitive exploration and optimization performance. While our work primarily focuses on maintaining validity, the framework of PSV-PPO can be extended in future research to incorporate additional forms of valuable domain knowledge, further enhancing reinforcement learning applications in drug discovery.
- [12] arXiv:2505.00578 (cross-list from eess.IV) [pdf, html, other]
-
Title: AI-Driven High-Resolution Cell Segmentation and Quantitative AnalysisSubjects: Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
Studying the growth and metabolism of microbes provides critical insights into their evolutionary adaptations to harsh environments, which are essential for microbial research and biotechnology applications. In this study, we developed an AI-driven image analysis system to efficiently segment individual cells and quantitatively analyze key cellular features. This system is comprised of four main modules. First, a denoising algorithm enhances contrast and suppresses noise while preserving fine cellular details. Second, the Segment Anything Model (SAM) enables accurate, zero-shot segmentation of cells without additional training. Third, post-processing is applied to refine segmentation results by removing over-segmented masks. Finally, quantitative analysis algorithms extract essential cellular features, including average intensity, length, width, and volume. The results show that denoising and post-processing significantly improved the segmentation accuracy of SAM in this new domain. Without human annotations, the AI-driven pipeline automatically and efficiently outlines cellular boundaries, indexes them, and calculates key cellular parameters with high accuracy. This framework will enable efficient and automated quantitative analysis of high-resolution fluorescence microscopy images to advance research into microbial adaptations to grow and metabolism that allow extremophiles to thrive in their harsh habitats.
- [13] arXiv:2505.00601 (cross-list from math.PR) [pdf, html, other]
-
Title: A stochastic epidemic model with memory of the last infection and waning immunityComments: Stochastic epidemic model with memory; age-structured model; varying infectivity; varying immunity/susceptibility; endemicity; local stabilitySubjects: Probability (math.PR); Populations and Evolution (q-bio.PE)
We adapt the article of Forien, Pang, Pardoux and Zotsa: Arxiv preprint Arxiv2210.04667(2022), on epidemic models with varying infectivity and waning immunity, to incorporate the memory of the last infection. To this end, we introduce a parametric approach and consider a piecewise deterministic Markov process modeling both the evolution of the parameter, also called the trait, and the age of infection of individuals over time. At each new infection, a new trait is randomly chosen for the infected individual according to a Markov kernel, and their age is reset to zero. In the large population limit, we derive a partial differential equation (PDE) that describes the density of traits and ages. The main goal is to study the conditions under which endemic equilibria exist for the deterministic PDE model and to establish an endemicity threshold that depends on the model parameters. The local stability of these equilibria is also analyzed. The endemicity threshold is computed for several examples, including models that incorporate a vaccination policy, and a local stability result is obtained for a memory-free SIS-type model.
- [14] arXiv:2505.00650 (cross-list from cs.LG) [pdf, html, other]
-
Title: OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival StratificationComments: Code available at: this http URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM)
Unsupervised learning of disease subtypes from multi-omics data presents a significant opportunity for advancing personalized medicine. We introduce OmicsCL, a modular contrastive learning framework that jointly embeds heterogeneous omics modalities-such as gene expression, DNA methylation, and miRNA expression-into a unified latent space. Our method incorporates a survival-aware contrastive loss that encourages the model to learn representations aligned with survival-related patterns, without relying on labeled outcomes. Evaluated on the TCGA BRCA dataset, OmicsCL uncovers clinically meaningful clusters and achieves strong unsupervised concordance with patient survival. The framework demonstrates robustness across hyperparameter configurations and can be tuned to prioritize either subtype coherence or survival stratification. Ablation studies confirm that integrating survival-aware loss significantly enhances the predictive power of learned embeddings. These results highlight the promise of contrastive objectives for biological insight discovery in high-dimensional, heterogeneous omics data.
Cross submissions (showing 8 of 8 entries)
- [15] arXiv:2307.05837 (replaced) [pdf, html, other]
-
Title: Geometrical Structure of Bifurcations during Spatial Decision-MakingComments: 20 pages, 10 figures; published versionSubjects: Neurons and Cognition (q-bio.NC); Statistical Mechanics (cond-mat.stat-mech); Biological Physics (physics.bio-ph)
Animals must constantly make decisions on the move, such as when choosing among multiple options, or "targets", in space. Recent evidence suggests that this results from a recursive feedback between the (vectorial) neural representation of the targets and the resulting motion defined by this consensus, which then changes the egocentric neural representation of the the options, and so on. Here we employ a simple model of this process to both explore how its dynamics account for the experimentally-observed abruptly-branching trajectories exhibited by animals during spatial decision-making, and to provide new insights into spatiotemporal computation. Essential neural dynamics, notably local excitation and long-range inhibition, are captured in our model via spin-system dynamics, with groups of Ising-spins representing neural "activity bumps" corresponding to target directions. Analysis, employing a novel "mean-field trajectory" approach, reveals the nature of the spontaneous symmetry breaking - bifurcations in the model that result in literal bifurcations in trajectory space and how it results in new geometric principles for spatiotemporal decision-making. We find that all bifurcation points, beyond the very first, fall on a small number of "bifurcation curves". It is the spatial organization of these curves that is shown to be key to determining the shape of the trajectories, such as self-similar or space filling, exhibited during decision-making, irrespective of the trajectory's starting point. Furthermore, we find that a non-Euclidean representation of space considerably reduces the number of bifurcation points in many geometrical configurations, preventing endless indecision and promoting effective spatial decision-making. This suggests that a non-Euclidean neural representation of space may be expected to have evolved across species in order to facilitate spatial decision-making.
- [16] arXiv:2503.11347 (replaced) [pdf, html, other]
-
Title: Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-seq Data AnalysisJournal-ref: Entropy-2025Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG); Biological Physics (physics.bio-ph)
Understanding the dynamic nature of biological systems is fundamental to deciphering cellular behavior, developmental processes, and disease progression. Single-cell RNA sequencing (scRNA-seq) has provided static snapshots of gene expression, offering valuable insights into cellular states at a single time point. Recent advancements in temporally resolved scRNA-seq, spatial transcriptomics (ST), and time-series spatial transcriptomics (temporal-ST) have further revolutionized our ability to study the spatiotemporal dynamics of individual cells. These technologies, when combined with computational frameworks such as Markov chains, stochastic differential equations (SDEs), and generative models like optimal transport and Schrödinger bridges, enable the reconstruction of dynamic cellular trajectories and cell fate decisions. This review discusses how these dynamical system approaches offer new opportunities to model and infer cellular dynamics from a systematic perspective.
- [17] arXiv:2502.11141 (replaced) [pdf, html, other]
-
Title: Cognitive Neural Architecture Search Reveals Hierarchical EntailmentSubjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
Recent research has suggested that the brain is more shallow than previously thought, challenging the traditionally assumed hierarchical structure of the ventral visual pathway. Here, we demonstrate that optimizing convolutional network architectures for brain-alignment via evolutionary neural architecture search results in models with clear representational hierarchies. Despite having random weights, the identified models achieve brain-alignment scores surpassing even those of pretrained classification models - as measured by both regression and representational similarity analysis. Furthermore, through traditional supervised training, architectures optimized for alignment with late ventral regions become competitive classification models. These findings suggest that hierarchical structure is a fundamental mechanism of primate visual processing. Finally, this work demonstrates the potential of neural architecture search as a framework for computational cognitive neuroscience research that could reduce the field's reliance on manually designed convolutional networks.
- [18] arXiv:2504.18506 (replaced) [pdf, html, other]
-
Title: Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup FunctionalComments: ICML 2025Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci); Chemical Physics (physics.chem-ph); Biomolecules (q-bio.BM)
Transition path sampling (TPS), which involves finding probable paths connecting two points on an energy landscape, remains a challenge due to the complexity of real-world atomistic systems. Current machine learning approaches use expensive, task-specific, and data-free training procedures, limiting their ability to benefit from recent advances in atomistic machine learning, such as high-quality datasets and large-scale pre-trained models. In this work, we address TPS by interpreting candidate paths as trajectories sampled from stochastic dynamics induced by the learned score function of pre-trained generative models, specifically denoising diffusion and flow matching. Under these dynamics, finding high-likelihood transition paths becomes equivalent to minimizing the Onsager-Machlup (OM) action functional. This enables us to repurpose pre-trained generative models for TPS in a zero-shot manner, in contrast with bespoke, task-specific TPS models trained in previous work. We demonstrate our approach on varied molecular systems, obtaining diverse, physically realistic transition pathways and generalizing beyond the pre-trained model's original training dataset. Our method can be easily incorporated into new generative models, making it practically relevant as models continue to scale and improve with increased data availability.