Peer-reviewed publications, patent portfolio, and technical documentation underlying the DNAI physics-constrained cancer digital twin platform.
Original research from the DNAI project. All preprints are freely available for download. Journal submissions in progress.
Feb 2026
We challenge the harmonization paradigm, demonstrating that site-specific variance encodes critical prognostic information. Group DRO with Pooled Cox achieves C=0.718 on CPTAC, outperforming all baselines including ComBat and CORAL.
Feb 2026
We introduce Plural Twins, a set-valued framework where each patient is represented as a distribution of outcomes. 82.9% of patients show policy instability; for 1 in 6, the optimal treatment depends on the algorithm's risk tolerance.
Feb 2026
We report a counterintuitive data scaling paradox: expanding PDX training data from 128 to 573 samples degrades clinical prediction. Multi-cancer alignment erases biology-preserving variance through shortcut domain adaptation.
Feb 2026
A unified framework for certifying when a clinical AI prediction is reliable enough for decision-making. Per-patient transportability certificates, structured abstention, and evidence-completion recommendations.
Feb 2026
We identify a clinically actionable phenotype defined by the intersection of high predicted treatment benefit and low robustness to perturbation. These patients (7.0%) show median OS of 478d with 71.1% event rate — the worst outcomes despite high expected benefit.
Feb 2026
We report a fundamental identifiability failure: drug sensitivity (beta) occupies 0.14% of its allowed range under survival-only supervision. Rather than treating this as a defect, we introduce the Therapeutic Controllability Index to quantify treatment authority per patient.
Mar 2026
A complete pipeline bridging VAF-based clonal deconvolution with physics-constrained tumor simulation. Introduces the Resistance Sentinel strategy that preserves clinically critical minor subclones in fixed-compartment ODE models, and knowledge-grounded per-clone drug sensitivity from OncoKB/CIViC replacing under-identified learned parameters.
Mar 2026
A novel architecture extending variational autoencoders to jointly decompose bulk tumor omics into per-subpopulation biological states in structured latent space. Unlike mutation-only deconvolution, LSCD infers per-clone pathway activities, growth rates, and epigenetic states — resolving the under-identification of drug sensitivity parameters.
Mar 2026
A pre-registered validation protocol for testing clone-aware digital twin predictions against longitudinal patient data. Compares predicted resistance emergence timing and dominant resistant clone identity against observed outcomes in patients with serial biopsies from GLASS, TRACERx, and GENIE BPC cohorts.
Mar 2026
We distill domain-specific foundation models (Geneformer, ESM-2) into the structured latent space of a multi-omics VAE, achieving R^2=0.774 on full latent reconstruction. The complementary distillation strategy preserves biological interpretability while incorporating protein structure and gene regulatory knowledge.
Mar 2026
We predict drug combination efficacy from monotherapy data alone using pathway sensitivity orthogonality. Validated on 1,209 GDSC drug pairs, the method achieves Spearman rho=0.800 between predicted and actual combination patterns, enabling zero-shot discovery of synergistic combinations without combination screens.
Mar 2026
We demonstrate that standard drug synergy benchmarks suffer from identity memorization: models learn cell-line fingerprints rather than genuine drug interaction biology. Cell-line-holdout evaluation reveals dramatic performance drops compared to random splits, calling for stricter evaluation protocols in combination prediction.
Apr 2026
We demonstrate that mechanistic Neural ODEs trained on observational survival data suffer from a fundamental identifiability failure: drug response parameters are either excluded from the computation graph or hijacked as prognostic proxies. Our V4.1 architecture enforces untreated gradient isolation via treatment-gated graph surgery, hard biological parameterization, and two-stage PDX system identification, achieving causally identified drug sensitivity (Spearman rho=0.416 vs RECIST, p<0.0001) without sacrificing survival prediction (C-index 0.742).
Apr 2026
We demonstrate that the within-cancer survival prediction ceiling (C-index 0.57) is caused by institutional domain shift, not feature poverty. Discrete-time hazard modeling (DeepHit) with multi-institutional Group DRO across 6 environments (17,526 patients) improved the held-out CGGA within-cancer C-index from 0.548 to 0.630. Foundation Model injection (-0.043) and proteomics LUPI (-0.041) both degraded external generalization. Adding diverse training cohorts (+0.045) was 10× more effective than adding features.
13 U.S. Provisional Applications filed. Covering physics-constrained simulation, domain adaptation, uncertainty quantification, runtime safety, risk-averse optimization, knowledge-grounded annotation, and drug combination prediction.
Physics-Constrained Sim-to-Real Transfer Learning
Preventing Metabolic Scaling-Induced Collapse
Uncertainty-Calibrated Missing Modality Imputation
Ontology-Guided Autogradient Modulation
Adjoint Sensitivity & Physics-Constrained Gradient Topologies
Distributionally Robust Training (DRO)
Stabilized Stochastic Inference and Risk-Averse Optimization in Physics-Constrained Oncology Digital Twins
Cryptographically Enforced Runtime Resource Gating in Differential Equation Solvers via Memory Allocation Interlock
System and Method for Stabilized Stochastic Inference and Fail-Closed Integrity Gating in Neural Networks
Systems and Methods for Knowledge-Grounded Parameter Annotation and Execution Control of Computational Dynamics Simulators
System and Method for Cryptographically Enforced Execution Mode and Automated Safety Assurance of Calibrated Machine Learning Pipelines
System and Method for GPU Execution Control via Privilege-Separated Resistance Sentinel Architecture in Fixed-Compartment Tumor History Modeling
System and Method for Predicting Drug Combination Efficacy Using Symmetric Bilinear Interaction of Monotherapy-Derived Pathway Sensitivity Profiles
System and Method for Cross-Domain Interventional Parameter Anchoring for Mechanistic Digital Twin Calibration
Runtime Execution Controller for Differentiable Simulators with Autodiff-Verified Parameter Exclusion and Fail-Closed Enforcement
Deep-dive technical documentation on the architecture, algorithms, and validated performance of the platform.
Neuro-Symbolic architecture overview. H-BDVAE, dual-path system, and physics-constrained simulation.
Why genomics alone cannot predict resistance. Non-Mutational Resistance and the failure of standard AI.
Split-Source Transfer Learning for predictive oncology. From organoids to patients via domain separation.
Visual walkthroughs of the platform architecture and patent portfolio.
15-slide visual walkthrough of the sim-to-real architecture, separated-state ODEs, and patent portfolio.
AI-generated podcast covering patent innovations, cross-species transfer learning, and clinical implications.
Key metrics across internal and external cohorts
Research Use Only
All publications and methods described here are for research purposes only and have not been cleared or approved by any regulatory authority for clinical use. The DNAI platform is not a medical device. Patent applications are U.S. Provisional Applications; no patents have been granted.
We welcome academic collaborations, validation partnerships, and licensing discussions.