CROSS-REFERENCES
CROSS-REFERENCE TO RELATED APPLICATIONS
This application is related to U.S. Provisional Application No. 63/967,576, filed January 25, 2026, entitled “SYSTEM AND METHOD FOR PHYSICS-CONSTRAINED SIM-TO-REAL TRANSFER LEARNING IN COMPUTATIONAL ONCOLOGY.” The present application provides additional disclosure directed to preventing metabolic scaling-induced representational collapse in cross-species oncology models, including selective domain alignment, variance collapse prevention, and related systems and methods.
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH
Not applicable.
I.FIELD OF THE INVENTION
The present invention relates generally to computational oncology, machine learning, and quantitative systems pharmacology (QSP). More specifically, the invention provides computer-implemented systems and methods for training cross-species tumor trajectory simulators that prevent representational collapse, a failure mode wherein latent information capacity is not readily remedied without substantial retraining or architectural modification, by selectively excluding metabolically scaled proliferation signals from adversarial domain alignment, thereby improving numerical stability of downstream ordinary differential equation (ODE) solvers and improving predictive validity in coupled representation-dynamical systems.
II.BACKGROUND OF THE INVENTION
1.Representational Collapse in Coupled ML-to-ODE Pipelines
Preclinical oncology models, including patient-derived xenografts (PDX), provide dense longitudinal tumor measurements, while human clinical datasets are comparatively sparse and irregular. Standard domain adaptation techniques apply adversarial training uniformly across all representation channels to enforce domain invariance between a source domain and a target domain.
The inventors identified a failure mode termed representational collapse that is particularly harmful in coupled machine learning and dynamical simulation pipelines. When proliferation-associated signals, which correlate with interspecies metabolic scaling differences, are forced into adversarial domain alignment, the induced gradient conflicts can trigger: (i) variance collapse in modality-specific latent channels, (ii) degradation in downstream hypernetwork parameter predictions, and (iii) numerical instability in ODE integration due to inadmissible or stiff parameterizations. Such collapse can degrade computational functionality of the simulator and is not readily remedied without substantial retraining or architectural modification.
2.Distinction from Statistical Partial Domain Adaptation
Prior art in partial domain adaptation and feature disentanglement teaches various methods for excluding or down-weighting statistically dissimilar subsets during alignment. Such methods generally rely on distributional metrics or task-driven heuristics and do not account for biophysical constraints that arise in cross-species oncology modeling.
In cross-species tumor modeling, proliferation markers can appear statistically similar across domains while remaining mechanistically incompatible due to allometric scaling laws. As a result, purely statistical selection criteria can fail to identify which signals must be protected from alignment to preserve stability of coupled dynamical solvers.
3.Need for Biophysically Grounded Selective Alignment for Solver-Stable Simulation
There is a need for systems that procedurally identify which latent representation channels should remain domain-private based on biophysically grounded criteria, and that structurally exclude those channels from adversarial gradients so that downstream dynamical parameter generation and numerical integration remain stable.
III.SUMMARY OF THE INVENTION
The invention provides computer-implemented systems and methods for preventing representational collapse by combining proliferation-aware selective adversarial alignment with constrained dynamical parameter generation and solver instrumentation.
In embodiments, the system:
- Computes a proliferation metric indicating cell-cycle activity;
- Determines a non-aligned subset of latent representation channels using a selection rule based on the proliferation metric;
- Encodes molecular features into an aligned subset and a non-aligned subset of a latent representation;
- Applies adversarial alignment to the aligned subset using a discriminator loss, while structurally excluding the non-aligned subset such that discriminator-loss gradients do not propagate to parameters that generate the non-aligned subset;
- Generates ODE parameters via a parameter generator constrained to predetermined ranges stored in memory; and
- Numerically integrates an ODE while logging solver instrumentation data and, in some embodiments, automatically updates solver control parameters for subsequent trajectory generation cycles based on the logged instrumentation data.
In evaluated implementations, the invention prevents representational collapse and improves numerical stability relative to uniform alignment approaches. Specific evaluated metrics are provided below.
IV.BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1: Block diagram of a coupled representation-dynamical system with selective adversarial alignment.
FIG. 2: Structural implementations of a private pathway (gradient stopping, architectural separation, masking, discriminator input restriction).
FIG. 3: Latent variance trajectories showing collapse under uniform alignment and preservation under selective alignment.
FIG. 4: Training procedure including adversarial loss and independence regularization.
FIG. 5: Inference procedure including constrained parameter generation, numerical integration, and solver instrumentation feedback.
V.DETAILED DESCRIPTION OF THE INVENTION
System Overview and Mathematical Formalization
[0001] A computer-implemented system 100 comprises:
- an encoder : mapping molecular features to a latent representation ;
- a partitioning mechanism that produces and ;
- a domain discriminator : ;
- a parameter generator (hypernetwork) : ;
- a constraint mapping ;
- an ODE solver integrating with ;
- and a solver instrumentation module 140 that records integration diagnostics to non-transitory memory.
Definitions and Terminology
[0002]The following terms are used throughout this application:
- Source domain
- preclinical tumor models including PDX.
- Target domain
- human clinical or population-scale datasets.
- Latent representation channels
- latent dimensions, tensor slices, encoder heads, or indexed feature groups. In one embodiment, and channels correspond to indices .
- Representational collapse
- loss of effective information capacity in a subset of channels during training, evidenced by one or more of: (i) variance collapse in one or more channels, (ii) degraded downstream parameter prediction, or (iii) numerical instability during ODE integration.
- Non-aligned subset / private subset
- channels designated to be excluded from adversarial alignment. In index form, the private subset corresponds to , and the aligned subset corresponds to . In gating form, a gate vector or indicates membership.
- Private pathway
- a structural mechanism ensuring that gradients of a discriminator loss do not propagate to parameters that generate the private subset. Implementations include gradient stopping, architectural separation, and input masking, and can also include restricting discriminator input to aligned channels.
- Parameters that generate the private subset
- parameters of the encoder component(s) whose outputs define . In embodiments with separate encoder heads, these parameters comprise weights exclusive to the private head and do not include weights exclusive to the aligned head.
- Predetermined range
- a bound stored in non-transitory memory (for example in configuration memory, model metadata, or a database) used by differentiable constraints to enforce admissibility and improve solver stability.
Metabolic Scaling Rationale and Mechanistic Integration
[0003] Interspecies tumor growth kinetics can be influenced by allometric scaling. In one embodiment, metabolic rate scales with body mass according to Kleiber's law , and rate-like biological processes scale approximately with . As a result, proliferation-associated signals can exhibit mechanistically different scaling regimes across species even when pathway activation appears superficially similar.
The system uses to modulate the selection rule. In one embodiment, the size of the private subset, the exclusion threshold, or a rank cutoff changes monotonically with , which provides consistent behavior under swapping of source and target domains.
Mass sourcing: In embodiments, and are obtained from stored species metadata in non-transitory memory, user-provided configuration, or a database of typical body masses.
Mechanistic integration into method: in one embodiment, is used to adjust an exclusion threshold or a selected proportion of channels designated as private, such that larger interspecies scaling disparities yield stronger exclusion of proliferation-linked channels from adversarial gradients.
Proliferation Metric and Selection Rule
[0004] The system computes a proliferation metric indicating cell-cycle activity. Embodiments include:
- gene set scoring using cell-cycle genes including MKI67, PCNA, TOP2A, MCM6, CDK1, CCNB1, CCNA2, CCNE1, PLK1, AURKA, and BUB1;
- a learned latent component supervised to correlate with Ki67 expression.
A selection rule maps the proliferation metric to a private-channel designation. Example embodiments include:
- Thresholding: where is an attribution or dependence score and is a threshold;
- Ranking: selecting top- channels by dependence with ;
- Gating: learning and designating channels as private based on gate values.
Fully specified example algorithm (enablement anchor):
- Compute proliferation score from a cell-cycle gene set score.
- Compute channel dependence scores where is mutual information or a gradient-based attribution magnitude relating channel to .
- Define as the indices of the top- channels by , or all channels where .
- Optionally set or as a function of , such that larger scaling disparities produce a larger private subset or stricter exclusion.
Adversarial Alignment and Structural Exclusion
[0005] The system trains a discriminator to predict domain labels from discriminator input . The discriminator loss is denoted . An adversarial objective is implemented via gradient reversal or equivalent sign inversion applied to encoder gradients for aligned channels.
Structural exclusion requirement:
The private subset is structurally excluded from adversarial alignment such that gradients of the discriminator loss do not propagate to parameters that generate the private subset.
Embodiments include:
Embodiment A — Discriminator input restricted to aligned subset:
so the discriminator loss is computed from aligned subset only.
Embodiment B — Gradient stopping (stop-gradient):
where blocks gradients from to parameters producing .
Embodiment C — Input masking:
with for private channels and for aligned channels, reducing discriminator-loss gradients with respect to private channels at the discriminator input.
Clarification for discriminator inputs that include private channels: In embodiments where discriminator input includes the private subset (for example via concatenation), the private subset is provided through a gradient-blocking operation such that discriminator-loss gradients do not update parameters that generate the private subset.
Explicit Gradient Routing and Parameter Updates
[0006] In one embodiment, training uses losses that update disjoint parameter subsets:
- task loss updates parameters of the encoder generating and , and updates the parameter generator ;
- discriminator loss updates discriminator parameters ;
- adversarial gradients derived from update only parameters that generate , and are structurally blocked from updating parameters that generate .
This explicit routing is implemented by at least one of: discriminator input restriction, stop-gradient, masking, or separate encoder heads, as described herein.
Independence Enforcement
[0007] To reduce leakage between aligned and private subsets, the system may include independence regularization, including:
- HSIC regularization between and ;
- orthogonality penalties or covariance penalties between and ;
- probe-based minimization where a probe attempts to predict proliferation from aligned channels and is discouraged.
Parameter Generation and Differentiable Constraints
[0008] A parameter generator outputs dynamical parameters which are constrained by differentiable mappings to predetermined ranges stored in memory. In one embodiment, comprises parameter heads, including , , and , and outputs parameters that are constrained as follows:
| Parameter | Constraint | Range |
|---|---|---|
| growth rate | day | |
| drug sensitivity | ||
| immune clearance |
In embodiments where inclusive endpoints are desired, the system additionally applies a clamp, projection, or stored-range saturation rule to enforce membership in a closed interval stored in memory.
ODE Families
[0009] The ODE integrated by solver may be selected from a family including:
Logistic growth:
Gompertz:
von Bertalanffy:
Immune-mediated tumor dynamics:
where is an immune effector state variable, is a saturation parameter, and is a decay or loss-rate parameter.
Coupled-System Stability Narrative
[0010] In coupled representation-dynamical systems, instability can manifest as a compound failure mode: adversarial alignment modifies latent geometry, which perturbs constrained parameter generation, which increases stiffness or inadmissibility of ODE parameterizations, which then triggers numerical instability (for example step rejections, excessive function evaluations, or integration failures). The invention addresses this coupled failure mode by mechanistically selecting proliferation-linked channels for structural exclusion from adversarial gradients and by constraining ODE parameters to predetermined ranges stored in memory, thereby improving numerical solver stability.
Solver Instrumentation and Feedback Control Policy
[0011] The solver instrumentation module 140 logs to non-transitory memory at least two of:
- number of function evaluations (NFE);
- wall-clock integration time;
- step rejection events;
- constraint violation events.
Concrete feedback policy example (technical loop):
In one embodiment, logged instrumentation is used to automatically update solver control parameters for a subsequent trajectory generation cycle, within bounds or thresholds stored in memory, for example:
- if step rejections exceed a stored threshold , decrease a maximum step size parameter selected from a stored set, or adjust an error-control heuristic within a stored envelope;
- if NFE exceeds a stored threshold , switch solver selection among a predefined set stored in memory, or adjust tolerances within a bounded envelope;
- if constraint violations are detected, adjust constraint application to occur earlier in the parameter-generation pipeline or clamp offending parameters before integration.
Experimental Evidence and Data Integrity
[0012] The following metrics are provided from evaluated implementations and are included as concrete indicators of technical effect. Where comparisons are described, the invention is not limited to a particular baseline implementation.
Collapse evidence (cross-version):
- DNA methylation latent variance collapse: (selective alignment version vs forced full alignment version).
- ODE growth-rate prediction degradation: (selective alignment version vs forced full alignment version).
- Model-internal implied PDX doubling-time divergence: median 4.49 days vs 939 days (raw selective vs standardized/aligned). This doubling time value is a model-internal implied timescale under a standardized alignment transform and is not a biological measurement.
Solver efficiency and stability (evaluated):
Solver
dopri5
adaptive Runge-Kutta
Tolerances
rtol 1e-4, atol 1e-6
Simulation horizon
365 days
50 output points
NFE mean +/- std
381.0 +/- 13.5
Wall time mean +/- std
26.6 +/- 1.6 ms
Solver failure rate
0/100 (0%)
No other numerical comparator metrics are asserted in this section.
VI.CLAIMS
What is claimed is:
Claim 1. (Structural Training System)
A computer-implemented system for training a cross-species tumor trajectory simulator, comprising one or more processors configured to:
- receive molecular feature data from preclinical tumor models (source domain) and human clinical data (target domain);
- compute a proliferation metric from the molecular feature data, wherein the proliferation metric indicates cell-cycle activity;
- determine, using a selection rule based on the proliferation metric, a non-aligned subset of latent representation channels;
- encode the molecular feature data into a latent representation comprising an aligned subset and the non-aligned subset;
- train a domain discriminator to distinguish the source domain from the target domain using discriminator input derived at least from the aligned subset;
- apply adversarial alignment to the aligned subset using a gradient reversal operation applied to gradients of a discriminator loss computed using the domain discriminator; and
- structurally exclude the non-aligned subset from adversarial alignment by configuring the system such that gradients of the discriminator loss do not propagate to parameters that generate the non-aligned subset, including by applying one or more of: (i) restricting discriminator input to the aligned subset, (ii) applying a stop-gradient operation with respect to the discriminator loss, (iii) applying a multiplicative mask at a discriminator input, or (iv) using a separate encoder head for the non-aligned subset.
Claim 2. (Inference and Simulation System)
A computer-implemented system for generating a tumor trajectory, comprising one or more processors configured to:
- obtain molecular feature data associated with a tumor sample;
- compute a proliferation metric from the molecular feature data;
- determine, using a selection rule based on the proliferation metric, a non-aligned subset of latent representation channels and an aligned subset of latent representation channels;
- compute a latent representation comprising the aligned subset and the non-aligned subset;
- generate dynamical model parameters from the latent representation using a parameter generation network;
- constrain at least one generated parameter to a predetermined range stored in memory using a differentiable constraint function;
- numerically integrate an ordinary differential equation selected from the group consisting of logistic growth, Gompertz growth, von Bertalanffy growth, and immune-mediated tumor dynamics, using the constrained parameters to compute a tumor trajectory;
- log, during the numerical integration, solver instrumentation data comprising at least two of: number of function evaluations, wall-clock integration time, step rejection events, or constraint violation events; and
- output the tumor trajectory and the solver instrumentation data.
Claim 3. (Cell-Cycle Genes)
The system of claim 1, wherein the proliferation metric is derived from expression of cell-cycle associated genes comprising MKI67, PCNA, TOP2A, MCM6, CDK1, CCNB1, CCNA2, CCNE1, PLK1, AURKA, and BUB1.
Claim 4. (Learned Proliferation Latent)
The system of claim 1, wherein the proliferation metric comprises a learned latent variable trained to correlate with Ki67 expression.
Claim 5. (Selection Rules)
The system of claim 1, wherein the selection rule comprises at least one of thresholding channel attribution, ranking channels by attribution, or attribution-based selection including integrated gradients.
Claim 6. (Gate Network)
The system of claim 1, wherein the selection rule comprises computing per-channel gate values using a differentiable gate network.
Claim 7. (Discriminator Input Restricted)
The system of claim 1, wherein structurally excluding the non-aligned subset comprises providing only the aligned subset as input to the domain discriminator.
Claim 8. (Gradient Stopping)
The system of claim 1, wherein structurally excluding the non-aligned subset comprises applying a stop-gradient operation to the non-aligned subset with respect to the discriminator loss.
Claim 9. (Architectural Separation)
The system of claim 1, wherein the non-aligned subset is generated by a physically separate encoder head with weight matrices distinct from weight matrices generating the aligned subset, and wherein the weight matrices generating the non-aligned subset do not receive gradients from the discriminator loss.
Claim 10. (Input Masking)
The system of claim 1, wherein structurally excluding the non-aligned subset comprises applying a multiplicative mask to reduce discriminator-loss gradients with respect to non-aligned channels.
Claim 11. (Persistence)
The system of claim 1, further comprising storing, in non-transitory memory, data identifying the non-aligned subset or gate values, and reusing the stored data during inference.
Claim 12. (HSIC Regularization)
The system of claim 1, further comprising training the latent representation using an HSIC regularization term configured to reduce statistical dependence between the proliferation metric and the aligned subset.
Claim 13. (Metabolic Scaling Factor Used in Selection)
The system of claim 1, further comprising computing an allometric scaling factor based on a body-mass ratio between a source species and a target species, and using the allometric scaling factor to set or modulate a threshold, rank cutoff, or gate criterion of the selection rule.
Claim 14. (Constraint Functions)
The system of claim 2, wherein the differentiable constraint function comprises at least one of sigmoid scaling to a stored range, softplus, rescaled tanh, or clipped linear constraints.
Claim 15. (Specific Constraints)
The system of claim 2, wherein: a growth rate parameter is constrained to day via sigmoid scaling and optionally clamped to a stored closed interval; an immune clearance parameter is constrained to positive values via softplus; and a drug sensitivity parameter is constrained to via sigmoid and optionally clamped to a stored closed interval.
Claim 16. (Variance Monitoring)
The system of claim 1, further comprising monitoring latent variance of one or more channels and comparing the monitored variance to a stored threshold indicative of representational collapse.
Claim 17. (Dynamic Recomputation)
The system of claim 2, wherein the selection rule is reapplied during inference at predetermined time intervals or upon detection of distribution shift.
Claim 18. (Method - Training)
A computer-implemented method for training a cross-species tumor trajectory simulator, comprising performing the steps of claim 1.
Claim 19. (Method - Inference)
A computer-implemented method for generating a tumor trajectory, comprising performing the steps of claim 2.
Claim 20. (Computer-Readable Medium - Training)
A non-transitory computer-readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to perform the method of claim 18.
Claim 21. (Computer-Readable Medium - Inference)
A non-transitory computer-readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to perform the method of claim 19.
Claim 22. (Planning with Feedback Control)
A computer-implemented method comprising:
- obtaining molecular feature data from a tumor sample;
- generating, for each of a plurality of candidate parameterizations, a simulated tumor trajectory by:
- computing a proliferation metric and applying a selection rule to determine aligned and non-aligned latent representation channels;
- generating constrained dynamical parameters using a parameter generation network with differentiable constraint functions;
- numerically integrating an ordinary differential equation using the constrained parameters; and
- logging solver instrumentation data comprising at least number of function evaluations and constraint violation events;
- writing, to non-transitory memory, a planning report comprising the simulated trajectories and the solver instrumentation data;
- automatically updating at least one numerical integration control parameter selected from a predefined set stored in memory for use during a subsequent trajectory generation cycle by: (i) comparing at least one logged solver diagnostic to one or more thresholds stored in memory, and (ii) modifying the at least one numerical integration control parameter within a stored envelope based on the comparison; and
- transmitting the planning report to a client device.
Claim 23. (Control Parameter Types and Policy)
The method of claim 22, wherein the numerical integration control parameter comprises at least one of solver selection, relative tolerance (rtol), absolute tolerance (atol), or maximum step size, and wherein the automatic updating comprises modifying the numerical integration control parameter according to a policy that compares at least one of step rejection events or number of function evaluations to the one or more thresholds stored in memory.
VII.ABSTRACT OF THE DISCLOSURE
A computer-implemented system prevents metabolic scaling-induced representational collapse in cross-species oncology models by selectively excluding proliferation-associated signals from adversarial domain alignment. The system computes a proliferation metric, determines a non-aligned subset using a selection rule, and applies adversarial alignment to an aligned subset while structurally excluding the non-aligned subset such that discriminator-loss gradients do not propagate to parameters generating the non-aligned subset. The system generates ordinary differential equation parameters constrained to predetermined ranges stored in memory, integrates an ODE using an adaptive solver while logging solver instrumentation data, and outputs tumor trajectories with integration diagnostics. Evaluated implementations demonstrate variance collapse prevention, parameter prediction preservation, and stable solver operation.
[End of Application]
Related Documentation
Patent 1: Sim-to-Real
Physics-Constrained Sim-to-Real Transfer Learning in Computational Oncology.
Patent 3: Uncertainty & Modalities
Uncertainty-calibrated missing modality handling and multi-layer UQ.
Patent 4: Autogradient Modulation
Ontology-guided autogradient modulation for domain adaptation.
Patent 5: Adjoint Sensitivity
Control of adjoint sensitivity computation graphs in differentiable digital twins.
Patent 6: Robust Training (DRO)
Distributionally robust training using dual-mode risk set construction.
Foundation Whitepaper
Complete technical documentation of the DNAI neuro-symbolic architecture.
Interested in licensing this technology?
Contact us to discuss partnership and licensing opportunities for the Selective Alignment System.