Treatment Designv1.0

Treatment Optimizer

Ranks treatment regimens using counterfactual reasoning

Architecture
Bilinear Drug-Cell Interaction + TARNet Counterfactual
Phase 1 Spearman ρ
0.727

Cell-holdout rank correlation on GDSC drug sensitivity

Target: > 0.60
Phase 2 C-index
0.715

Factual concordance of TARNet counterfactual model

Target: > 0.65
E2E Tests
11/11

All end-to-end pipeline tests passing

Target: 11/11
Status
Shadow

Not deployed for clinical use — needs OPE validation + external treatment data

Target: Shadow mode

Overview

A counterfactual treatment ranking engine that combines cell-line drug sensitivity data with patient-level survival modeling to rank treatment regimens. Phase 1 learns drug-cell interactions from GDSC (685 cell lines, 295 drugs). Phase 2 uses TARNet with inverse propensity weighting to estimate conditional average treatment effects from TCGA survival data. Includes abstention gating, Pareto optimization across efficacy/toxicity/resistance, and sequential treatment planning with beam search. Currently in shadow mode — not deployed for clinical recommendations.

Inputs

3 inputs
z_cell / z_patient328

Full VAE latent representing cell line or patient biology

Source: vae
drug_emb128

Drug molecular embeddings

Source: GDSC drug library
cancer_type32 (embedding)

Cancer type embedding for treatment-specific heads

Source: Clinical metadata

Outputs

3 outputs
IC50 predictions[295]

Predicted drug sensitivity for 295 compounds

Treatment ranking[49 regimens]

Pareto-ranked regimens with abstention flags

Sequential plan[3 lines]

Beam-searched 3-line treatment sequence with resistance penalties

Mathematical Formulation

Drug Interaction

Bilinear drug-cell sensitivity prediction

TARNet CATE

Conditional average treatment effect from factual/counterfactual heads

IPW Deconfounding

Inverse propensity weighting for observational confounding

Key Features

  • Phase 1: Bilinear drug-cell interaction model pretrained on GDSC (242K pairs)
  • Phase 2: TARNet with per-treatment FiLM heads and IPW deconfounding
  • Phase 3: 49 regimen registry with DrugMapper (GDSC↔TCGA↔clinical)
  • Phase 4: Pareto optimization with abstention gate and propensity safety
  • Phase 5: Sequential beam search over 3 treatment lines with resistance penalty

Key Innovations

  • 1Cross-domain drug sensitivity transfer (cell line → patient)
  • 2Counterfactual treatment effect estimation with TARNet
  • 3Abstention gating for uncertain treatment recommendations
  • 4Sequential treatment planning with pathway-based resistance modeling

Hyperparameters

Phase 1 LR
1e-3
Phase 2 LR
5e-4
Drug Embed Dim
128
TARNet Hidden
256
Regimens
49 (10 types + default)
Beam Width
5 (sequential planner)

Training Details

Phase 1: GDSC pretrained on 242K drug-cell pairs (685 cell lines with VAE latents, 295 drugs). Phase 2: TARNet trained on TCGA survival with IPW. Phase 4: Pareto ranking across efficacy, toxicity, resistance scores. Expert consensus: beta is under-identified, propensity uncalibrated for rare classes. Needs OPE validation.

Pipeline Position

Treatment Optimizer
Final Predictions