Back to Pipeline Overview

Treatment Designv1.0

Treatment Optimizer

Ranks treatment regimens using counterfactual reasoning

Architecture

Bilinear Drug-Cell Interaction + TARNet Counterfactual

Phase 1 Spearman ρ

0.727

Cell-holdout rank correlation on GDSC drug sensitivity

Target: > 0.60

Phase 2 C-index

0.715

Factual concordance of TARNet counterfactual model

Target: > 0.65

E2E Tests

11/11

All end-to-end pipeline tests passing

Target: 11/11

Status

Shadow

Not deployed for clinical use — needs OPE validation + external treatment data

Target: Shadow mode

Overview

A counterfactual treatment ranking engine that combines cell-line drug sensitivity data with patient-level survival modeling to rank treatment regimens. Phase 1 learns drug-cell interactions from GDSC (685 cell lines, 295 drugs). Phase 2 uses TARNet with inverse propensity weighting to estimate conditional average treatment effects from TCGA survival data. Includes abstention gating, Pareto optimization across efficacy/toxicity/resistance, and sequential treatment planning with beam search. Currently in shadow mode — not deployed for clinical recommendations.

Inputs

3 inputs

z_cell / z_patient328

Full VAE latent representing cell line or patient biology

Source: vae

drug_emb128

Drug molecular embeddings

Source: GDSC drug library

cancer_type32 (embedding)

Cancer type embedding for treatment-specific heads

Source: Clinical metadata

Outputs

3 outputs

IC50 predictions[295]

Predicted drug sensitivity for 295 compounds

Treatment ranking[49 regimens]

Pareto-ranked regimens with abstention flags

Sequential plan[3 lines]

Beam-searched 3-line treatment sequence with resistance penalties

Mathematical Formulation

Drug Interaction

Bilinear drug-cell sensitivity prediction

TARNet CATE

Conditional average treatment effect from factual/counterfactual heads

IPW Deconfounding

Inverse propensity weighting for observational confounding

Key Features

Phase 1: Bilinear drug-cell interaction model pretrained on GDSC (242K pairs)
Phase 2: TARNet with per-treatment FiLM heads and IPW deconfounding
Phase 3: 49 regimen registry with DrugMapper (GDSC↔TCGA↔clinical)
Phase 4: Pareto optimization with abstention gate and propensity safety
Phase 5: Sequential beam search over 3 treatment lines with resistance penalty
Treatment labels extracted for 9,415 TCGA patients from GDC
Knowledge-grounded per-clone drug sensitivity from OncoKB/CIViC

Key Innovations

1Cross-domain drug sensitivity transfer (cell line → patient)
2Counterfactual treatment effect estimation with TARNet
3Abstention gating for uncertain treatment recommendations
4Sequential treatment planning with pathway-based resistance modeling

Hyperparameters

Phase 1 LR

1e-3

Phase 2 LR

5e-4

Drug Embed Dim

128

TARNet Hidden

256

Regimens

49 (10 types + default)

Beam Width

5 (sequential planner)

Training Details

Phase 1: GDSC pretrained on 242K drug-cell pairs (685 cell lines with VAE latents, 295 drugs). Phase 2: TARNet trained on TCGA survival with IPW. Phase 4: Pareto ranking across efficacy, toxicity, resistance scores. Expert consensus: beta is under-identified, propensity uncalibrated for rare classes. Needs OPE validation.

Pipeline Position

Multi-modal VAE

Treatment Optimizer

Final Predictions