Toward Generative Virtual Cells: Co-Evolving World Models and Perturbation Planners
Abstract
Data-driven models can predict cellular responses to perturbations, yet they rarely help design the next experiment. Conversely, experiment-design policies typically assume a fixed surrogate model. We propose an adaptive agent that jointly evolves a virtual cell world model and a perturbation-design policy through uncertainty-aware experiment design, with each component proposing changes that improve the other. We introduce a minimal synthetic biology testbed: a five-gene regulatory network with CRISPR-like and environmental perturbations, returning noisy single-cell readouts. Within a co-evolution loop, a Bayesian-optimization-style planner chooses perturbations based on world model predictions, while an outer loop allows the agent to modify its world-model architecture under validation-and-capacity gating. On this testbed, continual retraining within the co-evolution loop is the dominant factor driving a 95%+ reduction in in-distribution prediction error; we also identify adaptation-induced distribution shift—planner bias narrows the training distribution, degrading out-of-distribution generalization by 4×. The architecture search component correctly identifies the initial architecture as sufficient for this problem scale, demonstrating capacity-aware null detection: a desirable conservative property absent from unconstrained self-modification. The framework provides a concrete, safety-aware testbed for studying model–planner coupling in scientific domains.