This paper introduces
Scouter, a novel computational tool designed to predict how cells respond to
genetic perturbations using transcriptomic data. Unlike previous models that depend on sparse and incomplete biological graphs, this method leverages
large language model embeddings to capture complex gene relationships from textual descriptions. By integrating these dense vectors with a
compressor-generator neural network, Scouter effectively simulates cellular reactions to both single and combined gene changes. The researchers demonstrate that this framework significantly
outperforms existing benchmarks, reducing prediction errors by approximately half across multiple datasets. Furthermore, the model is notable for its
efficiency and accessibility, as it runs on standard hardware without the need for extensive pretraining or specialized graph-based architectures.
References:
- Zhu O, Li J. Scouter predicts transcriptional responses to genetic perturbations with large language model embeddings[J]. Nature Computational Science, 2025: 1-8.