The paper introduces
PaperBanana, an advanced multi-agent framework designed to automate the creation of professional
academic methodology diagrams and statistical plots. The system utilizes a specialized workflow involving
retriever, planner, visualizer, and critic agents that collaborate to transform technical text into high-quality illustrations through iterative refinement. To rigorously measure performance, the researchers developed
PaperBananaBench, a benchmark derived from modern AI publications that evaluates images based on
faithfulness, conciseness, readability, and aesthetics. The study demonstrates that this agentic approach significantly outperforms standard generative models, producing results that align closely with human-drawn figures. Additionally, the documentation provides
comprehensive style guides and evaluation protocols aimed at standardizing the visual language of scientific discovery. The framework ultimately seeks to
democratize design resources for scientists while maintaining technical accuracy through human-in-the-loop oversight.
References:
- Zhu D, Meng R, Song Y, et al. PaperBanana: Automating Academic Illustration for AI Scientists[J]. arXiv preprint arXiv:2601.23265, 2026.