[1] Sébastien Bubeck, Christian Coester, Ronen Eldan, et al. Early science acceleration experiments with GPT-5, 2025. URL https://arxiv.org/abs/2511.16072.
[2] Andres M. Bran, Sam Cox, Oliver Schilter, et al. Augmenting large language models with chemistry tools. Nature Machine Intelligence, 6(5):525–535, 2024. DOI: 10.1038/s42256-024-00832-8. URL https://doi.org/10.1038/s42256-024-00832-8.
[3] Zifeng Wang, Lang Cao, Benjamin Danek, et al. Accelerating clinical evidence synthesis with large language models. npj Digital Medicine, 8:509, 2025. DOI: 10.1038/s41746-025-01840-7. URL https://doi.org/10.1038/s41746-025-01840-7.
[4] Michael Y. Li, Emily B. Fox, and Noah D. Goodman. Automated statistical model discovery with language models, 2024. URL https://arxiv.org/abs/2402.17879.
[5] Alfredo Guevara, Alexandru Lupsasca, David Skinner, et al. Single-minus graviton tree amplitudes are nonzero, 2026. URL https://cdn.openai.com/pdf/graviton.pdf. OpenAI preprint PDF.
[6] Michael P. Brenner, Vincent Cohen-Addad, and David Woodruff. Solving an open problem in theoretical physics using AI-assisted discovery, 2026. URL https://arxiv.org/abs/2603.04735.
[7] Sirui Lu, Zhijing Jin, Terry Jingchen Zhang, et al. Can theoretical physics research benefit from language agents?, 2025. URL https://arxiv.org/abs/2506.06214.
[8] Samuel Schmidgall, Yusheng Su, Ze Wang, et al. Agent laboratory: Using LLM agents as research assistants. In Findings of the Association for Computational Linguistics: EMNLP 2025, 2025. URL https://aclanthology.org/2025.findings-emnlp.320/.
[9] Erzhuo Shao, Yifang Wang, Yifan Qian, et al. SciSciGPT: Advancing human-AI collaboration in the science of science. Nature Computational Science, 2025. DOI: 10.1038/s43588-025-00906-6. URL https://doi.org/10.1038/s43588-025-00906-6.
[10] Yi Zhou. From paper to program: A multi-stage LLM-assisted workflow for accelerating quantum many-body algorithm development, 2026. URL https://arxiv.org/abs/2604.04089.
[11] Ken Deng, Xiangfei Wang, Guijing Duan, et al. Towards verifiable and self-correcting AI physicists for quantum many-body simulations, 2026. URL https://arxiv.org/abs/2604.00149.
[12] Jiaxuan Liu, Tiannian Zhu, Caiyuan Ye, et al. VASPilot: MCP-facilitated multi-agent intelligence for autonomous VASP simulations, 2025. URL https://arxiv.org/abs/2508.07035.
[13] Tiannian Zhu, Zhong Fang, Quansheng Wu, and Hongming Weng. Materialsgalaxy: A platform fusing experimental and theoretical data in condensed matter physics. Chinese Physics B, 34(12):120702, 2025.
[14] Juraj Gottweis, Wei-Hung Weng, Alexander Daryin, Tao Tu, Anil Palepu, Petar Sirkovic, et al. Towards an AI co-scientist, 2025. URL https://arxiv.org/abs/2502.18864.
[15] Linfeng Zhang, Siheng Chen, Yuzhu Cai, et al. Bohrium + SciMaster: Building the infrastructure and ecosystem for agentic science at scale, 2025. URL https://arxiv.org/abs/2512.20469.
[16] Jingyi Chai, Shuo Tang, Rui Ye, Yuwen Du, Xinyu Zhu, Mengcheng Zhou, Yanfeng Wang, Yuzhi Zhang, Linfeng Zhang, Siheng Chen, et al. Scimaster: Towards general-purpose scientific ai agents, part i. x-master as foundation: Can we lead on humanity’s last exam? arXiv preprint arXiv:2507.05241, 2025.
[17] Chris Lu, Cong Lu, Robert Tjarko Lange, et al. The AI scientist: Towards fully automated open-ended scientific discovery, 2024. URL https://arxiv.org/abs/2408.06292.
[18] Federico Bianchi, Owen Queen, Nitya Thakkar, Eric Sun, James Zou, et al. Exploring the use of AI authors and reviewers at Agents4Science. Nature Biotechnology, 44:11–14, 2026. DOI: 10.1038/s41587-025-02963-8. URL https://doi.org/10.1038/s41587-025-02963-8.
[19] Riccardo Bertolo and Alessandro Antonelli. Generative AI in scientific publishing: Disruptive or destructive? Nature Reviews Urology, 21:1–2, 2024. DOI: 10.1038/s41585-023-00836-w. URL https://doi.org/10.1038/s41585-023-00836-w.
[20] Keigo Kusumegi, Xinyu Yang, Paul Ginsparg, et al. Scientific production in the era of large language models. Science, 390(6779):1240–1243, 2025. DOI: 10.1126/science.adw3000. URL https://doi.org/10.1126/science.adw3000.
[21] Weixin Liang, Yaohui Zhang, Zhengxuan Wu, et al. Quantifying large language model usage in scientific papers. Nature Human Behaviour, 9:2599–2609, 2025. DOI: 10.1038/s41562-025-02273-8. URL https://doi.org/10.1038/s41562-025-02273-8.
[22] Anthropic. Model context protocol, 2024. URL https://modelcontextprotocol.io/docs/getting-started/intro.
[23] Anthropic. Agent skills protocol, 2025. URL https://agentskills.io/home.
[24] Hao Cui, Zahra Shamsi, Gowoon Cheon, et al. CURIE: Evaluating LLMs on multitask scientific long context understanding and reasoning, 2025. URL https://arxiv.org/abs/2503.13517.
[25] Haining Pan, James V. Roggeveen, Erez Berg, et al. CMT-benchmark: A benchmark for condensed matter theory built by expert researchers, 2025. URL https://arxiv.org/abs/2510.05228.
[26] Haoyu Guo, Maria Tikhanovskaya, Paul Raccuglia, et al. Expert evaluation of LLM world models: A high-superconductivity case study, 2025. URL https://arxiv.org/abs/2511.03782.
[27] Yanzhen Wang, Yiyang Jiang, Diana Golovanova, Kamal Das, Hyeonhu Bae, Yufei Zhao, Huu-Thong Le, Abhinava Chatterjee, Yunzhe Liu, Chao-Xing Liu, et al. Qmbench: A research level benchmark for quantum materials research. arXiv preprint arXiv:2512.19753, 2025.
[28] Weida Wang, Dongchen Huang, Jiatong Li, Tengchao Yang, Ziyang Zheng, Di Zhang, Dong Han, Benteng Chen, Binzhao Luo, Zhiyu Liu, et al. Cmphysbench: A benchmark for evaluating large language models in condensed matter physics. arXiv preprint arXiv:2508.18124, 2025.
[29] Ken Deng, Xiangfei Wang, Guijing Duan, Chen Mo, Junkun Huang, Runqing Zhang, Ling Qian, Zhiguo Huang, Jize Han, and Di Luo. Towards verifiable and self-correcting ai physicists for quantum many-body simulations. arXiv preprint arXiv:2604.00149, 2026.
[30] Tongtong Wu, Linhao Luo, Yuan-Fang Li, et al. Continual learning for large language models: A survey, 2024. URL https://arxiv.org/abs/2402.01364.
[31] Dawei Wang, Difang Huang, Haipeng Shen, and Brian Uzzi. A large-scale comparison of divergent creativity in humans and large language models. Nature Human Behaviour, 2025. DOI: 10.1038/s41562-025-02331-1. URL https://doi.org/10.1038/s41562-025-02331-1.
[32] Qianyue Hao, Fengli Xu, Yong Li, James Evans, et al. Artificial intelligence tools expand scientists’ impact but contract science’s focus. Nature, 649:1237–1243, 2026. DOI: 10.1038/s41586-025-09922-y. URL https://doi.org/10.1038/s41586-025-09922-y.
[33] Xiao-Liang Qi. The agentification of scientific research: A physicist’s perspective, 2026. URL https://arxiv.org/abs/2604.14718.
[34] Xiao-Liang Qi. Time, information and artificial intelligence. Physics, 2024. DOI: 10.7693/wl20240601. URL https://wuli.iphy.ac.cn/cn/article/doi/10.7693/wl20240601. Chinese article; page title also gives the English title “Time, information and artificial intelligence”.
[35] Xiao-Liang Qi. Teaching and mentoring the ai scientists, April 2025. URL https://pirsa.org/25040066. PIRSA:25040066.
[36] Xiao-Liang Qi. Teaching and mentoring the ai scientists. YouTube video, October 2025. URL https://www.youtube.com/watch?v=vYkYT1aBlVo. Title inferred from the corresponding PIRSA lecture link supplied by the author.
[37] Xiao-Liang Qi. A brief perspective on the artificial intelligence revolution. ai4.science discussion forum post, January 2026. URL https://forum.ai4.science/t/a-br ... ence-revolution/65. Posted January 19, 2026.