Sleep research article
Awakening the Sleeping Agent: Lean-Specific Agentic Data Reactivates General Tool Use in Goedel Prover
Authors: Jui-Hui Chung , Hongzhou Lin , Lai Jiang , Shange Tang , Chi Jin
One-line summary
A sleep science research article on Awakening the Sleeping Agent: Lean-Specific Agentic Data Reactivates General Tool Use in Goedel Prover.
Sleep health notes
Sleep health notes will be added by the Sleepatch editorial team.
中文解读
中文解读待补充:本站会优先为失眠研究、睡眠质量改善、昼夜节律等高价值睡眠研究添加中文说明。
Original abstract
Heavy supervised fine-tuning on a target domain can strongly suppress capabilities that were present in the base model. We study this phenomenon in formal mathematics using Goedel-Prover-V2, an open-source model heavily trained on 1.8 million formal-math examples. After domain specialization, the model almost completely loses its ability to produce valid tool calls, even when explicitly instructed to use tools, dropping from 89.4% function-calling accuracy in the base model to nearly 0%. We ask whether this agentic collapse is permanent or instead reversible. To answer this question, we fine-tune the specialized model on a small amount of Lean-specific tool-use data. Remarkably, as few as 100 agentic traces are sufficient to restore strong tool-calling behavior. Importantly, this recovery is not the result of reward hacking or benchmark-specific optimization: the recovery data is entirely drawn from the Lean setting, where the model uses natural-language queries to search the Mathlib library for relevant theorems and lemmas, yet the regained capability transfers well beyond that domain. In particular, these same 100 Lean-specific traces improve performance on the Berkeley Function Calling Leaderboard from near zero to 83.8%, approaching the base model's 89.4% despite the mismatch in task distribution and protocol. The recovered capability is also practically useful in-domain. On ProofNet, pass@32 improves from 21.51% to 25.81%. Together, these results show that heavy domain supervised fine-tuning can suppress general tool-use ability without permanently erasing it, and that a small amount of domain-specific agentic data can awaken dormant tool-use capabilities.
Links and sources
This content is provided for informational and educational purposes only and does not constitute medical advice, diagnosis, or treatment. Sleep disorders, chronic insomnia, sleep apnea, and other conditions must be evaluated and treated by a qualified healthcare professional. If you experience persistent or severe sleep problems, consult a licensed physician or sleep specialist. Research cited refers to peer-reviewed studies; individual results may vary. Sleepatch does not endorse any specific medication, supplement, or therapy.
Want a personalized sleep improvement plan?
Sleepatch can prepare a customized sleep wellness program, insomnia relief guide, and evidence-based sleep coaching based on your needs.
Explore sleep services
Comments