Jiahao Wu, Zhongwen Xu, Qiang Fu, and Wei Yang

Tencent · TEG · AIPD

December 2025


TL;DR


News


1. The Research Bottleneck for Multi-Turn Search Agents

Recently, LLM reasoning and tool-use capabilities have improved rapidly: on the one hand, the open-source community has released many strong models [2,8]; on the other hand, when moving to multi-turn, long-horizon search-and-verify settings (multiple retrievals, iterative disambiguation, evidence aggregation, and converging to a final answer), there is still a clear gap, where smaller open-source models often lag far behind closed-source commercial models in both effective use of search turns and final accuracy.