标签: 论文解读

Reflexion 论文深度解读：用语言代替梯度，让Agent学会自我反思

Reflexion = Verbal Reinforcement Learning
论文：Shinn et al., 2023, Northeastern + MIT + Princeton
原文链接：https://arxiv.org/abs/2303.11366
代码：https://github.com/noahshinn024/reflexion
本文记录我的论文学习过程与核心理解

Mr.Sun2026年5月11日...大约 12 分钟

AgentBench 论文深度解读：第一个系统化评估 LLM 作为 Agent 能力的基准

AgentBench: Evaluating LLMs as Agents
论文：Liu, Xu et al., 清华 + 上交 + UC Berkeley + Microsoft + Stanford 等
原文链接：https://arxiv.org/abs/2308.03688
发表：2023.8 | 引用：1000+（Semantic Scholar）
开源：https://github.com/alibabaagents/agentbench
本文记录我的论文学习过程与核心理解

Mr.Sun2026年5月6日...大约 11 分钟

ChatDev 论文深度解读：AI 驱动的多 Agent 软件开发虚拟公司

ChatDev: Communicative Agents for Software Development
论文：Chen Qian, Liu Wei et al., 清华 + 北大 + 微软亚洲研究院
原文链接：https://arxiv.org/abs/2307.07924
发表：2023.8 | 引用：500+（Semantic Scholar）
开源：https://github.com/OpenBMB/ChatDev
本文记录我的论文学习过程与核心理解

Mr.Sun2026年5月6日...大约 12 分钟

Generative Agents 论文深度解读：AI 模拟人类社会行为的开创性实验

Generative Agents: Interactive Simulacra of Human Behavior
论文：Stanford University + Google DeepMind
原文链接：https://arxiv.org/abs/2304.03442
发表：2023.4（arXiv）| UIST 2023 正式发表 | 引用：1000+
本文记录我的论文学习过程与核心理解

Mr.Sun2026年5月6日...大约 12 分钟

Agentic RAG 深度解读：检索增强与 Agent 能力的深度结合

Agentic RAG: Combining Retrieval-Augmented Generation with Agent Capabilities
本文记录我的论文学习过程与核心理解

Mr.Sun2026年5月4日...大约 10 分钟

Computer Use 论文深度解读：AI Agent 操控操作系统的多模态突破

Computer Use: Anthropic's Breakthrough in Native GUI Control for AI Agents
论文：Anthropic，2024
本文记录我的论文学习过程与核心理解

Mr.Sun2026年5月4日...大约 9 分钟

Self-Discovering 论文深度解读：LLM 自我组合推理结构的突破性方法

Self-Discover: Large Language Models Self-Compose Reasoning Structures
论文：Google DeepMind（Zhou et al.），2024
原文链接：https://arxiv.org/abs/2402.03620
本文记录我的论文学习过程与核心理解

Mr.Sun2026年5月4日...大约 11 分钟

MemGPT 论文深度解读：突破 LLM 上下文窗口限制的层级记忆管理

MemGPT: Towards LLMs as Operating Systems
论文：UC Berkeley Packer, Wooders, Lin, Fang, Patil, Stoica, Gonzalez，2023
本文记录我的论文学习过程与核心理解

Mr.Sun2026年5月1日...大约 8 分钟

Toolformer 论文深度解读：LLM 自学使用工具

Toolformer: Language Models Can Teach Themselves to Use Tools
论文：Timo Schick et al., Meta AI, 2023
本文记录我的论文学习过程与核心理解

Mr.Sun2026年4月29日...大约 8 分钟

ReAct 论文深度解读：让大模型学会"边想边做"

ReAct = Reasoning + Acting
论文：Yao et al., 2022, Google Research + Princeton
原文链接：https://arxiv.org/abs/2210.03629
本文记录我的论文学习过程与核心理解

Mr.Sun2026年4月28日...大约 5 分钟