AGI-Edgerunners/LLM-Agents-Papers

A repo lists papers related to LLM based agent

agentslarge-language-modelsllm-agentpaper-list

GAI 中文摘要

LLM-Agents-Papers 是一个精心整理的开源项目，致力于系统化地收集并分类与基于大语言模型的智能体相关的学术论文。通过提供结构化的研究索引，该项目帮助开发者与研究者快速追踪智能体领域的最新技术进展与前沿研究成果。

该项目涵盖了智能体增强技术，包括规划、记忆机制、反馈反射及检索增强生成等核心组件。详细梳理了智能体在角色扮演、工具使用、游戏模拟及人机交互等场景下的互动模式。系统分类了智能体在数学、物理、生物及金融等垂直领域的具体应用方案。深入讨论了模型训练优化、多智能体协同框架以及安全性与幻觉评估等行业关键议题。收录了包括基准测试、评估环境及相关数据集在内的基础设施建设研究。

适用于人工智能领域的研究人员、算法工程师以及对自主智能体开发感兴趣的开发者。该项目特别适合需要快速进行文献综述、寻找技术实现思路或追踪智能体领域最新技术趋势的使用场景。

⭐

2.3k

Stars

🔱

149

Forks

👁

Watchers

📋

Issues

Python创建于 2023/5/31更新于今天

在 GitHub 上查看

README

由 Gemini 翻译整理

LLM-Agents-Papers

:writing_hand: 项目描述

最后更新时间：2025/7/12

这是一个整理基于 LLM（大语言模型）的 Agent 相关论文的仓库。内容包括：

综述 (Survey)
增强技术 (Technique For Enhancement)
交互 (Interaction)
应用 (Application)
自动化 (Automation)
- 工作流 (Workflow)
- 自动评估 (Automatic Evaluation)
训练 (Training)
规模化 (Scaling)
- 单智能体框架 (Single-Agent Framework)
- 多智能体系统 (Multi-Agent System)
稳定性 (Stability)
基础设施 (Infrastructure)
其他 (Others)

:yellow_heart: 推荐阅读

为了获得更全面的了解，我们也推荐其他论文列表：

zjunlp/LLMAgentPapers: LLM Agent 必读论文集。
teacherpeterpan/self-correction-llm-papers: 关于 LLM 自动反馈与自我修正的研究论文集。
Paitesanshi/LLM-Agent-Survey: 关于基于 LLM 的自主智能体的综述。
woooodyy/llm-agent-paper-list: LLM Agent 领域必读论文。
git-disl/awesome-LLM-game-agent-papers: LLM 游戏智能体必读论文。

:newspaper: 论文列表

综述 (Survey)

[2025/06/10] Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents | [paper] | [code]
[2025/06/06] Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey | [paper] | [code]
[2025/05/27] Creativity in LLM-based Multi-Agent Systems: A Survey | [paper] | [code]
[2025/05/24] Multi-Party Conversational Agents: A Survey | [paper] | [code]
[2025/05/16] A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron? | [paper] | [code]
[2025/05/02] AI agents may be worth the hype but not the resources (yet): An initial exploration of machine translation quality and costs in three language pairs in the legal and news domains | [paper] | [code]
[2025/05/01] A Survey on Large Language Model based Human-Agent Systems | [paper] | [code]
[2025/04/30] Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications | [paper] | [code]
[2025/04/22] A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment | [paper] | [code]
[2025/04/20] Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey | [paper] | [code]
[2025/04/14] A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science | [paper] | [code]
[2025/04/12] A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason, and Agentic Systems | [paper] | [code]
[2025/03/28] Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey | [paper] | [code]
[2025/03/27] Large Language Model Agent: A Survey on Methodology, Applications and Challenges | [paper] | [code]
[2025/03/27] A Survey on (M)LLM-Based GUI Agents | [paper] | [code]
[2025/03/24] A Survey of Large Language Model Agents for Question Answering | [paper]