đ
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks paper by Xiyang Wu, Zongxia Li, Guangyao Shi
â 47
đ
Paper
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
by Xiyang Wu, Zongxia Li, Guangyao Shi arxiv-paper--unknown--2604.20987
Nexus Index
47.0 Top 100%
S: Semantic 50
A: Authority 0
P: Popularity 0
R: Recency 100
Q: Quality 45
Tech Context
Vital Performance 0.0%
0 DL / 30D
High Impact 0 Citations
2024 Year
ArXiv Venue
- FNI Rank
| Entity Passport | |
| Registry ID | arxiv-paper--unknown--2604.20987 |
| License | ArXiv |
| Provider | hf |
đ
Cite this paper
Academic & Research Attribution
BibTeX
@misc{arxiv_paper__unknown__2604.20987,
author = {Xiyang Wu, Zongxia Li, Guangyao Shi},
title = {Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks Paper},
year = {2026},
howpublished = {\url{https://free2aitools.com/paper/arxiv-paper--unknown--2604.20987}},
note = {Accessed via Free2AITools Knowledge Fortress}
} APA Style
Xiyang Wu, Zongxia Li, Guangyao Shi. (2026). Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks [Paper]. Free2AITools. https://free2aitools.com/paper/arxiv-paper--unknown--2604.20987
đŦTechnical Deep Dive
Full Specifications [+]âž
âī¸ Nexus Index V2.0
47.0
TOP 100% SYSTEM IMPACT
Semantic (S) 50
Authority (A) 0
Popularity (P) 0
Recency (R) 100
Quality (Q) 45
đŦ Index Insight
FNI V2.0 for Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:100), Quality (Q:45).
Free2AITools Nexus Index
Verification Authority
Unbiased Data
Node Refresh: VFS Live
đ Executive Summary
"Technical abstract for this publication is currently being indexed."
â Cite Node
@article{Unknown2026Co-Evolving,
title={Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks},
author={},
journal={arXiv preprint arXiv:arxiv-paper--unknown--2604.20987},
year={2026}
} Abstract & Analysis
VFS Recovering Reference...
[2604.20987] Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
-->
Computer Science > Artificial Intelligence
text
arXiv:2604.20987 (cs)
[Submitted on 22 Apr 2026]
Title: Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
text
Authors: Xiyang Wu , Zongxia Li , Guangyao Shi , Alexander Duffy , Tyler Marques , Matthew Lyle Olson , Tianyi Zhou , Dinesh Manocha View a PDF of the paper titled Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks, by Xiyang Wu and 7 other authors
View PDF
Abstract: Long horizon interactive environments are a testbed for evaluating agents skill usage abilities. These environments demand multi step reasoning, the chaining of multiple skills over many timesteps, and robust decision making under delayed rewards and partial observability. Games are a good testbed for evaluating agent skill usage in environments. Large Language Models (LLMs) offer a promising alternative as game playing agents, but they often struggle with consistent long horizon decision making because they lack a mechanism to discover, retain, and reuse structured skills across episodes. We present COSPLAY, a co evolution framework in which an LLM decision agent retrieves skills from a learnable skill bank to guide action taking, while an agent managed skill pipeline discovers reusable skills from the agents unlabeled rollouts to form a skill bank. Our framework improves both the decision agent to learn better skill retrieval and action generation, while the skill bank agent continually extracts, refines, and updates skills together with their contracts. Experiments across six game environments show that COSPLAY with an 8B base model achieves over 25.1 percent average reward improvement against four frontier LLM baselines on single player game benchmarks while remaining competitive on multi player social reasoning games.
Comments:
26 pages
Subjects:
Artificial Intelligence (cs.AI)
Cite as:
arXiv:2604.20987 [cs.AI]
(or
arXiv:2604.20987v1 [cs.AI] for this version)
https://doi.org/10.48550/arXiv.2604.20987
Focus to learn more
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Zongxia Li [ view email ] [v1] Wed, 22 Apr 2026 18:17:17 UTC (1,461 KB)
text
Full-text links:
Access Paper:
View a PDF of the paper titled Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks, by Xiyang Wu and 7 other authors View PDF TeX Source
text
view license
Current browse context:
cs.AI
text
new
|
recent
| 2026-04
Change to browse by:
cs
References & Citations
text
NASA ADS Google Scholar
Semantic Scholar
export BibTeX citation
Loading...
BibTeX formatted citation
text
×
loading...
Data provided by:
Bookmark
text
Bibliographic Tools
Bibliographic and Citation Tools
text
Bibliographic Explorer Toggle
Bibliographic Explorer ( What is the Explorer? )
Connected Papers Toggle
Connected Papers ( What is Connected Papers? )
Litmaps Toggle
Litmaps ( What is Litmaps? )
scite.ai Toggle
scite Smart Citations ( What are Smart Citations? )
Code, Data, Media
Code, Data and Media Associated with this Article
text
alphaXiv Toggle
alphaXiv ( What is alphaXiv? )
Links to Code Toggle
CatalyzeX Code Finder for Papers ( What is CatalyzeX? )
DagsHub Toggle
DagsHub ( What is DagsHub? )
GotitPub Toggle
Gotit.pub ( What is GotitPub? )
Huggingface Toggle
Hugging Face ( What is Huggingface? )
ScienceCast Toggle
ScienceCast ( What is ScienceCast? )
Demos
Demos
text
Replicate Toggle
Replicate ( What is Replicate? )
Spaces Toggle
Hugging Face Spaces ( What is Spaces? )
Spaces Toggle
TXYZ.AI ( What is TXYZ.AI? )
Related Papers
Recommenders and Search Tools
text
Link to Influence Flower
Influence Flower ( What are Influence Flowers? )
Core recommender toggle
CORE Recommender ( What is CORE? )
Author
Venue
Institution
Topic
About arXivLabs
arXivLabs: experimental projects with community collaborators
text
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs .
Which authors of this paper are endorsers? |
Disable MathJax ( What is MathJax? )
đĻData Source: hf
đ Daily sync (03:00 UTC)AI Summary: Based on hf metadata. Not a recommendation.
đĄī¸ Paper Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
đ Identity & Source
- id
- arxiv-paper--unknown--2604.20987
- slug
- unknown--2604.20987
- source
- hf
- author
- Xiyang Wu, Zongxia Li, Guangyao Shi
- license
- ArXiv
- tags
- paper, research, llm
âī¸ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
đ Engagement & Metrics
- downloads
- 0
- stars
- 0
- forks
- 0
Data indexed from public sources. Updated daily.