đ
HippoCamp: Benchmarking Contextual Agents on Personal Computers paper by Zhe Yang, Shulin Tian, Kairui Hu
â 37
đ
Paper
HippoCamp: Benchmarking Contextual Agents on Personal Computers
by Zhe Yang, Shulin Tian, Kairui Hu arxiv-paper--unknown--2604.01221
Nexus Index
37.0 Top 100%
S: Semantic 50
A: Authority 0
P: Popularity 0
R: Recency 100
Q: Quality 45
Tech Context
Vital Performance 0.0%
0 DL / 30D
High Impact 0 Citations
2024 Year
ArXiv Venue
- FNI Rank
| Entity Passport | |
| Registry ID | arxiv-paper--unknown--2604.01221 |
| License | ArXiv |
| Provider | hf |
đ
Cite this paper
Academic & Research Attribution
BibTeX
@misc{arxiv_paper__unknown__2604.01221,
author = {Zhe Yang, Shulin Tian, Kairui Hu},
title = {HippoCamp: Benchmarking Contextual Agents on Personal Computers Paper},
year = {2026},
howpublished = {\url{https://free2aitools.com/paper/arxiv-paper--unknown--2604.01221}},
note = {Accessed via Free2AITools Knowledge Fortress}
} APA Style
Zhe Yang, Shulin Tian, Kairui Hu. (2026). HippoCamp: Benchmarking Contextual Agents on Personal Computers [Paper]. Free2AITools. https://free2aitools.com/paper/arxiv-paper--unknown--2604.01221
đŦTechnical Deep Dive
Full Specifications [+]âž
âī¸ Nexus Index V2.0
37.0
TOP 100% SYSTEM IMPACT
Semantic (S) 50
Authority (A) 0
Popularity (P) 0
Recency (R) 100
Quality (Q) 45
đŦ Index Insight
FNI V2.0 for HippoCamp: Benchmarking Contextual Agents on Personal Computers: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:100), Quality (Q:45).
Free2AITools Nexus Index
Verification Authority
Unbiased Data
Node Refresh: VFS Live
đ Executive Summary
"Technical abstract for this publication is currently being indexed."
â Cite Node
@article{Unknown2026HippoCamp:,
title={HippoCamp: Benchmarking Contextual Agents on Personal Computers},
author={},
journal={arXiv preprint arXiv:arxiv-paper--unknown--2604.01221},
year={2026}
} Abstract & Analysis
VFS Recovering Reference...
[2604.01221] HippoCamp: Benchmarking Contextual Agents on Personal Computers
-->
Computer Science > Artificial Intelligence
text
arXiv:2604.01221 (cs)
[Submitted on 1 Apr 2026]
Title: HippoCamp: Benchmarking Contextual Agents on Personal Computers
text
Authors: Zhe Yang , Shulin Tian , Kairui Hu , Shuai Liu , Hoang-Nhat Nguyen , Yichi Zhang , Zujin Guo , Mengying Yu , Zinan Zhang , Jingkang Yang , Chen Change Loy , Ziwei Liu View a PDF of the paper titled HippoCamp: Benchmarking Contextual Agents on Personal Computers, by Zhe Yang and 11 other authors
View PDF
Abstract: We present HippoCamp, a new benchmark designed to evaluate agents' capabilities on multimodal file management. Unlike existing agent benchmarks that focus on tasks like web interaction, tool use, or software automation in generic settings, HippoCamp evaluates agents in user-centric environments to model individual user profiles and search massive personal files for context-aware reasoning. Our benchmark instantiates device-scale file systems over real-world profiles spanning diverse modalities, comprising 42.4 GB of data across over 2K real-world files. Building upon the raw files, we construct 581 QA pairs to assess agents' capabilities in search, evidence perception, and multi-step reasoning. To facilitate fine-grained analysis, we provide 46.1K densely annotated structured trajectories for step-wise failure diagnosis. We evaluate a wide range of state-of-the-art multimodal large language models (MLLMs) and agentic methods on HippoCamp. Our comprehensive experiments reveal a significant performance gap: even the most advanced commercial models achieve only 48.3% accuracy in user profiling, struggling particularly with long-horizon retrieval and cross-modal reasoning within dense personal file systems. Furthermore, our step-wise failure diagnosis identifies multimodal perception and evidence grounding as the primary bottlenecks. Ultimately, HippoCamp exposes the critical limitations of current agents in realistic, user-centric environments and provides a robust foundation for developing next-generation personal AI assistants.
Comments:
Project Page: this https URL
Subjects:
Artificial Intelligence (cs.AI) ; Computer Vision and Pattern Recognition (cs.CV)
Cite as:
arXiv:2604.01221 [cs.AI]
(or
arXiv:2604.01221v1 [cs.AI] for this version)
https://doi.org/10.48550/arXiv.2604.01221
Focus to learn more
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Shulin Tian [ view email ] [v1] Wed, 1 Apr 2026 17:58:33 UTC (24,493 KB)
text
Full-text links:
Access Paper:
View a PDF of the paper titled HippoCamp: Benchmarking Contextual Agents on Personal Computers, by Zhe Yang and 11 other authors View PDF TeX Source
text
view license
Current browse context: cs.AI
new
|
recent
| 2026-04
Change to browse by:
cs
cs.CV
References & Citations
text
NASA ADS Google Scholar
Semantic Scholar
export BibTeX citation
Loading...
BibTeX formatted citation
text
×
loading...
Data provided by:
Bookmark
text
Bibliographic Tools
Bibliographic and Citation Tools
text
Bibliographic Explorer Toggle
Bibliographic Explorer ( What is the Explorer? )
Connected Papers Toggle
Connected Papers ( What is Connected Papers? )
Litmaps Toggle
Litmaps ( What is Litmaps? )
scite.ai Toggle
scite Smart Citations ( What are Smart Citations? )
Code, Data, Media
Code, Data and Media Associated with this Article
text
alphaXiv Toggle
alphaXiv ( What is alphaXiv? )
Links to Code Toggle
CatalyzeX Code Finder for Papers ( What is CatalyzeX? )
DagsHub Toggle
DagsHub ( What is DagsHub? )
GotitPub Toggle
Gotit.pub ( What is GotitPub? )
Huggingface Toggle
Hugging Face ( What is Huggingface? )
Links to Code Toggle
Papers with Code ( What is Papers with Code? )
ScienceCast Toggle
ScienceCast ( What is ScienceCast? )
Demos
Demos
text
Replicate Toggle
Replicate ( What is Replicate? )
Spaces Toggle
Hugging Face Spaces ( What is Spaces? )
Spaces Toggle
TXYZ.AI ( What is TXYZ.AI? )
Related Papers
Recommenders and Search Tools
text
Link to Influence Flower
Influence Flower ( What are Influence Flowers? )
Core recommender toggle
CORE Recommender ( What is CORE? )
Author
Venue
Institution
Topic
About arXivLabs
arXivLabs: experimental projects with community collaborators
text
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs .
Which authors of this paper are endorsers? |
Disable MathJax ( What is MathJax? )
đĻData Source: hf
đ Daily sync (03:00 UTC)AI Summary: Based on hf metadata. Not a recommendation.
đĄī¸ Paper Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
đ Identity & Source
- id
- arxiv-paper--unknown--2604.01221
- slug
- unknown--2604.01221
- source
- hf
- author
- Zhe Yang, Shulin Tian, Kairui Hu
- license
- ArXiv
- tags
- paper, research
âī¸ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
đ Engagement & Metrics
- downloads
- 0
- stars
- 0
- forks
- 0
Data indexed from public sources. Updated daily.