LifeAgentBench: A Multi-dimensional Benchmark and Agent for Personal Health Assistants in Digital Health
Published in International Joint Conference on Artificial Intelligence (IJCAI) (under review), 2026
We introduce LifeAgentBench, a large-scale QA benchmark (22,573 questions) for long-horizon, cross-dimensional, and multi-user lifestyle health reasoning over structured health records (diet, activity, sleep, and emotion). We provide a standardized evaluation protocol, evaluate 11 leading LLMs, and propose LifeAgent, a training-free tool-calling baseline that improves performance via multi-step evidence retrieval and deterministic aggregation.
Recommended citation: Tian, Y.$^*$, Wang, Z.$^*$, Gungor, O., Fan, X., & Rosing, T. (2026). LifeAgentBench: A Multi-dimensional Benchmark and Agent for Personal Health Assistants in Digital Health. arXiv:2601.13880. Under review at IJCAI. https://arxiv.org/abs/2601.13880
