-
CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent Evaluation
Paper • 2401.01275 • Published • 1 -
Evaluating Very Long-Term Conversational Memory of LLM Agents
Paper • 2402.17753 • Published • 20 -
PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering
Paper • 2402.16288 • Published • 1 -
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models
Paper • 2502.14802 • Published • 13
Soulter
Soulter
·
AI & ML interests
None yet
Recent Activity
liked
a model
14 days ago
Menlo/Jan-nano-128k
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-R1-0528