Yoai
's Collections
Agents
updated
A Zero-Shot Language Agent for Computer Control with Structured
Reflection
Paper
•
2310.08740
•
Published
•
14
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Paper
•
2310.12823
•
Published
•
35
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring
Emergent Behaviors
Paper
•
2308.10848
•
Published
•
1
CLEX: Continuous Length Extrapolation for Large Language Models
Paper
•
2310.16450
•
Published
•
9
An Early Evaluation of GPT-4V(ision)
Paper
•
2310.16534
•
Published
•
21
Personas as a Way to Model Truthfulness in Language Models
Paper
•
2310.18168
•
Published
•
5
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation
Paper
•
2311.00272
•
Published
•
9
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
Paper
•
2311.02262
•
Published
•
10
Ultra-Long Sequence Distributed Transformer
Paper
•
2311.02382
•
Published
•
2
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning
Paper
•
2311.02303
•
Published
•
4
Prompt Engineering a Prompt Engineer
Paper
•
2311.05661
•
Published
•
20
Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized
Model Responses
Paper
•
2312.00763
•
Published
•
19
Merlin:Empowering Multimodal LLMs with Foresight Minds
Paper
•
2312.00589
•
Published
•
24
Instruction-tuning Aligns LLMs to the Human Brain
Paper
•
2312.00575
•
Published
•
11
DeepCache: Accelerating Diffusion Models for Free
Paper
•
2312.00858
•
Published
•
21
PathFinder: Guided Search over Multi-Step Reasoning Paths
Paper
•
2312.05180
•
Published
•
9
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
•
2312.10003
•
Published
•
35
Faithful Persona-based Conversational Dataset Generation with Large
Language Models
Paper
•
2312.10007
•
Published
•
6
Supervised Knowledge Makes Large Language Models Better In-context
Learners
Paper
•
2312.15918
•
Published
•
8
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention
and Distributed KVCache
Paper
•
2401.02669
•
Published
•
14
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper
•
2401.05033
•
Published
•
15
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model
Paper
•
2401.02330
•
Published
•
14
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual
Perception
Paper
•
2401.16158
•
Published
•
17
Weaver: Foundation Models for Creative Writing
Paper
•
2401.17268
•
Published
•
42
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper
•
2402.05140
•
Published
•
20
More Agents Is All You Need
Paper
•
2402.05120
•
Published
•
51
Premise Order Matters in Reasoning with Large Language Models
Paper
•
2402.08939
•
Published
•
25
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Paper
•
2402.09727
•
Published
•
35
Chain-of-Thought Reasoning Without Prompting
Paper
•
2402.10200
•
Published
•
99
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs
Miss
Paper
•
2402.10790
•
Published
•
40
Evaluating Very Long-Term Conversational Memory of LLM Agents
Paper
•
2402.17753
•
Published
•
18
GAIA: a benchmark for General AI Assistants
Paper
•
2311.12983
•
Published
•
183
Learning to Decode Collaboratively with Multiple Language Models
Paper
•
2403.03870
•
Published
•
18
SOTOPIA-π: Interactive Learning of Socially Intelligent Language
Agents
Paper
•
2403.08715
•
Published
•
20
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for
Large Language Models
Paper
•
2403.12881
•
Published
•
16
Evolutionary Optimization of Model Merging Recipes
Paper
•
2403.13187
•
Published
•
50
LLM Agent Operating System
Paper
•
2403.16971
•
Published
•
65
AgentLite: A Lightweight Library for Building and Advancing
Task-Oriented LLM Agent System
Paper
•
2402.15538
•
Published
•
6
LLMs Simulate Big Five Personality Traits: Further Evidence
Paper
•
2402.01765
•
Published
LLM Agents in Interaction: Measuring Personality Consistency and
Linguistic Alignment in Interacting Populations of Large Language Models
Paper
•
2402.02896
•
Published
Is Cognition and Action Consistent or Not: Investigating Large Language
Model's Personality
Paper
•
2402.14679
•
Published
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large
Language Models
Paper
•
2403.02246
•
Published
LLM Multi-Agent Systems: Challenges and Open Problems
Paper
•
2402.03578
•
Published
AgentScope: A Flexible yet Robust Multi-Agent Platform
Paper
•
2402.14034
•
Published
•
12
Social Skill Training with Large Language Models
Paper
•
2404.04204
•
Published
•
15
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of
Diverse Models
Paper
•
2404.18796
•
Published
•
68
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
•
2405.01535
•
Published
•
116
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
•
2406.04692
•
Published
•
55
Octo-planner: On-device Language Model for Planner-Action Agents
Paper
•
2406.18082
•
Published
•
47
ROS-LLM: A ROS framework for embodied AI with task feedback and
structured reasoning
Paper
•
2406.19741
•
Published
•
59
LiteSearch: Efficacious Tree Search for LLM
Paper
•
2407.00320
•
Published
•
37
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
•
2407.01489
•
Published
•
42
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for
LLM Agents
Paper
•
2407.04363
•
Published
•
26
Stark: Social Long-Term Multi-Modal Conversation with Persona
Commonsense Knowledge
Paper
•
2407.03958
•
Published
•
18
LAMBDA: A Large Model Based Data Agent
Paper
•
2407.17535
•
Published
•
34
PERSONA: A Reproducible Testbed for Pluralistic Alignment
Paper
•
2407.17387
•
Published
•
17
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS
Paper
•
2408.01584
•
Published
•
7
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in
Long-Horizon Tasks
Paper
•
2408.03615
•
Published
•
30
Generating novel experimental hypotheses from language models: A case
study on cross-dative generalization
Paper
•
2408.05086
•
Published
•
4
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
•
2408.06292
•
Published
•
115
Benchmarking Agentic Workflow Generation
Paper
•
2410.07869
•
Published
•
25
marcelbinz/Llama-3.1-Centaur-70B
Text Generation
•
Updated
•
391
•
12