-
Retentive Network: A Successor to Transformer for Large Language Models
Paper • 2307.08621 • Published • 171 -
LLM4SR: A Survey on Large Language Models for Scientific Research
Paper • 2501.04306 • Published • 33 -
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 81 -
On the Measure of Intelligence
Paper • 1911.01547 • Published • 3
Collections
Discover the best community collections!
Collections including paper arxiv:2501.04227
-
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 53 -
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
Paper • 2309.06497 • Published • 5 -
MindAgent: Emergent Gaming Interaction
Paper • 2309.09971 • Published • 11 -
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 84