-
Attention Is All You Need
Paper • 1706.03762 • Published • 68 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 14 -
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
Paper • 2305.13245 • Published • 5 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 242
Eli Chen
elichen3051
AI & ML interests
Learning Algorithm, Reinforcement Learning, Data Synthesize, Benchmarking
Recent Activity
published
a dataset
22 days ago
elichen-skymizer/lm-eval-ruler-results-private-32K
updated
a model
25 days ago
elichen3051/Llama-3.1-8B-GGUF
updated
a model
25 days ago
elichen3051/Llama-3.2-1B-GGUF-fp16