qwerty87
's Collections
Interesting AI papers
updated
Attention Is All You Need
Paper
•
1706.03762
•
Published
•
44
BERT: Pre-training of Deep Bidirectional Transformers for Language
Understanding
Paper
•
1810.04805
•
Published
•
14
Universal Language Model Fine-tuning for Text Classification
Paper
•
1801.06146
•
Published
•
6
Language Models are Few-Shot Learners
Paper
•
2005.14165
•
Published
•
11
EELBERT: Tiny Models through Dynamic Embeddings
Paper
•
2310.20144
•
Published
•
3
Scaling Laws for Neural Language Models
Paper
•
2001.08361
•
Published
•
6
Training Compute-Optimal Large Language Models
Paper
•
2203.15556
•
Published
•
10
BloombergGPT: A Large Language Model for Finance
Paper
•
2303.17564
•
Published
•
20
MARRS: Multimodal Reference Resolution System
Paper
•
2311.01650
•
Published
•
2
Scaling Instruction-Finetuned Language Models
Paper
•
2210.11416
•
Published
•
7
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
Understanding
Paper
•
1804.07461
•
Published
•
4
SuperGLUE: A Stickier Benchmark for General-Purpose Language
Understanding Systems
Paper
•
1905.00537
•
Published
•
2
Measuring Massive Multitask Language Understanding
Paper
•
2009.03300
•
Published
•
3
Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
Paper
•
2303.15647
•
Published
•
4
LoRA: Low-Rank Adaptation of Large Language Models
Paper
•
2106.09685
•
Published
•
30
QLoRA: Efficient Finetuning of Quantized LLMs
Paper
•
2305.14314
•
Published
•
45
The Power of Scale for Parameter-Efficient Prompt Tuning
Paper
•
2104.08691
•
Published
•
9
Learning to summarize from human feedback
Paper
•
2009.01325
•
Published
•
4
ReAct: Synergizing Reasoning and Acting in Language Models
Paper
•
2210.03629
•
Published
•
14
Training language models to follow instructions with human feedback
Paper
•
2203.02155
•
Published
•
15
Proximal Policy Optimization Algorithms
Paper
•
1707.06347
•
Published
•
3
Direct Preference Optimization: Your Language Model is Secretly a Reward
Model
Paper
•
2305.18290
•
Published
•
48
Constitutional AI: Harmlessness from AI Feedback
Paper
•
2212.08073
•
Published
•
2
Automatic Chain of Thought Prompting in Large Language Models
Paper
•
2210.03493
•
Published
•
2
PAL: Program-aided Language Models
Paper
•
2211.10435
•
Published
•
4