-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 83 -
SlimPajama-DC: Understanding Data Combinations for LLM Training
Paper • 2309.10818 • Published • 11 -
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)
Paper • 2309.08968 • Published • 23 -
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 39
Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
about 20 hours ago
Model Context Protocol (MCP): Landscape, Security Threats, and Future
Research Directions
upvoted
a
paper
about 20 hours ago
MCP Safety Audit: LLMs with the Model Context Protocol Allow Major
Security Exploits
upvoted
a
paper
1 day ago
Scalable Chain of Thoughts via Elastic Reasoning
Organizations
Collections
3
models
6
demolei/qwen2_5_vl_7b_grpo_chartqa_filtered_40
Updated
•
3
demolei/Qwen2.5-VL-7B-Instruct-chartqa_filtered_240
Updated
•
4
demolei/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
4
demolei/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
•
3
demolei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
demolei/sft_openassistant-guanaco
Updated
datasets
0
None public yet