arxiv:2501.19324
Yuhui Xu
yuhuixu
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
authored
a paper
2 days ago
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
upvoted
a
paper
2 days ago
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Organizations
None yet
Papers
12
models
14
yuhuixu/merged_model_linear_0.6_0.4
Text Generation
•
Updated
•
6
yuhuixu/merged_model_linear_0.5_0.5
Text Generation
•
Updated
•
4
yuhuixu/merged_model_linear_0.4_0.6
Text Generation
•
Updated
•
6
yuhuixu/llama_linear_merge_inst_base_0.5_0.5
Text Generation
•
Updated
•
40
yuhuixu/llama_linear_merge_inst_base_0.8_0.2
Text Generation
•
Updated
•
37
yuhuixu/llama_linear_merge_inst_base_0.9_0.1
Text Generation
•
Updated
•
47
yuhuixu/mistral-bias-0.85
Text Generation
•
Updated
•
1
yuhuixu/mistral-bias-0.9
Text Generation
•
Updated
•
2
yuhuixu/mistral-experts-calme
Text Generation
•
Updated
•
3
yuhuixu/dpo_mistralai-Mistral-7B-Instruct-v0.2-experts
Text Generation
•
Updated
•
4
datasets
None public yet