Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
37
6
Jue Wang
juewang
Follow
Nosdivad's profile picture
chrstfer's profile picture
thomwolf's profile picture
6 followers
·
4 following
https://juewang.me/about/
JueWANG26088228
LorrinWWW
AI & ML interests
None yet
Recent Activity
published
a model
6 days ago
juewang/deepseek_r1_0528_mtp_fp8
published
a model
6 days ago
togethercomputer/Llama-3.1-8B-Instruct-MoAA-DPO
published
a model
6 days ago
togethercomputer/Llama-3.1-8B-Instruct-MoAA-SFT
View all activity
Organizations
Papers
5
arxiv:
2406.04692
arxiv:
2310.17157
arxiv:
2309.08168
arxiv:
2307.14430
Expand 5 papers
models
14
Sort: Recently updated
juewang/deepseek_r1_0528_mtp_fp8
Updated
6 days ago
juewang/llama-3.1-8b-test-lora
Updated
Nov 13, 2024
juewang/deepseek-coder-6.7b-base-trt-int4-g64-hf
Text Generation
•
Updated
May 10, 2024
•
13
juewang/deepseek-coder-1.3b-base-trt-int4-g64-hf
Text Generation
•
Updated
May 10, 2024
•
14
juewang/deepseek-coder-1.3b-instruct-trt-int4-g64-hf
Text Generation
•
Updated
May 10, 2024
•
17
juewang/deepseek-coder-6.7b-instruct-trt-int4-g64-hf
Text Generation
•
Updated
May 9, 2024
•
14
juewang/deepseek-coder-6.7b-instruct-trt-int8-g64-hf
Text Generation
•
Updated
May 9, 2024
•
12
juewang/deepseek-coder-6.7b-instruct-trt-int8-g32-hf
Text Generation
•
Updated
May 9, 2024
•
13
juewang/deepseek-coder-6.7b-instruct-trt-int8-g128-hf
Text Generation
•
Updated
May 8, 2024
•
16
juewang/Meta-Llama-3-2B-mlp-layer-pruned
Text Generation
•
Updated
Apr 24, 2024
•
18
Expand 14 models
datasets
1
juewang/misc-data
Viewer
•
Updated
Oct 11, 2023
•
861k
•
267