29 10 28

Alex Chen PRO

alexchen4ai

https://alexchen4ai.github.io/blog/

AI & ML interests

NLP

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-0528

liked a model 5 months ago

NexaAIDev/DeepSeek-R1-Distill-Llama-8B-NexaQuant

liked a model 5 months ago

NexaAIDev/DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant

View all activity

Organizations

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29 • 227k • • 2.19k

liked 3 models 5 months ago

liked 2 models 6 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 787k • • 12.4k

deepseek-ai/DeepSeek-V3-Base

685B • Updated Mar 27 • 118k • 1.65k

upvoted a paper 7 months ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 44

liked 4 models 7 months ago

NexaAIDev/OmniAudio-2.6B

Audio-Text-to-Text • 0.6B • Updated Dec 13, 2024 • 407 • 269

tuwonga/rotoscopee

Text-to-Image • Updated May 5, 2023 • 82 • 31

google-bert/bert-base-uncased

Fill-Mask • 0.1B • Updated Feb 19, 2024 • 64.4M • • 2.34k

stable-diffusion-v1-5/stable-diffusion-v1-5

Text-to-Image • Updated Sep 7, 2024 • 2.83M • 661

New activity in NexaAIDev/OmniVLM-968M 7 months ago

Regarding Model Weights

#12 opened 7 months ago by

BimsaraRad

liked a Space 7 months ago

9.25k

Kolors Virtual Try-On

👕

Try on clothes virtually by uploading images

New activity in NexaAIDev/OmniVLM-968M 7 months ago

9x token reduction

#10 opened 7 months ago by

Sijuade

liked 2 models 7 months ago

NexaAIDev/Qwen2-Audio-7B-GGUF

Audio-Text-to-Text • 8B • Updated Nov 25, 2024 • 3.06k • 159

google/siglip-so400m-patch16-256-i18n

Zero-Shot Image Classification • 1B • Updated Nov 18, 2024 • 153 • 29