1 5 33

Burning ray

adarksky

aeryskyB

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

sczhou/CodeFormer

liked a model 12 days ago

Intelligent-Internet/II-Medical-8B

liked a model 16 days ago

Lightricks/LTX-Video

View all activity

Organizations

adarksky's activity

liked a Space 1 day ago

2.05k

CodeFormer

🐼

Enhance and restore old photos with faces

liked a model 12 days ago

Intelligent-Internet/II-Medical-8B

Text Generation • Updated 12 days ago • 11.2k • • 136

liked a model 16 days ago

Lightricks/LTX-Video

Text-to-Video • Updated 7 days ago • 354k • • 1.61k

liked 2 models 28 days ago

Qwen/Qwen3-235B-A22B-FP8

Text Generation • Updated 6 days ago • 57.2k • • 68

Qwen/Qwen3-0.6B-FP8

Text Generation • Updated 7 days ago • 6.73k • 44

liked a dataset about 1 month ago

nvidia/OpenMathReasoning

Viewer • Updated 3 days ago • 5.68M • 45.7k • 266

liked a model about 2 months ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation • Updated 19 days ago • 16.5k • • 302

upvoted a collection about 2 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated 28 days ago • 516

liked a dataset 3 months ago

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 40.8k • 175

updated a dataset 3 months ago

adarksky/WMT12-UN-es-en

Viewer • Updated Feb 28 • 11.2M • 13

published a dataset 3 months ago

adarksky/WMT12-UN-es-en

Viewer • Updated Feb 28 • 11.2M • 13

liked a Space 3 months ago

2.62k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

updated a model 4 months ago

adarksky/Qwen2.5-0.5B-sft-lora-rel-therapy

Text2Text Generation • Updated Feb 4

published a model 4 months ago

adarksky/Qwen2.5-0.5B-sft-lora-rel-therapy

Text2Text Generation • Updated Feb 4

liked a model 4 months ago

openai/whisper-tiny

Automatic Speech Recognition • Updated Feb 29, 2024 • 434k • 327

upvoted 2 papers 4 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 121

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 75

liked 2 models 4 months ago

deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated Feb 1 • 24.1k • 442

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 686k • • 12.2k

updated a model 4 months ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10 • 1.93M • • 4.41k