Sunyoung Hwang's picture

Sunyoung Hwang PRO

sosoai

·

https://sosohajalab.com

AI & ML interests

llm, vision, transformers, megabytes

Recent Activity

liked a model 4 days ago

apple/DiffuCoder-7B-cpGRPO

liked a model 4 days ago

Qwen/Qwen2.5-Coder-7B-Instruct

liked a model 6 days ago

agentica-org/DeepSWE-Preview

View all activity

Organizations

upvoted a collection 7 days ago

GLM-4.1V-Thinking

5 items • Updated 7 days ago • 40

upvoted a collection 9 days ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 6 days ago • 143

upvoted a paper 21 days ago

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Paper • 2506.13654 • Published 22 days ago • 43

upvoted a collection about 1 month ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 17 days ago • 66

upvoted an article about 1 month ago

Article

Interactive Tools for machine learning, deep learning, and math

By

•

May 26

• 44

upvoted 3 collections about 2 months ago

Perception Encoder

9 items • Updated Apr 17 • 61

Qwen3

72 items • Updated 23 days ago • 835

INTELLECT-2

INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. • 3 items • Updated 14 days ago • 23

upvoted 2 collections 2 months ago

CCI4.0

5 items • Updated 29 days ago • 11

Qwen3

21 items • Updated Apr 29 • 29

upvoted an article 2 months ago

Article

How to Build an MCP Server with Gradio

By

and 1 other •

Apr 30

• 177

upvoted 2 collections 2 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 7 days ago • 162

Perception LM

7 items • Updated Apr 17 • 57

upvoted an article 3 months ago

Article

Cohere on Hugging Face Inference Providers 🔥

By

and 6 others •

Apr 16

• 126

upvoted 3 collections 3 months ago

InternVL3

34 items • Updated Apr 20 • 72

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated 8 days ago • 68

Cogito v1 Preview

5 items • Updated Apr 8 • 116

upvoted a collection 4 months ago

Gemma 3 Release

24 items • Updated May 30 • 399

upvoted an article 4 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 294

upvoted a collection 4 months ago

Sky-T1-7B

A series of 7B models trained with different recipes and the corresponding training data. • 8 items • Updated Feb 14 • 7