soke's picture

1 9 4

soke

sode-k

·

https://gravatar.com/sodek28

AI & ML interests

LLM, NLP, LSTM I often use optuna for Hyperparameter automatic optimization.

Recent Activity

upvoted a paper about 1 month ago

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

upvoted a paper 4 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

liked a model 6 months ago

google/gemma-2-2b-jpn-it

View all activity

Organizations

sode-k's activity

upvoted a paper about 1 month ago

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Paper • 2504.04718 • Published Apr 7 • 41

upvoted a paper 4 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 391

upvoted a collection 6 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 77

upvoted 6 papers 6 months ago

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 39

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 125

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 67

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 50

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published Nov 16, 2024 • 47

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 78