Hasan Can Solakoğlu's picture

16 332

Hasan Can Solakoğlu PRO

hcsolakoglu

·

AI & ML interests

NLP, Vision, Data Science

Recent Activity

liked a dataset about 9 hours ago

sap-ai-research/diaforge-utc-r-0725

liked a model about 22 hours ago

OmniGen2/OmniGen2

upvoted a paper 2 days ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

View all activity

Organizations

upvoted a paper 2 days ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Paper • 2506.16500 • Published 19 days ago • 17

upvoted a paper 4 days ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published 7 days ago • 48

upvoted a collection 5 days ago

Reward Models

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated about 22 hours ago • 11

upvoted a collection 9 days ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 5 days ago • 143

upvoted a paper 11 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 13 days ago • 58

upvoted a paper 14 days ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published 15 days ago • 56

upvoted a collection 21 days ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated about 22 hours ago • 13

upvoted a collection 28 days ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 16 days ago • 66

upvoted 2 collections 3 months ago

Nemotron-H

Mamba-Transformer hybrid models • 10 items • Updated about 22 hours ago • 29

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 565

upvoted a collection 4 months ago

Gemma 3 Release

24 items • Updated May 30 • 399

upvoted 3 collections 5 months ago

CodeI/O

Collection for CodeI/O @ https://codei-o.github.io/ • 16 items • Updated May 6 • 7

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 14 days ago • 62

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Apr 28 • 500

upvoted a collection 6 months ago

DeepSeek-R1

10 items • Updated May 29 • 741