24 21 247

Erik Scholz

Green-Sky

Green-Sky

AI & ML interests

None yet

Recent Activity

liked a model about 4 hours ago

codys12/Qwen3-8B-BitNet

upvoted a paper 3 days ago

Voxtral

liked a model 9 days ago

apple/DiffuCoder-7B-cpGRPO

View all activity

Organizations

liked a model about 4 hours ago

codys12/Qwen3-8B-BitNet

Text Generation • 3B • Updated 15 days ago • 494 • 12

upvoted a paper 3 days ago

Voxtral

Paper • 2507.13264 • Published 5 days ago • 23

liked a model 9 days ago

apple/DiffuCoder-7B-cpGRPO

8B • Updated 18 days ago • 4.08k • 294

liked 3 models 10 days ago

upvoted 2 papers 13 days ago

Shared DIFF Transformer

Paper • 2501.17900 • Published Jan 29 • 1

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 180

upvoted 2 papers 15 days ago

Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?

Paper • 2502.11895 • Published Feb 17 • 2

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 74

liked a model 16 days ago

IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 1 • 29

upvoted 2 papers 16 days ago

An Extra RMSNorm is All You Need for Fine Tuning to 1.58 Bits

Paper • 2505.08823 • Published May 12 • 2

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 622

liked a model 16 days ago

LatitudeGames/Muse-12B

Text Generation • 12B • Updated May 17 • 420 • 37

liked a model 17 days ago

OmniGen2/OmniGen2

Any-to-Any • Updated 17 days ago • 59.2k • 394

commented a paper 17 days ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published 29 days ago • 73 •

liked a dataset 21 days ago

HuggingFaceFW/fineweb-2

Viewer • Updated 26 days ago • 5.02B • 651k • 599

upvoted a collection 23 days ago

blt

Collection

4 items • Updated Apr 17 • 26

upvoted a paper 23 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 106

liked a dataset 24 days ago

facebook/seamless-interaction

Updated 8 days ago • 146k • 108

Erik Scholz

AI & ML interests

Recent Activity

Organizations

Green-Sky's activity