44 1

Steven Goldfeather

treehugg3

AI & ML interests

Messing with LLM weights, LLM alignment techniques

Recent Activity

new activity about 1 month ago

jukofyork/creative-writing-control-vectors-v3.0:“The doom lies in yourself, not in your name.”

new activity 5 months ago

jukofyork/creative-writing-control-vectors-v3.0:Wur doomed!

new activity 5 months ago

nvidia/Nemotron-CC-v2:Acess Request

View all activity

Organizations

None yet

New activity in jukofyork/creative-writing-control-vectors-v3.0 about 1 month ago

“The doom lies in yourself, not in your name.”

👀 6

256

#15 opened 4 months ago by

jukofyork

New activity in jukofyork/creative-writing-control-vectors-v3.0 5 months ago

Wur doomed!

566

#14 opened 12 months ago by

jukofyork

New activity in nvidia/Nemotron-CC-v2 5 months ago

Acess Request

➕ 9

#3 opened 5 months ago by

muchanem

New activity in Arki05/Grok-1-GGUF 5 months ago

``` 🥲 Не удалось загрузить модель Failed to load model error loading model: missing tensor 'blk.0.ffn_down_exps.weight' ```

#18 opened 7 months ago by

MAGIC000

New activity in ByteDance-Seed/Seed-OSS-36B-Base-woSyn 5 months ago

What was the underlying training data distribution?

👍 1

#2 opened 5 months ago by

treehugg3

New activity in mradermacher/model_requests 5 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

#1303 opened 5 months ago by

Poro7

https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Base-woSyn

#1317 opened 5 months ago by

treehugg3

New activity in ggml-org/gpt-oss-120b-GGUF 5 months ago

What quantization level does this model use?

#2 opened 5 months ago by

ernestr

New activity in nvidia/Nemotron-CC-v2 5 months ago

license may be too restrictive for its purposes

👍 1

#2 opened 5 months ago by

huu-ontocord

upvoted an article 5 months ago

Article

NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual

Aug 18, 2025

•

New activity in nvidia/Nemotron-CC-v2 5 months ago

Which parts of the dataset are synthetic?

#1 opened 5 months ago by

treehugg3

replied to RakshitAralimatti's post 5 months ago

Thanks for answering. There's a severe lack of high quality technical references in the AI space, so hopefully you will help fill the void.

replied to RakshitAralimatti's post 5 months ago

Not a bad start, but it needs compression and doesn't include examples or quantitative justifications for the areas that benefit from reasoning. Doesn't even mention backtracking, which is something in reasoning models I'd like to learn more about. What are examples of "novel" reasoning paths?

What was the research paper which introduced reasoning? You don't list any sources or external links.

New activity in microsoft/Phi-tiny-MoE-instruct 5 months ago