realy?
Alifian candra
Alian95
AI & ML interests
None yet
Recent Activity
replied to
burtenshaw's
post
9 months ago
NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO.
🔗 https://huggingface.co/reasoning-course
This unit is super useful if you’re tuning models with reinforcement learning. It will help with:
- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions
This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.
📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.
new activity
9 months ago
huggingface/InferenceSupport:Alian95/Alian95
published
a model
9 months ago
Alian95/Alian95
Organizations
None yet
replied to
burtenshaw's
post
9 months ago
Alian95/Alian95
1
#132 opened 9 months ago
by
Alian95
reacted to
burtenshaw's
post with ❤️
9 months ago
Post
3498
NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO.
🔗
reasoning-course
This unit is super useful if you’re tuning models with reinforcement learning. It will help with:
- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions
This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.
📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.
🔗
This unit is super useful if you’re tuning models with reinforcement learning. It will help with:
- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions
This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.
📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.
reacted to
clem's
post with 👀
9 months ago
Post
2309
Very interesting security section by
@yjernite
@lvwerra
@reach-vb
@dvilasuero
& the team replicating R1. Broadly applicable to most open-source models & some to APIs (but APIs have a lot more additional risks because you're not in control of the underlying system):
https://huggingface.co/blog/open-r1/update-4#is-it-safe
https://huggingface.co/blog/open-r1/update-4#is-it-safe
reacted to
AdinaY's
post with 🤗🚀
9 months ago
Post
2532
Let's check out the latest releases from the Chinese community in March!
👉 https://huggingface.co/collections/zh-ai-community/march-2025-releases-from-the-chinese-community-67c6b479ebb87abbdf8e2e76
✨MLLM
> R1 Omni by Alibaba Tongyi - 0.5B
> Qwen2.5 Omni by Alibaba Qwen - 7B with apache2.0
🖼️Video
> CogView-4 by ZhipuAI - Apacha2.0
> HunyuanVideo-I2V by TencentHunyuan
> Open Sora2.0 - 11B with Apache2.0
> Stepvideo TI2V by StepFun AI - 30B with MIT license
🎵Audio
> DiffDiffRhythm - Apache2.0
> Spark TTS by SparkAudio - 0.5B
⚡️Image/3D
> Hunyuan3D 2mv/2mini (0.6B) by @TencentHunyuan
> FlexWorld by ByteDance - MIT license
> Qwen2.5-VL-32B-Instruct by Alibaba Qwen - Apache2.0
> Tripo SG (1.5B)/SF by VastAIResearch - MIT license
> InfiniteYou by ByteDance
> LHM by Alibaba AIGC team - Apache2.0
> Spatial LM by ManyCore
🧠Reasoning
> QwQ-32B by Alibaba Qwen - Apache2.0
> Skywork R1V - 38B with MIT license
> RWKV G1 by RWKV AI - 0.1B pure RNN reasoning model with Apache2.0
> Fin R1 by SUFE AIFLM Lab - financial reasoning
🔠LLM
> DeepSeek v3 0324 by DeepSeek -MIT license
> Babel by Alibaba DAMO - 9B/83B/25 languages
👉 https://huggingface.co/collections/zh-ai-community/march-2025-releases-from-the-chinese-community-67c6b479ebb87abbdf8e2e76
✨MLLM
> R1 Omni by Alibaba Tongyi - 0.5B
> Qwen2.5 Omni by Alibaba Qwen - 7B with apache2.0
🖼️Video
> CogView-4 by ZhipuAI - Apacha2.0
> HunyuanVideo-I2V by TencentHunyuan
> Open Sora2.0 - 11B with Apache2.0
> Stepvideo TI2V by StepFun AI - 30B with MIT license
🎵Audio
> DiffDiffRhythm - Apache2.0
> Spark TTS by SparkAudio - 0.5B
⚡️Image/3D
> Hunyuan3D 2mv/2mini (0.6B) by @TencentHunyuan
> FlexWorld by ByteDance - MIT license
> Qwen2.5-VL-32B-Instruct by Alibaba Qwen - Apache2.0
> Tripo SG (1.5B)/SF by VastAIResearch - MIT license
> InfiniteYou by ByteDance
> LHM by Alibaba AIGC team - Apache2.0
> Spatial LM by ManyCore
🧠Reasoning
> QwQ-32B by Alibaba Qwen - Apache2.0
> Skywork R1V - 38B with MIT license
> RWKV G1 by RWKV AI - 0.1B pure RNN reasoning model with Apache2.0
> Fin R1 by SUFE AIFLM Lab - financial reasoning
🔠LLM
> DeepSeek v3 0324 by DeepSeek -MIT license
> Babel by Alibaba DAMO - 9B/83B/25 languages
reacted to
jasoncorkill's
post with 👀
9 months ago
Post
2278
🔥 It's out! We published the dataset for our evaluation of
@OpenAI
's new 4o image generation model.
Rapidata/OpenAI-4o_t2i_human_preference
Yesterday we published the first large evaluation of the new model, showing that it absolutely leaves the competition in the dust. We have now made the results and data available here! Please check it out and ❤️ !
Rapidata/OpenAI-4o_t2i_human_preference
Yesterday we published the first large evaluation of the new model, showing that it absolutely leaves the competition in the dust. We have now made the results and data available here! Please check it out and ❤️ !
reacted to
samihalawa's
post with 👀🔥👍
9 months ago
Post
3518
🧠 PROMPT FOR CONVERTING ANY MODEL IN REASONING "THINKING" MODEL🔥🤖
Convert any model to Deepseek R1 like "thinking" model. 💭
Convert any model to Deepseek R1 like "thinking" model. 💭
You're now a thinking-first LLM. For all inputs:
1. Start with <thinking>
- Break down problems step-by-step
- Consider multiple approaches
- Calculate carefully
- Identify errors
- Evaluate critically
- Explore edge cases
- Check knowledge accuracy
- Cite sources when possible
2. End with </thinking>
3. Then respond clearly based on your thinking.
The <thinking> section is invisible to users and helps you produce better answers.
For math: show all work and verify
For coding: reason through logic and test edge cases
For facts: verify information and consider reliability
For creative tasks: explore options before deciding
For analysis: examine multiple interpretations
Example:
<thinking>
[Step-by-step analysis]
[Multiple perspectives]
[Self-critique]
[Final conclusion]
</thinking>
[Clear, concise response to user]great
reacted to
Keltezaa's
post with 🔥
9 months ago
Post
9888
Dear HF Staff and pro Users.
Why did you remove the "Regen" feature from the ZeroGPU feature?
Is this an error or intended?
I am now limited to 13 images per 24 hrs. Using my space.
When I upgraded to Pro, it was exclusively for the 5x more usage and the faster regen.
The reason I spend my hard earned money on your site was for this feature.
This is totally unacceptable.
########
Other Pro Users please reply and tag others
IF YOU AGREE or DISAGREE.
########
@Always-cheating ,@anonymous111110987654321 ,@Arshili @bedspirit @blackedguy @John6666 ,@DavidBaloches @E-07 ,@f-14 @mindfulpeoples @multimodalart
Why did you remove the "Regen" feature from the ZeroGPU feature?
Is this an error or intended?
I am now limited to 13 images per 24 hrs. Using my space.
When I upgraded to Pro, it was exclusively for the 5x more usage and the faster regen.
The reason I spend my hard earned money on your site was for this feature.
This is totally unacceptable.
########
Other Pro Users please reply and tag others
IF YOU AGREE or DISAGREE.
########
@Always-cheating ,@anonymous111110987654321 ,@Arshili @bedspirit @blackedguy @John6666 ,@DavidBaloches @E-07 ,@f-14 @mindfulpeoples @multimodalart
reacted to
AdinaY's
post with 👍
9 months ago
Post
1799
Exciting release from 3D-focused startup - VastAIResearch
They just dropped 2 open 3D models on the hub 🚀
✨TripoSG: 1.5B MoE Transformer 3D model
Model: VAST-AI/TripoSG
Paper: TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models (2502.06608)
✨ TripoSF: 3D shape modeling with SparseFlex, enabling high-resolution reconstruction (up to 1024³)
Model: VAST-AI/TripoSF
Paper: SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling (2503.21732)
They just dropped 2 open 3D models on the hub 🚀
✨TripoSG: 1.5B MoE Transformer 3D model
Model: VAST-AI/TripoSG
Paper: TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models (2502.06608)
✨ TripoSF: 3D shape modeling with SparseFlex, enabling high-resolution reconstruction (up to 1024³)
Model: VAST-AI/TripoSF
Paper: SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling (2503.21732)
reacted to
jasoncorkill's
post with 🚀
9 months ago
Post
2278
🔥 It's out! We published the dataset for our evaluation of
@OpenAI
's new 4o image generation model.
Rapidata/OpenAI-4o_t2i_human_preference
Yesterday we published the first large evaluation of the new model, showing that it absolutely leaves the competition in the dust. We have now made the results and data available here! Please check it out and ❤️ !
Rapidata/OpenAI-4o_t2i_human_preference
Yesterday we published the first large evaluation of the new model, showing that it absolutely leaves the competition in the dust. We have now made the results and data available here! Please check it out and ❤️ !
reacted to
AdinaY's
post with 🔥
9 months ago
Post
2532
Let's check out the latest releases from the Chinese community in March!
👉 https://huggingface.co/collections/zh-ai-community/march-2025-releases-from-the-chinese-community-67c6b479ebb87abbdf8e2e76
✨MLLM
> R1 Omni by Alibaba Tongyi - 0.5B
> Qwen2.5 Omni by Alibaba Qwen - 7B with apache2.0
🖼️Video
> CogView-4 by ZhipuAI - Apacha2.0
> HunyuanVideo-I2V by TencentHunyuan
> Open Sora2.0 - 11B with Apache2.0
> Stepvideo TI2V by StepFun AI - 30B with MIT license
🎵Audio
> DiffDiffRhythm - Apache2.0
> Spark TTS by SparkAudio - 0.5B
⚡️Image/3D
> Hunyuan3D 2mv/2mini (0.6B) by @TencentHunyuan
> FlexWorld by ByteDance - MIT license
> Qwen2.5-VL-32B-Instruct by Alibaba Qwen - Apache2.0
> Tripo SG (1.5B)/SF by VastAIResearch - MIT license
> InfiniteYou by ByteDance
> LHM by Alibaba AIGC team - Apache2.0
> Spatial LM by ManyCore
🧠Reasoning
> QwQ-32B by Alibaba Qwen - Apache2.0
> Skywork R1V - 38B with MIT license
> RWKV G1 by RWKV AI - 0.1B pure RNN reasoning model with Apache2.0
> Fin R1 by SUFE AIFLM Lab - financial reasoning
🔠LLM
> DeepSeek v3 0324 by DeepSeek -MIT license
> Babel by Alibaba DAMO - 9B/83B/25 languages
👉 https://huggingface.co/collections/zh-ai-community/march-2025-releases-from-the-chinese-community-67c6b479ebb87abbdf8e2e76
✨MLLM
> R1 Omni by Alibaba Tongyi - 0.5B
> Qwen2.5 Omni by Alibaba Qwen - 7B with apache2.0
🖼️Video
> CogView-4 by ZhipuAI - Apacha2.0
> HunyuanVideo-I2V by TencentHunyuan
> Open Sora2.0 - 11B with Apache2.0
> Stepvideo TI2V by StepFun AI - 30B with MIT license
🎵Audio
> DiffDiffRhythm - Apache2.0
> Spark TTS by SparkAudio - 0.5B
⚡️Image/3D
> Hunyuan3D 2mv/2mini (0.6B) by @TencentHunyuan
> FlexWorld by ByteDance - MIT license
> Qwen2.5-VL-32B-Instruct by Alibaba Qwen - Apache2.0
> Tripo SG (1.5B)/SF by VastAIResearch - MIT license
> InfiniteYou by ByteDance
> LHM by Alibaba AIGC team - Apache2.0
> Spatial LM by ManyCore
🧠Reasoning
> QwQ-32B by Alibaba Qwen - Apache2.0
> Skywork R1V - 38B with MIT license
> RWKV G1 by RWKV AI - 0.1B pure RNN reasoning model with Apache2.0
> Fin R1 by SUFE AIFLM Lab - financial reasoning
🔠LLM
> DeepSeek v3 0324 by DeepSeek -MIT license
> Babel by Alibaba DAMO - 9B/83B/25 languages
reacted to
BFFree's
post with 😎
9 months ago
Post
1399
The handheld point and shoot digital camera is close to my heart cause my Dad always had his in the front pocket of his shirt. Love the lines, simplicity and compact greatness. Some fun mashups
reacted to
JLouisBiz's
post with 🔥
9 months ago
Post
1656
In this exciting demonstration, we explore how you can enhance your productivity with cutting-edge features right at your fingertips. Experience seamless speech recognition and automatic text correction on GNU/Linux systems using just a couple of mouse clicks!
https://www.youtube.com/watch?v=51jEUtjrARo
What You'll Discover:
Speech Recognition: Activate by pressing *Mouse Button 9*. Say goodbye to typing fatigue as our system effortlessly converts spoken words into digital text.
Automatic LLM Text Correction: Press Mouse Button 8 for instant, intelligent corrections. Our advanced language model ensures your writing is polished and precise.
Why You Should Watch:
✅ Boost Your Efficiency
🔍 Simplify Complex Tasks
💡 Enhance Writing Quality
Whether you're a developer looking to streamline coding or someone who spends hours typing reports, this demonstration will show how these features can transform the way you work.
Don't miss out on discovering an innovative approach that integrates speech recognition and text correction into your daily routine with ease!
💬 Drop a comment below if you have questions or want to share how these features could benefit your workflow.
https://www.youtube.com/watch?v=51jEUtjrARo
What You'll Discover:
Speech Recognition: Activate by pressing *Mouse Button 9*. Say goodbye to typing fatigue as our system effortlessly converts spoken words into digital text.
Automatic LLM Text Correction: Press Mouse Button 8 for instant, intelligent corrections. Our advanced language model ensures your writing is polished and precise.
Why You Should Watch:
✅ Boost Your Efficiency
🔍 Simplify Complex Tasks
💡 Enhance Writing Quality
Whether you're a developer looking to streamline coding or someone who spends hours typing reports, this demonstration will show how these features can transform the way you work.
Don't miss out on discovering an innovative approach that integrates speech recognition and text correction into your daily routine with ease!
💬 Drop a comment below if you have questions or want to share how these features could benefit your workflow.