enzo's picture

enzo PRO

enzostvs

AI & ML interests

β€” building cool stuffs ✨

Recent Activity

Organizations

Hugging Face's profile picture Blog-explorers's profile picture Hugging Face Tools's profile picture Devart.bio's profile picture Social Post Explorers's profile picture Dev Mode Explorers's profile picture Hugging Face Discord Community's profile picture

enzostvs's activity

New activity in enzostvs/deepsite 2 days ago
New activity in enzostvs/deepsite 4 days ago

Upload index.html

#113 opened 4 days ago by
Hungtruthguard
New activity in enzostvs/deepsite 5 days ago

Upload Main Script.zip

#111 opened 5 days ago by
Silver25
New activity in enzostvs/deepsite 7 days ago

Fdgh

#110 opened 7 days ago by
YemeniInMars
reacted to openfree's post with πŸ”₯ 8 days ago
view post
Post
2538
πŸš€ Introducing Phi-4-reasoning-plus: Powerful 14B Reasoning Model by Microsoft!

VIDraft/phi-4-reasoning-plus

🌟 Key Highlights
Compact Size (14B parameters): Efficient for use in environments with limited computing resources, yet powerful in performance.

Extended Context (32k tokens): Capable of handling lengthy and complex input sequences.

Enhanced Reasoning: Excels at multi-step reasoning, particularly in mathematics, science, and coding challenges.

Chain-of-Thought Methodology: Provides a detailed reasoning process, followed by concise, accurate summaries.

πŸ… Benchmark Achievements
Despite its smaller size, Phi-4-reasoning-plus has delivered impressive results, often surpassing significantly larger models:

Mathematical Reasoning (AIME 2025): Achieved an accuracy of 78%, significantly outperforming larger models like DeepSeek-R1 Distilled (51.5%) and Claude-3.7 Sonnet (58.7%).

Olympiad-level Math (OmniMath): Strong performance with an accuracy of 81.9%, surpassing DeepSeek-R1 Distilled's 63.4%.

Graduate-Level Science Questions (GPQA-Diamond): Delivered competitive performance at 68.9%, close to larger models and demonstrating its capabilities in advanced scientific reasoning.

Coding Challenges (LiveCodeBench): Scored 53.1%, reflecting strong performance among smaller models, though slightly behind specialized coding-focused models.

πŸ›‘οΈ Safety and Robustness
Comprehensive safety evaluation completed through Microsoft's independent AI Red Team assessments.

High standards of alignment and responsible AI compliance validated through extensive adversarial testing.

🎯 Recommended Applications
Phi-4-reasoning-plus is especially suitable for:
Systems with limited computational resources.
Latency-sensitive applications requiring quick yet accurate responses.

πŸ“œ License
Freely available under the MIT License for broad accessibility and flexible integration into your projects.
  • 2 replies
Β·
New activity in enzostvs/deepsite 8 days ago

Create Gerador de sinais

#109 opened 8 days ago by
Ndikumukiza
New activity in enzostvs/qwensite 8 days ago

Have ploblem

1
#1 opened 8 days ago by
timoon811
New activity in enzostvs/deepsite 8 days ago
New activity in enzostvs/deepsite 10 days ago
New activity in enzostvs/deepsite 11 days ago

Teste

#102 opened 11 days ago by
Rwhehhehe