131 9 97

enzo PRO

enzostvs

https://en-zo.dev

AI & ML interests

— building cool stuffs ✨

Recent Activity

new activity 2 days ago

enzostvs/deepsite:How do I add my own images?

new activity 3 days ago

enzostvs/deepsite:Upload WhatsApp Image 2025-05-06 at 20.15.30.jpeg

new activity 4 days ago

enzostvs/deepsite:Upload index.html

View all activity

Organizations

enzostvs's activity

New activity in enzostvs/deepsite 2 days ago

How do I add my own images?

#115 opened 3 days ago by

AnderSmithMusic

New activity in enzostvs/deepsite 3 days ago

Upload WhatsApp Image 2025-05-06 at 20.15.30.jpeg

#114 opened 3 days ago by

Mendezmartinok

New activity in enzostvs/deepsite 4 days ago

Upload index.html

#113 opened 4 days ago by

Hungtruthguard

New activity in enzostvs/deepsite 5 days ago

Upload Main Script.zip

#111 opened 5 days ago by

Silver25

New activity in enzostvs/deepsite 7 days ago

Fdgh

#110 opened 7 days ago by

YemeniInMars

reacted to openfree's post with 🔥 8 days ago

Post

2538

🚀 Introducing Phi-4-reasoning-plus: Powerful 14B Reasoning Model by Microsoft!

VIDraft/phi-4-reasoning-plus

🌟 Key Highlights
Compact Size (14B parameters): Efficient for use in environments with limited computing resources, yet powerful in performance.

Extended Context (32k tokens): Capable of handling lengthy and complex input sequences.

Enhanced Reasoning: Excels at multi-step reasoning, particularly in mathematics, science, and coding challenges.

Chain-of-Thought Methodology: Provides a detailed reasoning process, followed by concise, accurate summaries.

🏅 Benchmark Achievements
Despite its smaller size, Phi-4-reasoning-plus has delivered impressive results, often surpassing significantly larger models:

Mathematical Reasoning (AIME 2025): Achieved an accuracy of 78%, significantly outperforming larger models like DeepSeek-R1 Distilled (51.5%) and Claude-3.7 Sonnet (58.7%).

Olympiad-level Math (OmniMath): Strong performance with an accuracy of 81.9%, surpassing DeepSeek-R1 Distilled's 63.4%.

Graduate-Level Science Questions (GPQA-Diamond): Delivered competitive performance at 68.9%, close to larger models and demonstrating its capabilities in advanced scientific reasoning.

Coding Challenges (LiveCodeBench): Scored 53.1%, reflecting strong performance among smaller models, though slightly behind specialized coding-focused models.

🛡️ Safety and Robustness
Comprehensive safety evaluation completed through Microsoft's independent AI Red Team assessments.

High standards of alignment and responsible AI compliance validated through extensive adversarial testing.

🎯 Recommended Applications
Phi-4-reasoning-plus is especially suitable for:
Systems with limited computational resources.
Latency-sensitive applications requiring quick yet accurate responses.

📜 License
Freely available under the MIT License for broad accessibility and flexible integration into your projects.