Ali WALEED's picture

1 18 2

Ali WALEED

ali-hagrassy

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Multi-Token Attention

upvoted a paper 17 days ago

The Prompt Report: A Systematic Survey of Prompting Techniques

upvoted a paper 24 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

View all activity

Organizations

upvoted a paper 12 days ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 54

upvoted a paper 17 days ago

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 65

upvoted 5 papers 24 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 296

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 179

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 217

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 246

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 254

upvoted a paper about 2 months ago

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Paper • 2505.17894 • Published May 23 • 218

upvoted an article about 2 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5, 2024

• 277

upvoted 9 papers 5 months ago

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 34

The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer

Paper • 2502.15631 • Published Feb 21 • 9

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 197

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published Jan 30 • 20

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 152

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

Paper • 2502.12170 • Published Feb 13 • 12

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published Feb 17 • 54

Phantom: Subject-consistent video generation via cross-modal alignment

Paper • 2502.11079 • Published Feb 16 • 60

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published Feb 18 • 38