10 7 3

Liliang Ren

renll

http://renll.github.io/

AI & ML interests

None yet

Recent Activity

new activity 4 days ago

microsoft/Phi-4-mini-flash-reasoning:paradox model

updated a model 9 days ago

microsoft/Phi-4-mini-flash-reasoning

new activity 9 days ago

microsoft/Phi-4-mini-flash-reasoning:Fix typo in `configuration_phi4flash.py`

View all activity

Organizations

New activity in microsoft/Phi-4-mini-flash-reasoning 4 days ago

paradox model

#9 opened 5 days ago by

BhargavMupparisetty

updated a model 9 days ago

microsoft/Phi-4-mini-flash-reasoning

Text Generation • 4B • Updated 9 days ago • 21.9k • 220

New activity in microsoft/Phi-4-mini-flash-reasoning 9 days ago

Fix typo in `configuration_phi4flash.py`

#8 opened 10 days ago by

hmellor

New activity in microsoft/Phi-4-mini-flash-reasoning 10 days ago

Make `configuration_phi4flash.py` and `modeling_phi4flash.py` compatible with standard sliding window config

#7 opened 16 days ago by

hmellor

Make `config.json` compatible with standard sliding window config

👍 1

#6 opened 16 days ago by

hmellor

liked a model 15 days ago

kernels-community/vllm-flash-attn3

Updated 3 days ago • 16

authored 2 papers 22 days ago

PaTH Attention: Position Encoding via Accumulating Householder Transformations

Paper • 2505.16381 • Published May 22

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Paper • 2507.06607 • Published Jul 9 • 10

New activity in microsoft/Phi-4-mini-flash-reasoning about 1 month ago

Prompt word input is always overwritten by default input, always talking to oneself

#2 opened about 1 month ago by

xiaoboelse

Improve model card: Add paper abstract for Phi-4-mini-flash-reasoning

#4 opened about 1 month ago by

nielsr

Add training codebase link mentioned in paper abstract

👍 1

#3 opened about 1 month ago by

nielsr

upvoted a paper about 1 month ago

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Paper • 2507.06607 • Published Jul 9 • 10

commented a paper about 1 month ago

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Paper • 2507.06607 • Published Jul 9 • 10 •

liked a model about 1 month ago

microsoft/Phi-4-mini-flash-reasoning

Text Generation • 4B • Updated 9 days ago • 21.9k • 220

published a model about 1 month ago

microsoft/Phi-4-mini-flash-reasoning

Text Generation • 4B • Updated 9 days ago • 21.9k • 220

upvoted a paper 2 months ago

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Paper • 2506.09991 • Published Jun 11 • 56

authored a paper 4 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 48

upvoted a paper 4 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 48

liked a model 4 months ago

microsoft/Phi-4-mini-reasoning

Text Generation • 4B • Updated May 1 • 20.1k • 201

Liliang Ren

AI & ML interests

Recent Activity

Organizations

renll's activity

paradox model

Fix typo in `configuration_phi4flash.py`

Make `configuration_phi4flash.py` and `modeling_phi4flash.py` compatible with standard sliding window config

Make `config.json` compatible with standard sliding window config

Prompt word input is always overwritten by default input, always talking to oneself

Improve model card: Add paper abstract for Phi-4-mini-flash-reasoning

Add training codebase link mentioned in paper abstract