1 8 4

Zhiheng Lyu

cogito233

https://cogito233.github.io/

AI & ML interests

None yet

Recent Activity

liked a model 29 days ago

MiniMaxAI/MiniMax-M1-80k

upvoted a paper 30 days ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

liked a model about 1 month ago

MiniMaxAI/MiniMax-M1-40k

View all activity

Organizations

liked a model 29 days ago

MiniMaxAI/MiniMax-M1-80k

Text Generation • 456B • Updated 10 days ago • 28.5k • • 659

upvoted a paper 30 days ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published about 1 month ago • 253

liked a model about 1 month ago

MiniMaxAI/MiniMax-M1-40k

Text Generation • 456B • Updated 10 days ago • 16.5k • 169

liked a Space about 1 month ago

325

MiniMax M1

💬

Generate code snippets and web applications from text descriptions

upvoted a paper about 1 month ago

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 24

upvoted a paper about 2 months ago

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26 • 18

authored a paper about 2 months ago

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26 • 18

upvoted a paper about 2 months ago

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 53

upvoted a paper 2 months ago

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published May 12 • 27

updated a dataset 3 months ago

cogito233/model_7B

Updated May 2 • 3

published a dataset 3 months ago

cogito233/model_7B

Updated May 2 • 3

updated a dataset 3 months ago

cogito233/model_3B

Updated May 2 • 4

published a dataset 3 months ago

cogito233/model_3B

Updated May 2 • 4

published a model 3 months ago

cogito233/qwen2.5-7b-dp-hard

Updated May 2

authored a paper 3 months ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published Apr 1 • 44

upvoted a paper 3 months ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published Apr 1 • 44

upvoted a paper 4 months ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published Mar 14 • 21

updated a dataset 5 months ago

TIGER-Lab/PixelWorld

Viewer • Updated Feb 17 • 104k • 887 • 4

updated a model 5 months ago

cogito233/mediawiki-images

Updated Feb 16

published a model 5 months ago