Anwar
abdoali5672
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability upvoted a paper about 5 hours ago
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models upvoted a paper about 5 hours ago
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and MitigationOrganizations
None yet