Maharsh's picture

In a Training Loop 🔄

Maharsh

maharshpatelx

·

AI & ML interests

None yet

Recent Activity

liked a model 20 days ago

XiaomiMiMo/MiMo-V2.5-Pro

liked a model about 1 month ago

MiniMaxAI/MiniMax-M2.7

liked a model about 2 months ago

facebook/tribev2

View all activity

Organizations

upvoted an article 5 months ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

qgallouedec

•

Dec 4, 2025

• 69

upvoted a collection 5 months ago

Common Pile v0.1

All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6, 2025 • 40

upvoted 2 collections 6 months ago

Trinity

Collection of Arcee AI models in the Trinity family • 14 items • Updated Mar 25 • 30

Holo2

Holo2 - Cost-Efficient Models for Cross-Platform Computer-Use Agents • 4 items • Updated Feb 2 • 27

upvoted an article 7 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

driaforall

•

Oct 9, 2025

• 33

upvoted a collection 7 months ago

GTA1

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 5

upvoted an article 7 months ago

Article

GRPO for GUI Grounding Done Right

HelloKKMe

•

Jun 11, 2025

• 37

upvoted 2 collections 8 months ago

Qwen3-VL

37 items • Updated Dec 31, 2025 • 721

ScaleCUA

6 items • Updated Mar 2 • 18

upvoted an article 8 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

upvoted a collection 8 months ago

Holo1.5

Holo1.5 - Open Foundation Models for Computer Use Agents • 5 items • Updated Sep 15, 2025 • 35

upvoted a paper 9 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 127

upvoted a collection 9 months ago

The Well

A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24, 2025 • 51

upvoted a paper 10 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

upvoted a collection 10 months ago

GUI Datasets

Datasets from the graphical user interfaces domain (screenshots). • 20 items • Updated Dec 3, 2024 • 8

upvoted an article 10 months ago

Article

Creating custom kernels for the AMD MI300

ror, seungrokj

•

Jul 9, 2025

• 54

upvoted 2 collections 11 months ago

ERNIE 4.5

collection of ERNIE 4.5 models. • 27 items • Updated Nov 11, 2025 • 189

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 216

upvoted a collection about 1 year ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 736

upvoted an article about 1 year ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

open-r1

•

Jan 31, 2025

• 51