Prithiv Sakthi's picture

Prithiv Sakthi

prithivMLmods

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality @strangerzonehf @strangerguardhf

Recent Activity

updated a Space about 8 hours ago

prithivMLmods/Multimodal-OCR

updated a collection about 9 hours ago

07/11 ~ Visual Understanding

updated a collection about 9 hours ago

07/11 ~ Visual Understanding

View all activity

Organizations

upvoted 3 papers 2 days ago

High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning

Paper • 2507.05920 • Published 4 days ago • 11

MedGemma Technical Report

Paper • 2507.05201 • Published 4 days ago • 10

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published 3 days ago • 64

upvoted 3 articles 3 days ago

Article

KV Cache from scratch in nanoVLM

By

and 4 others •

Jun 4

• 84

Article

Transformers backend integration in SGLang

By

and 4 others •

19 days ago

• 44

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

By

and 1 other •

11 days ago

• 87

upvoted a collection 4 days ago

Tar

Unifying Visual Understanding and Generation via Text-Aligned Representations • 5 items • Updated 10 days ago • 14

upvoted 2 papers 5 days ago

Fast and Simplex: 2-Simplicial Attention in Triton

Paper • 2507.02754 • Published 8 days ago • 23

Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

Paper • 2507.02321 • Published 9 days ago • 38

upvoted 3 papers 7 days ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published 9 days ago • 45

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published 9 days ago • 90

HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation

Paper • 2506.21546 • Published 15 days ago • 2

upvoted 2 papers 8 days ago

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

Paper • 2507.01925 • Published 9 days ago • 30

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published 9 days ago • 119

upvoted 3 papers 9 days ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published 11 days ago • 64

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published 17 days ago • 38

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published 10 days ago • 179

upvoted an article 10 days ago

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

By

•

10 days ago

• 64

upvoted 2 collections 10 days ago

Gemma 3n

4 items • Updated 2 days ago • 168

FLUX.1

A collection of our FLUX.1 models and LoRAs. • 9 items • Updated 15 days ago • 148