Sushant Gautam's picture

23 7 3

Sushant Gautam

SushantGautam

·

https://www.sushant.info.np/

AI & ML interests

multimodal, deep learning

Recent Activity

published a model 14 days ago

SushantGautam/Kvasir-VQA-x1-lora_250812-1155

published a model 14 days ago

SushantGautam/Kvasir-VQA-x1-pali3b-lora_250812-1132

updated a model 14 days ago

SushantGautam/Kvasir-VQA-x1-pali3b-lora

View all activity

Organizations

upvoted 2 papers 2 months ago

Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models

Paper • 2505.16647 • Published May 22 • 1

Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy

Paper • 2506.09958 • Published Jun 11 • 1

upvoted an article 9 months ago

Article

Generative Agent Simulations of 1,000 People

By

•

Nov 19, 2024

• 10

upvoted a paper 9 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 134

upvoted a paper 11 months ago

Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study

Paper • 2409.17580 • Published Sep 26, 2024 • 9

upvoted 2 papers 12 months ago

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Paper • 2409.01437 • Published Sep 2, 2024 • 72

SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset

Paper • 2405.07354 • Published May 12, 2024 • 2