1 93 274

Snehasish Barman

sbarman25

shtefcs's profile picture

Fishtiks's profile picture

mondalsurojit's profile picture

racksyz
sbarman25

AI & ML interests

Machine Learning for Health, AI, Distributed Systems

Recent Activity

updated a collection 2 days ago

Models

upvoted an article 3 days ago

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

updated a collection 6 months ago

Datasets

View all activity

Organizations

None yet

sbarman25 's collections 17

Training & Architectures

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 76
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Paper • 2307.08691 • Published Jul 17, 2023 • 9
Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 160
Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 51

Med AI Papers

Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries

Paper • 2310.13132 • Published Oct 19, 2023 • 8
Do We Still Need Clinical Language Models?

Paper • 2302.08091 • Published Feb 16, 2023 • 3
Advancing Multimodal Medical Capabilities of Gemini

Paper • 2405.03162 • Published May 6, 2024 • 2
The impact of using an AI chatbot to respond to patient messages

Paper • 2310.17703 • Published Oct 26, 2023 • 5

Models

Idefics1: https://huggingface.co/posts/HugoLaurencon/224808006741857

numind/NuNER-v0.1

Token Classification • Updated Apr 30, 2024 • 7.13k • 63
llmware/dragon-mistral-7b-v0

Text Generation • 7B • Updated Dec 14, 2023 • 882 • 113
fireworks-ai/firefunction-v1

Text Generation • 47B • Updated Mar 6, 2024 • 10 • 124
teknium/OpenHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Feb 19, 2024 • 209k • 864

Evals & Monitoring

G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment

Paper • 2303.16634 • Published Mar 29, 2023 • 3
miracl/miracl-corpus

Viewer • Updated Jan 5, 2023 • 77.2M • 3.23k • 46
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 36
How is ChatGPT's behavior changing over time?

Paper • 2307.09009 • Published Jul 18, 2023 • 24

Agentic

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 226
gaia-benchmark/GAIA

Updated Feb 13 • 7.46k • 402
osunlp/Mind2Web

Viewer • Updated Jul 19, 2023 • 253 • 546 • 106
AppAgent: Multimodal Agents as Smartphone Users

Paper • 2312.13771 • Published Dec 21, 2023 • 55

CV / Text-to-Image / Image-to-Image / Diffusion

https://huggingface.co/collections/merve/

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Paper • 2208.12242 • Published Aug 25, 2022 • 12
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Paper • 2308.06721 • Published Aug 13, 2023 • 31
h94/IP-Adapter-FaceID

Text-to-Image • Updated Apr 16, 2024 • 275k • 1.74k
PALP: Prompt Aligned Personalization of Text-to-Image Models

Paper • 2401.06105 • Published Jan 11, 2024 • 50

Hardware-aware Models

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

Tool Usage (w/VLMs)

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

Paper • 2303.11381 • Published Mar 20, 2023 • 2
Efficient Tool Use with Chain-of-Abstraction Reasoning

Paper • 2401.17464 • Published Jan 30, 2024 • 21
Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24 • 60k • 4.15k • 493

Audio Stuff

kyutai/moshiko-pytorch-bf16

8B • Updated Sep 18, 2024 • 162k • 185
openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 3.23M • • 2.53k

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4, 2024 • 66
Learning To Teach Large Language Models Logical Reasoning

Paper • 2310.09158 • Published Oct 13, 2023 • 1
ChipNeMo: Domain-Adapted LLMs for Chip Design

Paper • 2311.00176 • Published Oct 31, 2023 • 9
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Paper • 2308.09583 • Published Aug 18, 2023 • 7

Datasets

✔️https://huggingface.co/librarian-bots ✔️https://lilacai-lilac.hf.space/ https://huggingface.co/common-pile/collections

HuggingFaceH4/no_robots

Viewer • Updated Apr 18, 2024 • 10k • 1.46k • 490
HuggingFaceH4/ultrachat_200k

Viewer • Updated Oct 16, 2024 • 515k • 23.2k • 559
allenai/ultrafeedback_binarized_cleaned

Viewer • Updated Dec 1, 2023 • 186k • 527 • 70
HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16, 2024 • 187k • 8.33k • 298

Safety / Alignment / Policies / SMI

🔖Cheatsheet: http://tinyurl.com/35vvs6d9 🔖Foundation Model Cheatsheet: https://fmcheatsheet.org/

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 70
Sparks of Artificial General Intelligence: Early experiments with GPT-4

Paper • 2303.12712 • Published Mar 22, 2023 • 4
Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11, 2024 • 29
Measuring Implicit Bias in Explicitly Unbiased Large Language Models

Paper • 2402.04105 • Published Feb 6, 2024 • 1

Spaces

Running

223

223

AI2 WildBench Leaderboard (V2)

🦁

Display and explore model leaderboards and chat history
Runtime error

61

61

Prompt Collective

🗣
Running

27

27

— Inference Api —

📟

Redirect to text generation
Running on CPU Upgrade

498

498

GAIA Leaderboard

🦾

Submit and evaluate models on GAIA benchmark

Vulnerabilities

https://llm-attacks.org/

Scalable Extraction of Training Data from (Production) Language Models

Paper • 2311.17035 • Published Nov 28, 2023 • 3
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Paper • 2401.05566 • Published Jan 10, 2024 • 30
Exploiting Novel GPT-4 APIs

Paper • 2312.14302 • Published Dec 21, 2023 • 14
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Paper • 2404.13208 • Published Apr 19, 2024 • 40

Others

Masked Autoencoders Are Scalable Vision Learners

Paper • 2111.06377 • Published Nov 11, 2021 • 3
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 58
distil-whisper/distil-large-v2

Automatic Speech Recognition • 0.8B • Updated Mar 6 • 9.83k • 512
Seven Failure Points When Engineering a Retrieval Augmented Generation System

Paper • 2401.05856 • Published Jan 11, 2024 • 2

Text-to-nD++

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8, 2024 • 22
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Paper • 2403.13248 • Published Mar 20, 2024 • 78

Vision Language Models

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 570k • • 1.22k
vidore/colpali_train_set

Viewer • Updated Jun 20 • 119k • 2.63k • 82
vidore/colpali

Visual Document Retrieval • Updated Jun 27 • 8.53k • 449
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Paper • 2408.08459 • Published Aug 15, 2024 • 46

Training & Architectures

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 76
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Paper • 2307.08691 • Published Jul 17, 2023 • 9
Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 160
Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 51

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4, 2024 • 66
Learning To Teach Large Language Models Logical Reasoning

Paper • 2310.09158 • Published Oct 13, 2023 • 1
ChipNeMo: Domain-Adapted LLMs for Chip Design

Paper • 2311.00176 • Published Oct 31, 2023 • 9
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Paper • 2308.09583 • Published Aug 18, 2023 • 7

Med AI Papers

Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries

Paper • 2310.13132 • Published Oct 19, 2023 • 8
Do We Still Need Clinical Language Models?

Paper • 2302.08091 • Published Feb 16, 2023 • 3
Advancing Multimodal Medical Capabilities of Gemini

Paper • 2405.03162 • Published May 6, 2024 • 2
The impact of using an AI chatbot to respond to patient messages

Paper • 2310.17703 • Published Oct 26, 2023 • 5

Datasets

✔️https://huggingface.co/librarian-bots ✔️https://lilacai-lilac.hf.space/ https://huggingface.co/common-pile/collections

HuggingFaceH4/no_robots

Viewer • Updated Apr 18, 2024 • 10k • 1.46k • 490
HuggingFaceH4/ultrachat_200k

Viewer • Updated Oct 16, 2024 • 515k • 23.2k • 559
allenai/ultrafeedback_binarized_cleaned

Viewer • Updated Dec 1, 2023 • 186k • 527 • 70
HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16, 2024 • 187k • 8.33k • 298

Models

Idefics1: https://huggingface.co/posts/HugoLaurencon/224808006741857

numind/NuNER-v0.1

Token Classification • Updated Apr 30, 2024 • 7.13k • 63
llmware/dragon-mistral-7b-v0

Text Generation • 7B • Updated Dec 14, 2023 • 882 • 113
fireworks-ai/firefunction-v1

Text Generation • 47B • Updated Mar 6, 2024 • 10 • 124
teknium/OpenHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Feb 19, 2024 • 209k • 864

Safety / Alignment / Policies / SMI

🔖Cheatsheet: http://tinyurl.com/35vvs6d9 🔖Foundation Model Cheatsheet: https://fmcheatsheet.org/

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 70
Sparks of Artificial General Intelligence: Early experiments with GPT-4

Paper • 2303.12712 • Published Mar 22, 2023 • 4
Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11, 2024 • 29
Measuring Implicit Bias in Explicitly Unbiased Large Language Models

Paper • 2402.04105 • Published Feb 6, 2024 • 1

Evals & Monitoring

G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment

Paper • 2303.16634 • Published Mar 29, 2023 • 3
miracl/miracl-corpus

Viewer • Updated Jan 5, 2023 • 77.2M • 3.23k • 46
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 36
How is ChatGPT's behavior changing over time?

Paper • 2307.09009 • Published Jul 18, 2023 • 24

Spaces

Running

223

223

AI2 WildBench Leaderboard (V2)

🦁

Display and explore model leaderboards and chat history
Runtime error

61

61

Prompt Collective

🗣
Running

27

27

— Inference Api —

📟

Redirect to text generation
Running on CPU Upgrade

498

498

GAIA Leaderboard

🦾

Submit and evaluate models on GAIA benchmark

Agentic

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 226
gaia-benchmark/GAIA

Updated Feb 13 • 7.46k • 402
osunlp/Mind2Web

Viewer • Updated Jul 19, 2023 • 253 • 546 • 106
AppAgent: Multimodal Agents as Smartphone Users

Paper • 2312.13771 • Published Dec 21, 2023 • 55

Vulnerabilities

https://llm-attacks.org/

Scalable Extraction of Training Data from (Production) Language Models

Paper • 2311.17035 • Published Nov 28, 2023 • 3
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Paper • 2401.05566 • Published Jan 10, 2024 • 30
Exploiting Novel GPT-4 APIs

Paper • 2312.14302 • Published Dec 21, 2023 • 14
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Paper • 2404.13208 • Published Apr 19, 2024 • 40

CV / Text-to-Image / Image-to-Image / Diffusion

https://huggingface.co/collections/merve/

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Paper • 2208.12242 • Published Aug 25, 2022 • 12
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Paper • 2308.06721 • Published Aug 13, 2023 • 31
h94/IP-Adapter-FaceID

Text-to-Image • Updated Apr 16, 2024 • 275k • 1.74k
PALP: Prompt Aligned Personalization of Text-to-Image Models

Paper • 2401.06105 • Published Jan 11, 2024 • 50

Others

Masked Autoencoders Are Scalable Vision Learners

Paper • 2111.06377 • Published Nov 11, 2021 • 3
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 58
distil-whisper/distil-large-v2

Automatic Speech Recognition • 0.8B • Updated Mar 6 • 9.83k • 512
Seven Failure Points When Engineering a Retrieval Augmented Generation System

Paper • 2401.05856 • Published Jan 11, 2024 • 2

Hardware-aware Models

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

Text-to-nD++

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8, 2024 • 22
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Paper • 2403.13248 • Published Mar 20, 2024 • 78

Tool Usage (w/VLMs)

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

Paper • 2303.11381 • Published Mar 20, 2023 • 2
Efficient Tool Use with Chain-of-Abstraction Reasoning

Paper • 2401.17464 • Published Jan 30, 2024 • 21
Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24 • 60k • 4.15k • 493

Vision Language Models

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 570k • • 1.22k
vidore/colpali_train_set

Viewer • Updated Jun 20 • 119k • 2.63k • 82
vidore/colpali

Visual Document Retrieval • Updated Jun 27 • 8.53k • 449
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Paper • 2408.08459 • Published Aug 15, 2024 • 46

Audio Stuff

kyutai/moshiko-pytorch-bf16

8B • Updated Sep 18, 2024 • 162k • 185
openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 3.23M • • 2.53k

Snehasish Barman

AI & ML interests

Recent Activity

Organizations

sbarman25 's collections 17

AI2 WildBench Leaderboard (V2)

Prompt Collective

— Inference Api —

GAIA Leaderboard

AI2 WildBench Leaderboard (V2)

Prompt Collective

— Inference Api —

GAIA Leaderboard