Snehasish Barman
sbarman25
AI & ML interests
Machine Learning for Health, AI, Distributed Systems
Recent Activity
updated
a collection
2 days ago
Models
upvoted
an
article
3 days ago
Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face
updated
a collection
6 months ago
Datasets
Organizations
None yet
Med AI Papers
-
Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries
Paper • 2310.13132 • Published • 8 -
Do We Still Need Clinical Language Models?
Paper • 2302.08091 • Published • 3 -
Advancing Multimodal Medical Capabilities of Gemini
Paper • 2405.03162 • Published • 2 -
The impact of using an AI chatbot to respond to patient messages
Paper • 2310.17703 • Published • 5
Models
Idefics1: https://huggingface.co/posts/HugoLaurencon/224808006741857
Evals & Monitoring
-
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment
Paper • 2303.16634 • Published • 3 -
miracl/miracl-corpus
Viewer • Updated • 77.2M • 3.23k • 46 -
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Paper • 2306.05685 • Published • 36 -
How is ChatGPT's behavior changing over time?
Paper • 2307.09009 • Published • 24
Agentic
CV / Text-to-Image / Image-to-Image / Diffusion
https://huggingface.co/collections/merve/
-
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Paper • 2208.12242 • Published • 12 -
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Paper • 2308.06721 • Published • 31 -
h94/IP-Adapter-FaceID
Text-to-Image • Updated • 275k • 1.74k -
PALP: Prompt Aligned Personalization of Text-to-Image Models
Paper • 2401.06105 • Published • 50
Hardware-aware Models
Tool Usage (w/VLMs)
Audio Stuff
LLM Related
💫 Glossary
https://osanseviero.github.io/hackerllama/blog/posts/hitchhiker_guide/
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 66 -
Learning To Teach Large Language Models Logical Reasoning
Paper • 2310.09158 • Published • 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper • 2311.00176 • Published • 9 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper • 2308.09583 • Published • 7
Datasets
✔️https://huggingface.co/librarian-bots
✔️https://lilacai-lilac.hf.space/
https://huggingface.co/common-pile/collections
Safety / Alignment / Policies / SMI
🔖Cheatsheet: http://tinyurl.com/35vvs6d9
🔖Foundation Model Cheatsheet: https://fmcheatsheet.org/
-
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 70 -
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Paper • 2303.12712 • Published • 4 -
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Paper • 2401.06080 • Published • 29 -
Measuring Implicit Bias in Explicitly Unbiased Large Language Models
Paper • 2402.04105 • Published • 1
Spaces
-
Running223223
AI2 WildBench Leaderboard (V2)
🦁Display and explore model leaderboards and chat history
-
Runtime error6161
Prompt Collective
🗣 -
Running2727
— Inference Api —
📟Redirect to text generation
-
Running on CPU Upgrade498498
GAIA Leaderboard
🦾Submit and evaluate models on GAIA benchmark
Vulnerabilities
https://llm-attacks.org/
-
Scalable Extraction of Training Data from (Production) Language Models
Paper • 2311.17035 • Published • 3 -
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
Paper • 2401.05566 • Published • 30 -
Exploiting Novel GPT-4 APIs
Paper • 2312.14302 • Published • 14 -
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Paper • 2404.13208 • Published • 40
Others
-
Masked Autoencoders Are Scalable Vision Learners
Paper • 2111.06377 • Published • 3 -
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
Paper • 2311.00430 • Published • 58 -
distil-whisper/distil-large-v2
Automatic Speech Recognition • 0.8B • Updated • 9.83k • 512 -
Seven Failure Points When Engineering a Retrieval Augmented Generation System
Paper • 2401.05856 • Published • 2
Text-to-nD++
Vision Language Models
-
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text • 8B • Updated • 570k • • 1.22k -
vidore/colpali_train_set
Viewer • Updated • 119k • 2.63k • 82 -
vidore/colpali
Visual Document Retrieval • Updated • 8.53k • 449 -
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations
Paper • 2408.08459 • Published • 46
Training & Architectures
LLM Related
💫 Glossary
https://osanseviero.github.io/hackerllama/blog/posts/hitchhiker_guide/
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 66 -
Learning To Teach Large Language Models Logical Reasoning
Paper • 2310.09158 • Published • 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper • 2311.00176 • Published • 9 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper • 2308.09583 • Published • 7
Med AI Papers
-
Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries
Paper • 2310.13132 • Published • 8 -
Do We Still Need Clinical Language Models?
Paper • 2302.08091 • Published • 3 -
Advancing Multimodal Medical Capabilities of Gemini
Paper • 2405.03162 • Published • 2 -
The impact of using an AI chatbot to respond to patient messages
Paper • 2310.17703 • Published • 5
Datasets
✔️https://huggingface.co/librarian-bots
✔️https://lilacai-lilac.hf.space/
https://huggingface.co/common-pile/collections
Models
Idefics1: https://huggingface.co/posts/HugoLaurencon/224808006741857
Safety / Alignment / Policies / SMI
🔖Cheatsheet: http://tinyurl.com/35vvs6d9
🔖Foundation Model Cheatsheet: https://fmcheatsheet.org/
-
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 70 -
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Paper • 2303.12712 • Published • 4 -
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Paper • 2401.06080 • Published • 29 -
Measuring Implicit Bias in Explicitly Unbiased Large Language Models
Paper • 2402.04105 • Published • 1
Evals & Monitoring
-
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment
Paper • 2303.16634 • Published • 3 -
miracl/miracl-corpus
Viewer • Updated • 77.2M • 3.23k • 46 -
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Paper • 2306.05685 • Published • 36 -
How is ChatGPT's behavior changing over time?
Paper • 2307.09009 • Published • 24
Spaces
-
Running223223
AI2 WildBench Leaderboard (V2)
🦁Display and explore model leaderboards and chat history
-
Runtime error6161
Prompt Collective
🗣 -
Running2727
— Inference Api —
📟Redirect to text generation
-
Running on CPU Upgrade498498
GAIA Leaderboard
🦾Submit and evaluate models on GAIA benchmark
Agentic
Vulnerabilities
https://llm-attacks.org/
-
Scalable Extraction of Training Data from (Production) Language Models
Paper • 2311.17035 • Published • 3 -
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
Paper • 2401.05566 • Published • 30 -
Exploiting Novel GPT-4 APIs
Paper • 2312.14302 • Published • 14 -
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Paper • 2404.13208 • Published • 40
CV / Text-to-Image / Image-to-Image / Diffusion
https://huggingface.co/collections/merve/
-
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Paper • 2208.12242 • Published • 12 -
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Paper • 2308.06721 • Published • 31 -
h94/IP-Adapter-FaceID
Text-to-Image • Updated • 275k • 1.74k -
PALP: Prompt Aligned Personalization of Text-to-Image Models
Paper • 2401.06105 • Published • 50
Others
-
Masked Autoencoders Are Scalable Vision Learners
Paper • 2111.06377 • Published • 3 -
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
Paper • 2311.00430 • Published • 58 -
distil-whisper/distil-large-v2
Automatic Speech Recognition • 0.8B • Updated • 9.83k • 512 -
Seven Failure Points When Engineering a Retrieval Augmented Generation System
Paper • 2401.05856 • Published • 2
Hardware-aware Models
Text-to-nD++
Tool Usage (w/VLMs)
Vision Language Models
-
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text • 8B • Updated • 570k • • 1.22k -
vidore/colpali_train_set
Viewer • Updated • 119k • 2.63k • 82 -
vidore/colpali
Visual Document Retrieval • Updated • 8.53k • 449 -
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations
Paper • 2408.08459 • Published • 46
Audio Stuff