46 56 177

Nick Doiron

monsoon-nlp

https://mapmeld.com/plant-based-llms/

AI & ML interests

biology and multilingual models

Recent Activity

liked a dataset about 20 hours ago

Datadog/BOOM

liked a model 5 days ago

nari-labs/Dia-1.6B

updated a dataset 6 days ago

monsoon-nlp/genetic-counselor-freeform-questions

View all activity

Organizations

monsoon-nlp's activity

liked a dataset about 20 hours ago

Datadog/BOOM

Preview • Updated 2 days ago • 6.24k • 14

liked a model 5 days ago

nari-labs/Dia-1.6B

Text-to-Speech • Updated 11 days ago • 178k • • 2.37k

updated a dataset 6 days ago

monsoon-nlp/genetic-counselor-freeform-questions

Viewer • Updated 6 days ago • 42 • 39

reacted to seawolf2357's post with 👀 8 days ago

Post

5757

Samsung Hacking Incident: Samsung Electronics' Official Hugging Face Account Compromised
Samsung Electronics' official Hugging Face account has been hacked. Approximately 17 hours ago, two new language models (LLMs) were registered under Samsung Electronics' official Hugging Face account. These models are:

https://huggingface.co/Samsung/MuTokenZero2-32B
https://huggingface.co/Samsung/MythoMax-L2-13B

The model descriptions contain absurd and false claims, such as being trained on "1 million W200 GPUs," hardware that doesn't even exist.
Moreover, community participants on Hugging Face who have noticed this issue are continuously posting that Samsung Electronics' account has been compromised.
There is concern about potential secondary and tertiary damage if users download these LLMs released under the Samsung Electronics account, trusting Samsung's reputation without knowing about the hack.
Samsung Electronics appears to be unaware of this situation, as they have not taken any visible measures yet, such as changing the account password.
Source: https://discord.gg/openfreeai

2 replies

updated a model 11 days ago

monsoon-nlp/dna-blockdiff-2

Fill-Mask • Updated 11 days ago • 10

liked a model 13 days ago

PrimeIntellect/INTELLECT-2

Updated 12 days ago • 836 • 184

liked a Space 24 days ago

Leaderboard

🥇

Browse and submit evaluation results for AI benchmarks

updated a Space about 1 month ago

README

⚡

published a Space about 1 month ago

README

⚡

New activity in monsoon-nlp/code-refusal-for-abliteration about 1 month ago

Integrate m/re spurces

#1 opened 6 months ago by

monsoon-nlp

updated a dataset about 1 month ago

monsoon-nlp/wheat-bees

Updated Apr 23 • 97

upvoted a paper about 1 month ago

Pretraining Language Models for Diachronic Linguistic Change Discovery

Paper • 2504.05523 • Published Apr 7 • 6

upvoted a collection about 1 month ago

blt

Collection

4 items • Updated Apr 17 • 21

upvoted an article about 2 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

and 6 others •

Apr 5

• 144

upvoted a collection about 2 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated 25 days ago • 510

reacted to merterbak's post with 🔥 about 2 months ago

Post

3022

Meta has unveiled its Llama 4 🦙 family of models, featuring native multimodality and mixture-of-experts architecture. Two model families are available now:
Models🤗: meta-llama/llama-4-67f0c30d9fe03840bc9d0164
Blog Post: https://ai.meta.com/blog/llama-4-multimodal-intelligence/
HF's Blog Post: https://huggingface.co/blog/llama4-release

- 🧠 Native Multimodality - Process text and images in a unified architecture
- 🔍 Mixture-of-Experts - First Llama models using MoE for incredible efficiency
- 📏 Super Long Context - Up to 10M tokens
- 🌐 Multilingual Power - Trained on 200 languages with 10x more multilingual tokens than Llama 3 (including over 100 languages with over 1 billion tokens each)

🔹 Llama 4 Scout
- 17B active parameters (109B total)
- 16 experts architecture
- 10M context window
- Fits on a single H100 GPU
- Beats Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1

🔹 Llama 4 Maverick
- 17B active parameters (400B total)
- 128 experts architecture
- It can fit perfectly on DGX H100(8x H100)
- 1M context window
- Outperforms GPT-4o and Gemini 2.0 Flash
- ELO score of 1417 on LMArena currently second best model on arena

🔹 Llama 4 Behemoth (Coming Soon)
- 288B active parameters (2T total)
- 16 experts architecture
- Teacher model for Scout and Maverick
- Outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on STEM benchmarks

liked a model about 2 months ago

InstaDeepAI/ChatNT

Text Generation • Updated 25 days ago • 39 • 3

upvoted a paper about 2 months ago

Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages

Paper • 2503.20212 • Published Mar 26 • 6

liked a dataset about 2 months ago

huggingface-legal/takedown-notices

Viewer • Updated 23 days ago • 35 • 576 • 24

updated a collection about 2 months ago

Bio Series

Collection

Embeddings and NLG related to biology / amino acid sequences • 12 items • Updated Mar 31 • 1