Mistral AI Game Jam

Enterprise
community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Mistral-AI-Game-Jam's activity

MrDragonFoxย 
posted an update 6 days ago
view post
Post
1722
did a small emotive classified test dataset for all the tts tuners out there

MrDragonFox/Elise

3h total mit - single speaker voice

dataset is a copy of an existing one just added the emotional tags over 1200 samples - should be good enough to test if emotional tags stick in your finetune
  • 1 reply
ยท
MikeDoesย 
posted an update 8 days ago
view post
Post
2750
๐Ÿš€ We are quite excited to announce the Ai4Privacy Python library! ๐ŸŽ‰

pip install ai4privacy to anonymize short english text with OpenPII Masking 500k labels

๐Ÿ“Š Day 5/7 of PII Masking 1M announcements complete! โฐ
MikeDoesย 
posted an update 9 days ago
MikeDoesย 
posted an update 12 days ago
view post
Post
1703
๐Ÿ“Š 99%+ PII Masking Precision in English Straight to Your Browser! ๐Ÿš€

ai4privacy/general-english-anonymiser-openpii-500k

Hard Facts:
๐Ÿ–ฅ๏ธ Runs in-browserโ€”blazing fast, no server latency
๐Ÿ‘ Open-source, MIT-licensed (even for commercial use)
๐Ÿ“ˆ Full metrics on Hugging Face dataset and model pages

Day 3 out 7 of PII-Masking-1M Announcements Complete!
*Accuracies reported from the new OpenPII-500k dataset

#DataPrivacy #AI #OpenSource
MikeDoesย 
posted an update 14 days ago
view post
Post
2083
#PII Masking Tech that does not **** around!

We are happy to release the OpenPII English Anonymiser โ€”the most powerful open-source tool for redacting sensitive info from English text.

Fine-tuned Modernbert on 5.7 million+ PII examples, itโ€™s clocking 99%+ accuracy across emails, dates, social numbers, and more!

Why itโ€™s a big deal:
โœ… Top-tier precision: 100% for passport numbers, 99.96% for emails*.
โœ… Totally free: MIT license for personal or commercial use.
โœ… No secrets: Full metrics shared on Hugging Face.

#AI #OpenSource #DataSecurity @huggingface

Day 2 out 7 of PII-Masking-1M Announcements Complete!

*Accuracies reported from the new OpenPII-500k dataset

ai4privacy/llama-ai4privacy-english-anonymiser-openpii
MikeDoesย 
posted an update 16 days ago
view post
Post
2687
๐Ÿš€ Ai4Privacy Team is excited to unveil PII-Masking-1M, our most significant release yet! ๐ŸŽ‰

This publication series ๐Ÿ“ฆ includes datasets ๐Ÿ“Š, models ๐Ÿค–, and applications โš™๏ธ to advance PII masking with AI systems ๐Ÿ›ก๏ธ

Starting on Monday with daily posts at 7 PM CET โฐ
Tonicย 
posted an update 26 days ago
view post
Post
1213
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธHey there folks,

Did you know that you can use ModernBERT to detect model hallucinations ?

Check out the Demo : Tonic/hallucination-test

See here for Medical Context Demo : MultiTransformer/tonic-discharge-guard

check out the model from KRLabs : KRLabsOrg/lettucedect-large-modernbert-en-v1

and the library they kindly open sourced for it : https://github.com/KRLabsOrg/LettuceDetect

๐Ÿ‘†๐Ÿปif you like this topic please contribute code upstream ๐Ÿš€

  • 2 replies
ยท
Tonicย 
posted an update 28 days ago
view post
Post
720
Powered by KRLabsOrg/lettucedect-large-modernbert-en-v1 from KRLabsOrg.

Detect hallucinations in answers based on context and questions using ModernBERT with 8192-token context support!

### Model Details
- **Model Name**: [lettucedect-large-modernbert-en-v1]( KRLabsOrg/lettucedect-large-modernbert-en-v1)
- **Organization**: [KRLabsOrg]( KRLabsOrg )
- **Github**: [https://github.com/KRLabsOrg/LettuceDetect](https://github.com/KRLabsOrg/LettuceDetect)
- **Architecture**: ModernBERT (Large) with extended context support up to 8192 tokens
- **Task**: Token Classification / Hallucination Detection
- **Training Dataset**: [RagTruth]( wandb/RAGTruth-processed)
- **Language**: English
- **Capabilities**: Detects hallucinated spans in answers, provides confidence scores, and calculates average confidence across detected spans.

LettuceDetect excels at processing long documents to determine if an answer aligns with the provided context, making it a powerful tool for ensuring factual accuracy.
ngxsonย 
posted an update about 1 month ago
view post
Post
3231
A comprehensive matrix for which format should you use.

Read more on my blog post: https://huggingface.co/blog/ngxson/common-ai-model-formats

| Hardware        | GGUF      | PyTorch                | Safetensors              | ONNX  |
|-----------------|-----------|------------------------|--------------------------|-------|
| CPU             | โœ… (best) | ๐ŸŸก                      | ๐ŸŸก                       | โœ…    |
| GPU             | โœ…        | โœ…                      | โœ…                       | โœ…    |
| Mobile          | โœ…        | ๐ŸŸก (via executorch)     | โŒ                       | โœ…    |
| Apple silicon   | โœ…        | ๐ŸŸก                      | โœ… (via MLX framework)   | โœ…    |
  • 1 reply
ยท
Tonicย 
posted an update about 2 months ago
view post
Post
2382
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธhey there folks ,

Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math

give it a try !
Tonicย 
posted an update 2 months ago
view post
Post
2973
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks ,

our team made a game during the @mistral-game-jam and we're trying to win the community award !

try our game out and drop us a โค๏ธ like basically to vote for us !

Mistral-AI-Game-Jam/TextToSurvive

hope you like it !
ngxsonย 
posted an update 2 months ago
Tonicย 
posted an update 3 months ago
view post
Post
1901
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks ,

Facebook AI just released JASCO models that make music stems .

you can try it out here : Tonic/audiocraft

hope you like it
ngxsonย 
posted an update 3 months ago
view post
Post
3423
Check out my collection of pre-made GGUF LoRA adapters!

This allow you to use both normal + abliterated version of popular models like llama, qwen, etc, without having to double to amount of VRAM usage.

ngxson/gguf_lora_collection
ยท
Tonicย 
posted an update 3 months ago
view post
Post
2466
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธHey there folks , Open LLM Europe just released Lucie 7B-Instruct model , a billingual instruct model trained on open data ! You can check out my unofficial demo here while we wait for the official inference api from the group : Tonic/Lucie-7B hope you like it ๐Ÿš€
ngxsonย 
posted an update 3 months ago
Tonicย 
posted an update 3 months ago
view post
Post
1724
microsoft just released Phi-4 , check it out here : Tonic/Phi-4

hope you like it :-)