Alessandro Ercolani

giux78

AI & ML interests

NLP, Reinforcement Learning, Semantics, Computational Neuroscience

Recent Activity

liked a Space 2 days ago
evalitahf/evalita_llm_leaderboard
reacted to tomaarsen's post with 🔥 7 days ago
‼️Sentence Transformers v4.0 is out! You can now train and finetune reranker models with multi-GPU training, bf16 support, loss logging, callbacks & much more. I also prove that finetuning on your domain helps much more than you might think. 1️⃣ Reranker Training Refactor Reranker models can now be trained using an extensive trainer with a lot of powerful features: - MultiGPU Training (Data Parallelism (DP) and Distributed Data Parallelism (DDP)) - bf16 training support; loss logging - Evaluation datasets + evaluation loss - Improved callback support + an excellent Weights & Biases integration - Gradient checkpointing, gradient accumulation - Model card generation - Resuming from a training checkpoint without performance loss - Hyperparameter Optimization and much more! Read my detailed blogpost to learn about the components that make up this new training approach: https://huggingface.co/blog/train-reranker Notably, the release is fully backwards compatible: all deprecations are soft, meaning that they still work but emit a warning informing you how to upgrade. 2️⃣ New Reranker Losses - 11 new losses: - 2 traditional losses: BinaryCrossEntropy and CrossEntropy - 2 distillation losses: MSE and MarginMSE - 2 in-batch negatives losses: MNRL (a.k.a. InfoNCE) and CMNRL - 5 learning to rank losses: Lambda, p-ListMLE, ListNet, RankNet, ListMLE 3️⃣ New Reranker Documentation - New Training Overview, Loss Overview, API Reference docs - 5 new, 1 refactored training examples docs pages - 13 new, 6 refactored training scripts - Migration guides (2.x -> 3.x, 3.x -> 4.x) 4️⃣ Blogpost Alongside the release, I've written a blogpost where I finetune ModernBERT on a generic question-answer dataset. My finetunes easily outperform all general-purpose reranker models, even models 4x as big. Finetuning on your domain is definitely worth it: https://huggingface.co/blog/train-reranker See the full release notes here: https://github.com/UKPLab/sentence-transformers/releases/v4.0.1
View all activity

Organizations

Rocket AI's profile picture Spaces-explorers's profile picture Blog-explorers's profile picture FairMind's profile picture Business Operating System's profile picture mii-community's profile picture Social Post Explorers's profile picture mii-llm's profile picture Coloss's profile picture

giux78's activity

New activity in mii-llm/pinocchio-ita-leaderboard 5 months ago

The leaderboard is down...

2
#1 opened 5 months ago by
zhiminy
New activity in mii-llm/pinocchio 7 months ago
New activity in mii-llm/open_ita_llm_leaderboard 11 months ago

Update app.py

#13 opened 11 months ago by
giux78

Update app.py

1
#12 opened 11 months ago by
giux78

Update leaderboard_general.csv

#10 opened 11 months ago by
giux78

Problem with the viewer

1
#10 opened 11 months ago by
giux78
New activity in meta-llama/Meta-Llama-3-8B 12 months ago

Access Problems

61
#45 opened 12 months ago by
VityaVitalich
New activity in gorilla-llm/APIBench 12 months ago

Dataset is not loading

1
#2 opened about 1 year ago by
vinbloke
New activity in giux78/gemma-2b-sft-ita 12 months ago

Information on the model

4
#1 opened 12 months ago by
anakin87
New activity in mii-llm/open_ita_llm_leaderboard about 1 year ago

Upload app.py

#8 opened about 1 year ago by
giux78

What is `m_mmul` benchmark?

3
#7 opened about 1 year ago by
zhiminy
New activity in mii-community/UsenetArchiveIT-conversations about 1 year ago
New activity in mii-llm/open_ita_llm_leaderboard about 1 year ago

Upload app.py

#3 opened about 1 year ago by
giux78

Upload 2 files

#2 opened about 1 year ago by
giux78
New activity in alexandrainst/m_mmlu about 1 year ago

Data corrupter

1
#4 opened about 1 year ago by
giux78

Data corrupted

1
#3 opened about 1 year ago by
giux78