Softcatalà

non-profit

https://www.softcatala.org/

Activity Feed

AI & ML interests

Language technologies for Catalan language

Recent Activity

jordimas new activity about 8 hours ago

softcatala/catalan-dictionary:[bot] Conversion to Parquet

jmontane updated a dataset about 8 hours ago

softcatala/catalan-dictionary

jordimas new activity about 15 hours ago

softcatala/catalan-dictionary:Add Catalan description

View all activity

jordimas

in softcatala/catalan-dictionary about 8 hours ago

[bot] Conversion to Parquet

#3 opened about 2 years ago by

parquet-converter

jmontane

updated a dataset about 8 hours ago

softcatala/catalan-dictionary

Viewer • Updated about 8 hours ago • 1.22M • 777 • 3

jordimas

in softcatala/catalan-dictionary about 15 hours ago

Add Catalan description

#4 opened about 15 hours ago by

xavivars

ccoreilly

updated a dataset 4 days ago

softcatala/catalan-youtube-speech

Updated 4 days ago • 78 • 3

jordimas

updated a dataset 11 days ago

softcatala/wikimedia-common-audio-catalan

Viewer • Updated 11 days ago • 512 • 112

jordimas

published a dataset 11 days ago

softcatala/wikimedia-common-audio-catalan

Viewer • Updated 11 days ago • 512 • 112

jordimas

updated 4 datasets 13 days ago

updated a Space 13 days ago

README

📈

jordimas

published a Space 13 days ago

README

📈

jordimas

updated a dataset 19 days ago

softcatala/optimot-linguistic-data

Viewer • Updated 19 days ago • 4.01k • 103

jordimas

in softcatala/optimot-linguistic-data 19 days ago

[bot] Conversion to Parquet

#1 opened 21 days ago by

parquet-converter

jordimas

published a dataset 21 days ago

softcatala/optimot-linguistic-data

Viewer • Updated 19 days ago • 4.01k • 103

albertvillanova

posted an update 24 days ago

Post

3607

🎉 KTO is now part of the stable TRL API

As of Promote KTO to stable API, KTOTrainer and KTOConfig have graduated from trl.experimental to the stable trl API. https://github.com/huggingface/trl/pull/6175

This one closes out a long road. Over the past 6+ months, the "Align KTO with DPO" effort landed ~90 PRs methodically bringing KTO up to the standard we hold for stable trainers, one carefully-scoped change at a time:
- Feature parity with DPO: full VLM support (incl. multi-image), sync_ref_model, PEFT + Liger, ZeRO-3 + PEFT dtype fix, pad_to_multiple_of, activation offloading, IterableDataset and dict eval_dataset, remove_unused_columns, and reference-logprob precomputation at init.
- Consistency with DPO: aligned method order and signatures, tokenization, _prepare_dataset, PEFT handling, ref-model preparation for distributed training, and config layout — plus a new DataCollatorForKTO and output format. Metrics moved into _compute_loss and simplified to direct averages via the shared _metrics attribute.
- Removing legacy baggage: dropped encoder-decoder support, BOS/EOS handling, null_ref_context, generate_during_eval, model_init, preprocess_logits_for_metrics, model/ref adapter names, and several dead config knobs.
- Coverage: a full test suite mirroring DPO, text collator tests, VLM tests, and slow tests.
- The promotion itself: the experimental → stable move (#6175) and shim cleanup (#6287), handled so downstream users get a clean deprecation path.

Honestly, this has been one of the more complex tasks I've taken on since joining the team, not because any single change was hard, but because it demanded sustained consistency across a ~2,000-line trainer, with every branch, comment, and edge case kept in lockstep with DPO.

Huge thanks to everyone who reviewed along the way (especially @qgallouedec ), the incremental review cadence is exactly what kept this maintainable.

KTO now sits on equal footing with our other flagship trainers. 🚀

2 replies

ccoreilly

updated a Space 3 months ago

Síntesi en català

👁

Generate Catalan speech from text

albertvillanova

posted an update 5 months ago

Post

3041

🚀 TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

We’re excited to see what the community builds on top of this.

If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! 🤗

The future of ML tooling is agent-native.
🔗 https://github.com/huggingface/trl/releases/tag/v0.29.0