Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper β’ 2409.08264 β’ Published Sep 12 β’ 43
WebInstruct π Embeddings 𧱠Models Collection A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses β’ 3 items β’ Updated Sep 4 β’ 11
view article Article Selective fine-tuning of Language Models with Spectrum By anakin87 β’ Sep 3 β’ 30
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated 3 days ago β’ 46
Mixture-of-preference-reward-modeling Collection The mixture of preference datasets used for reward modeling. β’ 2 items β’ Updated Apr 29 β’ 2
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. β’ 14 items β’ Updated May 8 β’ 23
Data-Efficient Multimodal Fusion on a Single GPU Paper β’ 2312.10144 β’ Published Dec 15, 2023 β’ 6
Preference Datasets for DPO Collection This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs β’ 7 items β’ Updated 15 days ago β’ 34
An Emulator for Fine-Tuning Large Language Models using Small Language Models Paper β’ 2310.12962 β’ Published Oct 19, 2023 β’ 14
π SD-XL Training Suite Collection All the steps to train your own SD-XL custom model β’ 7 items β’ Updated Oct 3 β’ 21
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Paper β’ 2307.06949 β’ Published Jul 13, 2023 β’ 50