Trey Saddler's picture

4 13

Trey Saddler

tosaddler

·

https://treysaddler.com

AI & ML interests

None yet

Recent Activity

reacted to reach-vb's post with ❤️ about 1 month ago

Massive week for Open AI/ ML: Mistral Pixtral & Instruct Large - ~123B, 128K context, multilingual, json + function calling & open weights https://huggingface.co/mistralai/Pixtral-Large-Instruct-2411 https://huggingface.co/mistralai/Mistral-Large-Instruct-2411 Allen AI Tülu 70B & 8B - competive with claude 3.5 haiku, beats all major open models like llama 3.1 70B, qwen 2.5 and nemotron https://huggingface.co/collections/allenai/tulu-3-models-673b8e0dc3512e30e7dc54f5 https://huggingface.co/collections/allenai/tulu-3-datasets-673b8df14442393f7213f372 Llava o1 - vlm capable of spontaneous, systematic reasoning, similar to GPT-o1, 11B model outperforms gemini-1.5-pro, gpt-4o-mini, and llama-3.2-90B-vision https://huggingface.co/Xkev/Llama-3.2V-11B-cot Black Forest Labs Flux.1 tools - four new state of the art model checkpoints & 2 adapters for fill, depth, canny & redux, open weights https://huggingface.co/collections/reach-vb/black-forest-labs-flux1-6743847bde9997dd26609817 Jina AI Jina CLIP v2 - general purpose multilingual and multimodal (text & image) embedding model, 900M params, 512 x 512 resolution, matroyoshka representations (1024 to 64) https://huggingface.co/jinaai/jina-clip-v2 Apple AIM v2 & CoreML MobileCLIP - large scale vision encoders outperform CLIP and SigLIP. CoreML optimised MobileCLIP models https://huggingface.co/collections/apple/aimv2-6720fe1558d94c7805f7688c https://huggingface.co/apple/coreml-mobileclip A lot more got released like, OpenScholar (https://huggingface.co/collections/OpenScholar/openscholar-v1-67376a89f6a80f448da411a6), smoltalk (https://huggingface.co/datasets/HuggingFaceTB/smoltalk), Hymba (https://huggingface.co/collections/nvidia/hymba-673c35516c12c4b98b5e845f), Open ASR Leaderboard (https://huggingface.co/spaces/hf-audio/open_asr_leaderboard) and much more.. Can't wait for the next week! 🤗

View all activity

Organizations

tosaddler's activity

reacted to reach-vb's post with ❤️ about 1 month ago

Post

3165

Massive week for Open AI/ ML:

Mistral Pixtral & Instruct Large - ~123B, 128K context, multilingual, json + function calling & open weights
mistralai/Pixtral-Large-Instruct-2411
mistralai/Mistral-Large-Instruct-2411

Allen AI Tülu 70B & 8B - competive with claude 3.5 haiku, beats all major open models like llama 3.1 70B, qwen 2.5 and nemotron
allenai/tulu-3-models-673b8e0dc3512e30e7dc54f5
allenai/tulu-3-datasets-673b8df14442393f7213f372

Llava o1 - vlm capable of spontaneous, systematic reasoning, similar to GPT-o1, 11B model outperforms gemini-1.5-pro, gpt-4o-mini, and llama-3.2-90B-vision
Xkev/Llama-3.2V-11B-cot

Black Forest Labs Flux.1 tools - four new state of the art model checkpoints & 2 adapters for fill, depth, canny & redux, open weights
reach-vb/black-forest-labs-flux1-6743847bde9997dd26609817

Jina AI Jina CLIP v2 - general purpose multilingual and multimodal (text & image) embedding model, 900M params, 512 x 512 resolution, matroyoshka representations (1024 to 64)
jinaai/jina-clip-v2

Apple AIM v2 & CoreML MobileCLIP - large scale vision encoders outperform CLIP and SigLIP. CoreML optimised MobileCLIP models
apple/aimv2-6720fe1558d94c7805f7688c
apple/coreml-mobileclip

A lot more got released like, OpenScholar ( OpenScholar/openscholar-v1-67376a89f6a80f448da411a6), smoltalk ( HuggingFaceTB/smoltalk), Hymba ( nvidia/hymba-673c35516c12c4b98b5e845f), Open ASR Leaderboard ( hf-audio/open_asr_leaderboard) and much more..

Can't wait for the next week! 🤗

reacted to asoria's post with 👍 3 months ago

Post

2459

📝 I wrote a tutorial on how to get started with the fine-tuning process using Hugging Face tools, providing an end-to-end workflow.

The tutorial covers creating a new dataset using the new SQL Console 🛢 and fine-tuning a model with SFT, guided by the Notebook Creator App 📙.

👉 You can read the full article here:
https://huggingface.co/blog/asoria/easy-fine-tuning-with-hf
asoria/auto-notebook-creator

New activity in openlifescienceai/open_medical_llm_leaderboard 4 months ago

Model evaluation and submission stuck of LB.

#17 opened 7 months ago by

liked a model 5 months ago

dunzhang/stella_en_1.5B_v5

Sentence Similarity • Updated 15 days ago • 366k • 184

liked a dataset 5 months ago

PatronusAI/HaluBench

Viewer • Updated Jul 11 • 14.9k • 897 • 35

liked 2 models 5 months ago

mistralai/Mistral-Nemo-Instruct-2407

Text Generation • Updated Nov 6 • 3.1M • • 1.32k

andrewt28/MolLM

Updated Mar 17 • 2

liked 2 models 6 months ago

facebook/multi-token-prediction

Updated Jun 18 • 349

nvidia/Nemotron-4-340B-Instruct

Updated Jun 24 • 125 • 663

New activity in openlifescienceai/open_medical_llm_leaderboard 7 months ago

🚩 Report: Legal issue(s)

#16 opened 7 months ago by

liked a Space 8 months ago

LLM Performance Leaderboard

liked 2 models 8 months ago

HuggingFaceM4/idefics2-8b-chatty

Image-Text-to-Text • Updated Jul 30 • 532 • 92

aaditya/Llama3-OpenBioLLM-8B

Text Generation • Updated Apr 28 • 10.1k • 160

updated a Space 8 months ago

README

liked a Space 8 months ago

Running on CPU Upgrade

Open Medical-LLM Leaderboard

New activity in openlifescienceai/open_medical_llm_leaderboard 8 months ago

It's broken

#9 opened 8 months ago by

liked a model 8 months ago

aaditya/Llama3-OpenBioLLM-70B

Text Generation • Updated May 9 • 14.1k • 359

updated a model 8 months ago

tosaddler/Llama3-OpenBioLLM-70B-GGUF

liked a model 8 months ago

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27 • 567k • 5.93k

liked a model over 1 year ago

bigcode/starcoder

Text Generation • Updated Oct 8 • 16.2k • • 2.83k