Rust Sun

vigos

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

jackboyla/glirel-large-v0

liked a dataset about 1 month ago

microsoft/orca-agentinstruct-1M-v1

liked a model about 1 month ago

nomic-ai/nomic-embed-vision-v1.5

View all activity

Organizations

vigos's activity

liked a model about 1 month ago

jackboyla/glirel-large-v0

Updated Nov 3 • 11

liked a dataset about 1 month ago

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1 • 1.05M • 13.1k • 404

liked 2 models about 1 month ago

nomic-ai/nomic-embed-vision-v1.5

Image Feature Extraction • Updated Jun 8 • 23.7k • 117

nvidia/NV-Embed-v2

Feature Extraction • Updated 25 days ago • 185k • 327

liked a Space about 1 month ago

Running

167

💻

GLiNER-medium-v2.1, zero-shot NER

liked a model about 1 month ago

ali-vilab/In-Context-LoRA

Text-to-Image • Updated 9 days ago • 125k • • 500

liked 2 models about 2 months ago

fishaudio/fish-agent-v0.1-3b

Audio-to-Audio • Updated Nov 1 • 813 • 234

microsoft/OmniParser

Image-Text-to-Text • Updated 24 days ago • 4.33k • 1.5k

liked a Space about 2 months ago

Runtime error

📙

Docs Scraper

liked 2 models 2 months ago

ostris/OpenFLUX.1

Text-to-Image • Updated Oct 3 • 9k • 589

meta-llama/Meta-Llama-Guard-2-8B

Text Generation • Updated May 13 • 18.8k • 287

upvoted an article 2 months ago

Article

How to build a custom text classifier without days of human labeling

•

Oct 17

• 55

reacted to reach-vb's post with 👍 2 months ago

Post

5445

Multimodal Ichigo Llama 3.1 - Real Time Voice AI 🔥

> WhisperSpeech X Llama 3.1 8B
> Trained on 50K hours of speech (7 languages)
> Continually trained on 45hrs 10x A1000s
> MLS -> WhisperVQ tokens -> Llama 3.1
> Instruction tuned on 1.89M samples
> 70% speech, 20% transcription, 10% text
> Apache 2.0 licensed ⚡

Architecture:
> WhisperSpeech/ VQ for Semantic Tokens
> Llama 3.1 8B Instruct for Text backbone
> Early fusion (Chameleon)

I'm super bullish on HomeBrew/ Jan and early fusion, audio and text, multimodal models!

(P.S. Play with the demo on Hugging Face: jan-hq/Ichigo-llama3.1-s-instruct)

liked a model 3 months ago

rain1011/pyramid-flow-sd3

Text-to-Video • Updated Oct 30 • 795

liked a dataset 3 months ago

gorilla-llm/Berkeley-Function-Calling-Leaderboard

Preview • Updated 16 days ago • 718 • 51

liked a model 3 months ago

grounded-ai/phi3.5-hallucination-judge

Updated Sep 19 • 163 • 3

upvoted 3 collections 4 months ago

liked a model 6 months ago

foduucom/table-detection-and-extraction

Object Detection • Updated Aug 6, 2023 • 57.1k • 67