Rust Sun's picture
4 15

Rust Sun

vigos

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
jackboyla/glirel-large-v0
liked a dataset about 1 month ago
microsoft/orca-agentinstruct-1M-v1
liked a model about 1 month ago
nomic-ai/nomic-embed-vision-v1.5
View all activity

Organizations

Hugging Face Discord Community's profile picture

vigos's activity

liked a Space about 2 months ago
upvoted an article 2 months ago
view article
Article

How to build a custom text classifier without days of human labeling

By sdiazlor โ€ข
โ€ข 55
reacted to reach-vb's post with ๐Ÿ‘ 2 months ago
view post
Post
5445
Multimodal Ichigo Llama 3.1 - Real Time Voice AI ๐Ÿ”ฅ

> WhisperSpeech X Llama 3.1 8B
> Trained on 50K hours of speech (7 languages)
> Continually trained on 45hrs 10x A1000s
> MLS -> WhisperVQ tokens -> Llama 3.1
> Instruction tuned on 1.89M samples
> 70% speech, 20% transcription, 10% text
> Apache 2.0 licensed โšก

Architecture:
> WhisperSpeech/ VQ for Semantic Tokens
> Llama 3.1 8B Instruct for Text backbone
> Early fusion (Chameleon)

I'm super bullish on HomeBrew/ Jan and early fusion, audio and text, multimodal models!

(P.S. Play with the demo on Hugging Face: jan-hq/Ichigo-llama3.1-s-instruct)