2 26 36

Kyle Tuft

Chilangosta

AI & ML interests

None yet

Recent Activity

liked a Space 2 days ago

omni-research/Tarsier2-7b

upvoted a paper 2 days ago

HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution

upvoted a paper 2 days ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

View all activity

Organizations

None yet

Chilangosta's activity

liked a Space 2 days ago

Tarsier2 7b

💬

Chat about videos and images

upvoted 3 papers 2 days ago

HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution

Paper • 2501.10045 • Published 19 days ago • 9

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 14 days ago • 81

GSTAR: Gaussian Surface Tracking and Reconstruction

Paper • 2501.10283 • Published 18 days ago • 5

liked a model 6 days ago

RWKV-Red-Team/ARWKV-7B-Preview-0.1

Text Generation • Updated 6 days ago • 62 • 15

liked a Space 8 days ago

LlamaV-o1

👀

Ask questions about images and get detailed answers

liked a model 8 days ago

mlx-community/Llama-3.2-11B-Vision-Instruct-abliterated

Image-Text-to-Text • Updated Dec 16, 2024 • 2.55k • 5

liked a Space 8 days ago

VideoLLaMA3

💬

Frontier Foundation Models for Video Understanding

updated a collection 10 days ago

Interest

Collection

2 items • Updated 10 days ago

upvoted a paper 10 days ago

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Paper • 2501.13928 • Published 12 days ago • 16

liked a model 11 days ago

HKUSTAudio/Llasa-3B

Text-to-Speech • Updated 3 days ago • 6.38k • 409

liked a Space 15 days ago

248

Llasa 3b Tts

🔥

Zero Shot voice cloning with llasa 3b (Unofficial Demo)

upvoted an article 15 days ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

15 days ago

• 54

liked a model 19 days ago

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 9 days ago • 270k • 910

upvoted an article 19 days ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

20 days ago

• 39

upvoted a paper 19 days ago

Multimodal LLMs Can Reason about Aesthetics in Zero-Shot

Paper • 2501.09012 • Published 20 days ago • 10

liked a model 21 days ago

MiniMaxAI/MiniMax-VL-01

Image-Text-to-Text • Updated 11 days ago • 2.09k • 227

upvoted a paper 21 days ago

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published 25 days ago • 43

liked 2 Spaces 24 days ago

1.13k

Finegrain Image Enhancer

🖼

Clarity AI Upscaler Reproduction

FLUX LoRa the Explorer

🏆

Create images with prompts using LoRA models