Bryan Chen's picture

Bryan Chen

BryanBradfo

·

https://bryanbradfo.github.io/

AI & ML interests

Representation Learning, Computer vision, Large Language models, Vision language models, Frugal AI, Responsible AI, Graph, Optimization, Reinforcement Learning, Incremental Learning

Recent Activity

liked a Space about 8 hours ago

jbilcke-hf/ai-tube

liked a Space about 9 hours ago

BryanBradfo/KokAudio

liked a model about 10 hours ago

ByteDance/MegaTTS3

View all activity

Organizations

BryanBradfo's activity

liked a Space about 8 hours ago

AiTube

Explore AI-generated videos in 2025

liked a Space about 9 hours ago

VoiceBloom

Generate audio from text with customizable voice and speed

liked a model about 10 hours ago

ByteDance/MegaTTS3

Updated 6 days ago • 150

liked a Space about 10 hours ago

Hi3DGen

High-fidelity 3D Geometry Generation from images

updated a Space about 10 hours ago

VoiceBloom

Generate audio from text with customizable voice and speed

published a Space about 10 hours ago

VoiceBloom

Generate audio from text with customizable voice and speed

liked 4 Spaces about 11 hours ago

Huggy

Play with a stick-catching AI dog 🐶

HappyChat

Speech to speech in streaming with FastRTC

Hear Your Voice

Record and hear your voice echoed back

TranslateIsAllYouNeed

Translate text between multiple languages

upvoted 7 papers about 11 hours ago

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published 6 days ago • 31

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 5 days ago • 37

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published 3 days ago • 52

Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL

Paper • 2503.23157 • Published 5 days ago • 4

Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models

Paper • 2503.22165 • Published 6 days ago • 17

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Paper • 2504.01016 • Published 1 day ago • 18

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 132

updated a Space about 14 hours ago

TranslateIsAllYouNeed

Translate text between multiple languages

updated a Space about 15 hours ago

Hear Your Voice

Record and hear your voice echoed back

upvoted an article about 15 hours ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

22 days ago

• 363