A Multimodal Approach to Device-Directed Speech Detection with Large Language Models Paper • 2403.14438 • Published Mar 21 • 2
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation Paper • 2403.17694 • Published Mar 26 • 10
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations Paper • 2308.11466 • Published Aug 22, 2023 • 1