MM LLM Papers OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Paper • 2401.12202 • Published Jan 22 • 10 Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Paper • 2401.11708 • Published Jan 22 • 30
OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Paper • 2401.12202 • Published Jan 22 • 10
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Paper • 2401.11708 • Published Jan 22 • 30
Interesting Papers to Read StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Paper • 2401.11053 • Published Jan 19 • 10
StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Paper • 2401.11053 • Published Jan 19 • 10
TheSeriousProgrammer/spoken_words_en_ml_commons_filtered_split Viewer • Updated Jan 27, 2023 • 355k • 121