Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey Paper • 2503.12605 • Published Mar 16 • 36
Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published Mar 16 • 44
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 876
view article Article Welcome to Inference Providers on the Hub 🔥 By julien-c and 6 others • Jan 28 • 483
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.27k
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 140
Tree of Thoughts: Deliberate Problem Solving with Large Language Models Paper • 2305.10601 • Published May 17, 2023 • 12
Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques Paper • 2310.08101 • Published Oct 12, 2023 • 2
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines Paper • 2410.21220 • Published Oct 28, 2024 • 10
LongReward: Improving Long-context Large Language Models with AI Feedback Paper • 2410.21252 • Published Oct 28, 2024 • 18
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction Paper • 2410.21169 • Published Oct 28, 2024 • 31
Improve Vision Language Model Chain-of-thought Reasoning Paper • 2410.16198 • Published Oct 21, 2024 • 27
How to Design Translation Prompts for ChatGPT: An Empirical Study Paper • 2304.02182 • Published Apr 5, 2023 • 1
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt Paper • 2403.17556 • Published Mar 26, 2024 • 1