Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages Paper • 2308.12038 • Published Aug 23, 2023 • 2
A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs Paper • 2411.17265 • Published Nov 26, 2024
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 7 days ago • 31
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models Paper • 2506.05176 • Published 25 days ago • 62
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs Paper • 2412.16855 • Published Dec 22, 2024 • 5
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions Paper • 2412.08737 • Published Dec 11, 2024 • 54
Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging Paper • 2410.15035 • Published Oct 19, 2024 • 1
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval Paper • 2407.19669 • Published Jul 29, 2024 • 24
Towards General Text Embeddings with Multi-stage Contrastive Learning Paper • 2308.03281 • Published Aug 7, 2023 • 2
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback Paper • 2312.00849 • Published Dec 1, 2023 • 12
Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants Paper • 2310.00653 • Published Oct 1, 2023 • 3
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding Paper • 2308.10529 • Published Aug 21, 2023 • 1