arxiv:2312.11805

Gemini: A Family of Highly Capable Multimodal Models

Published on Dec 19, 2023

· Submitted by

akhaliq on Dec 20, 2023

#2 Paper of the day

Authors:

,

,

,

,

Jean-Baptiste Alayrac ,

Jiahui Yu ,

,

,

Andrew M. Dai ,

,

,

,

,

,

,

,

,

,

,

,

,

Abstract

This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of Gemini models in cross-modal reasoning and language understanding will enable a wide variety of use cases and we discuss our approach toward deploying them responsibly to users.

View arXiv page View PDF Add to collection

Community

Smooke

Dec 20, 2023

942 Authors!

Dec 20, 2023

Not me. You listed the wrong 'Yonghui Wu' in the author list.

AdinaY

Dec 20, 2023

Not me. You listed the wrong 'Yonghui Wu' in the author list.

Hi! Thanks for letting us know. The authorship has been removed from your account.

Dec 20, 2023

Not me. You listed the wrong 'Yonghui Wu' in the author list.

Absolute chad for not pretending to be a Google Author.

Dec 20, 2023

The 'Timothy Chung' in the author list is wrongly associated with my account as well

AdinaY

Dec 20, 2023

The 'Timothy Chung' in the author list is wrongly associated with my account as well

Hi! Thanks for the feedback. Authorship removed!: )

Dec 20, 2023

The 'Timothy Chung' in the author list is wrongly associated with my account as well

Just keep it.

Jokes aside thanks for your honesty.

Dec 21, 2023

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

Dec 25, 2023

Wrong 'Yi Luan' as the author, not me. This is the second time, should I change my name? LOL

Dec 25, 2023

Wrong 'Yi Luan' as the author, not me. This is the second time, should I change my name? LOL

Become such a high profile researcher that you defame the old Yi Luan and become the real omega Yi Luan.

Next time it happens just pretend to be the author and contact press with this as proof 🤣

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Models citing this paper 136

Browse 136 models citing this paper

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2312.11805 in a dataset README.md to link it from this page.

Spaces citing this paper 845

Collections including this paper 15