marcusinthesky
's Collections
Multimodal Embeddings
updated
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Paper
•
2403.19651
•
Published
•
22
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency
Determines Multimodal Model Performance
Paper
•
2404.04125
•
Published
•
28
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and
Training Strategies
Paper
•
2404.08197
•
Published
•
29
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Paper
•
2403.20327
•
Published
•
48
OpenGVLab/InternVL-14B-224px
Image Feature Extraction
•
Updated
•
1.32k
•
36
Alibaba-NLP/gte-large-en-v1.5
Sentence Similarity
•
Updated
•
2.45M
•
199
jinaai/jina-embeddings-v2-base-en
Feature Extraction
•
Updated
•
235k
•
•
713
castorini/repllama-v1.1-mrl-7b-lora-passage
Feature Extraction
•
Updated
•
15
•
5
McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp
Sentence Similarity
•
Updated
•
14.5k
•
4
BAAI/bge-visualized
Updated
•
46
royokong/e5-v
Image-Text-to-Text
•
Updated
•
5.35k
•
21
TIGER-Lab/VLM2Vec-Full
Text Generation
•
Updated
•
32.9k
•
21
openbmb/VisRAG-Ret
Feature Extraction
•
Updated
•
1.99k
•
61