Marcus Gawronsky's picture

Marcus Gawronsky

marcusinthesky

·

AI & ML interests

Representation Learning

Recent Activity

liked a dataset 2 days ago

vinai/RecGPT-datasets

liked a model 8 days ago

AIDC-AI/Ovis2.5-2B

liked a model 21 days ago

openai/gpt-oss-20b

View all activity

Organizations

upvoted 2 papers about 2 months ago

Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations

Paper • 2507.04886 • Published Jul 7 • 3

MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings

Paper • 2506.23115 • Published Jun 29 • 37

upvoted a paper 2 months ago

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Paper • 2506.14028 • Published Jun 16 • 92

upvoted 3 papers 3 months ago

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Paper • 2503.04812 • Published Mar 4 • 15

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 280

upvoted 3 papers 4 months ago

FG-CLIP: Fine-Grained Visual and Textual Alignment

Paper • 2505.05071 • Published May 8 • 18

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22 • 55

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Paper • 2504.10483 • Published Apr 14 • 21

upvoted 3 papers 5 months ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 55

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19 • 49

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 169

upvoted a paper 6 months ago

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

Paper • 2502.08468 • Published Feb 12 • 15

upvoted 5 papers 8 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89

SPAR: Personalized Content-Based Recommendation via Long Engagement Attention

Paper • 2402.10555 • Published Feb 16, 2024 • 36

Item-Language Model for Conversational Recommendation

Paper • 2406.02844 • Published Jun 5, 2024 • 12

Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation

Paper • 2412.18176 • Published Dec 24, 2024 • 17

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 40

upvoted 2 papers 9 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 56