In a Training Loop 🔄

53 114 186

Dmitry Ryumin

DmitryRyumin

https://dmitryryumin.github.io

DmitryRyumin

AI & ML interests

Machine Learning and Applications, Multi-Modal Understanding

Recent Activity

liked a Space 10 days ago

huggingface/ai-deadlines

liked a Space 15 days ago

black-forest-labs/FLUX.2-dev

liked a model 19 days ago

ResembleAI/chatterbox-turbo

View all activity

Organizations

upvoted an article about 2 months ago

Article

Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks

Nov 15, 2025

•

upvoted a paper about 2 months ago

Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning

Paper • 2511.02818 • Published Nov 4, 2025 • 15

upvoted 9 papers 2 months ago

SelectMix: Enhancing Label Noise Robustness through Targeted Sample Mixing

Paper • 2509.11265 • Published Sep 14, 2025 • 1

Intra-Cluster Mixup: An Effective Data Augmentation Technique for Complementary-Label Learning

Paper • 2509.17971 • Published Sep 22, 2025 • 1

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 99

Token Activation Map to Visually Explain Multimodal LLMs

Paper • 2506.23270 • Published Jun 29, 2025 • 5

LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models

Paper • 2504.14032 • Published Apr 18, 2025 • 7

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 29

E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

Paper • 2510.22733 • Published Oct 26, 2025 • 31

Heavy Labels Out! Dataset Distillation with Label Space Lightening

Paper • 2408.08201 • Published Aug 15, 2024 • 21

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 60

upvoted 3 papers 3 months ago

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7, 2025 • 54

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

Paper • 2510.06590 • Published Oct 8, 2025 • 73

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 97

upvoted 3 collections 3 months ago

upvoted 2 papers 3 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 99

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published Sep 21, 2025 • 13

upvoted a collection 3 months ago

Qwen3-Omni

Collection

6 items • Updated 5 days ago • 177

Dmitry Ryumin

AI & ML interests

Recent Activity

Organizations

DmitryRyumin's activity

Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks