1 47 449

nDimensional

AI & ML interests

Computer Vision, Diffusers, Transformers, ML, NLP, Diffusion Models, Unsupervised Learning, JAX, Neural Networks

Recent Activity

liked a model about 16 hours ago

Wan-AI/Wan2.2-T2V-A14B

liked a model 4 days ago

tencent/Hunyuan3D-2

liked a dataset 5 days ago

allenai/wildjailbreak

View all activity

Organizations

None yet

upvoted a paper 29 days ago

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

Paper • 2507.01953 • Published about 1 month ago • 19

upvoted 2 papers about 1 month ago

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 61

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published Jun 24 • 26

upvoted 3 papers 3 months ago

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14 • 72

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22 • 63

upvoted 3 papers 4 months ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20 • 73

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published Mar 20 • 76

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19 • 49

upvoted 3 papers 5 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 167

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 52

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 146

upvoted 2 papers 6 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 121

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 116

upvoted 2 papers 7 months ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 76

upvoted 2 papers 9 months ago

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 24

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 52

upvoted a paper 10 months ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 55

upvoted a paper 11 months ago

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2, 2024 • 97

nDimensional

AI & ML interests

Recent Activity

Organizations

nDimensional's activity