Ross Wightman's picture

Ross Wightman

rwightman

·

AI & ML interests

Computer vision, transfer learning, semi/self supervised learning, robotics.

Recent Activity

new activity 7 days ago

timm/vit_little_patch16_reg4_gap_256.sbb_in1k:Loss exploding to nan

liked a model 21 days ago

openai/gpt-oss-20b

liked a model 21 days ago

openai/gpt-oss-120b

View all activity

Organizations

upvoted a collection 21 days ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 19 days ago • 318

upvoted an article 21 days ago

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

By

•

Jun 11, 2024

• 64

upvoted a collection 25 days ago

MetaCLIP

MetaCLIP & MetaCLIP2 OpenCLIP and timm models. All models are dual timm + OpenCLIP (or just timm for specific vit encoders). • 24 items • Updated 24 days ago • 2

upvoted 3 collections about 1 month ago

Perception Encoder

OpenCLIP (PE Core image + text) and timm PE Core, Spatial, Lang (ViT only) weights. NOTE: These weights do not work with original modeling code. • 19 items • Updated 25 days ago • 5

Meta CLIP 1/2

Scaling CLIP data with transparent training distribution from an end-to-end pipeline. • 11 items • Updated about 8 hours ago • 4

Perception Encoder

17 items • Updated Jul 11 • 66

upvoted a paper about 2 months ago

RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations

Paper • 2412.19628 • Published Dec 27, 2024 • 2

upvoted a collection about 2 months ago

RecNeXt

37 items • Updated 25 days ago • 2

upvoted a collection 2 months ago

Gemma 3n

4 items • Updated Jul 10 • 215

upvoted a collection 3 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11 • 291

upvoted a collection 4 months ago

OpenVision

27 items • Updated 10 days ago • 29

upvoted a paper 6 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 146

upvoted a collection 6 months ago

SigLIP 2

OpenCLIP and timm SigLIP 2 models • 47 items • Updated 21 days ago • 23

upvoted an article 6 months ago

Article

SigLIP 2: A better multilingual vision language encoder

By

and 2 others •

Feb 21

• 179

upvoted 4 articles 7 months ago

Article

🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces

By

•

Feb 2

• 6

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 878

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

Jan 15

• 47

Article

Timm ❤️ Transformers: Use any timm model with transformers

By

and 4 others •

Jan 16

• 51

upvoted 2 papers 8 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 154

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 121