Together

company

Verified

https://together.ai

togethercompute

togethercomputer

Inference Provider

2,542,247 monthly requests

AI & ML interests

Foundation Models, Decentralized Computing, Open Source AI.

Recent Activity

Zhongzhu submitted a paper 22 days ago

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

YYF42 submitted a paper about 2 months ago

Introspective Diffusion Language Models

JamesSand authored a paper over 1 year ago

On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis

View all activity

Papers

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

View all Papers

Articles

Fine-tune Any LLM from the Hugging Face Hub with Together AI

KaiserWhoLearns

authored a paper about 1 month ago

What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models

Paper • 2506.06485 • Published Jun 6, 2025 • 5

KaiserWhoLearns

authored a paper about 2 months ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Paper • 2604.08510 • Published Apr 9 • 4

KaiserWhoLearns

submitted a paper to Daily Papers about 2 months ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Paper • 2604.08510 • Published Apr 9 • 4

submitted a paper to Daily Papers about 2 months ago

Introspective Diffusion Language Models

Paper • 2604.11035 • Published Apr 13 • 25

KaiserWhoLearns

authored a paper 3 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

KaiserWhoLearns

submitted a paper to Daily Papers 3 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

submitted a paper to Daily Papers 4 months ago

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

Paper • 2602.21196 • Published Feb 24 • 7

KaiserWhoLearns

authored a paper 4 months ago

FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights

Paper • 2602.02905 • Published Feb 2 • 5

authored a paper 10 months ago

Cartridges: Lightweight and general-purpose long context representations via self-study

Paper • 2506.06266 • Published Jun 6, 2025 • 8

posted an update 10 months ago

Post

387

🚀 Full-Quality Wan2.2 Video Generation on a single 24GB GPU — Powered by DFloat11

We just released the DFloat11 compressed Wan2.2 models. Now you can run full-quality Wan2.2 video generation on a single 24GB GPU, thanks to DFloat11 compression and CPU offloading.

🔗 Image-to-Video: DFloat11/Wan2.2-I2V-A14B-DF11
🔗 Text-to-Video: DFloat11/Wan2.2-T2V-A14B-DF11

authored a paper about 1 year ago

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Paper • 2505.07782 • Published May 12, 2025 • 19

authored a paper about 1 year ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published Apr 15, 2025 • 31

authored 2 papers over 1 year ago

Language Models Prefer What They Know: Relative Confidence Estimation via Confidence Preferences

Paper • 2502.01126 • Published Feb 3, 2025 • 4

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 126

authored a paper over 1 year ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

authored a paper over 1 year ago

On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis

Paper • 2501.04377 • Published Jan 8, 2025 • 14

authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 59

authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 59

authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 59

authored a paper over 1 year ago

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Paper • 2306.11698 • Published Jun 20, 2023 • 13