DS3Lab

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

biyuan authored a paper 15 days ago

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

biyuan authored a paper 15 days ago

Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning

biyuan authored a paper 15 days ago

Holistic Evaluation of Language Models

View all activity

biyuan

authored 11 papers 15 days ago

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

Paper • 2303.06865 • Published Mar 13, 2023 • 1

Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning

Paper • 2306.00088 • Published May 31, 2023 • 1

Holistic Evaluation of Language Models

Paper • 2211.09110 • Published Nov 16, 2022

Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads

Paper • 2410.01805 • Published Oct 2, 2024

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild

Paper • 2410.05357 • Published Oct 7, 2024

Zero-Indexing Internet Search Augmented Generation for Large Language Models

Paper • 2411.19478 • Published Nov 29, 2024

HEXGEN-TEXT2SQL: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL Workflow

Paper • 2505.05286 • Published May 8 • 1

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Paper • 2505.24298 • Published May 30 • 27

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

Paper • 2506.07227 • Published Jun 8

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification

Paper • 2506.07235 • Published Jun 8

Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny

Paper • 2507.16331 • Published 18 days ago • 18

xzyao

authored a paper 4 months ago

DataPerf: Benchmarks for Data-Centric AI Development

Paper • 2207.10062 • Published Jul 20, 2022

xzyao

authored a paper 9 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 57

biyuan

authored a paper 10 months ago

Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining

Paper • 2410.08102 • Published Oct 10, 2024 • 20

zhangce

authored a paper about 1 year ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 60

juewang

authored a paper about 1 year ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 60

xzyao

authored 2 papers over 1 year ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 43

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

Paper • 2311.13028 • Published Nov 21, 2023 • 1

biyuan

authored a paper over 1 year ago

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11, 2024 • 55

xzyao

authored a paper over 1 year ago

DeltaZip: Multi-Tenant Language Model Serving via Delta Compression

Paper • 2312.05215 • Published Dec 8, 2023 • 1

AI & ML interests

Recent Activity

Team members 4

ds3lab's activity