Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

coqui/XTTS-v1

Text-to-Speech • Updated Nov 10, 2023 • 4.53k • 371
suno/bark

Text-to-Speech • Updated Oct 4, 2023 • 28.2k • • 1.38k

jphme/Llama-2-13b-chat-german

Text Generation • Updated Oct 6, 2023 • 1.13k • 62

Blockwise Parallel Transformer for Long Context Large Models

Paper • 2305.19370 • Published May 30, 2023 • 3
Blockwise Self-Attention for Long Document Understanding

Paper • 1911.02972 • Published Nov 7, 2019 • 1
Blockwise Compression of Transformer-based Models without Retraining

Paper • 2304.01483 • Published Apr 4, 2023 • 1

Backpropagation

Sparse Backpropagation for MoE Training

Paper • 2310.00811 • Published Oct 1, 2023 • 2
The Forward-Forward Algorithm: Some Preliminary Investigations

Paper • 2212.13345 • Published Dec 27, 2022 • 2
Fine-Tuning Language Models with Just Forward Passes

Paper • 2305.17333 • Published May 27, 2023 • 3
Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Paper • 2309.13192 • Published Sep 22, 2023 • 1

Bingsu/adetailer

Updated Nov 21, 2024 • 18.3M • 591

Yntec/epiCPhotoGasm

Text-to-Image • Updated Apr 18, 2024 • 889 • 50
ThinkDiffusion/ThinkDiffusionXL

Text-to-Image • Updated Nov 18, 2023 • 203 • 19
redstonehero/virile_reality_2.0

Text-to-Image • Updated Jul 28, 2023 • 3

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

Paper • 2210.17432 • Published Oct 31, 2022 • 1
TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Paper • 2305.08379 • Published May 15, 2023 • 3
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

Paper • 2308.12219 • Published Aug 23, 2023 • 1
CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 73

jeffwan/llama-30b-hf

Text Generation • Updated Apr 2, 2023 • 13

A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies

Paper • 2302.06218 • Published Feb 13, 2023 • 1
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training

Paper • 2306.10209 • Published Jun 16, 2023 • 2
SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

Paper • 2205.10034 • Published May 20, 2022 • 1
A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training

Paper • 2303.06318 • Published Mar 11, 2023 • 1

Running on T4

2.72k

2.72k

XTTS

🐸

Create personalized speech using text and audio samples

coqui/XTTS-v1

Text-to-Speech • Updated Nov 10, 2023 • 4.53k • 371
suno/bark

Text-to-Speech • Updated Oct 4, 2023 • 28.2k • • 1.38k

Yntec/epiCPhotoGasm

Text-to-Image • Updated Apr 18, 2024 • 889 • 50
ThinkDiffusion/ThinkDiffusionXL

Text-to-Image • Updated Nov 18, 2023 • 203 • 19
redstonehero/virile_reality_2.0

Text-to-Image • Updated Jul 28, 2023 • 3

jphme/Llama-2-13b-chat-german

Text Generation • Updated Oct 6, 2023 • 1.13k • 62

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

Paper • 2210.17432 • Published Oct 31, 2022 • 1
TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Paper • 2305.08379 • Published May 15, 2023 • 3
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

Paper • 2308.12219 • Published Aug 23, 2023 • 1
CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 73

Blockwise Parallel Transformer for Long Context Large Models

Paper • 2305.19370 • Published May 30, 2023 • 3
Blockwise Self-Attention for Long Document Understanding

Paper • 1911.02972 • Published Nov 7, 2019 • 1
Blockwise Compression of Transformer-based Models without Retraining

Paper • 2304.01483 • Published Apr 4, 2023 • 1

jeffwan/llama-30b-hf

Text Generation • Updated Apr 2, 2023 • 13

Backpropagation

Sparse Backpropagation for MoE Training

Paper • 2310.00811 • Published Oct 1, 2023 • 2
The Forward-Forward Algorithm: Some Preliminary Investigations

Paper • 2212.13345 • Published Dec 27, 2022 • 2
Fine-Tuning Language Models with Just Forward Passes

Paper • 2305.17333 • Published May 27, 2023 • 3
Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Paper • 2309.13192 • Published Sep 22, 2023 • 1

A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies

Paper • 2302.06218 • Published Feb 13, 2023 • 1
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training

Paper • 2306.10209 • Published Jun 16, 2023 • 2
SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

Paper • 2205.10034 • Published May 20, 2022 • 1
A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training

Paper • 2303.06318 • Published Mar 11, 2023 • 1

Bingsu/adetailer

Updated Nov 21, 2024 • 18.3M • 591

Running on T4

2.72k

2.72k

XTTS

🐸

Create personalized speech using text and audio samples

Previous
1
...
12,765
12,766
12,767
12,768
12,769
...
13,405
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs