Cerebras

company

Verified

https://www.cerebras.net/

CerebrasSystems

Cerebras

AI & ML interests

None defined yet.

Recent Activity

lazarevich updated a dataset 3 days ago

cerebras/Synth-Long-SFT32K

lazarevich published a dataset 3 days ago

cerebras/Synth-Long-SFT32K

lazarevich updated a dataset 4 days ago

cerebras/Synth-Long-SFT32K

View all activity

cerebras's activity

lazarevich

updated a dataset 3 days ago

cerebras/Synth-Long-SFT32K

Viewer • Updated 9 days ago • 68k • 18 • 2

lazarevich

published a dataset 3 days ago

cerebras/Synth-Long-SFT32K

Viewer • Updated 9 days ago • 68k • 18 • 2

daniel-cerebras

updated a Space 5 months ago

Chain Of Thought

Generate detailed, step-by-step responses to queries using AI

rohand

updated 2 datasets 6 months ago

cerebras/HybridDialogue

Viewer • Updated Aug 19, 2024 • 19.9k • 57 • 2

cerebras/TAT-QA-Arithmetic-CoT

Viewer • Updated Aug 19, 2024 • 8.33k • 75 • 4

rohand

updated 3 models 7 months ago

cerebras/Dragon-DocChat-Context-Encoder

Updated Aug 16, 2024 • 8 • 2

cerebras/Dragon-DocChat-Query-Encoder

Updated Aug 16, 2024 • 3 • 1

cerebras/Llama3-DocChat-1.0-8B

Text Generation • Updated Aug 16, 2024 • 137 • 69

qanthony

authored a paper 7 months ago

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 4

abhaygupta

authored a paper 7 months ago

DAiSEE: Towards User Engagement Recognition in the Wild

Paper • 1609.01885 • Published Sep 7, 2016

qanthony

authored 4 papers 8 months ago

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 34

Zyda: A 1.3T Dataset for Open Language Modeling

Paper • 2406.01981 • Published Jun 4, 2024 • 3

Comparative Study of Large Language Model Architectures on Frontier

Paper • 2402.00691 • Published Feb 1, 2024

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 50

abhaygupta

authored 4 papers 10 months ago

RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Paper • 2206.14098 • Published Jun 28, 2022

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models

Paper • 2303.10464 • Published Mar 18, 2023 • 1

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Paper • 2303.11525 • Published Mar 21, 2023 • 1

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7

YX-Cerebras

updated a model 10 months ago

cerebras/Cerebras-GPT-Intermediate

Text Generation • Updated Apr 23, 2024

aarticerebras

updated a model 12 months ago

cerebras/Cerebras-LLaVA-13B

Text Generation • Updated Mar 19, 2024 • 27 • 4