Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Jason Wolosonovich's picture

Jason Wolosonovich

wolosonovich

21world's profile picture

·

jmwoloso

AI & ML interests

None yet

Organizations

Collections 2

DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models

Paper • 2309.14509 • Published Sep 25, 2023 • 20
LLM Augmented LLMs: Expanding Capabilities through Composition

Paper • 2401.02412 • Published Jan 4, 2024 • 39
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 56
Tuning Language Models by Proxy

Paper • 2401.08565 • Published Jan 16, 2024 • 24

Equall/perplexity_evaluation

Viewer • Updated Feb 20, 2024 • 3.13k • 62 • 3
Equall/Saul-7B-Base

Text Generation • 7B • Updated Mar 10, 2024 • 127 • 30
Equall/Saul-7B-Instruct-v1

Text Generation • 7B • Updated Mar 10, 2024 • 2.01k • 98
SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6, 2024 • 89

DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models

Paper • 2309.14509 • Published Sep 25, 2023 • 20
LLM Augmented LLMs: Expanding Capabilities through Composition

Paper • 2401.02412 • Published Jan 4, 2024 • 39
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 56
Tuning Language Models by Proxy

Paper • 2401.08565 • Published Jan 16, 2024 • 24

Equall/perplexity_evaluation

Viewer • Updated Feb 20, 2024 • 3.13k • 62 • 3
Equall/Saul-7B-Base

Text Generation • 7B • Updated Mar 10, 2024 • 127 • 30
Equall/Saul-7B-Instruct-v1

Text Generation • 7B • Updated Mar 10, 2024 • 2.01k • 98
SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6, 2024 • 89

models 0

None public yet

datasets 0

None public yet

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs