Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Byron Gibson's picture

18

Byron Gibson

bgibson

·

byrongibson
byrongibson

AI & ML interests

None yet

Organizations

None yet

bgibson 's collections 5

vectara/hallucination_evaluation_model

Text Classification • 0.1B • Updated 29 days ago • 284k • 315

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 63
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 73
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 41
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 55

allenai/OLMo-7B

Text Generation • 7B • Updated Jul 16, 2024 • 3.37k • 643
chatdb/natural-sql-7b

Text Generation • 7B • Updated Feb 4, 2024 • 1.58k • 129
databricks/dbrx-base

Text Generation • 132B • Updated Apr 19, 2024 • 7 • 560
databricks/dbrx-instruct

Text Generation • 132B • Updated Apr 19, 2024 • 18.7k • 1.12k

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258
Running

95

95

Transformers to Core ML

⚡
enterprise-explorers/Llama-2-7b-chat-coreml

Text Generation • Updated Jul 18, 2023 • 2.62k • 138
tiiuae/falcon-7b-instruct

Text Generation • 7B • Updated Oct 12, 2024 • 143k • 1k

nomic-ai/gpt4all-j-prompt-generations

Viewer • Updated Apr 24, 2023 • 809k • 285 • 222

vectara/hallucination_evaluation_model

Text Classification • 0.1B • Updated 29 days ago • 284k • 315

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258
Running

95

95

Transformers to Core ML

⚡
enterprise-explorers/Llama-2-7b-chat-coreml

Text Generation • Updated Jul 18, 2023 • 2.62k • 138
tiiuae/falcon-7b-instruct

Text Generation • 7B • Updated Oct 12, 2024 • 143k • 1k

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 63
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 73
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 41
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 55

nomic-ai/gpt4all-j-prompt-generations

Viewer • Updated Apr 24, 2023 • 809k • 285 • 222

allenai/OLMo-7B

Text Generation • 7B • Updated Jul 16, 2024 • 3.37k • 643
chatdb/natural-sql-7b

Text Generation • 7B • Updated Feb 4, 2024 • 1.58k • 129
databricks/dbrx-base

Text Generation • 132B • Updated Apr 19, 2024 • 7 • 560
databricks/dbrx-instruct

Text Generation • 132B • Updated Apr 19, 2024 • 18.7k • 1.12k

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs