Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.17764

Legal AI - Research

Anna University, Chennai

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
Improving Access to Justice for the Indian Population: A Benchmark for Evaluating Translation of Legal Text to Indian Languages

Paper • 2310.09765 • Published Oct 15, 2023
joelniklaus/legal_case_document_summarization

Viewer • Updated Feb 2, 2023 • 7.97k • 3 • 12
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

Paper • 2308.11462 • Published Aug 20, 2023 • 2

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 107
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 123
Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 103

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 250
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22 • 43
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

towards-embedded-foundational-models

meta-llama/Llama-2-7b-chat-hf

Text Generation • Updated Apr 17 • 709k • 3.85k
ibm-granite/granite-8b-code-instruct-4k

Text Generation • Updated 18 days ago • 3.49k • 102
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
Running

134

🏆

Low-bit Quantized Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

Dom Per Andersson

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 96
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 103
TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14 • 43

Running

8

📊

OpenWakeWord
xai-org/grok-1

Text Generation • Updated Mar 28 • 1.14k • 2.16k
databricks/dbrx-instruct

Text Generation • Updated Apr 19 • 6.04k • 1.1k
mistralai/Mistral-7B-Instruct-v0.2

Text Generation • Updated Aug 2 • 1.04M • 2.52k

DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients

Paper • 1606.06160 • Published Jun 20, 2016 • 1
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590
mobiuslabsgmbh/Llama-2-7b-chat-hf_1bitgs8_hqq

Text Generation • Updated 10 days ago • 23 • 74

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

Previous
1
2
3
4
5
...
19
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs