Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mesolitica 's Collections
Audio Language Model
Malaysian Reasoning
Malaysian Finetuned Instruct LoRA
Malaysian Speech-to-Text
Malaysian Text-to-Speech
Malaysian Translation
Malaysian pretraining dataset
Malaysian instruction dataset
MaLLaM 🌙
Malaysian CausalLM
Malaysian LLM2Vec
Malaysian Seq2Seq
Malaysian MaskLM

MaLLaM 🌙

updated 7 days ago

Pretrain from scratch 4096 context length on 90B tokens Malaysian text, https://huggingface.co/papers/2401.14680

Upvote
15

  • mesolitica/mallam-1.1B-4096

    Text Generation • 1B • Updated Oct 7, 2024 • 154 • 9

  • mesolitica/mallam-3B-4096

    Text Generation • 3B • Updated Oct 7, 2024 • 62 • 1

  • mesolitica/mallam-5B-4096

    Text Generation • 5B • Updated Oct 13, 2024 • 62 • 2

  • mesolitica/mallam-1.1b-20k-instructions

    Text Generation • 1B • Updated Dec 19, 2023 • 27 • 1

  • mesolitica/mallam-1.1b-20k-instructions-v2

    Text Generation • 1B • Updated Jan 25, 2024 • 73

  • mesolitica/mallam-3b-20k-instructions

    Text Generation • 3B • Updated Dec 16, 2023 • 25

  • mesolitica/mallam-5b-20k-instructions

    Text Generation • 5B • Updated Dec 17, 2023 • 20 • 1

  • mesolitica/mallam-5b-20k-instructions-v2

    Text Generation • 5B • Updated Jan 25, 2024 • 113 • 1
Upvote
15
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs