Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mesolitica 's Collections
Speech Tokenizer
Audio Language Model
Malaysian Reasoning
Malaysian Finetuned Instruct LoRA
Malaysian Speech-to-Text
Malaysian Text-to-Speech
Malaysian Translation
Malaysian pretraining dataset
Malaysian instruction dataset
MaLLaM 🌙
Malaysian CausalLM
Malaysian LLM2Vec
Malaysian Seq2Seq
Malaysian MaskLM

Malaysian Seq2Seq

updated Jun 24

Trained on 17B tokens, 81GB of cleaned texts, able to understand standard Malay, local Malay, local Mandarin, Manglish, and local Tamil.

Upvote
-

  • mesolitica/nanot5-small-malaysian-cased

    0.1B • Updated Apr 24, 2024 • 6

  • mesolitica/nanot5-base-malaysian-cased

    0.2B • Updated Apr 15, 2024 • 18

  • mesolitica/nanot5-large-malaysian-cased

    0.8B • Updated Apr 18, 2024 • 5

  • mesolitica/t5-tiny-standard-bahasa-cased

    Feature Extraction • Updated Oct 6, 2022 • 6

  • mesolitica/t5-small-bahasa-cased

    Updated Oct 6, 2022 • 42

  • mesolitica/t5-super-tiny-bahasa-cased

    Updated Oct 6, 2022 • 3

  • mesolitica/t5-super-super-tiny-standard-bahasa-cased

    Feature Extraction • Updated Oct 6, 2022 • 6 • 1

  • mesolitica/t5-3x-super-tiny-standard-bahasa-cased

    Feature Extraction • Updated Oct 9, 2022 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs