Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

kuleshov-group
/
caduceus-ph_seqlen-1k_d_model-256_n_layer-4_lr-8e-3

Fill-Mask
Transformers
Safetensors
caduceus
custom_code
Model card Files Files and versions Community
caduceus-ph_seqlen-1k_d_model-256_n_layer-4_lr-8e-3
Ctrl+K
Ctrl+K
  • 1 contributor
History: 7 commits
yairschiff's picture
yairschiff
Ensure weights are tied for BiMamba (if applicable) when loaded from_pretrained
65b2a48 verified 6 months ago
  • .gitattributes
    1.52 kB
    initial commit about 1 year ago
  • README.md
    5.18 kB
    Upload tokenizer about 1 year ago
  • config.json
    1.38 kB
    Upload CaduceusForMaskedLM about 1 year ago
  • configuration_caduceus.py
    1.96 kB
    Upload CaduceusForMaskedLM about 1 year ago
  • model.safetensors
    7.75 MB
    LFS
    Upload CaduceusForMaskedLM about 1 year ago
  • modeling_caduceus.py
    28.6 kB
    Ensure weights are tied for BiMamba (if applicable) when loaded from_pretrained 6 months ago
  • modeling_rcps.py
    9.98 kB
    Enable mambav2 compat 7 months ago
  • special_tokens_map.json
    173 Bytes
    Upload tokenizer about 1 year ago
  • tokenization_caduceus.py
    4.97 kB
    Upload tokenizer about 1 year ago
  • tokenizer_config.json
    1.48 kB
    Upload tokenizer about 1 year ago