Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

djuna
/
G2-GSHT-32K

Text Generation
Transformers
Safetensors
gemma2
mergekit
Merge
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
G2-GSHT-32K
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
djuna's picture
djuna
Increase context window using rope base freq
6f3f9ea verified 11 months ago
  • .gitattributes
    1.57 kB
    Duplicate from djuna/G2-GSHT 11 months ago
  • README.md
    1.09 kB
    Duplicate from djuna/G2-GSHT 11 months ago
  • config.json
    905 Bytes
    Increase context window using rope base freq 11 months ago
  • mergekit_config.yml
    372 Bytes
    Duplicate from djuna/G2-GSHT 11 months ago
  • model-00001-of-00005.safetensors
    4.96 GB
    xet
    Duplicate from djuna/G2-GSHT 11 months ago
  • model-00002-of-00005.safetensors
    4.98 GB
    xet
    Duplicate from djuna/G2-GSHT 11 months ago
  • model-00003-of-00005.safetensors
    4.93 GB
    xet
    Duplicate from djuna/G2-GSHT 11 months ago
  • model-00004-of-00005.safetensors
    4.98 GB
    xet
    Duplicate from djuna/G2-GSHT 11 months ago
  • model-00005-of-00005.safetensors
    470 MB
    xet
    Duplicate from djuna/G2-GSHT 11 months ago
  • model.safetensors.index.json
    37.3 kB
    Duplicate from djuna/G2-GSHT 11 months ago
  • special_tokens_map.json
    636 Bytes
    Duplicate from djuna/G2-GSHT 11 months ago
  • tokenizer.json
    17.5 MB
    xet
    Duplicate from djuna/G2-GSHT 11 months ago
  • tokenizer.model
    4.24 MB
    xet
    Duplicate from djuna/G2-GSHT 11 months ago
  • tokenizer_config.json
    41 kB
    Duplicate from djuna/G2-GSHT 11 months ago