| pipeline_tag: text-generation | |
| inference: false | |
| tags: | |
| - meta | |
| - llama | |
| - llama-2 | |
| - codellama | |
| - mlx | |
| # **CodeLlama** | |
| Code Llama is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 34 billion parameters. This model is designed for general code synthesis and understanding. This is the repository for the 7B base model, in `npz` format suitable for use in Apple's MLX framework. | |
| Weights have been converted to `float16` from the original `bfloat16` type, because `numpy` is not compatible with `bfloat16` out of the box. | |
| How to use with [MLX](https://github.com/ml-explore/mlx). | |
| ```bash | |
| # Install mlx, mlx-examples, huggingface-cli | |
| pip install mlx | |
| pip install huggingface_hub hf_transfer | |
| git clone https://github.com/ml-explore/mlx-examples.git | |
| # Download model | |
| export HF_HUB_ENABLE_HF_TRANSFER=1 | |
| huggingface-cli download --local-dir models --local-dir-use-symlinks False mlx-llama/CodeLlama-7b-mlx CodeLlama-7b-mlx | |
| # Run example | |
| python mlx-examples/llama/llama.py CodeLlama-7b-mlx CodeLlama-7b-mlx/tokenizer.model "My name is " | |
| ``` | |
| Please, refer to the [original model card](https://github.com/facebookresearch/codellama/blob/main/MODEL_CARD.md) for details on CodeLlama. | |