Spaces:

Norod78
/

OpenELM_3B_Demo

Running on Zero

Norod78 commited on Apr 27, 2024

Commit

75a15fb

verified ·

1 Parent(s): 68c2400

Switched model to OpenELM-3B-Instruct

Files changed (2) hide show

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: Apple OpenELM-3B
 emoji: 🍎
 colorFrom: green
 colorTo: red
@@ -15,7 +15,7 @@ suggested_hardware: t4-small
 OpenELM was introduced in [this paper](https://arxiv.org/abs/2404.14619v1).
-This Space demonstrates [OpenELM-3B](apple/OpenELM-3B) from Apple. Please, check the original model card for details.
 You can see the other models of the OpenELM family [here](https://huggingface.co/apple/OpenELM)
 # The following Information was taken "as is" from original model card

 ---
+title: Apple OpenELM-3B-Instruct
 emoji: 🍎
 colorFrom: green
 colorTo: red
 OpenELM was introduced in [this paper](https://arxiv.org/abs/2404.14619v1).
+This Space demonstrates [OpenELM-3B-Instruct](https://huggingface.co/apple/OpenELM-3B-Instruct) from Apple. Please, check the original model card for details.
 You can see the other models of the OpenELM family [here](https://huggingface.co/apple/OpenELM)
 # The following Information was taken "as is" from original model card

app.py CHANGED Viewed

@@ -12,12 +12,12 @@ DEFAULT_MAX_NEW_TOKENS = 256
 MAX_INPUT_TOKEN_LENGTH = 512
 DESCRIPTION = """\
-# OpenELM-3B
-This Space demonstrates [OpenELM-3B](https://huggingface.co/apple/OpenELM-3B) by Apple. Please, check the original model card for details.
 You can see the other models of the OpenELM family [here](https://huggingface.co/apple/OpenELM)
 The following Colab notebooks are available:
-* [OpenELM-3B (GPU)](https://gist.github.com/Norod/4f11bb36bea5c548d18f10f9d7ec09b0)
 * [OpenELM-270M (CPU)](https://gist.github.com/Norod/5a311a8e0a774b5c35919913545b7af4)
 You might also be interested in checking out Apple's [CoreNet Github page](https://github.com/apple/corenet?tab=readme-ov-file).
@@ -33,8 +33,8 @@ LICENSE = """
 <p/>
 ---
-As a derivative work of [OpenELM-3B](https://huggingface.co/apple/OpenELM-3B) by Apple,
-this demo is governed by the original [license](https://huggingface.co/apple/OpenELM-3B/blob/main/LICENSE).
 """
 if not torch.cuda.is_available():
@@ -42,7 +42,7 @@ if not torch.cuda.is_available():
 if torch.cuda.is_available():
-    model_id = "apple/OpenELM-3B"
     model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", trust_remote_code=True, low_cpu_mem_usage=True)
     tokenizer_id = "meta-llama/Llama-2-7b-hf"
     tokenizer = AutoTokenizer.from_pretrained(tokenizer_id)

 MAX_INPUT_TOKEN_LENGTH = 512
 DESCRIPTION = """\
+# OpenELM-3B-Instruct
+This Space demonstrates [OpenELM-3B-Instruct](https://huggingface.co/apple/OpenELM-3B-Instruct) by Apple. Please, check the original model card for details.
 You can see the other models of the OpenELM family [here](https://huggingface.co/apple/OpenELM)
 The following Colab notebooks are available:
+* [OpenELM-3B-Instruct (GPU)](https://gist.github.com/Norod/4f11bb36bea5c548d18f10f9d7ec09b0)
 * [OpenELM-270M (CPU)](https://gist.github.com/Norod/5a311a8e0a774b5c35919913545b7af4)
 You might also be interested in checking out Apple's [CoreNet Github page](https://github.com/apple/corenet?tab=readme-ov-file).
 <p/>
 ---
+As a derivative work of [OpenELM-3B-Instruct](https://huggingface.co/apple/OpenELM-3B-Instruct) by Apple,
+this demo is governed by the original [license](https://huggingface.co/apple/OpenELM-3B-Instruct/blob/main/LICENSE).
 """
 if not torch.cuda.is_available():
 if torch.cuda.is_available():
+    model_id = "apple/OpenELM-3B-Instruct"
     model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", trust_remote_code=True, low_cpu_mem_usage=True)
     tokenizer_id = "meta-llama/Llama-2-7b-hf"
     tokenizer = AutoTokenizer.from_pretrained(tokenizer_id)