wbag
Walmart-the-bag
AI & ML interests
Merging, Finetuning, and Pretraining LLM models.
Recent Activity
liked
a model
3 days ago
tencent/HunyuanVideo
new activity
29 days ago
PR-Puppets/PR-Puppet-Sora:š© Report: Legal issue(s)
updated
a model
about 2 months ago
mlx-community/Qwen2.5.1-Coder-7B-Instruct-4bit
Organizations
Walmart-the-bag's activity
posted
an
update
2 months ago
reacted to
merve's
post with š
3 months ago
Post
2844
This is not a drill š„
HuggingChat is now multimodal with meta-llama/Llama-3.2-11B-Vision-Instruct! š¤
This also comes with multimodal assistants, I have migrated my Marcus Aurelius advice assistant to Llama-Vision and Marcus can see now! š
Chat with Marcus: https://hf.co/chat/assistant/65bfed22022ba290531112f8
Start chatting with Llama-Vision 3.2 11B Instruct https://huggingface.co/chat/models/meta-llama/Llama-3.2-11B-Vision-Instruct
HuggingChat is now multimodal with meta-llama/Llama-3.2-11B-Vision-Instruct! š¤
This also comes with multimodal assistants, I have migrated my Marcus Aurelius advice assistant to Llama-Vision and Marcus can see now! š
Chat with Marcus: https://hf.co/chat/assistant/65bfed22022ba290531112f8
Start chatting with Llama-Vision 3.2 11B Instruct https://huggingface.co/chat/models/meta-llama/Llama-3.2-11B-Vision-Instruct
reacted to
KingNish's
post with š
3 months ago
Post
3148
A super good and fast image inpainting demo is here.
Its' super cool and realistic.
Demo by @OzzyGT (Must try):
OzzyGT/diffusers-fast-inpaint
Its' super cool and realistic.
Demo by @OzzyGT (Must try):
OzzyGT/diffusers-fast-inpaint
reacted to
KingNish's
post with ā
7 months ago
Post
4632
Microsoft Just Launched 3 Powerful Models
1. Phi 3 Medium (4k and 128k): A 14b Instruct tuned models that outperformed big models like Command R+ (104b), GPT 3.5 Pro, Gemini Pro, and is highly competitive with top models such as Mixtral 8x22b, Llama3 70B, and GPT 4.
microsoft/Phi-3-medium-4k-instruct
DEMO: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-Medium
2. Phi 3 Mini Vision 128k: A 4.5 billion-parameter, instruction-tuned vision model that has outperformed models such as Llava3 and Claude 3, and is providing stiff competition to Gemini 1Pro Vision.
microsoft/Phi-3-vision-128k-instruct
3. Phi3 Small (8k and 128k): Better than Llama3 8b, Mixtral 8x7b and GPT 3.5 turbo.
microsoft/Phi-3-small-128k-instruct
1. Phi 3 Medium (4k and 128k): A 14b Instruct tuned models that outperformed big models like Command R+ (104b), GPT 3.5 Pro, Gemini Pro, and is highly competitive with top models such as Mixtral 8x22b, Llama3 70B, and GPT 4.
microsoft/Phi-3-medium-4k-instruct
DEMO: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-Medium
2. Phi 3 Mini Vision 128k: A 4.5 billion-parameter, instruction-tuned vision model that has outperformed models such as Llava3 and Claude 3, and is providing stiff competition to Gemini 1Pro Vision.
microsoft/Phi-3-vision-128k-instruct
3. Phi3 Small (8k and 128k): Better than Llama3 8b, Mixtral 8x7b and GPT 3.5 turbo.
microsoft/Phi-3-small-128k-instruct
posted
an
update
7 months ago
Post
1658
Phi-3-Medium just came out! So far it's decent (fails a few riddles š), try it for yourself and let me know how it is.
Original Model: microsoft/Phi-3-medium-128k-instruct
Test it out: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-medium *running on ZERO gpu*
Original Model: microsoft/Phi-3-medium-128k-instruct
Test it out: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-medium *running on ZERO gpu*
posted
an
update
7 months ago
Post
2153
Mm what a good time for a new merge!
This is a merge of 6 models that were finetuned on llama3 8b. This has done pretty decent on some coding tasks, for the parameter size. I have looked through models because a lot of people cannot run 33B models (deepseek) for coding.
Original Model: Walmart-the-bag/Llama-3-LizardCoder-8B
GGUF: Walmart-the-bag/Llama-3-LizardCoder-8B-GGUF
This is a merge of 6 models that were finetuned on llama3 8b. This has done pretty decent on some coding tasks, for the parameter size. I have looked through models because a lot of people cannot run 33B models (deepseek) for coding.
Original Model: Walmart-the-bag/Llama-3-LizardCoder-8B
GGUF: Walmart-the-bag/Llama-3-LizardCoder-8B-GGUF
posted
an
update
8 months ago
Post
1390
Juggernaut X V10 is pretty good, its a few weeks old but not very popular. Try it out and let me know what you guys think. I think it is pretty good for a daily use.
ā Original Model: RunDiffusion/Juggernaut-X-v10
š Test it out: Walmart-the-bag/Juggernaut-X-v10
š« Author: https://huggingface.co/RunDiffusion
ā Original Model: RunDiffusion/Juggernaut-X-v10
š Test it out: Walmart-the-bag/Juggernaut-X-v10
š« Author: https://huggingface.co/RunDiffusion
posted
an
update
8 months ago
Post
1058
Replete-AI/code_bagel
Make the ultimate coding finetune to compete with the likes of closed source models using the code_bagel dataset!
Made by @rombodawg of RepleteAi, the code_bagel dataset contains over 800 million tokens of deduplicated and uncensored code from only reputable sources on huggingface. This code is formatted in the alpaca instruct format for ease of use in training.
Make the ultimate coding finetune to compete with the likes of closed source models using the code_bagel dataset!
Made by @rombodawg of RepleteAi, the code_bagel dataset contains over 800 million tokens of deduplicated and uncensored code from only reputable sources on huggingface. This code is formatted in the alpaca instruct format for ease of use in training.
This comment has been hidden
š