Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
2
Tomer Asida
tomeras1
Follow
akhaliq's profile picture
0xrizzler's profile picture
thomwolf's profile picture
6 followers
·
2 following
AI & ML interests
None yet
Recent Activity
liked
a model
15 days ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
liked
a model
about 1 month ago
nvidia/Llama-3_3-Nemotron-Super-49B-v1
new
activity
7 months ago
ai21labs/AI21-Jamba-Mini-1.5:
Run with Transformers got Error: Tensor on device meta is not on the expected device cuda:0
View all activity
Organizations
tomeras1
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
15 days ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
Text Generation
•
Updated
4 days ago
•
18.1k
•
•
263
liked
a model
about 1 month ago
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation
•
Updated
14 days ago
•
126k
•
•
267
New activity in
ai21labs/AI21-Jamba-Mini-1.5
7 months ago
Run with Transformers got Error: Tensor on device meta is not on the expected device cuda:0
3
#14 opened 7 months ago by
taozhang9527
updated
3 models
7 months ago
ai21labs/Jamba-tiny-dev
Updated
Oct 1, 2024
•
14.8k
•
12
ai21labs/AI21-Jamba-Large-1.5
Text Generation
•
Updated
Mar 6
•
2.67k
•
217
ai21labs/AI21-Jamba-Mini-1.5
Text Generation
•
Updated
Mar 6
•
5.6k
•
266
authored
a paper
8 months ago
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Paper
•
2408.12570
•
Published
Aug 22, 2024
•
34
New activity in
ai21labs/Jamba-v0.1
8 months ago
Update LoRA fine-tune example - more target_modules, lower LR, bf16
#49 opened 8 months ago by
michael-go
New activity in
ai21labs/Jamba-v0.1
12 months ago
Move to in-library checkpoint
#43 opened 12 months ago by
tomeras1
Move to in-library checkpoint
#42 opened 12 months ago by
tomeras1
authored
a paper
about 1 year ago
Jamba: A Hybrid Transformer-Mamba Language Model
Paper
•
2403.19887
•
Published
Mar 28, 2024
•
111
New activity in
ai21labs/Jamba-v0.1
about 1 year ago
Fix small typo in JambaDecoder
1
#24 opened about 1 year ago by
mber
Remove TGI tag
1
1
#8 opened about 1 year ago by
osanseviero
Fix bias logic to enable QLoRA finetuning
2
3
#5 opened about 1 year ago by
winglian
Update modeling_jamba.py - LoRA support in Mamba
#6 opened about 1 year ago by
tomeras1
Fix bias logic to enable QLoRA finetuning
2
3
#5 opened about 1 year ago by
winglian
Load more