12 2

Tomer Asida

tomeras1

AI & ML interests

None yet

Recent Activity

liked a model 2 months ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

liked a model 3 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1

new activity 8 months ago

ai21labs/AI21-Jamba-Mini-1.5:Run with Transformers got Error: Tensor on device meta is not on the expected device cuda:0

View all activity

Organizations

tomeras1's activity

liked a model 2 months ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation • Updated about 1 month ago • 10.6k • • 308

liked a model 3 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1

Text Generation • Updated 8 days ago • 182k • • 295

New activity in ai21labs/AI21-Jamba-Mini-1.5 8 months ago

Run with Transformers got Error: Tensor on device meta is not on the expected device cuda:0

#14 opened 8 months ago by

taozhang9527

updated a model 8 months ago

ai21labs/Jamba-tiny-dev

Updated Oct 1, 2024 • 19.6k • 11

updated 2 models 9 months ago

ai21labs/AI21-Jamba-Large-1.5

Text Generation • Updated Mar 6 • 2.82k • 215

ai21labs/AI21-Jamba-Mini-1.5

Text Generation • Updated Mar 6 • 6.15k • 268

authored a paper 9 months ago

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Paper • 2408.12570 • Published Aug 22, 2024 • 34

New activity in ai21labs/Jamba-v0.1 10 months ago

Update LoRA fine-tune example - more target_modules, lower LR, bf16

#49 opened 10 months ago by

michael-go

New activity in ai21labs/Jamba-v0.1 about 1 year ago

Move to in-library checkpoint

#43 opened about 1 year ago by

tomeras1

Move to in-library checkpoint

#42 opened about 1 year ago by

tomeras1

authored a paper about 1 year ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 111

New activity in ai21labs/Jamba-v0.1 about 1 year ago

Fix small typo in JambaDecoder

#24 opened about 1 year ago by

mber

Remove TGI tag

➕ 1

#8 opened about 1 year ago by

osanseviero

Fix bias logic to enable QLoRA finetuning

👍 2

#5 opened about 1 year ago by

winglian

Update modeling_jamba.py - LoRA support in Mamba

#6 opened about 1 year ago by

tomeras1

Fix bias logic to enable QLoRA finetuning

👍 2

#5 opened about 1 year ago by

winglian