gigant
·
AI & ML interests
multimodal
Recent Activity
Organizations
gigant/led_tib
Text Generation
•
0.2B
•
Updated
•
22
gigant/SmolLM-mc4-500-rawrope
Text Generation
•
0.1B
•
Updated
•
6
gigant/SmolLM-mc4-500-ropescaled
Text Generation
•
0.1B
•
Updated
•
7
gigant/SmolLM-500-rawrope
Text Generation
•
0.1B
•
Updated
•
8
gigant/SmolLM-500-ropescaled
Text Generation
•
0.1B
•
Updated
•
6
gigant/SmolLM-135M-ft-500-steps
Text Generation
•
0.1B
•
Updated
•
9
gigant/SmolLM-135M-rescaled-ft-500-steps
Text Generation
•
0.1B
•
Updated
•
7
gigant/SmolLM-135M-unjetlagged-200-steps
Updated
gigant/SmolLM-135M-full-unjetlagged-2000-steps
Updated
gigant/SmolLM-135M-full-jetlagged-200-steps
Text Generation
•
0.1B
•
Updated
•
8
gigant/SmolLM-135M-full-unjetlagged-200-steps
Text Generation
•
0.1B
•
Updated
•
6
gigant/SmolLM-135M-jetlagged-200-steps
Updated
gigant/SmolLM-135M-unjetlagged
Updated
gigant/SmolLM-135M-scaled-rope-sw
Text Generation
•
0.1B
•
Updated
•
5
gigant/flan-t5fire-small
Text Generation
•
0.1B
•
Updated
•
4
gigant/graphlongt5-structural-dependency-0408
Text Generation
•
Updated
•
3
gigant/longt5-0322
Text Generation
•
Updated
•
3
gigant/graphlongt5-structural-0324
Text Generation
•
Updated
•
3
gigant/graphlongt5-dependency-0322
Text Generation
•
Updated
•
3
gigant/graphlongt5-globallocal-0322
Text Generation
•
Updated
•
3
gigant/graphlongt5-structural-0320
Text Generation
•
Updated
•
3
gigant/graphlongt5-dependency-0308
Text Generation
•
Updated
•
3
gigant/graphlongt5-globallocal-0308
Text Generation
•
Updated
•
3
gigant/longt5-0229
Text Generation
•
Updated
•
3
gigant/graphlongt5-globallocal-0228
Text Generation
•
Updated
•
3
gigant/graphlongt5-dependency-0228
Text Generation
•
Updated
•
3
gigant/longt5-global-3epoch
Text Generation
•
Updated
•
4
gigant/graph-t5-global-window-8k-longt5local
Text Generation
•
Updated
•
5
gigant/graph-t5-global-window-8k-tib
Text Generation
•
Updated
•
5
gigant/gt5-wip
Text Generation
•
Updated
•
5