-
meta-llama/Llama-2-7b-hf
Text Generation • Updated • 1.27M • 1.78k -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 181 -
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
Paper • 2401.04398 • Published • 20 -
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models
Paper • 2402.01118 • Published • 29
Alina
iblub
·
AI & ML interests
None yet
Organizations
None yet
Collections
2
models
21
iblub/idefics2-8b-mwm-finetuned-qlora_8bit_10e
Updated
iblub/idefics2-8b-mwm-finetuned-8bit
Updated
iblub/idefics2-8b-mwm-finetuned
Updated
iblub/idefics2-8b-docvqa-finetuned-tutorial
Updated
iblub/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
•
1
iblub/detr-finetuned-balloon
Object Detection
•
Updated
•
38
iblub/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
iblub/ppo-lunar-lander-week8
Reinforcement Learning
•
Updated
iblub/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
13
iblub/ppo-Pyramid
Reinforcement Learning
•
Updated
•
6
datasets
None public yet