Kevin King PRO
NeoCodes-dev
AI & ML interests
Deep RL, RL for LLMs
Recent Activity
updated
a collection
10 days ago
VLMs - Robotics
updated
a collection
11 days ago
DataSets
Organizations
ARC-AGI2
VLMs - Robotics
Embedding Models
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 87 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 145 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 39 • 1 -
Running1313
CrewAI Gradio Support Agent
👁Build support agent with CrewAI multi-agents and Gradio
Datasets - CryptoSage
VLMs
Agents
Classifier Models
LLMs
Datasets - MultiModal
Agent-Specific/Function-Calling Models
Datasets - Robotics
-
nvidia/PhysicalAI-Robotics-Manipulation-Kitchen
Viewer • Updated • 405k • 1.25k • 9 -
nvidia/PhysicalAI-Robotics-Manipulation-SingleArm
Updated • 44.2k • 10 -
nvidia/PhysicalAI-SimReady-Warehouse-01
Viewer • Updated • 753 • 2.98k • 16 -
manycore-research/SpatialLM-Testset
Viewer • Updated • 107 • 1.49k • 57
MMMs
Models - CryptoSage
Datasets - Reasoning
Spaces
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 64 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 45 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 80
DataSets
Datasets - Coding
Datasets - MultiModal
ARC-AGI2
Agent-Specific/Function-Calling Models
VLMs - Robotics
Datasets - Robotics
-
nvidia/PhysicalAI-Robotics-Manipulation-Kitchen
Viewer • Updated • 405k • 1.25k • 9 -
nvidia/PhysicalAI-Robotics-Manipulation-SingleArm
Updated • 44.2k • 10 -
nvidia/PhysicalAI-SimReady-Warehouse-01
Viewer • Updated • 753 • 2.98k • 16 -
manycore-research/SpatialLM-Testset
Viewer • Updated • 107 • 1.49k • 57
Embedding Models
MMMs
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 87 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 145 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 39 • 1 -
Running1313
CrewAI Gradio Support Agent
👁Build support agent with CrewAI multi-agents and Gradio
Models - CryptoSage
Datasets - CryptoSage
Datasets - Reasoning
VLMs
Spaces
Agents
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 64 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 45 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 80
Classifier Models
DataSets
LLMs