view article Article π¦Έπ»#1: Open-endedness and AI Agents β A Path from Generative to Creative AI? By Kseniase β’ Dec 25, 2024 β’ 14
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper β’ 2504.13169 β’ Published Apr 17 β’ 39
Long Reasoning Collection Datasets with reasoning traces for math and code (Train + Eval) β’ 49 items β’ Updated Mar 21 β’ 1
Long Reasoning Collection Datasets with reasoning traces for math and code (Train + Eval) β’ 49 items β’ Updated Mar 21 β’ 1
view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others β’ Oct 24, 2023 β’ 60
Long Reasoning Collection Datasets with reasoning traces for math and code (Train + Eval) β’ 49 items β’ Updated Mar 21 β’ 1
Long Reasoning Collection Datasets with reasoning traces for math and code (Train + Eval) β’ 49 items β’ Updated Mar 21 β’ 1