ssz

ssz1111

8 19 16

AI & ML interests

None yet

Recent Activity

updated a collection 27 days ago

SpokenWOZ

updated a collection 28 days ago

GATEAU

updated a collection 28 days ago

FaithLens

View all activity

Organizations

upvoted 4 papers about 1 month ago

SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories

Paper • 2606.01311 • Published May 31 • 37

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published May 28 • 250

Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement

Paper • 2605.26952 • Published May 26 • 16

upvoted a paper 2 months ago

A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

Paper • 2605.06200 • Published May 7 • 15

upvoted an article 6 months ago

Article

Merge Large Language Models with mergekit

mlabonne

•

Jan 9, 2024

• 156

upvoted a paper 6 months ago

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

upvoted a paper 7 months ago

FaithLens: Detecting and Explaining Faithfulness Hallucination

Paper • 2512.20182 • Published Dec 23, 2025 • 9

upvoted a paper 8 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 218

upvoted 2 papers 9 months ago

A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks

Paper • 2510.05608 • Published Oct 7, 2025 • 4

BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

Paper • 2510.05318 • Published Oct 6, 2025 • 22

upvoted 4 papers about 1 year ago

SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications

Paper • 2506.18951 • Published Jun 23, 2025 • 22

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning

Paper • 2505.16483 • Published May 22, 2025 • 10

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17, 2025 • 40

upvoted 3 papers over 1 year ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6, 2025 • 23

Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement

Paper • 2410.15633 • Published Oct 21, 2024 • 7

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents

Paper • 2305.13040 • Published May 22, 2023 • 2

upvoted a paper about 2 years ago

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 30

ssz

AI & ML interests

Recent Activity

Organizations

ssz1111's activity

Merge Large Language Models with mergekit