CMU-LTI

university

LTIatCMU

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

wlqmfl1999 authored a paper 3 days ago

Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition

wlqmfl1999 authored a paper 3 days ago

Measuring Sycophancy of Language Models in Multi-turn Dialogues

seungone authored a paper 4 days ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

View all activity

cmu-lti's activity

wlqmfl1999

authored 2 papers 3 days ago

Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition

Paper • 2403.16158 • Published Mar 24, 2024

Measuring Sycophancy of Language Models in Multi-turn Dialogues

Paper • 2505.23840 • Published 10 days ago • 1

seungone

authored a paper 4 days ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

Paper • 2506.01789 • Published 5 days ago • 12

seungone

authored a paper 9 days ago

Let's Predict Sentence by Sentence

Paper • 2505.22202 • Published 11 days ago • 17

aashiqmuhamed

authored 2 papers 12 days ago

SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs

Paper • 2504.08192 • Published Apr 11 • 4

CoRAG: Collaborative Retrieval-Augmented Generation

Paper • 2504.01883 • Published Apr 2 • 10

seungone

authored a paper 12 days ago

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published 18 days ago • 99

lwaekfjlk

authored a paper 12 days ago

Time-R1: Towards Comprehensive Temporal Reasoning in LLMs

Paper • 2505.13508 • Published 22 days ago • 14

seungone

authored a paper 12 days ago

FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS

Paper • 2505.16409 • Published 17 days ago • 2

aashiqmuhamed

authored a paper 12 days ago

Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs

Paper • 2505.20254 • Published 12 days ago • 5

Xuhui

updated a dataset 14 days ago

cmu-lti/interactive-swe

Viewer • Updated 14 days ago • 500 • 100

Xuhui

published a dataset 16 days ago

cmu-lti/interactive-swe

Viewer • Updated 14 days ago • 500 • 100

seungone

authored a paper 18 days ago

Reasoning Models Better Express Their Confidence

Paper • 2505.14489 • Published 18 days ago • 19

gneubig

authored a paper 23 days ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published 24 days ago • 25

seungone

authored a paper 23 days ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published 24 days ago • 25

ProKil

authored a paper 30 days ago

AutoLibra: Agent Metric Induction from Open-Ended Feedback

Paper • 2505.02820 • Published May 5 • 3

SiddharthY

updated a collection about 1 month ago

CAIRE

Evaluation tool to assess the cultural relevance of images for user-defined culture labels • 5 items • Updated about 1 month ago

SiddharthY

updated a dataset about 1 month ago

cmu-lti/caire-universal

Viewer • Updated about 1 month ago • 400 • 54

SiddharthY

published a dataset about 1 month ago

cmu-lti/caire-universal

Viewer • Updated about 1 month ago • 400 • 54

SiddharthY

updated a collection about 1 month ago

CAIRE

Evaluation tool to assess the cultural relevance of images for user-defined culture labels • 5 items • Updated about 1 month ago