CopeNLU

university

http://www.copenlu.com

Activity Feed Request to join this org

AI & ML interests

natural language understanding, low-resource learning, fact checking, explainable AI

Recent Activity

Lo updated a dataset 6 days ago

copenlu/cmt-benchmark-nq

Lo updated a dataset 13 days ago

copenlu/cmt-benchmark-druid

Lo updated a dataset 13 days ago

copenlu/cmt-benchmark-counterfact

View all activity

copenlu's activity

Lo

updated a dataset 6 days ago

copenlu/cmt-benchmark-nq

Viewer • Updated 6 days ago • 46.3k • 509

Lo

updated 2 datasets 13 days ago

copenlu/cmt-benchmark-druid

Viewer • Updated 13 days ago • 40.5k • 446

copenlu/cmt-benchmark-counterfact

Viewer • Updated 13 days ago • 24.3k • 533

kera7

updated a dataset 22 days ago

copenlu/mm-framing

Viewer • Updated 22 days ago • 633k • 161 • 3

Karolina

authored a paper 28 days ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published Apr 11 • 27

Karolina

authored a paper about 1 month ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 84

spaidartaigar

authored a paper about 1 month ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 84

namin0202

authored 6 papers about 2 months ago

Can Large Language Models Infer and Disagree Like Humans?

Paper • 2305.13788 • Published May 23, 2023

Stable Language Model Pre-training by Reducing Embedding Variability

Paper • 2409.07787 • Published Sep 12, 2024

Diffusion Models Through a Global Lens: Are They Culturally Inclusive?

Paper • 2502.08914 • Published Feb 13

When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts

Paper • 2503.16826 • Published Mar 21

Can LVLMs and Automatic Metrics Capture Underlying Preferences of Blind and Low-Vision Individuals for Navigational Aid?

Paper • 2502.14883 • Published Feb 15

Sightation Counts: Leveraging Sighted User Feedback in Building a BLV-aligned Dataset of Diagram Descriptions

Paper • 2503.13369 • Published Mar 17 • 7

Karolina

authored 2 papers 2 months ago

A Latent-Variable Model for Intrinsic Probing

Paper • 2201.08214 • Published Jan 20, 2022 • 1

Social Bias Probing: Fairness Benchmarking for Language Models

Paper • 2311.09090 • Published Nov 15, 2023 • 2

IAugenstein

authored a paper 3 months ago

Can Community Notes Replace Professional Fact-Checkers?

Paper • 2502.14132 • Published Feb 19 • 6

Nadav

authored a paper 3 months ago

Can Community Notes Replace Professional Fact-Checkers?

Paper • 2502.14132 • Published Feb 19 • 6

zainmujahid

authored a paper 3 months ago

Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts

Paper • 2502.13640 • Published Feb 19

IAugenstein

authored a paper 3 months ago

Unstructured Evidence Attribution for Long Context Query Focused Summarization

Paper • 2502.14409 • Published Feb 20 • 3

zainmujahid

authored a paper 3 months ago

Unstructured Evidence Attribution for Long Context Query Focused Summarization

Paper • 2502.14409 • Published Feb 20 • 3