BigScience Workshop

non-profit

https://bigscience.huggingface.co

bigscienceW

bigscience-workshop

AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

hyunwoongko authored a paper 1 day ago

Kanana: Compute-efficient Bilingual Language Models

nguyenvulebinh authored a paper 2 days ago

MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models

armanc authored a paper 8 days ago

TESS 2: A Large-Scale Generalist Diffusion Language Model

View all activity

bigscience's activity

IAugenstein

authored a paper 3 days ago

Can Community Notes Replace Professional Fact-Checkers?

Paper • 2502.14132 • Published 9 days ago • 5

IAugenstein

authored a paper 7 days ago

Unstructured Evidence Attribution for Long Context Query Focused Summarization

Paper • 2502.14409 • Published 8 days ago • 3

IAugenstein

authored 18 papers 8 days ago

SemEval 2017 Task 10: ScienceIE - Extracting Keyphrases and Relations from Scientific Publications

Paper • 1704.02853 • Published Apr 10, 2017 • 1

A Supervised Approach to Extractive Summarisation of Scientific Papers

Paper • 1706.03946 • Published Jun 13, 2017 • 1

MultiFC: A Real-World Multi-Domain Dataset for Evidence-Based Fact Checking of Claims

Paper • 1909.03242 • Published Sep 7, 2019 • 1

Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions

Paper • 2310.05779 • Published Oct 9, 2023 • 1

SubjQA: A Dataset for Subjectivity and Review Comprehension

Paper • 2004.14283 • Published Apr 29, 2020 • 1

PHD: Pixel-Based Language Modeling of Historical Documents

Paper • 2310.18343 • Published Oct 22, 2023 • 2

People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection

Paper • 2311.01270 • Published Nov 2, 2023 • 1

Factcheck-GPT: End-to-End Fine-Grained Document-Level Fact-Checking and Correction of LLM Output

Paper • 2311.09000 • Published Nov 15, 2023 • 1

Semi-Supervised Exaggeration Detection of Health Science Press Releases

Paper • 2108.13493 • Published Aug 30, 2021 • 1

A Latent-Variable Model for Intrinsic Probing

Paper • 2201.08214 • Published Jan 20, 2022 • 1

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

Paper • 2202.06671 • Published Feb 14, 2022 • 2

Modeling Information Change in Science Communication with Semantically Matched Paraphrases

Paper • 2210.13001 • Published Oct 24, 2022 • 1

Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language

Paper • 2406.17753 • Published Jun 25, 2024 • 1

From Internal Conflict to Contextual Adaptation of Language Models

Paper • 2407.17023 • Published Jul 24, 2024 • 1

FLARE: Faithful Logic-Aided Reasoning and Exploration

Paper • 2410.11900 • Published Oct 14, 2024 • 4

SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages

Paper • 2406.14425 • Published Jun 20, 2024 • 2

Adapting Neural Link Predictors for Data-Efficient Complex Query Answering

Paper • 2301.12313 • Published Jan 29, 2023 • 1

Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models

Paper • 2401.14440 • Published Jan 25, 2024 • 1