BigScience Workshop

non-profit

https://bigscience.huggingface.co

bigscience-workshop

AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

WojciechKusa authored a paper 8 days ago

Multilingual Refusal Alignment for Safer Large Language Models

WojciechKusa authored a paper 8 days ago

Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?

christopher new activity 17 days ago

bigscience/mt0-large:why mt0-large is 1.3B while mt5-large is 780M?

View all activity

authored 2 papers 8 days ago

Multilingual Refusal Alignment for Safer Large Language Models

Paper • 2606.07535 • Published Apr 24

Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?

Paper • 2606.12250 • Published 10 days ago

in bigscience/mt0-large 17 days ago

why mt0-large is 1.3B while mt5-large is 780M?

#6 opened over 1 year ago by

in bigscience/bloom-560m 17 days ago

Geração de Texto

#63 opened 7 months ago by

alcidesmoreira1963

Adding Evaluation Results

#61 opened over 2 years ago by

leaderboard-pr-bot

in bigscience/T0 17 days ago

Hosted inference API: 500 Internal Server Error returned

#4 opened over 3 years ago by

in bigscience/bloom-1b1 17 days ago

Adding Evaluation Results

#41 opened over 2 years ago by

leaderboard-pr-bot

Adding Evaluation Results

#42 opened almost 2 years ago by

leaderboard-pr-bot

Add evaluation results on the mathemakitten--winobias_antistereotype_test config and test split of mathemakitten/winobias_antistereotype_test

#32 opened over 3 years ago by

System Requirements

#38 opened over 3 years ago by

Request: DOI

#43 opened over 1 year ago by

authored 2 papers 2 months ago

Scaling Low-Resource MT via Synthetic Data Generation with LLMs

Paper • 2505.14423 • Published May 20, 2025 • 2

Open Machine Translation for Esperanto

Paper • 2603.29345 • Published Mar 31

authored a paper 3 months ago

RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

Paper • 2603.09723 • Published Mar 10 • 7

in bigscience/bloom 4 months ago

pretokenizer Regex issues?

#278 opened almost 2 years ago by

Test PR

#286 opened 4 months ago by

Test discussion

#287 opened 4 months ago by

Test discussion

#288 opened 4 months ago by

authored a paper 4 months ago

References Improve LLM Alignment in Non-Verifiable Domains

Paper • 2602.16802 • Published Feb 18 • 2

authored a paper 4 months ago

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models

Paper • 2602.16609 • Published Feb 18 • 9