Spaces:

avanigupta
/

dataqa-env

Running

App Files Files Community

dataqa-env / dataqa_env /server /tasks.py

Commit History

remove ambiguous moderation rows, replace with clear-cut examples

fcce834

Running

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

replace ambiguous salary issue with date format fix

f1b7439

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

remove ambiguous LR fix — identify-only, any valid LR works

a1f98bf

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

fix moderation issue row collisions and verify all data

8560706

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

add content moderation task with real OpenAI Moderation data

b99e42b

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

add toxic/biased response issue to alignment task

c699b6f

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

replace ambiguous fixes with deterministic ones across all tasks

b08652c

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

make alignment issues subtler to challenge frontier models

96d698c

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

use real NVIDIA HelpSteer data for alignment task

4051320

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

improve alignment task: replace label swaps with real contamination

a9620ef

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

use real Stanford Alpaca data for alignment task

7479de3

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

add alignment data QA task: 12 issues in LLM instruction-tuning data

5cb467d

avanigupta Claude Opus 4.6 (1M context) commited on 17 days ago

expand datasets to include harder real-world scenarios

5d90461

avanigupta commited on 17 days ago

expand datasets

081eb22

avanigupta commited on 17 days ago

add fix stage+demo

c3002ad

avanigupta commited on 17 days ago

fixes v1: add per step reward

cd11aba

avanigupta commited on 17 days ago

init

4c1a85d

Varshith B commited on 17 days ago

Commit History

remove ambiguous moderation rows, replace with clear-cut examples fcce834 Running

replace ambiguous salary issue with date format fix f1b7439

remove ambiguous LR fix — identify-only, any valid LR works a1f98bf

fix moderation issue row collisions and verify all data 8560706

add content moderation task with real OpenAI Moderation data b99e42b

add toxic/biased response issue to alignment task c699b6f

replace ambiguous fixes with deterministic ones across all tasks b08652c

make alignment issues subtler to challenge frontier models 96d698c

use real NVIDIA HelpSteer data for alignment task 4051320

improve alignment task: replace label swaps with real contamination a9620ef

use real Stanford Alpaca data for alignment task 7479de3

add alignment data QA task: 12 issues in LLM instruction-tuning data 5cb467d

expand datasets to include harder real-world scenarios 5d90461

expand datasets 081eb22

add fix stage+demo c3002ad

fixes v1: add per step reward cd11aba

init 4c1a85d

remove ambiguous moderation rows, replace with clear-cut examples

fcce834

Running

replace ambiguous salary issue with date format fix

f1b7439

remove ambiguous LR fix — identify-only, any valid LR works

a1f98bf

fix moderation issue row collisions and verify all data

8560706

add content moderation task with real OpenAI Moderation data

b99e42b

add toxic/biased response issue to alignment task

c699b6f

replace ambiguous fixes with deterministic ones across all tasks

b08652c

make alignment issues subtler to challenge frontier models

96d698c

use real NVIDIA HelpSteer data for alignment task

4051320

improve alignment task: replace label swaps with real contamination

a9620ef

use real Stanford Alpaca data for alignment task

7479de3

add alignment data QA task: 12 issues in LLM instruction-tuning data

5cb467d

expand datasets to include harder real-world scenarios

5d90461

expand datasets

081eb22

add fix stage+demo

c3002ad

fixes v1: add per step reward

cd11aba

init

4c1a85d