ehartford commited on
Commit
e01e019
·
verified ·
1 Parent(s): c926d3a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -17,6 +17,8 @@ Eric Hartford and Quixi.ai present ReAligned Classifier, a lightweight bias dete
17
 
18
  ReAligned Classifier outputs calibrated probabilities suitable for use as continuous reward signals.
19
 
 
 
20
  ## Model Architecture
21
 
22
  - **Base Model:** meta-llama/Llama-3.2-1B
 
17
 
18
  ReAligned Classifier outputs calibrated probabilities suitable for use as continuous reward signals.
19
 
20
+ Using this classifier as a reward signal might teach a model to favor either Western or Chinese framing, depending on how you configure your RL reward functions.
21
+
22
  ## Model Architecture
23
 
24
  - **Base Model:** meta-llama/Llama-3.2-1B