Update README.md
Browse files
README.md
CHANGED
|
@@ -17,6 +17,8 @@ Eric Hartford and Quixi.ai present ReAligned Classifier, a lightweight bias dete
|
|
| 17 |
|
| 18 |
ReAligned Classifier outputs calibrated probabilities suitable for use as continuous reward signals.
|
| 19 |
|
|
|
|
|
|
|
| 20 |
## Model Architecture
|
| 21 |
|
| 22 |
- **Base Model:** meta-llama/Llama-3.2-1B
|
|
|
|
| 17 |
|
| 18 |
ReAligned Classifier outputs calibrated probabilities suitable for use as continuous reward signals.
|
| 19 |
|
| 20 |
+
Using this classifier as a reward signal might teach a model to favor either Western or Chinese framing, depending on how you configure your RL reward functions.
|
| 21 |
+
|
| 22 |
## Model Architecture
|
| 23 |
|
| 24 |
- **Base Model:** meta-llama/Llama-3.2-1B
|