PEFT How to use ProbLLMs/DPO_Checkpoint with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("deepseek-ai/deepseek-math-7b-rl")
model = PeftModel.from_pretrained(base_model, "ProbLLMs/DPO_Checkpoint")