Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance.
TitleOS PRO
TitleOS
AI & ML interests
I break the Xbox One/Series. Featured on OSGWiki. Former Xbox MVP. Previously InfoSec at Apple, then SRE at DreamBox Learning, now looking for a new opportunity. Artificial Intelligence LLM enthusiast, wannabe expert. They/Them. 🏳️🌈
Recent Activity
liked a model 42 minutes ago
nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF upvoted a collection 42 minutes ago
NVIDIA Nemotron v3 liked a model 42 minutes ago
nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16