Cached layer activations for steering vector experiments
Abdullah
amirali1985
AI & ML interests
Mechanistic interpretability, high dimensional geometry, persona role playing.
Recent Activity
updated a collection about 10 hours ago
Curveball Steering — Active Datasets updated a dataset about 10 hours ago
curveball-steering/conversations_sycophancy_llama3.2-1B-it_large published a dataset about 10 hours ago
curveball-steering/conversations_sycophancy_llama3.2-1B-it_large