Steve Wu PRO
wangzhang
·
AI & ML interests
Neural Network Interpretability, Refusal Direction Analysis, LLM Safety Mechanisms, Model Abliteration Techniques, Activation Engineering, AI Alignment Research, Mixture-of-Experts Architectures, Transformer Optimization
Recent Activity
liked a model 2 days ago
wangzhang/Devstral-Small-2-24B-Instruct-abliterated new activity 3 days ago
wangzhang/Qwen3.5-122B-A10B-abliterated-GGUF:Prometheus open source? updated a dataset 3 days ago
wangzhang/prometheus-datasetsOrganizations
None yet