Bartosz Cywiński
bcywinski
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored
a paper
5 days ago
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation submitted
a paper
5 days ago
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation updated
a dataset 6 days ago
bcywinski/uyghurs-censored Organizations
None yet