Filippo Tonini
filo362
AI & ML interests
LLM safety in multi-agent environments
Recent Activity
upvoted a paper about 24 hours ago
Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model InternalsOrganizations
None yet