Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Lukas Galke Poech's picture
11 23

Lukas Galke Poech

lgalke
EvilScript's profile picture giannor's profile picture namazifard's profile picture
·
https://lgalke.github.io
  • LukasGalke
  • lgalke
  • lukas-galke-8086b0155
  • lukasgalke.bsky.social

AI & ML interests

LLM interpretability, agentic/multi-agent safety

Recent Activity

authored a paper 3 days ago
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
upvoted a paper 4 days ago
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
authored a paper 8 days ago
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling
View all activity

Organizations

Danish Foundation Models's profile picture MLX Community's profile picture filter with espresso's profile picture RUNE Lab's profile picture Schneider-Kamp Lab's profile picture Machine Ecology Lab's profile picture Inversion Lab for AI Safety's profile picture AI Safety & Interpretability Lab's profile picture
lgalke 's papers 20
arxiv:2606.10747
arxiv:2606.09707
arxiv:2606.06286
arxiv:2605.31170
arxiv:2605.26045
arxiv:2605.07462
arxiv:2512.07407
arxiv:2603.12117
arxiv:2602.08818
arxiv:2512.04799
arxiv:2508.02271
arxiv:2505.14524
arxiv:2502.11895
arxiv:2502.06728
arxiv:2412.08528
arxiv:2411.05882
arxiv:2311.09707
arxiv:2302.12239
arxiv:2204.03954
arxiv:1902.06423
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs