Mateusz Dziemian's picture

Mateusz Dziemian

mattmdjaga

·

AI & ML interests

Interested in AI safety.

Recent Activity

authored a paper 6 days ago

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

updated a dataset 11 days ago

sureheremarv/ipi_arena_attacks

published a dataset 11 days ago

sureheremarv/ipi_arena_attacks

View all activity

Organizations

authored a paper 6 days ago

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

Paper • 2603.15714 • Published 13 days ago

authored 2 papers 7 months ago

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

Paper • 2507.20526 • Published Jul 28, 2025 • 1

Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems

Paper • 2504.07831 • Published Apr 10, 2025

authored 2 papers over 1 year ago

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Paper • 2410.09024 • Published Oct 11, 2024 • 1

Applying Refusal-Vector Ablation to Llama 3.1 70B Agents

Paper • 2410.10871 • Published Oct 8, 2024 • 1