arxiv:2510.09462
Alexander Panfilov
kotekjedi
AI & ML interests
None yet
Recent Activity
updated a dataset 4 days ago
honeypot-redteam/strategic_lies authored a paper 8 months ago
Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols