Servant, Stalker, Predator: How An Honest, Helpful, And Harmless (3H) Agent Unlocks Adversarial Skills Paper • 2508.19500 • Published Aug 27 • 2
Servant, Stalker, Predator: How An Honest, Helpful, And Harmless (3H) Agent Unlocks Adversarial Skills Paper • 2508.19500 • Published Aug 27 • 2
Servant, Stalker, Predator: How An Honest, Helpful, And Harmless (3H) Agent Unlocks Adversarial Skills Paper • 2508.19500 • Published Aug 27 • 2 • 2
Favicon Trojans: Executable Steganography Via Ico Alpha Channel Exploitation Paper • 2507.09074 • Published Jul 11 • 6 • 5
Favicon Trojans: Executable Steganography Via Ico Alpha Channel Exploitation Paper • 2507.09074 • Published Jul 11 • 6
Favicon Trojans: Executable Steganography Via Ico Alpha Channel Exploitation Paper • 2507.09074 • Published Jul 11 • 6 • 5
Can AI Freelancers Compete? Benchmarking Earnings, Reliability, and Task Success at Scale Paper • 2505.13511 • Published May 16 • 1 • 2
AI-Invented Tonal Languages: Preventing a Machine Lingua Franca Beyond Human Understanding Paper • 2503.01063 • Published Mar 2 • 5
AI-Invented Tonal Languages: Preventing a Machine Lingua Franca Beyond Human Understanding Paper • 2503.01063 • Published Mar 2 • 5
AI-Invented Tonal Languages: Preventing a Machine Lingua Franca Beyond Human Understanding Paper • 2503.01063 • Published Mar 2 • 5 • 2
Beyond No: Quantifying AI Over-Refusal and Emotional Attachment Boundaries Paper • 2502.14975 • Published Feb 20
Beyond No: Quantifying AI Over-Refusal and Emotional Attachment Boundaries Paper • 2502.14975 • Published Feb 20 • 3
Beyond No: Quantifying AI Over-Refusal and Emotional Attachment Boundaries Paper • 2502.14975 • Published Feb 20 • 3
Forbidden Science: Dual-Use AI Challenge Benchmark and Scientific Refusal Tests Paper • 2502.06867 • Published Feb 8 • 1
Forbidden Science: Dual-Use AI Challenge Benchmark and Scientific Refusal Tests Paper • 2502.06867 • Published Feb 8 • 1 • 2
Language Models And A Second Opinion Use Case: The Pocket Professional Paper • 2410.20636 • Published Oct 27, 2024 • 2