Can open-source AI agents escape their sandbox containers? (initial results)
Last June, I presented my capstone project at the Technical Alignment Research Accelerator (TARA) Demo Day. The whole thing started with a paper from the UK AI Security Institute called Quantifying Frontier LLM Capabilities for Container Sandbox Escape. From this paper, they introduced SandboxEscapeBench, a benchmark to test whether AI