Willi Ballenthin
tags
#benchmark
bookmarks
AgentRE-Bench — LLM Reverse Engineering Benchmark
February 14, 2026