This environment simulates database reliability engineering workflows where an agent must debug, optimize, and submit a fixed SQL query against live SQLite databases across three task levels.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results