Pitfalls in Evaluating Interpretability Agents - AI Research Paper | Discoveai