Braintrust adds agent-run diffing and eval replays · BusellAI