Thanks; I didn't spot that they disabled tools in the harness. Also they don't provide an "out" to allow the models to express uncertainty so the instructions force a guess to be made.
As an aside though it's still funny that the two tools WITH search also disagreed.
As an aside though it's still funny that the two tools WITH search also disagreed.