there are many details in one image, and the chances of some player recognizing one of those details is an instance of the birthday problem?
That would be a valid model. But you are still right that it doesn’t apply: It would give the effect that a different geoguesser would get the picture right every test, while we are seeing consistent results from the top geoguessers.