Show HN: FluidAudio – Swift Speaker Diarization on CoreML

Show HN (score: 5)
Found: July 13, 2025
ID: 283

Description

Other
Show HN: FluidAudio – Swift Speaker Diarization on CoreML We needed a speaker diarization solution that could run every few seconds alongside transcription on iOS and macOS. But native Swift support was either limited or locked behind paid licenses. Since diarization is a common need in speech-to-text workflows, we decided to open source our work and give back to the community.

We initially tried sherpa-onnx, which works, but running both diarization and transcription models slowed down older devices. CPU-only inference just isn’t ideal for near real-time workloads, so we wanted the option to offload segmentation and speaker embedding to the GPU or ANE. Supporting M1 Macs in particular meant pushing more of the workload to the ANE.

Instead of shoehorning the ONNX model into CoreML with C++, we converted the original PyTorch models directly to CoreML. This approach required some monkey-patching in the PyTorch and pyannote code, but the initial benchmarks look promising.

We’d love feedback! We're currently working on adding VAD and integrating Parakeet for transcription, but still wrestling with CoreML model conversion.

More from Show

Show HN: Pi-hosts – Give the Pi coding agent access to your servers

Show HN: Pi-hosts – Give the Pi coding agent access to your servers I built that initially for an AI chat bot that allows teams to perform DevOps tasks straight out of Slack&#x2F;Teams (with proper permission control, obviously).<p>Useful to let developers perform mundane tasks, or help coordinate incident response.<p>I ended up using it myself on my own machine to manage Hetzner and AWS boxes. I thought that may be useful to others.<p>The default policy should be good enough, but if you don&#x27;t like living dangerously, you can set it to `paranoid`.<p>Install: `pi install npm:pi-hosts`<p>GitHub: <a href="https:&#x2F;&#x2F;github.com&#x2F;hunvreus&#x2F;pi-hosts" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;hunvreus&#x2F;pi-hosts</a>

Show HN: 49Agents – 2D Canvas IDE for Orchestrating Agents, Repos, Issues

Show HN: 49Agents – 2D Canvas IDE for Orchestrating Agents, Repos, Issues Beads tables (Steve Yegge&#x27;s) for issue tracking. Can view git trees, terminals, issue tables, notes, and files all on one screen. Can connect multiple machines via private network (like tailscale)

Show HN: ClusterdOS – Kubernetes without the platform team

Show HN: ClusterdOS – Kubernetes without the platform team

No other tools from this source yet.