This is an automated archive made by the Lemmit Bot.
The original was posted on /r/selfhosted by /u/MysteriousSophon on 2025-11-25 23:36:48+00:00.
Been running this for a few months for my own use, finally cleaned it up for open source.
Usually gets rid of 12-16minutes of audio ads from an hour long podcast episode
What it does:
Takes any podcast RSS feed, processes episodes to remove ads, and serves a modified feed you can add to your podcast app.
How it works:
- Whisper transcribes the audio
- Claude detects ad segments from the transcript
- FFmpeg cuts them out and stitches the audio back together
- Flask serves the modified RSS feed
Processing happens when you first play an episode (or your podcast app tries to download the episode). Usually takes a minute to run through the complete flow before service the podcast file.
Requirements:
Docker with NVIDIA GPU (for Whisper)
Anthropic API key
Limitations:
Not perfect - sometimes misses ads or clips content
First play of each episode has a delay
GPU required for reasonable transcription speed
GitHub: https://github.com/hemant6488/podcast-server
PRs welcome.