- Python 71.2%
- HTML 26.6%
- JavaScript 1.6%
- Dockerfile 0.6%
Give tracks a source-agnostic identity so the same song from different sources no longer replays in a loop. - Canonicalizer resolves (artist, title) to a MusicBrainz recording MBID (no API key; ~1 req/s, descriptive User-Agent, best-effort). Hits and confirmed misses are cached in SQLite; transient errors are not. - Track.key becomes mbid:<id> when resolved, else a normalized name:<artist>|<title> fallback — still source-agnostic. - Scheduler now owns the authoritative anti-repeat on the canonical key, canonicalizing the drawn track with a bounded retry; providers keep a cheap recent-locator filter to limit retries. - db: canonical_cache table, history.locator column with migration for existing databases, recent_locators(). - Canonicalization can be turned off via RADIEO_CANONICAL_ENABLED=0. Verified: MBID hit/cache/miss, cross-source key collapse, scheduler dodging a recent play, schema migration, and full stack (Navidrome + yt-dlp) with zero Python tracebacks and a valid 192 kbps MP3 stream. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> |
||
|---|---|---|
| cache | ||
| config | ||
| ingest | ||
| stream | ||
| .env.example | ||
| .gitignore | ||
| docker-compose.yml | ||
| README.md | ||
radieo
A personal music radio: an always-on HTTP audio stream, automatically fed from several sources and broadcast with Liquidsoap.
The goal is a hassle-free stream that always has something playing, where the next track is picked automatically. It is meant for personal use (a couple of simultaneous listeners), not for public broadcasting.
How it works
radieo is built as two layers, each running in its own Docker container and sharing a cache volume:
ingest(Python) — the brain. It decides what to play next, resolves and downloads tracks into a local cache, keeps a pre-filled queue, and exposes the next track over HTTP atGET /next. (currently it only serves the cache directory; the download providers come in later milestones — see roadmap)stream(Liquidsoap) — deliberately dumb. It pulls the next track from theingestdaemon, broadcasts the audio over HTTP, and never goes silent thanks to a local cache fallback.
Playback sources (planned): a Navidrome library via the OpenSubsonic API, arbitrary tracks fetched with yt-dlp (Bandcamp, SoundCloud, YouTube…), and listening suggestions from a ListenBrainz RSS feed.
Usage
Requirements: Docker with Compose v2.
# Drop some .mp3 files into the cache directory
cp /path/to/music/*.mp3 cache/
# Build and start the stream
docker compose up -d
# Listen (VLC, a browser, any audio player)
# http://localhost:8000/radio.mp3
Stop it with docker compose down.
The stream is MP3 at 192 kbps. Multiple clients can listen at the same time.
New files dropped into cache/ are picked up automatically (the playlist is
reloaded when the directory changes).
Configuration
Copy .env.example to .env and fill in your Navidrome details:
cp .env.example .env
# edit .env: RADIEO_NAVIDROME_URL / USER / PASSWORD / PLAYLIST
If the Navidrome variables are left empty, the source is simply disabled and
the stream plays whatever is already in cache/ (the milestone-1/2 behaviour).
For the yt-dlp source, list the URLs to draw from in config/urls.txt (copy
config/urls.txt.example). Each line is either a direct track URL or a
container URL (playlist, album, label, artist page) from which one track is
picked at random. The relative mix between sources is set by
RADIEO_WEIGHT_NAVIDROME / RADIEO_WEIGHT_YTDLP (a weight of 0 disables a
source); the file being absent also disables yt-dlp.
Current status
Milestone 5 — MBID canonicalizer: done.
- Two playback sources feed a weighted scheduler: a Navidrome/OpenSubsonic
playlist and a hand-maintained list of yt-dlp URLs (
config/urls.txt). Container URLs (playlist/album/label/artist) are expanded and one track is drawn at random. - Each track is canonicalized to a MusicBrainz recording MBID (no API key
needed; ~1 req/s, best-effort, results cached in SQLite). This gives a
source-agnostic identity, so the same song from two sources collapses to one;
when no confident match is found it falls back to a normalized
(artist, title)key. The scheduler uses this canonical key for anti-repeat, with the providers applying a cheap locator filter first. - Each source has its own fetcher (Subsonic stream / yt-dlp download); files are cached ahead of playback (prefetch buffer) and decoded by Liquidsoap.
- Play history and LRU retention are tracked in a SQLite database under
state/: only the N most recently played files are kept on disk (RADIEO_RETENTION_KEEP, default 20). Orphaned download temp files are swept on startup. GET /nextreturns the next track as an annotated Liquidsoap URI with real title/artist metadata (or an empty body when nothing is ready).stream(Liquidsoap v2.4.5) pulls viarequest.dynamicand falls back to the localcache/directory;mksafeguarantees silence rather than a crash.- HTTP stream served at
http://localhost:8000/radio.mp3(MP3, 192 kbps), multiple simultaneous listeners supported.
The ListenBrainz suggestion feed comes next. (Known cosmetic quirk: at startup
the fallback logs a few harmless ffmpeg "Invalid data" warnings while probing
non-audio files such as .gitkeep; to be quieted in the polish milestone.)
Roadmap
- ✅ Broadcasting skeleton — Liquidsoap serving the cache directory.
- ✅ Ingestion daemon — Python daemon exposing
GET /next; Liquidsoap switches to arequest.dynamicsource with the cache as fallback. - ✅ Navidrome provider — play from an OpenSubsonic playlist, with caching, LRU retention and play history.
- ✅ yt-dlp provider — fetch tracks from a maintained URL/artist list; weighted mixing between sources.
- ✅ Canonicalizer — MusicBrainz MBID lookup for source-agnostic de-duplication.
- ListenBrainz provider — parse the RSS suggestions feed and resolve each one to Navidrome or yt-dlp.
- Polish — crossfade, robustness, optional web player, config file.