No description

Python 71.2%
HTML 26.6%
JavaScript 1.6%
Dockerfile 0.6%

Find a file

Pierre-Olivier Mercier 7e0f08b863 Milestone 5: MusicBrainz MBID canonicalizer Give tracks a source-agnostic identity so the same song from different sources no longer replays in a loop. - Canonicalizer resolves (artist, title) to a MusicBrainz recording MBID (no API key; ~1 req/s, descriptive User-Agent, best-effort). Hits and confirmed misses are cached in SQLite; transient errors are not. - Track.key becomes mbid:<id> when resolved, else a normalized name:<artist>\|<title> fallback — still source-agnostic. - Scheduler now owns the authoritative anti-repeat on the canonical key, canonicalizing the drawn track with a bounded retry; providers keep a cheap recent-locator filter to limit retries. - db: canonical_cache table, history.locator column with migration for existing databases, recent_locators(). - Canonicalization can be turned off via RADIEO_CANONICAL_ENABLED=0. Verified: MBID hit/cache/miss, cross-source key collapse, scheduler dodging a recent play, schema migration, and full stack (Navidrome + yt-dlp) with zero Python tracebacks and a valid 192 kbps MP3 stream. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>		2026-07-02 18:46:30 +08:00
cache	Milestone 1: Liquidsoap broadcasting skeleton	2026-07-02 16:14:14 +08:00
config	Milestone 4: yt-dlp provider and weighted source scheduler	2026-07-02 17:58:24 +08:00
ingest	Milestone 5: MusicBrainz MBID canonicalizer	2026-07-02 18:46:30 +08:00
stream	Add smooth crossfade transition between tracks	2026-07-02 18:28:57 +08:00
.env.example	Milestone 5: MusicBrainz MBID canonicalizer	2026-07-02 18:46:30 +08:00
.gitignore	Milestone 4: yt-dlp provider and weighted source scheduler	2026-07-02 17:58:24 +08:00
docker-compose.yml	Milestone 5: MusicBrainz MBID canonicalizer	2026-07-02 18:46:30 +08:00
README.md	Milestone 5: MusicBrainz MBID canonicalizer	2026-07-02 18:46:30 +08:00

README.md

radieo

A personal music radio: an always-on HTTP audio stream, automatically fed from several sources and broadcast with Liquidsoap.

The goal is a hassle-free stream that always has something playing, where the next track is picked automatically. It is meant for personal use (a couple of simultaneous listeners), not for public broadcasting.

How it works

radieo is built as two layers, each running in its own Docker container and sharing a cache volume:

ingest (Python) — the brain. It decides what to play next, resolves and downloads tracks into a local cache, keeps a pre-filled queue, and exposes the next track over HTTP at GET /next. (currently it only serves the cache directory; the download providers come in later milestones — see roadmap)
stream (Liquidsoap) — deliberately dumb. It pulls the next track from the ingest daemon, broadcasts the audio over HTTP, and never goes silent thanks to a local cache fallback.

Playback sources (planned): a Navidrome library via the OpenSubsonic API, arbitrary tracks fetched with yt-dlp (Bandcamp, SoundCloud, YouTube…), and listening suggestions from a ListenBrainz RSS feed.

Usage

Requirements: Docker with Compose v2.

# Drop some .mp3 files into the cache directory
cp /path/to/music/*.mp3 cache/

# Build and start the stream
docker compose up -d

# Listen (VLC, a browser, any audio player)
#   http://localhost:8000/radio.mp3

Stop it with docker compose down.

The stream is MP3 at 192 kbps. Multiple clients can listen at the same time. New files dropped into cache/ are picked up automatically (the playlist is reloaded when the directory changes).

Configuration

Copy .env.example to .env and fill in your Navidrome details:

cp .env.example .env
# edit .env: RADIEO_NAVIDROME_URL / USER / PASSWORD / PLAYLIST

If the Navidrome variables are left empty, the source is simply disabled and the stream plays whatever is already in cache/ (the milestone-1/2 behaviour).

For the yt-dlp source, list the URLs to draw from in config/urls.txt (copy config/urls.txt.example). Each line is either a direct track URL or a container URL (playlist, album, label, artist page) from which one track is picked at random. The relative mix between sources is set by RADIEO_WEIGHT_NAVIDROME / RADIEO_WEIGHT_YTDLP (a weight of 0 disables a source); the file being absent also disables yt-dlp.

Current status

Milestone 5 — MBID canonicalizer: done.

Two playback sources feed a weighted scheduler: a Navidrome/OpenSubsonic playlist and a hand-maintained list of yt-dlp URLs (config/urls.txt). Container URLs (playlist/album/label/artist) are expanded and one track is drawn at random.
Each track is canonicalized to a MusicBrainz recording MBID (no API key needed; ~1 req/s, best-effort, results cached in SQLite). This gives a source-agnostic identity, so the same song from two sources collapses to one; when no confident match is found it falls back to a normalized (artist, title) key. The scheduler uses this canonical key for anti-repeat, with the providers applying a cheap locator filter first.
Each source has its own fetcher (Subsonic stream / yt-dlp download); files are cached ahead of playback (prefetch buffer) and decoded by Liquidsoap.
Play history and LRU retention are tracked in a SQLite database under state/: only the N most recently played files are kept on disk (RADIEO_RETENTION_KEEP, default 20). Orphaned download temp files are swept on startup.
GET /next returns the next track as an annotated Liquidsoap URI with real title/artist metadata (or an empty body when nothing is ready).
stream (Liquidsoap v2.4.5) pulls via request.dynamic and falls back to the local cache/ directory; mksafe guarantees silence rather than a crash.
HTTP stream served at http://localhost:8000/radio.mp3 (MP3, 192 kbps), multiple simultaneous listeners supported.

The ListenBrainz suggestion feed comes next. (Known cosmetic quirk: at startup the fallback logs a few harmless ffmpeg "Invalid data" warnings while probing non-audio files such as .gitkeep; to be quieted in the polish milestone.)

Roadmap

✅ Broadcasting skeleton — Liquidsoap serving the cache directory.
✅ Ingestion daemon — Python daemon exposing GET /next; Liquidsoap switches to a request.dynamic source with the cache as fallback.
✅ Navidrome provider — play from an OpenSubsonic playlist, with caching, LRU retention and play history.
✅ yt-dlp provider — fetch tracks from a maintained URL/artist list; weighted mixing between sources.
✅ Canonicalizer — MusicBrainz MBID lookup for source-agnostic de-duplication.
ListenBrainz provider — parse the RSS suggestions feed and resolve each one to Navidrome or yt-dlp.
Polish — crossfade, robustness, optional web player, config file.