PI Zero W Audio Book

Motivation and requirements

My dad is practically blind and at 80 years has trouble hearing and operating tiny or more complicated electronics controls. Touch screens, smart phones, keyboards, and small mp3 players are completely out of the picture. I have tried using small dummy MP3 player (Sencor) with 5 buttons (prev, next, play|pause, volume up/down) as an initial assessment whether audio book player is something he would be able to control. Even though he uesd it, he was struggling with controlling it and the small player with 2-3x overloaded button controlls was too much. Also it lacked a fundamental option of remote book update. So I've decided to build custom player with following requirements:

volume control is an analog knob (ideally it turns off all the way to the left)
keep the number of buttons to minimum (spaced far apart - resilient to random touch)
allow remote content change - wifi
open content (not locked to a publisher)
does not need to be battery operated
minimal level of state indicators
sufficient output volume to drive speakers/headphones

Install

Dependencie

Use venv for managing dependencies

python3 -mvenv env
activate env with `source env/bin/activate`
pip3 install gpiozero
pip3 install python-mpd2
pip3 install google-cloud-texttospeech

knihaui.py

User pi on Raspberry PI Zero has this repo checked out under knihaui folder.
There is also folder /data on the root writable by pi user.
/etc/rc.local is modified to disable video output, set PCM volume to 100, set IO pins and set permissions on /data
We have wifi_restart.sh and related service definition to automatically ping and restart wifi.
/etc/systemd/system/knihaui.service takes care of running the UI.
Service is enabled with systemctl enable knihaui.
MPD is installed and enabled on the system running on port 6600 and using /data for media directory.
Unused or extra components are disabled. We keep avahi for name discovery.
To prolong SD card lifetime download overlayfs and use as per instructions in readme.

newsgen.py

download project certificate from google cloud to env/newsgen-credentials.json` To run:
export GOOGLE_APPLICATION_CREDENTIALS=env/newsgen-credentials.json
source env/bin/activate
Running python3 newsgen.py creates /tmp/news.mp3 if successful

Listen to Example brief in Slovak here

Automate with crontab.

V0

V0 was the set of scripts to slice larger audio books into manageable small files suitable for dumb players. This also allowed to prepend "chapter X" voice at the start of each slice.

V1

V1 is the physical build with buttons that my dad is using right now.

Build hardware using Pi zero W
PY UI that drives the buttons and controlls MPD
Test remotre upgrade capability - SSH
Add support for internet radios (SRo and Radio Litera)
Add doc of system modification of raspbian to this doc

V2

HW: Add serial port output to external connector for improved troubleshooting
HW: Replace potentiometer with rotary encoder and set master volume directly using Alsa
HW: Add rocker switch with indicator to allow turn off/on and immediate powered-on indication
OS: Serial console
SW: rotary switch volume control
SW: user request to have information about the day available as another station
OS: read-only mount mode to prolong SD card lifetime

Audio book player for senior visually impaired.

Related tags

Overview

PI Zero W Audio Book

Motivation and requirements

Install

Dependencie

knihaui.py

newsgen.py

V0

V1

V2

Schematic

Photos

Owner

Andrej Hosna

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Inner ear models for Python

Users can transcribe their favorite piano recordings to MIDI files after installation

Audio pitch-shifting & re-sampling utility, based on the EMU SP-1200

Open Sound Strip, Sequence or Record in Audacity

Bot Music Pintar. Created by Rio

Music bot of # Owner

Converting UGG files from Rode Wireless Go II transmitters (unsompressed recordings) to WAV format

Python interface to the WebRTC Voice Activity Detector

Music player and music library manager for Linux, Windows, and macOS

pedalboard is a Python library for adding effects to audio.

Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21

Python implementation of the Short Term Objective Intelligibility measure

Sound-Equalizer- This is a Sound Equalizer GUI App Using Python's PyQt5

This Bot can extract audios and subtitles from video files

❤️ Hi There Im Cozmo Music Bot A next gen powerful telegram group Music bot for get your Songs and music @Venuja_Sadew

MusicBrainz Picard

A tool for retrieving audio in the past

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jarvis From Basic to Advance - make a voice assistant similar to JARVIS (in iron man movie)