Python > Scientific Audio

Scientific research in audio/music.

Collection 1.7k stars GitHub

Audio Related Packages

Feature extraction
Read-Write
Transformations - General DSP
Perceptial Models - Auditory Models
Data augmentation
Speech Processing
Environmental Sounds
Source Separation
Music Information Retrieval
Deep Learning
Symbolic Music - MIDI - Musicology
Realtime applications
Web Audio
Audio Dataset and Dataloaders

Tutorials

Books

Feature extraction

audiolazy 712 updated 4y ago

Realtime Audio Processing lib, general purpose.

aubio 3.7k updated 7mo ago

Feature extractor, written in C, Python interface.

audioFlux 3.3k updated 4mo ago

A library for audio and music analysis, feature extraction.

essentia 3.5k updated 4mo ago

Music related low level and high level feature extractor, C++ based, includes Python bindings.

python_speech_features 2.4k updated 4y ago

Common speech features for ASR.

pyYAAFE 248 updated 5y ago

Python bindings for YAAFE feature extractor.

speechpy 886 updated 1y ago

Library for Speech Processing and Recognition, mostly feature extraction for now.

spafe 480 updated 1y ago

Python library for features extraction from audio files.

Read-Write

audioread 537 updated 2mo ago

Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.

mutagen 1.9k updated 4mo ago

Reads and writes all kind of audio metadata for various formats.

pyAV 3.1k updated 3mo ago

PyAV is a Pythonic binding for FFmpeg or Libav.

(Py)Soundfile 14 updated 6y ago

Library based on libsndfile, CFFI, and NumPy.

pySox 538 updated 1y ago

Wrapper for sox.

stempeg 105 updated 8mo ago

read/write of STEMS multistream audio.

tinytag 806 updated 3mo ago

reading music meta data of MP3, OGG, FLAC and Wave files.

Transformations - General DSP

acoustics 556 (archived)

useful tools for acousticians.

AudioTK 252 (archived)

DSP filter toolbox (lots of filters).

AudioTSM 90 updated 8y ago

real-time audio time-scale modification procedures.

Gammatone 228 (archived)

Gammatone filterbank implementation.

pyFFTW 412 updated 7mo ago

Wrapper for FFTW(3).

NSGT 105 updated 2y ago

Non-stationary gabor transform, constant-q.

matchering 2.5k updated 3mo ago

Automated reference audio mastering.

MDCT 54 updated 4y ago

MDCT transform.

pydub 9.7k updated 3mo ago

Manipulate audio with a simple and easy high level interface.

pytftb 280 updated 1y ago

Implementation of the MATLAB Time-Frequency Toolbox.

pyroomacoustics 1.8k updated 3mo ago

Room Acoustics Simulation (RIR generator)

PyRubberband 215 updated 1y ago

Wrapper for rubberband to do pitch-shifting and time-stretching.

PyWavelets 2.4k updated 4mo ago

Discrete Wavelet Transform in Python.

Resampy 280 updated 1y ago

Sample rate conversion.

sound_field_analysis 106 updated 3y ago

Analyze, visualize and process sound field data recorded by spherical microphone arrays.

STFT 48 updated 1y ago

Standalone package for Short-Time Fourier Transform.

Perceptial Models - Auditory Models

Sound Field Synthesis Toolbox 73 updated 6mo ago

Sound Field Synthesis Toolbox.

cochlea 119 updated 2y ago

Inner ear models.

Brian2 1.1k updated 3mo ago

Spiking neural networks simulator, includes cochlea model.

Loudness 40 updated 7y ago

Perceived loudness, includes Zwicker, Moore/Glasberg model.

pyloudnorm 763 updated 6mo ago

Audio loudness meter and normalization, implements ITU-R BS.1770-4.

Data augmentation

audiomentations 2.2k updated 6mo ago

Audio Data Augmentation.

muda 237 updated 5y ago

Musical Data Augmentation.

pydiogment 85 updated 3y ago

Audio Data Augmentation.

Speech Processing

aeneas 2.8k updated 2y ago

Forced aligner, based on MFCC+DTW, 35+ languages.

deepspeech 26.7k (archived)

Pretrained automatic speech recognition.

gentle 1.7k updated 1y ago

Forced-aligner built on Kaldi.

Parselmouth 1.2k updated 3mo ago

Python interface to the Praat phonetics and speech analysis, synthesis, and manipulation software.

persephone 159 updated 3y ago

Automatic phoneme transcription tool.

pyannote.audio 9.4k updated 3mo ago

Neural building blocks for speaker diarization.

pyAudioAnalysis 6.2k updated 11mo ago

Feature Extraction, Classification, Diarization.

py-webrtcvad 2.5k updated 2y ago

Interface to the WebRTC Voice Activity Detector.

pypesq 409 updated 11mo ago

Wrapper for the PESQ score calculation.

pystoi 360 updated 2y ago

Short Term Objective Intelligibility measure (STOI).

PyWorldVocoder 780 updated 1y ago

Wrapper for Morise's World Vocoder.

Montreal Forced Aligner 1.8k updated 4mo ago

Forced aligner, based on Kaldi (HMM), English (others can be trained).

SpeechRecognition 9.0k updated 3mo ago

Wrapper for several ASR engines and APIs, online and offline.

Environmental Sounds

sed_eval 158 updated 2y ago

Evaluation toolbox for Sound Event Detection

Source Separation

commonfate 17 updated 6y ago

Common Fate Model and Transform.

NTFLib 48 updated 1y ago

Sparse Beta-Divergence Tensor Factorization.

NUSSL 644 (archived)

Holistic source separation framework including DSP methods and deep learning methods.

NIMFA 558 updated 5y ago

Several flavors of non-negative-matrix factorization.

Music Information Retrieval

Catchy 22 updated 9y ago

Corpus Analysis Tools for Computational Hook Discovery.

chord-detection 143 updated 3y ago

Algorithms for chord detection and key estimation.

Madmom 1.6k updated 3mo ago

MIR packages with strong focus on beat detection, onset detection and chord recognition.

mir_eval 693 updated 4mo ago

Common scores for various MIR tasks. Also includes bss_eval implementation.

msaf 548 updated 4mo ago

Music Structure Analysis Framework.

librosa 8.3k updated 3mo ago

General audio and music analysis.

Deep Learning

Kapre 946 updated 8mo ago

Keras Audio Preprocessors

TorchAudio 2.8k updated 3mo ago

PyTorch Audio Loaders

nnAudio 1.1k updated 7mo ago

Accelerated audio processing using 1D convolution networks in PyTorch.

Symbolic Music - MIDI - Musicology

Music21 2.4k updated 4mo ago

Toolkit for Computer-Aided Musicology.

Mido 1.6k updated 4mo ago

Realtime MIDI wrapper.

mingus 927 updated 2y ago

Advanced music theory and notation package with MIDI file and playback support.

Pretty-MIDI 1.0k updated 4mo ago

Utility functions for handling MIDI data in a nice/intuitive way.

Realtime applications

Jupylet 249 updated 2y ago

Subtractive, additive, FM, and sample-based sound synthesis.

PYO 1.4k updated 10mo ago

Realtime audio dsp engine.

python-sounddevice 1.2k updated 4mo ago

PortAudio wrapper providing realtime audio I/O with NumPy.

ReTiSAR 79 updated 2y ago

Binaural rendering of streamed or IR-based high-order spherical microphone array signals.

Web Audio

TimeSide (Beta) 394 updated 1y ago

high level audio analysis, imaging, transcoding, streaming and labelling.

Audio Dataset and Dataloaders

beets 14.9k updated 3mo ago

Music library manager and MusicBrainz tagger.

musdb 194 updated 1y ago

Parse and process the MUSDB18 dataset.

medleydb 209 updated 2y ago

Parse medleydb audio + annotations.

Soundcloud API 111 updated 8mo ago

Wrapper for Soundcloud API.

Youtube-Downloader 139.9k updated 4mo ago

Download youtube videos (and the audio).

audiomate 138 updated 3y ago

Loading different types of audio datasets.

mirdata 400 updated 5mo ago

Common loaders for Music Information Retrieval (MIR) datasets.

Tutorials

Whirlwind Tour Of Python 4.0k updated 2y ago

fast-paced introduction to Python essentials, aimed at researchers and developers.

Introduction to Numpy and Scipy 3.2k updated 4mo ago

Highly recommended tutorial, covers large parts of the scientific Python ecosystem.

MIR Notebooks 1.3k updated 4mo ago

collection of instructional iPython Notebooks for music information retrieval (MIR).

Selected Topics in Audio Signal Processing 69 updated 4y ago

Exercises as iPython notebooks.

Live-coding a music synthesizer 18 updated 4y ago

Live-coding video showing how to use the SoundDevice library to reproduce realistic sounds. Code

Books

Python Data Science Handbook 47.1k updated 2y ago

Jake Vanderplas, Excellent Book and accompanying tutorial notebooks.

Python > Scientific Audio

Contents

Audio Related Packages

Feature extraction

Read-Write

Transformations - General DSP

Perceptial Models - Auditory Models

Data augmentation

Speech Processing

Environmental Sounds

Source Separation

Music Information Retrieval

Deep Learning

Symbolic Music - MIDI - Musicology

Realtime applications

Web Audio

Audio Dataset and Dataloaders

Tutorials

Books