The Sound Description Interchange Format (SDIF) is an established standard for the interchange of sound descriptions and analysis data. This project provides libraries, SDIF (in C) and Easdif (in C++), tools, and wrappers to read and write SDIF files.
pedalboard is a Python library for working with audio: reading, writing, rendering, adding effects, and more. It supports most popular audio file formats and a number of common audio effects out of the box, and also allows the use of VST3® and Audio Unit formats for loading third-party software instruments and effects.
The AFsp package is a library of routines for reading and writing audio files. The emphasis is on providing support for the type of audio file used by the speech processing research community. The routines have been designed to be easy to use, yet provide transparent support the reading of several audio file formats. A secondary purpose for distributing these routines is to encourage the use of a standard audio file format for the header information in the output files.
The following file formats are supported for reading.
NIST SPHERE audio files
Sun/NeXT audio files
DEC audio files
IRCAM SoundFiles
INRS-Telecom audio files
ESPS sampled data feature files
Headerless audio files
The audio file open routine automatically senses the file type and communicates it to the audio file reading routines. Formats are converted on the fly as the file is read, so the user manipulates floats and doesn’t need to worry about the underlying data format.
For writing, the routines produce a standard format file, though options are available to produce headerless files if desired. This standard format is a compatible with the Sun audio file format. There is provision for storing extra information in the extensible part of the header.
Several audio file utilities (for copying, comparing, and filtering audio files) are included in the package.
aubio is a tool designed for the extraction of annotations from audio signals. Its features include segmenting a sound file before each of its attacks, performing pitch detection, tapping the beat and producing midi streams from live audio.
Because these tasks are difficult, we thought it was important to gather them in a dedicated library. To increase the fun, we have made these algorithms work in a causal way, so as to be used in real time applications with as low delay as possible. Functions can be used offline in sound editors and software samplers, or online in audio effects and virtual instruments.
This library can read SF2 SoundFont files and render audio samples from them in real-time. It properly reads in a compliant SF2 file and can be used to obtain meta data such as preset names. It also has an audio rendering engine that can generate audio samples for key events that come from (say) a MIDI keyboard. This library is currently being used by my SoundFonts and SoundFontsPlus applications for SF2 file parsing and, in the latter app, as the sample generating engine.
Although most of the library code is generic C++17/23, there are a few bits that expect an Apple platform that has the AudioToolbox and Accelerate frameworks available. The goal is to be a simple library for reading SF2 files as well as a competent SF2 audio renderer whose output can be fed to any sort of audio processing chain, but it would probably take some effort to remove it from the Apple ecosystem.
Discord has many unwanted rate limits, especially in the audio area. This package does all the work and ensures that your music never stops playing due to ffmpeg or Discord, with as little effort as possible.
BassBoom is a music player made with C# using the fast mpg123 library as the native backend that handles the music playback and song information, including the playback device information.
This library is a viable library aimed for cross-platform music playing because we’ve selected mpg123 as the MP3 backend library for its ease of use and for its fast music playback. This library is frictionless as it aims for stability and cross-platform compatibility.
In addition to your regular music files, BassBoom also supports online MPEG radio stations that you can use to play your own favorite radio stations, as long as they don’t use AAC or any other codec that BassBoom doesn’t support.
Zoog is a Rust library that consists of functionality that can be used to determine the loudness of an Ogg Opus file and also to rewrite that file with new internal gain information as well as loudness-related comment tags. It also has functionality for purely manipulating comment tags of both Ogg Opus and Ogg Vorbis files.
Zoog currently contains two tools, opusgain and zoogcomment. opusgain can be used to:
set the output gain value located in the Opus binary header inside Opus files so that the file plays at the loudness of the original encoded audio, or of that consistent with the ReplayGain or EBU R 128 standards.
write the Opus comment tags used by some music players to decide what volume to play an Opus-encoded audio file at.
It is intended to solve the “Opus plays too quietly” problem.