DDSP: Differentiable Digital Signal Processing

Overview

Differentiable Digital Signal Procressing (DDSP) enables direct integration of classic signal processing elements with end-to-end learning, utilizing strong inductive biases without sacrificing the expressive power of neural networks. This approach enables high-fidelity audio synthesis without the need for large autoregressive models or adversarial losses, and permits interpretable manipulation of each separate model component. In all figures below, linear-frequency log-magnitude spectrograms are used to visualize the audio, which is synthesized with a sample rate of 16kHz.

https://gyazo.com/9fe07cdc21e632bf9401fa6764e85875

Links:

https://magenta.tensorflow.org/ddsp

Paper:

https://arxiv.org/abs/2001.04643

Sound Samples:

https://storage.googleapis.com/ddsp/index.html

What you can do:

- Timber Transfer (cello -> violin, vocal -> violin)

- Timber Interpolation

- Accoustic environment transfer (copy the reverbration of Suntory Hall to my room ambience)

- De-reverbration