Tacotron 2 - Art Research by Daito Manabe

Tacotron 2

Source: https://github.com/NVIDIA/tacotron2

Samples: https://google.github.io/tacotron/publications/tacotron2/index.html

a neural network architecture for speech synthesis directly from text.

Multispeaker Text-To-Speech Synthesis

Samples : https://google.github.io/tacotron/publications/speaker_adaptation/

a neural network-based system for text-to-speech (TTS) synthesis that is able to generate speech audio in the voice of many different speakers, including those unseen during training

Flowtron (Flow based model)

https://nv-adlr.github.io/Flowtron

Auto-regressive flow-based generative network for text to speech synthesis

https://www.youtube.com/watch?v=bOf2S7OzFEg

#toolkit

#text2speech

#opensource