Percival: CNN + Wasserstein GAN + weigthed Least Square


Example of progress of generation of the warped spectral envelope for CMU Arctic BDL32 voice:



CMU Arctic SLT (Eng-Ame) (trained on 16kHz, demo data for Merlin and Percival)
8xConv2D 16x5x5 WGANwLS3xBLSTM512PML RESYNTHORI
b0488
b0489
b0490
b0491
b0492
b0493
b0494
b0495
b0496
b0497

CMU Arctic BDL (Eng-Ame) (trained on 32kHz)
8xConv2D 16x5x5 WGANwLS3xBLSTM512PML RESYNTHORI
b0490
b0491
b0492
b0493
b0494
b0495
b0496
b0497
b0498
b0499

Nick (Eng-Eng) (trained on 48kHz)
8xConv2D 16x5x5 WGANwLS3xBLSTM512PML RESYNTHORI
527
528
529
530
531
532
533
534
535
536

LS (Eng-Eng) (trained on 32kHz)
8xConv2D 16x5x5 WGANwLS3xBLSTM512PML RESYNTHORI
2981
2982
2983
2984
2985
2986
2987
2988
2989
2990