Papers - TODA Tomoki
-
Low-latency real-time non-parallel voice conversion based on cyclic variational autoencoder and multiband WaveRNN with data-driven linear prediction Reviewed
P.L. Tobing, T. Toda
Proc. 11th ISCA Speech Synthesis Workshop (SSW11) page: 142 - 147 2021.8
-
Full-band LPCNet: a real-time neural vocoder for 48 kHz audio with a CPU Reviewed
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
IEEE Access Vol. 9 page: 94923 - 94933 2021.7
-
Crank: an open-source software for nonparallel voice conversion based on vector-quantized variational autoencoder Reviewed
K. Kobayashi, W.-C. Huang, Y.-C. Wu, P.L. Tobing, T. Hayashi, T. Toda
Proc. IEEE ICASSP page: 5934 - 5938 2021.6
-
Any-to-one sequence-to-sequence voice conversion using self-supervised discrete speech representations Reviewed
W.-C. Huang, Y.-C. Wu, T. Hayashi, T. Toda
Proc. IEEE ICASSP page: 5944 - 5948 2021.6
-
Speech recognition by simply fine-tuning BERT Reviewed International coauthorship
W.-C. Huang, C.-H. Wu, S.-B. Luo, K.-Y. Chen, H.-M. Wang, T. Toda
Proc. IEEE ICASSP page: 7343 - 7347 2021.6
-
Non-autoregressive sequence-to-sequence voice conversion Reviewed
T. Hayashi, W.-C. Huang, K. Kobayashi, T. Toda
Proc. IEEE ICASSP page: 7068 - 7072 2021.6
-
High-intelligibility speech synthesis for dysarthric speakers with LPCNet-based TTS and CycleVAE-based VC Reviewed
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
Proc. IEEE ICASSP page: 7058 - 7062 2021.6
-
Speech emotion recognition based on listener adaptive models Reviewed
A. Ando, R. Masumura, H. Sato, T. Moriya, T. Ashihara, Y. Ijima, T. Toda
Proc. IEEE ICASSP page: 6274 - 6278 2021.6
-
Noise level limited sub-modeling for diffusion probabilistic vocoders Reviewed
T. Okamoto, T. Toda, Y. Shiga, H. Kawai
Proc. IEEE ICASSP page: 6029 - 6033 2021.6
-
Speech emotion recognition based on listener-dependent emotion perception models Reviewed
A. Ando, T. Mori, S. Kobashikawa, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 10 ( e6 ) page: 1 - 11 2021.4
-
Quasi-periodic WaveNet: an autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network Reviewed
Y.-C. Wu, T. Hayashi, P.L. Tobing, K. Kobayashi, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 29 page: 1134 - 1148 2021.3
-
Pretraining techniques for sequence-to-sequence voice conversion Reviewed
W.-C. Huang, T. Hayashi, Y.-C. Wu, H. Kameoka, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 29 page: 745 - 755 2021.2
-
Quasi-periodic parallel WaveGAN: a non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network Reviewed
Y.-C. Wu, T. Hayashi, T. Okamoto, H. Kawai, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 29 page: 792 - 806 2021.2
-
Investigation of training data size for real-time neural vocoders on CPUs Reviewed
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
Acoustical Science and Technology, Acoustical Letter Vol. 42 ( 1 ) page: 65 - 68 2021.1
-
Many-to-many voice transformer network Reviewed
H. Kameoka, W.-C. Huang, K. Tanaka, T. Kaneko, N. Hojo, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 29 page: 656 - 670 2021.1
-
Cross-lingual voice conversion using cyclic variational auto-encoder and a WaveNet vocoder Reviewed
H. Nakatani, P.L. Tobing, K. Takeda, T. Toda
Proc. APSIPA ASC page: 520 - 526 2020.12
-
Phoneme embeddings on predicting fundamental frequency pattern for electrolaryngeal speech Reviewed
M. Eshghi, K. Kobayashi, K. Tanaka, H. Kameoka, T. Toda
Proc. APSIPA ASC page: 572 - 577 2020.12
-
ASVspoof 2019: a large-scale public database of synthetic, converted and replayed speech Reviewed International coauthorship
X. Wang, J. Yamagishi, M. Todisco, H. Delgado, A. Nautsch, N. Evans, M. Sahidullah, V. Vestman, T. Kinnunen, K.A. Lee, L. Juvela, P. Alku, Y.-H. Peng, H.-T. Hwang, Y. Tsao, H.-M. Wang, S. Le Maguer, M. Becker, F. Henderson, R. Clark, Y. Zhang, Q. Wang, Y. Jia, K. Onuma, K. Mushika, T. Kaneda, Y. Jiang, L.-J. Liu, Y.-C. Wu, W.-C. Huang, T. Toda, K. Tanaka, H. Kameoka, I. Steiner, D. Matrouf, J.-F. Bonastre, A. Govender, S. Ronanki, J.-X. Zhang, Z.-H. Ling
Computer Speech and Language Vol. 64 ( Article 101114 ) page: 1 - 27 2020.11
-
Conformer-based sound event detection with semi-supervised learning and data augmentation Reviewed International coauthorship
K. Miyazaki, T. Komatsu, T. Hayashi, S. Watanabe, T. Toda, K. Takeda
Proc. DCASE 2020 Workshop page: 100 - 104 2020.11
-
An evaluation of voice conversion with neural network spectral mapping models and WaveNet vocoder Reviewed
P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 9 ( e26, ) page: 1 - 14 2020.11