Papers - TODA Tomoki
-
Many-to-many voice transformer network Reviewed
H. Kameoka, W.-C. Huang, K. Tanaka, T. Kaneko, N. Hojo, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 29 page: 656 - 670 2021.1
-
Investigation of training data size for real-time neural vocoders on CPUs Reviewed
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
Acoustical Science and Technology, Acoustical Letter Vol. 42 ( 1 ) page: 65 - 68 2021.1
-
Phoneme embeddings on predicting fundamental frequency pattern for electrolaryngeal speech Reviewed
M. Eshghi, K. Kobayashi, K. Tanaka, H. Kameoka, T. Toda
Proc. APSIPA ASC page: 572 - 577 2020.12
-
Cross-lingual voice conversion using cyclic variational auto-encoder and a WaveNet vocoder Reviewed
H. Nakatani, P.L. Tobing, K. Takeda, T. Toda
Proc. APSIPA ASC page: 520 - 526 2020.12
-
ASVspoof 2019: a large-scale public database of synthetic, converted and replayed speech Reviewed International coauthorship
X. Wang, J. Yamagishi, M. Todisco, H. Delgado, A. Nautsch, N. Evans, M. Sahidullah, V. Vestman, T. Kinnunen, K.A. Lee, L. Juvela, P. Alku, Y.-H. Peng, H.-T. Hwang, Y. Tsao, H.-M. Wang, S. Le Maguer, M. Becker, F. Henderson, R. Clark, Y. Zhang, Q. Wang, Y. Jia, K. Onuma, K. Mushika, T. Kaneda, Y. Jiang, L.-J. Liu, Y.-C. Wu, W.-C. Huang, T. Toda, K. Tanaka, H. Kameoka, I. Steiner, D. Matrouf, J.-F. Bonastre, A. Govender, S. Ronanki, J.-X. Zhang, Z.-H. Ling
Computer Speech and Language Vol. 64 ( Article 101114 ) page: 1 - 27 2020.11
-
Conformer-based sound event detection with semi-supervised learning and data augmentation Reviewed International coauthorship
K. Miyazaki, T. Komatsu, T. Hayashi, S. Watanabe, T. Toda, K. Takeda
Proc. DCASE 2020 Workshop page: 100 - 104 2020.11
-
An evaluation of voice conversion with neural network spectral mapping models and WaveNet vocoder Reviewed
P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 9 ( e26, ) page: 1 - 14 2020.11
-
Quasi-periodic parallel WaveGAN vocoder: a non-autoregressive pitch-dependent dilated convolution model for parametric speech generation Reviewed
Y.-C. Wu, T. Hayashi, T. Okamoto, H. Kawai, T. Toda
Proc. INTERSPEECH page: 3535 - 3539 2020.10
-
The NU voice conversion system for the Voice Conversion Challenge 2020: on the effectiveness of sequence-to-sequence models and autoregressive neural vocoders Reviewed
W.-C. Huang, P.L. Tobing, Y.-C. Wu, K. Kobayashi, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 165 - 169 2020.10
-
The sequence-to-sequence baseline for the Voice Conversion Challenge 2020: cascading ASR and TTS Reviewed International coauthorship
W.-C. Huang, T. Hayashi, S. Watanabe, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 160 - 164 2020.10
-
Baseline system of Voice Conversion Challenge 2020 with cyclic variational autoencoder and parallel WaveGAN Reviewed
P.L. Tobing, Y.-C. Wu, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 155 - 159 2020.10
-
Predictions of subjective ratings and spoofing assessments of Voice Conversion Challenge 2020 submissions Reviewed International coauthorship
R.K. Das, T. Kinnunen, W.-C. Huang, Z. Ling, J. Yamagishi, Z. Yi, X. Tian, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 99 - 120 2020.10
-
Voice Conversion Challenge 2020 -- intra-lingual semi-parallel and cross-lingual voice conversion -- Reviewed International coauthorship
Z. Yi, W.-C. Huang, X. Tian, J. Yamagishi, R.K. Das, T. Kinnunen, Z. Ling, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 80 - 98 2020.10
-
Cyclic spectral modeling for unsupervised unit discovery into voice conversion with excitation and waveform modeling Reviewed
P.L. Tobing, T. Hayashi, Y.-C. Wu, K. Kobayashi, T. Toda
Proc. INTERSPEECH page: 4861 - 4865 2020.10
-
Voice transformer network: sequence-to-sequence voice conversion using transformer with text-to-speech pretraining Reviewed
W.-C. Huang, T. Hayashi, Y.-C. Wu, H. Kameoka, T. Toda
Proc. INTERSPEECH page: 4676 - 4680 2020.10
-
Intelligibility enhancement based on speech waveform modification using hearing impairment simulator Reviewed
S. Hikosaka, S. Seki, T. Hayashi, K. Kobayashi, K. Takeda, H. Banno, T. Toda
Proc. INTERSPEECH page: 4059 - 4063 2020.10
-
Semi-supervised self-produced speech enhancement and suppression based on joint source modeling of air- and body-conducted signals using variational autoencoder Reviewed
S. Seki, M. Takada, T. Toda
Proc. INTERSPEECH page: 4039 - 4043 2020.10
-
A cyclical post-filtering approach to mismatch refinement of neural vocoder for text-to-speech systems Reviewed
Y.-C. Wu, P.L. Tobing, K. Yasuhara, N. Matsunaga, Y. Ohtani, T. Toda
Proc. INTERSPEECH page: 3540 - 3544 2020.10
-
Implementation of low-latency electrolaryngeal speech enhancement based on multi-task CLDNN Reviewed
K. Kobayashi, T. Toda
Proc. EUSIPCO page: 396 - 400 2020.8
-
Semi-supervised enhancement and suppression of self-produced speech using correspondence between air- and body-conducted signals Reviewed
M. Takada, S. Seki, P.L. Tobing, T. Toda
Proc. EUSIPCO page: 456 - 460 2020.8