論文 - 戸田 智基
-
Speech recognition by simply fine-tuning BERT 査読有り 国際共著
W.-C. Huang, C.-H. Wu, S.-B. Luo, K.-Y. Chen, H.-M. Wang, T. Toda
Proc. IEEE ICASSP 頁: 7343 - 7347 2021年6月
-
Non-autoregressive sequence-to-sequence voice conversion 査読有り
T. Hayashi, W.-C. Huang, K. Kobayashi, T. Toda
Proc. IEEE ICASSP 頁: 7068 - 7072 2021年6月
-
High-intelligibility speech synthesis for dysarthric speakers with LPCNet-based TTS and CycleVAE-based VC 査読有り
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
Proc. IEEE ICASSP 頁: 7058 - 7062 2021年6月
-
Speech emotion recognition based on listener adaptive models 査読有り
A. Ando, R. Masumura, H. Sato, T. Moriya, T. Ashihara, Y. Ijima, T. Toda
Proc. IEEE ICASSP 頁: 6274 - 6278 2021年6月
-
Noise level limited sub-modeling for diffusion probabilistic vocoders 査読有り
T. Okamoto, T. Toda, Y. Shiga, H. Kawai
Proc. IEEE ICASSP 頁: 6029 - 6033 2021年6月
-
Any-to-one sequence-to-sequence voice conversion using self-supervised discrete speech representations 査読有り
W.-C. Huang, Y.-C. Wu, T. Hayashi, T. Toda
Proc. IEEE ICASSP 頁: 5944 - 5948 2021年6月
-
Speech emotion recognition based on listener-dependent emotion perception models 査読有り Open Access
A. Ando, T. Mori, S. Kobashikawa, T. Toda
APSIPA Transactions on Signal and Information Processing 10 巻 ( e6 ) 頁: 1 - 11 2021年4月
-
Quasi-periodic WaveNet: an autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network 査読有り
Y.-C. Wu, T. Hayashi, P.L. Tobing, K. Kobayashi, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing 29 巻 頁: 1134 - 1148 2021年3月
-
Pretraining techniques for sequence-to-sequence voice conversion 査読有り
W.-C. Huang, T. Hayashi, Y.-C. Wu, H. Kameoka, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing 29 巻 頁: 745 - 755 2021年2月
-
Quasi-periodic parallel WaveGAN: a non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network 査読有り
Y.-C. Wu, T. Hayashi, T. Okamoto, H. Kawai, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing 29 巻 頁: 792 - 806 2021年2月
-
Many-to-many voice transformer network 査読有り
H. Kameoka, W.-C. Huang, K. Tanaka, T. Kaneko, N. Hojo, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing 29 巻 頁: 656 - 670 2021年1月
-
Investigation of training data size for real-time neural vocoders on CPUs 査読有り Open Access
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
Acoustical Science and Technology, Acoustical Letter 42 巻 ( 1 ) 頁: 65 - 68 2021年1月
-
Cross-lingual voice conversion using cyclic variational auto-encoder and a WaveNet vocoder 査読有り
H. Nakatani, P.L. Tobing, K. Takeda, T. Toda
Proc. APSIPA ASC 頁: 520 - 526 2020年12月
-
Phoneme embeddings on predicting fundamental frequency pattern for electrolaryngeal speech 査読有り
M. Eshghi, K. Kobayashi, K. Tanaka, H. Kameoka, T. Toda
Proc. APSIPA ASC 頁: 572 - 577 2020年12月
-
An evaluation of voice conversion with neural network spectral mapping models and WaveNet vocoder 査読有り Open Access
P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
APSIPA Transactions on Signal and Information Processing 9 巻 ( e26, ) 頁: 1 - 14 2020年11月
-
ASVspoof 2019: a large-scale public database of synthetic, converted and replayed speech 査読有り 国際共著
X. Wang, J. Yamagishi, M. Todisco, H. Delgado, A. Nautsch, N. Evans, M. Sahidullah, V. Vestman, T. Kinnunen, K.A. Lee, L. Juvela, P. Alku, Y.-H. Peng, H.-T. Hwang, Y. Tsao, H.-M. Wang, S. Le Maguer, M. Becker, F. Henderson, R. Clark, Y. Zhang, Q. Wang, Y. Jia, K. Onuma, K. Mushika, T. Kaneda, Y. Jiang, L.-J. Liu, Y.-C. Wu, W.-C. Huang, T. Toda, K. Tanaka, H. Kameoka, I. Steiner, D. Matrouf, J.-F. Bonastre, A. Govender, S. Ronanki, J.-X. Zhang, Z.-H. Ling
Computer Speech and Language 64 巻 ( Article 101114 ) 頁: 1 - 27 2020年11月
-
Conformer-based sound event detection with semi-supervised learning and data augmentation 査読有り 国際共著
K. Miyazaki, T. Komatsu, T. Hayashi, S. Watanabe, T. Toda, K. Takeda
Proc. DCASE 2020 Workshop 頁: 100 - 104 2020年11月
-
Quasi-periodic parallel WaveGAN vocoder: a non-autoregressive pitch-dependent dilated convolution model for parametric speech generation 査読有り
Y.-C. Wu, T. Hayashi, T. Okamoto, H. Kawai, T. Toda
Proc. INTERSPEECH 頁: 3535 - 3539 2020年10月
-
The NU voice conversion system for the Voice Conversion Challenge 2020: on the effectiveness of sequence-to-sequence models and autoregressive neural vocoders 査読有り
W.-C. Huang, P.L. Tobing, Y.-C. Wu, K. Kobayashi, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 頁: 165 - 169 2020年10月
-
The sequence-to-sequence baseline for the Voice Conversion Challenge 2020: cascading ASR and TTS 査読有り 国際共著
W.-C. Huang, T. Hayashi, S. Watanabe, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 頁: 160 - 164 2020年10月