論文 - 戸田 智基
-
Low-latency real-time non-parallel voice conversion based on cyclic variational autoencoder and multiband WaveRNN with data-driven linear prediction 査読有り
P.L. Tobing, T. Toda
Proc. 11th ISCA Speech Synthesis Workshop (SSW11) 頁: 142 - 147 2021年8月
-
Full-band LPCNet: a real-time neural vocoder for 48 kHz audio with a CPU 査読有り
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
IEEE Access 9 巻 頁: 94923 - 94933 2021年7月
-
Crank: an open-source software for nonparallel voice conversion based on vector-quantized variational autoencoder 査読有り
K. Kobayashi, W.-C. Huang, Y.-C. Wu, P.L. Tobing, T. Hayashi, T. Toda
Proc. IEEE ICASSP 頁: 5934 - 5938 2021年6月
-
Any-to-one sequence-to-sequence voice conversion using self-supervised discrete speech representations 査読有り
W.-C. Huang, Y.-C. Wu, T. Hayashi, T. Toda
Proc. IEEE ICASSP 頁: 5944 - 5948 2021年6月
-
Speech recognition by simply fine-tuning BERT 査読有り 国際共著
W.-C. Huang, C.-H. Wu, S.-B. Luo, K.-Y. Chen, H.-M. Wang, T. Toda
Proc. IEEE ICASSP 頁: 7343 - 7347 2021年6月
-
Non-autoregressive sequence-to-sequence voice conversion 査読有り
T. Hayashi, W.-C. Huang, K. Kobayashi, T. Toda
Proc. IEEE ICASSP 頁: 7068 - 7072 2021年6月
-
High-intelligibility speech synthesis for dysarthric speakers with LPCNet-based TTS and CycleVAE-based VC 査読有り
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
Proc. IEEE ICASSP 頁: 7058 - 7062 2021年6月
-
Speech emotion recognition based on listener adaptive models 査読有り
A. Ando, R. Masumura, H. Sato, T. Moriya, T. Ashihara, Y. Ijima, T. Toda
Proc. IEEE ICASSP 頁: 6274 - 6278 2021年6月
-
Noise level limited sub-modeling for diffusion probabilistic vocoders 査読有り
T. Okamoto, T. Toda, Y. Shiga, H. Kawai
Proc. IEEE ICASSP 頁: 6029 - 6033 2021年6月
-
Speech emotion recognition based on listener-dependent emotion perception models 査読有り
A. Ando, T. Mori, S. Kobashikawa, T. Toda
APSIPA Transactions on Signal and Information Processing 10 巻 ( e6 ) 頁: 1 - 11 2021年4月
-
Quasi-periodic WaveNet: an autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network 査読有り
Y.-C. Wu, T. Hayashi, P.L. Tobing, K. Kobayashi, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing 29 巻 頁: 1134 - 1148 2021年3月
-
Pretraining techniques for sequence-to-sequence voice conversion 査読有り
W.-C. Huang, T. Hayashi, Y.-C. Wu, H. Kameoka, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing 29 巻 頁: 745 - 755 2021年2月
-
Quasi-periodic parallel WaveGAN: a non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network 査読有り
Y.-C. Wu, T. Hayashi, T. Okamoto, H. Kawai, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing 29 巻 頁: 792 - 806 2021年2月
-
Investigation of training data size for real-time neural vocoders on CPUs 査読有り
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
Acoustical Science and Technology, Acoustical Letter 42 巻 ( 1 ) 頁: 65 - 68 2021年1月
-
Many-to-many voice transformer network 査読有り
H. Kameoka, W.-C. Huang, K. Tanaka, T. Kaneko, N. Hojo, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing 29 巻 頁: 656 - 670 2021年1月
-
Cross-lingual voice conversion using cyclic variational auto-encoder and a WaveNet vocoder 査読有り
H. Nakatani, P.L. Tobing, K. Takeda, T. Toda
Proc. APSIPA ASC 頁: 520 - 526 2020年12月
-
Phoneme embeddings on predicting fundamental frequency pattern for electrolaryngeal speech 査読有り
M. Eshghi, K. Kobayashi, K. Tanaka, H. Kameoka, T. Toda
Proc. APSIPA ASC 頁: 572 - 577 2020年12月
-
ASVspoof 2019: a large-scale public database of synthetic, converted and replayed speech 査読有り 国際共著
X. Wang, J. Yamagishi, M. Todisco, H. Delgado, A. Nautsch, N. Evans, M. Sahidullah, V. Vestman, T. Kinnunen, K.A. Lee, L. Juvela, P. Alku, Y.-H. Peng, H.-T. Hwang, Y. Tsao, H.-M. Wang, S. Le Maguer, M. Becker, F. Henderson, R. Clark, Y. Zhang, Q. Wang, Y. Jia, K. Onuma, K. Mushika, T. Kaneda, Y. Jiang, L.-J. Liu, Y.-C. Wu, W.-C. Huang, T. Toda, K. Tanaka, H. Kameoka, I. Steiner, D. Matrouf, J.-F. Bonastre, A. Govender, S. Ronanki, J.-X. Zhang, Z.-H. Ling
Computer Speech and Language 64 巻 ( Article 101114 ) 頁: 1 - 27 2020年11月
-
Conformer-based sound event detection with semi-supervised learning and data augmentation 査読有り 国際共著
K. Miyazaki, T. Komatsu, T. Hayashi, S. Watanabe, T. Toda, K. Takeda
Proc. DCASE 2020 Workshop 頁: 100 - 104 2020年11月
-
An evaluation of voice conversion with neural network spectral mapping models and WaveNet vocoder 査読有り
P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
APSIPA Transactions on Signal and Information Processing 9 巻 ( e26, ) 頁: 1 - 14 2020年11月