Papers - TODA Tomoki
-
Quasi-periodic parallel WaveGAN vocoder: a non-autoregressive pitch-dependent dilated convolution model for parametric speech generation Reviewed
Y.-C. Wu, T. Hayashi, T. Okamoto, H. Kawai, T. Toda
Proc. INTERSPEECH page: 3535 - 3539 2020.10
-
The NU voice conversion system for the Voice Conversion Challenge 2020: on the effectiveness of sequence-to-sequence models and autoregressive neural vocoders Reviewed
W.-C. Huang, P.L. Tobing, Y.-C. Wu, K. Kobayashi, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 165 - 169 2020.10
-
The sequence-to-sequence baseline for the Voice Conversion Challenge 2020: cascading ASR and TTS Reviewed International coauthorship
W.-C. Huang, T. Hayashi, S. Watanabe, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 160 - 164 2020.10
-
Baseline system of Voice Conversion Challenge 2020 with cyclic variational autoencoder and parallel WaveGAN Reviewed
P.L. Tobing, Y.-C. Wu, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 155 - 159 2020.10
-
Predictions of subjective ratings and spoofing assessments of Voice Conversion Challenge 2020 submissions Reviewed International coauthorship
R.K. Das, T. Kinnunen, W.-C. Huang, Z. Ling, J. Yamagishi, Z. Yi, X. Tian, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 99 - 120 2020.10
-
Voice Conversion Challenge 2020 -- intra-lingual semi-parallel and cross-lingual voice conversion -- Reviewed International coauthorship
Z. Yi, W.-C. Huang, X. Tian, J. Yamagishi, R.K. Das, T. Kinnunen, Z. Ling, T. Toda
Proc. Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 page: 80 - 98 2020.10
-
Cyclic spectral modeling for unsupervised unit discovery into voice conversion with excitation and waveform modeling Reviewed
P.L. Tobing, T. Hayashi, Y.-C. Wu, K. Kobayashi, T. Toda
Proc. INTERSPEECH page: 4861 - 4865 2020.10
-
Voice transformer network: sequence-to-sequence voice conversion using transformer with text-to-speech pretraining Reviewed
W.-C. Huang, T. Hayashi, Y.-C. Wu, H. Kameoka, T. Toda
Proc. INTERSPEECH page: 4676 - 4680 2020.10
-
Intelligibility enhancement based on speech waveform modification using hearing impairment simulator Reviewed
S. Hikosaka, S. Seki, T. Hayashi, K. Kobayashi, K. Takeda, H. Banno, T. Toda
Proc. INTERSPEECH page: 4059 - 4063 2020.10
-
Semi-supervised self-produced speech enhancement and suppression based on joint source modeling of air- and body-conducted signals using variational autoencoder Reviewed
S. Seki, M. Takada, T. Toda
Proc. INTERSPEECH page: 4039 - 4043 2020.10
-
A cyclical post-filtering approach to mismatch refinement of neural vocoder for text-to-speech systems Reviewed
Y.-C. Wu, P.L. Tobing, K. Yasuhara, N. Matsunaga, Y. Ohtani, T. Toda
Proc. INTERSPEECH page: 3540 - 3544 2020.10
-
Implementation of low-latency electrolaryngeal speech enhancement based on multi-task CLDNN Reviewed
K. Kobayashi, T. Toda
Proc. EUSIPCO page: 396 - 400 2020.8
-
Semi-supervised enhancement and suppression of self-produced speech using correspondence between air- and body-conducted signals Reviewed
M. Takada, S. Seki, P.L. Tobing, T. Toda
Proc. EUSIPCO page: 456 - 460 2020.8
-
Weakly-supervised sound event detection with self-attention Reviewed International coauthorship
K. Miyazaki, T. Komatsu, T. Hayashi, S. Watanabe, T. Toda, K. Takeda
Proc. IEEE ICASSP page: 66 - 70 2020.5
-
ESPNET-TTS: Uunified, reproducible, and integratable open source end-to-end text-to-speech toolkit Reviewed International coauthorship
T. Hayashi, R. Yamamoto, K. Inoue, T. Yoshimura, S. Watanabe, T. Toda, K. Takeda, Y. Zhang, X. Tan
Proc. IEEE ICASSP page: 7654 - 7658 2020.5
-
Efficient shallow WaveNet vocoder using multiple samples output based on Laplacian distribution and linear prediction Reviewed
P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
Proc. IEEE ICASSP page: 7204 - 7208 2020.5
-
Transformer-based text-to-speech with weighted forced attention Reviewed
T. Okamoto, T. Toda, Y. Shiga, H. Kawai
Proc. IEEE ICASSP page: 6729 - 6733 2020.5
-
Non-parallel voice conversion system with WaveNet vocoder and collapsed speech suppression Reviewed
Y.-C. Wu, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda
IEEE Access Vol. 8 ( 1 ) page: 62094 - 62106 2020.4
-
LMS経由で手書きレポートを返却するWebサービス「かみレポ」の開発・評価 Reviewed
大平 茂輝, 清谷 峻也, 伊藤 瑠哉, 岡本 康佑, 谷川 右京, 出口 大輔, 戸田 智基
情報処理学会論文誌:教育とコンピュータ Vol. 6 ( 1 ) page: 52 - 68 2020.2
-
Customer satisfaction estimation in contact center calls based on a hierarchical multi-task model Reviewed
A. Ando, R. Masumura, H. Kamiyama, S. Kobashikawa, Y. Aono, T. Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing Vol. 28 ( 1 ) page: 715 - 728 2020.1