Papers - TODA Tomoki
-
音声のMOS評価法の限界と大規模比較評価の新しい可能性 Invited Reviewed
安田 裕介, 戸田 智基
日本音響学会誌 Vol. 80 ( 7 ) page: 393 - 400 2024.7
-
合成音声の客観評価とVoiceMOSチャレンジ Invited Reviewed International coauthorship
クーパー エリカ, ホワン ウェンチン, ツァオ ユ, ワン シンミン, 戸田 智基, 山岸 順一
日本音響学会誌 Vol. 80 ( 7 ) page: 381 - 392 2024.7
-
A review on subjective and objective evaluation of synthetic speech Invited Reviewed International coauthorship
E. Cooper, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda, J. Yamagishi
Acoustical Science and Technology Vol. 45 ( 4 ) page: 161 - 183 2024.7
-
Mandarin speech reconstruction from tongue motion ultrasound images based on generative adversarial networks Reviewed International coauthorship
F. Li, F. Shen, D. Ma, S. Zhang, J. Zhou, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
Proc. IEEE EMBC page: 4 pages 2024.7
-
Unequally spaced sound field interpolation for rotation-robust beamforming Reviewed
S. Luan, Y. Wakabayashi, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 32 page: 3185 - 3199 2024.6
-
Pretraining and adaptation techniques for electrolaryngeal speech recognition Reviewed
L.P. Violeta, D. Ma, W.-C. Huang, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 32 page: 2777 - 2789 2024.5
-
Audio difference learning for audio captioning Reviewed
T. Komatsu, Y. Fujita, K. Takeda, T. Toda
Proc. IEEE ICASSP page: 1456 - 1460 2024.4
-
ConvNeXt-TTS and ConvNeXt-VC: ConvNeXt-based fast end-to-end sequence-to-sequence text-to-speech and voice conversion Reviewed
T. Okamoto, Y. Ohtani, T. Toda, H. Kawai
Proc. IEEE ICASSP page: 12456 - 12460 2024.4
-
MF-AED-AEC: speech emotion recognition by leveraging multimodal fusion, ASR error detection, and ASR error correction Reviewed International coauthorship
J. He, X. Shi, X. Li, T. Toda
Proc. IEEE ICASSP page: 11066 - 11070 2024.4
-
Electrolaryngeal speech intelligibility enhancement through robust linguistic encoders Reviewed
L.P. Violeta, W.-C. Huang, D. Ma, R. Yamamoto, K. Kobayashi, T. Toda
Proc. IEEE ICASSP page: 10961 - 10965 2024.4
-
FIRNET: fundamental frequency controllable fast neural vocoder with trainable finite impulse response filter Reviewed
Y. Ohtani, T. Okamoto, T. Toda, H. Kawai
Proc. IEEE ICASSP page: 10871 - 10875 2024.4
-
An investigation of fundamental frequency pattern prediction for Japanese eelectrolaryngeal speech enhancement based on frame-wise phoneme representations Reviewed
M. Eshghi, T. Toda
IEEE Access Vol. 12 page: 50137 - 50153 2024.4
-
Dual-channel target speaker extraction based on conditional variational autoencoder and directional information Reviewed
R. Wang, L. Li, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 32 page: 12 pages 2024.3
-
Fast neural speech waveform generative models with fully-connected layer-based upsampling Reviewed
H. Yamashita, T. Okamoto, R. Takashima, Y. Ohtani, T. Takiguchi, T. Toda, H. Kawai
IEEE Access Vol. 12 page: 31409 - 31421 2024.2
-
喉頭摘出者における音声収録アプリを用いた術前音声の保存 ―Save the Voice プロジェクト― Reviewed
西尾 直樹, 戸田 智基, 小林 和弘, 三谷 壮平, 飴矢 美里, 向山 宣昭, 木村 宏之, 徳倉 達也, 坪井 崇, 藤本 保志, 曾根 三千彦
喉頭 Vol. 35 ( 2 ) page: 142 - 147 2023.12
-
The Singing Voice Conversion Challenge 2023 Reviewed International coauthorship
W.-C. Huang, L.P. Violeta, S. Liu, J. Shi, T. Toda
Proc. IEEE ASRU page: 8 pages 2023.12
-
ED-CEC: improving rare word recognition using ASR post-processing based on error detection and context-aware error correction Reviewed
J. He, Z. Yang, T. Toda
Proc. IEEE ASRU page: 6 pages 2023.12
-
Improving severity preservation of healthy-to-pathological voice conversion with global style tokens Reviewed International coauthorship
B. Halpern, W.-C. Huang, L.P. Violeta, R. van Son, T. Toda
Proc. IEEE ASRU page: 7 pages 2023.12
-
A comparative study of voice conversion models with large-scale speech and singing data: the T13 systems for the Singing Voice Conversion Challenge 2023 Reviewed
R. Yamamoto, R. Yoneyama, L.P. Violeta, W.-C. Huang, T. Toda
Proc. IEEE ASRU page: 6 pages 2023.12
-
The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains Reviewed International coauthorship
E. Cooper, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda, J. Yamagishi
Proc. IEEE ASRU page: 7 pages 2023.12