Papers - TODA Tomoki
-
Audio difference learning for audio captioning Reviewed
T. Komatsu, Y. Fujita, K. Takeda, T. Toda
Proc. IEEE ICASSP page: 1456 - 1460 2024.4
-
ConvNeXt-TTS and ConvNeXt-VC: ConvNeXt-based fast end-to-end sequence-to-sequence text-to-speech and voice conversion Reviewed
T. Okamoto, Y. Ohtani, T. Toda, H. Kawai
Proc. IEEE ICASSP page: 12456 - 12460 2024.4
-
MF-AED-AEC: speech emotion recognition by leveraging multimodal fusion, ASR error detection, and ASR error correction Reviewed International coauthorship
J. He, X. Shi, X. Li, T. Toda
Proc. IEEE ICASSP page: 11066 - 11070 2024.4
-
Electrolaryngeal speech intelligibility enhancement through robust linguistic encoders Reviewed
L.P. Violeta, W.-C. Huang, D. Ma, R. Yamamoto, K. Kobayashi, T. Toda
Proc. IEEE ICASSP page: 10961 - 10965 2024.4
-
FIRNET: fundamental frequency controllable fast neural vocoder with trainable finite impulse response filter Reviewed
Y. Ohtani, T. Okamoto, T. Toda, H. Kawai
Proc. IEEE ICASSP page: 10871 - 10875 2024.4
-
Dual-channel target speaker extraction based on conditional variational autoencoder and directional information Reviewed
R. Wang, L. Li, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 32 page: 12 pages 2024.3
-
Fast neural speech waveform generative models with fully-connected layer-based upsampling Reviewed
H. Yamashita, T. Okamoto, R. Takashima, Y. Ohtani, T. Takiguchi, T. Toda, H. Kawai
IEEE Access Vol. 12 page: 31409 - 31421 2024.2
-
喉頭摘出者における音声収録アプリを用いた術前音声の保存 ―Save the Voice プロジェクト― Reviewed
西尾 直樹, 戸田 智基, 小林 和弘, 三谷 壮平, 飴矢 美里, 向山 宣昭, 木村 宏之, 徳倉 達也, 坪井 崇, 藤本 保志, 曾根 三千彦
喉頭 Vol. 35 ( 2 ) page: 142 - 147 2023.12
-
The Singing Voice Conversion Challenge 2023 Reviewed International coauthorship
W.-C. Huang, L.P. Violeta, S. Liu, J. Shi, T. Toda
Proc. IEEE ASRU page: 8 pages 2023.12
-
ED-CEC: improving rare word recognition using ASR post-processing based on error detection and context-aware error correction Reviewed
J. He, Z. Yang, T. Toda
Proc. IEEE ASRU page: 6 pages 2023.12
-
Improving severity preservation of healthy-to-pathological voice conversion with global style tokens Reviewed International coauthorship
B. Halpern, W.-C. Huang, L.P. Violeta, R. van Son, T. Toda
Proc. IEEE ASRU page: 7 pages 2023.12
-
A comparative study of voice conversion models with large-scale speech and singing data: the T13 systems for the Singing Voice Conversion Challenge 2023 Reviewed
R. Yamamoto, R. Yoneyama, L.P. Violeta, W.-C. Huang, T. Toda
Proc. IEEE ASRU page: 6 pages 2023.12
-
The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains Reviewed International coauthorship
E. Cooper, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda, J. Yamagishi
Proc. IEEE ASRU page: 7 pages 2023.12
-
WaveNeXt: ConvNeXt-based fast neural vocoder without iSTFT layer Reviewed
T. Okamoto, H. Yamashita, Y. Ohtani, T. Toda, H. Kawai
Proc. IEEE ASRU page: 8 pages 2023.12
-
Sequence-to-sequence network training methods for automatic guitar transcription with tokenized outputs Reviewed
S. Kim, K. Takeda, T. Toda
Proc. ISMIR page: 524 - 531 2023.11
-
Evaluating methods for ground-truth-free foreign accent conversion Reviewed
W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 1136 - 1141 2023.11
-
An analysis of personalized speech recognition system development for the deaf and hard-of-hearing Reviewed
L.P. Violeta, T. Toda
Proc. APSIPA ASC page: 1851 - 1856 2023.11
-
Semi-supervised multimodal emotion recognition with consensus decision-making and label correction Reviewed International coauthorship
J. Tian, D. Hu, X. Shi, J. He, X. Li, Y. Gao, T. Toda, X. Xu, X. Hu
Proc. MRAC page: 67 - 73 2023.10
-
Differentiable representation of warping based on Lie group theory Reviewed
A. Miyashita, T. Toda
Proc. IEEE WASPAA page: 5 pages 2023.10
-
Directional target speaker extraction under noisy underdetermined conditions through conditional variational autoencoder with global style tokens Reviewed
R. Wang, T. Toda
Proc. IEEE WASPAA page: 5 pages 2023.10