Papers - TODA Tomoki
-
End-to-end Mandarin speech reconstruction based on ultrasound tongue images using deep learning Reviewed International coauthorship
F. Li, F. Shen, D. Ma, J. Zhou, S. Zhang, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
IEEE Transactions on Neural Systems and Rehabilitation Engineering Vol. 33 page: 140 - 149 2024.12
-
Two-stage framework for robust speech emotion recognition using target speaker extraction in human speech noise conditions Reviewed
J. Mi, X. Shi, D. Ma, J. He, T. Fujimura, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Improved architecture for high-resolution piano transcription to efficiently capture acoustic characteristics of music signals Reviewed
J. Mi, S. Kim, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Multi-modal video summarization based on two-stage fusion of audio, visual, and recognized text information Reviewed
Z. Yang, J. He, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Multi-task learning approaches for music similarity representation learning based on individual instrument sounds Reviewed
T. Imamura, Y. Hashizume, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
A study on multimodal fusion and layer adapter in emotion recognition Reviewed International coauthorship
X. Shi, Y. Gao, J. He, J. Mi, X. Li, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Reference-free automatic speech severity evaluation using acoustic unit language modelling Reviewed
B. Halpern, T. Toda
Proc. SpandLDeteriorate Workshop of ACM Multimedia Asia (Workshop on Multi-Biological Sensing Data for Speech and Language Deterioration Prediction) page: 5 pages 2024.12
-
The VoiceMOS Challenge 2024: beyond speech quality prediction Reviewed International coauthorship
W.-C. Huang, S.-W. Fu, E. Cooper, R. Zezario, T. Toda, H.-M. Wang, J. Yamagishi, Y. Tsao
Proc. IEEE SLT page: 813 - 820 2024.12
-
Multi-speaker text-to-speech training with speaker anonymized data Reviewed International coauthorship
W.-C. Huang, Y.-C. Wu, T. Toda
IEEE Signal Processing Letters Vol. 31 page: 2995 - 2999 2024.10
-
2DP-2MRC: 2-dimensional pointer-based machine reading comprehension method for multimodal moment retrieval Reviewed
J. He, T. Toda
Proc. INTERSPEECH page: 5073 - 5077 2024.9
-
CtrSVDD: a benchmark dataset and baseline analysis for controlled singing voice deepfake detection Reviewed International coauthorship
Y. Zang, J. Shi, Y. Zhang, R. Yamamoto, J. Han, Y. Tang, S. Xu, W. Zhao, J. Guo, T. Toda, Z. Duan
Proc. INTERSPEECH page: 4783 - 4787 2024.9
-
Exploring the robustness of text-to-speech synthesis based on diffusion probabilistic models to heavily noisy transcriptions Reviewed
J. Feng, Y. Yasuda, T. Toda
Proc. INTERSPEECH page: 4408 - 4412 2024.9
-
QHM-GAN: neural vocoder based on quasi-harmonic modeling Reviewed
S. Chen, T. Toda
Proc. INTERSPEECH page: 3889 - 3893 2024.9
-
Multimodal fusion of music theory-inspired and self-supervised representations for improved emotion recognition Reviewed International coauthorship
X. Shi, X. LI, T. Toda
Proc. INTERSPEECH page: 3724 - 3728 2024.9
-
Quantifying the effect of speech pathology on automatic and human speaker verification Reviewed International coauthorship
B. Halpern, T. Tienkamp, W.-C. Huang, L.P. Violeta, T. Rebernik, S. de Visscher, M.J.H. Witjes, M. Wieling, D. Abur, T. Toda
Proc. INTERSPEECH page: 3015 - 3019 2024.9
-
Embedding learning for preference-based speech quality assessment Reviewed
C.-H. Hu, Y. Yasuda, T. Toda
Proc. INTERSPEECH page: 2685 - 2689 2024.9
-
Challenge of singing voice synthesis using only text-to-speech corpus with FIRNet source-filter neural vocoder Reviewed
T. Okamoto, Y. Ohtani, S. Shimizu, T. Toda, H. Kawai
Proc. INTERSPEECH page: 1870 - 1874 2024.9
-
Unsupervised training of neural network-based virtual microphone estimator Reviewed
J. Wang, T. Toda
Proc. EUSIPCO page: 256 - 260 2024.8
-
Discriminative neighborhood smoothing for generative anomalous sound detection Reviewed
T. Fujimura, K. Imoto, T. Toda
Proc. EUSIPCO page: 156 - 160 2024.8
-
Robust sequence-to-sequence voice conversion for electrolaryngeal speech enhancement in noisy and reverberant conditions Reviewed
D. Ma, Y. Choi, F. Li, C. Xie, K. Kobayashi, T. Toda
Proc. IEEE EMBC page: 4 pages 2024.7