Papers - TODA Tomoki
-
A voice conversion system from electrolarynx speech to preoperative patient’s speech for total laryngectomy Reviewed Open Access
N. Nishio, K. Kobayashi, D. Ma, S. Mitani, M. Sone, T. Toda
OTO Open Vol. 10 ( 1 ) page: 5 pages 2026.2
-
Severity-controllable pathological text-to-speech synthesis for clinical applications Reviewed Open Access
B.M. Halpern, W.-C. Huang, L.P. Violeta, T. Toda
IEEE Transactions on Neural Systems and Rehabilitation Engineering Vol. 34 page: 573 - 582 2026.1
-
A comprehensive study on the effectiveness of ASR representations for noise-robust speech emotion recognition Reviewed International coauthorship
X. Shi, J. He, X. Li, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 34 page: 707 - 722 2026.1
-
PARCO: phoneme-augmented robust contextual ASR via contrastive entity disambiguation Reviewed
J. He, N. Sawada, K. Miyazaki, T. Toda
Proc. IEEE ASRU page: 7 pages 2025.12
-
Voice factor control using FIR-based fast neural vocoder for speech generation applications Reviewed
Y. Ohtani, T. Okamoto, T. Toda, H. Kawai
Proc. IEEE ASRU page: 4 pages 2025.12
-
The AudioMOS Challenge 2025 Reviewed International coauthorship
W.-C. Huang, H. Wang, C. Liu, Y.-C. Wu, A. Tjandra, W.-N. Hsu, E. Cooper, Y. Qin, T. Toda
Proc. IEEE ASRU page: 8 pages 2025.12
-
Layer-wise analysis for quality of multilingual synthesized speech Reviewed
E. Cooper, T. Okamoto, Y. Ohtani, T. Toda, H. Kawai
Proc. IEEE ASRU page: 7 pages 2025.12
-
Sequence-to-sequence voice conversion with weighted guided attention Reviewed Open Access
H. Yamashita, T. Okamoto, R. Takashima, Y. Ohtani, T. Takiguchi, T. Toda, H. Kawai
IEEE Access Vol. 13 page: 216583 - 216595 2025.12
-
Audio difference learning framework for audio captioning Reviewed
T. Komatsu, K. Takeda, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e34 ) page: 1 - 18 2025.11
-
Study on automatic generation of lecture videos based on content analysis of lecture slides Reviewed
K. Mizukami, D. Deguchi, T. Toda, H. Murase, H. Kyutoku, T. Minematsu
Proc. CELDA page: 4 pages 2025.11
-
XPPG-PCA: reference-free automatic speech severity evaluation with principal components Reviewed International coauthorship Open Access
B.M. Halpern, T.B. Tienkamp, T. Rebernik, R.J.J.H. van Son, S.A.H.J. de Visscher, M.J.H. Witjes, D. Abur, T. Toda
IEEE Journal of Selected Topics in Signal Processing Vol. 19 ( 5 ) page: 783 - 795 2025.10
-
Wavehax: aliasing-free neural waveform synthesis based on 2D convolution and harmonic prior for reliable complex spectrogram estimation Reviewed
R. Yoneyama, A. Miyashita, R. Yamamoto, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 4454 - 4470 2025.10
-
Handling domain shifts for anomalous sound detection: a review of DCASE-related work Reviewed International coauthorship
K. Wilkinghoff, T. Fujimura, K. Imoto, J. Le Roux, Z.-H. Tan, T. Toda
Proc. DCASE Workshop page: 20 - 24 2025.10
-
Speaker privacy and security in the big data era: protection and defense against deepfake Invited International coauthorship
L. Chen, K.A. Lee, Z.-H. Ling, X. Wang, R.K. Das, T. Toda, H. Li
Proc. APSIPA ASC 2025.10
-
Neural semi-fragile watermarking for proactive deepfake speech detection Reviewed
D. Yoon, T. Toda
Proc. APSIPA ASC page: 2396 - 2401 2025.10
-
Disfluency disentanglement enhancement in spoken-text-style transfer for spontaneous speech synthesis Reviewed
Y. Nakata, D. Yoshioka, W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 2254 - 2259 2025.10
-
Investigation of the effectiveness of converted speech auditory feedback in low-latency real-time voice conversion Reviewed
K. Niwa, K. Kobayashi, T. Toda
Proc. APSIPA ASC page: 905 - 910 2025.10
-
Designing a music difficulty measure for controllable automatic piano rearrangement Reviewed
H. Miyaji, K. Sawada, W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 834 - 839 2025.10
-
Estimating speaker'ss seating position from monaural speech in a simulated vehicle interior sound field Reviewed
M. Kaneko, W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 625 - 629 2025.10
-
An evaluation of supervised virtual microphone estimators in reverberant sound fields Reviewed
K. Hattori, W.-C. Huang, K. Takeda, T. Toda
Proc. APSIPA ASC page: 517 - 522 2025.10