Papers - TODA Tomoki
-
A preliminary study on sectional voice anonymization and detection Reviewed International coauthorship
S. Tang, Z. Liu, L. Chen, K. Lee, T. Toda, Z.-H. Ling
Proc. APSIPA ASC page: 318 - 323 2025.10
-
Hierarchical symbolic music generation with variational autoencoder-based bar-wise feature sequences Reviewed
K. Sawada, W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 299 - 304 2025.10
-
Adjusting bias in anomaly scores via variance minimization for domain-generalized discriminative anomalous sound detection Reviewed
M. Matsumoto, T. Fujimura, W.-C. Huang, T. Toda
Proc. DCASE Workshop page: 25 - 29 2025.10
-
ASDKit: a toolkit for comprehensive evaluation of anomalous sound detection methods Reviewed International coauthorship
T. Fujimura, K. Wilkinghoff, K. Imoto, T. Toda
Proc. DCASE Workshop page: 40 - 44 2025.10
-
Discriminative anomalous sound detection using pseudo labels, target signal enhancement, and ensemble feature extractors Reviewed
T. Fujimura, I. Kuroyanagi, T. Toda
Proc. DCASE Workshop page: 180 - 184 2025.10
-
Music similarity representation learning focusing on individual instruments with source separation and human preference Reviewed
T. Imamura, Y. Hashizume, W.-C. Huang, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 4, e305 ) page: 1 - 29 2025.10
-
喉頭摘出者における自己音声の再獲得 ~Save the Voice Project~ Reviewed
西尾 直樹, 小林 和弘, 戸田 智基
気管食道科学会会報 Vol. 76 ( 5 ) page: 255 - 263 2025.10
-
VAE-SiFiGAN: source-filter HiFi-GAN based on variational autoencoder representations with enhanced pitch controllability Reviewed Open Access
K. Ogita, R. Yoneyama, W.-C. Huang, T. Toda
Proc. EUSIPCO page: 531 - 535 2025.9
-
QHARMA-GAN: quasi-harmonic neural vocoder based on autoregressive moving average model Reviewed Open Access
S. Chen, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 3703 - 3719 2025.9
-
Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment Reviewed Open Access
Y. Yasuda, T. Toda
Computer Speech and Language Vol. 96 ( Article 101888 ) page: 1 - 16 2025.9
-
Serenade: a singing style conversion framework based on audio infilling Reviewed Open Access
L.P. Violeta, W.-C. Huang, T. Toda
Proc. EUSIPCO page: 411 - 415 2025.9
-
M4SER: multimodal, multirepresentation, multitask, and multistrategy learning for speech emotion recognition Reviewed International coauthorship
J. He, X. Shi, C.-H. Hu, J. Mi, X. Li, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 4055 - 4070 2025.9
-
Text- and speech-style control for lecture speech generation focusing on disfluency Reviewed
D. Yoshioka, Y. Nakata, Y. Yasuda, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e26 ) page: 1 - 31 2025.9
-
Continual subjective evaluation method of speech by merging sort-based preference tests towards ever-expanding corpus of human ratings Reviewed Open Access
Y. Yasuda, J. Yamagishi, T. Toda
Proc. SSW page: 14 - 20 2025.8
-
Eigenvoice synthesis based on model editing for speaker generation Reviewed Open Access
M. Murata, K. Miyazaki, T. Koriyama, T. Toda
Proc. INTERSPEECH page: 5523 - 5527 2025.8
-
Unifying listener scoring scales: comparison learning framework for speech quality assessment and continuous speech emotion recognition Reviewed Open Access
C.-H. Hu, Y. Yasuda, A. Yoshimoto, T. Toda
Proc. INTERSPEECH page: 5428 - 5432 2025.8
-
SHEET: a multi-purpose open-source speech human evaluation estimation toolkit Reviewed Open Access
W.-C. Huang, E. Cooper, T. Toda
Proc. INTERSPEECH page: 2355 - 2359 2025.8
-
CMT-LLM: context-aware multi-talker ASR utilizing large language models Reviewed Open Access
J. He, N. Sawada, K. Miyazaki, T. Toda
Proc. INTERSPEECH page: 2575 - 2579 2025.8
-
GIA-MIC: multimodal emotion recognition with gated interactive attention and modality-invariant learning constraints Reviewed Open Access
J. He, J. Mi, T. Toda
Proc. INTERSPEECH page: 2695 - 2699 2025.8
-
Relationship between objective and subjective perceptual measures of speech in individuals with head and neck cancer Reviewed International coauthorship Open Access
B. Halpern, T. Tienkamp, T. Rebernik, R. van Son, M. Wieling, D. Abur, T. Toda
Proc. INTERSPEECH page: 3733 - 3737 2025.8