Papers - TODA Tomoki
-
Noise and reverberation-controllable voice conversion Reviewed
Y. Choi, C. Xie, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 2430 - 2443 2025.6
-
PMF-CEC: phoneme-augmented multimodal fusion for context-aware ASR error correction with error-specific selective decoding Reviewed
J. He, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 2402 - 2417 2025.6
-
Improving anomalous sound detection through pseudo-anomalous set selection and pseudo-label utilization under unlabeled conditions Reviewed Open Access
I. Kuroyanagi, T. Fujimura, K. Takeda, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e13 ) page: 1 - 28 2025.6
-
Analysis and extension of noisy-target training for unsupervised target signal enhancement Reviewed Open Access
T. Fujimura, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e12 ) page: 1 - 27 2025.6
-
An investigation of noisy-to-noisy voice conversion performance in various noisy conditions Reviewed Open Access
C. Xie, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e10 ) page: 1 - 30 2025.6
-
Sequence-to-sequence voice conversion-based techniques for electrolaryngeal speech enhancement in noisy and reverberant conditions Reviewed International coauthorship Open Access
D. Ma, Y. Choi, T. Fujimura, F. Li, C. Xie, K. Kobayashi, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e8 ) page: 1 - 40 2025.5
-
Fast neural vocoder with fundamental frequency control using finite impulse response filters Reviewed
Y. Ohtani, T. Okamoto, T. Toda, H. Kawai
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 1893 - 1906 2025.4
-
Predicting fundamental frequency patterns in electrolaryngeal speech using automated phoneme extraction Reviewed Open Access
M. Eshghi, T. Toda
IEEE Access Vol. 13 page: 73831 - 73847 2025.4
-
Generalized sound field interpolation for freely spaced microphone arrays in rotation-robust beamforming Reviewed Open Access
S. Luan, Y. Wakabayashi, T. Toda
Applied Acoustics Vol. 236 ( Article 110706 ) page: 1 - 15 2025.4
-
Mora-level prosody prediction for text-to-speech using Japanese BERT without accentual labels Reviewed
T. Ogura, T. Okamoto, Y. Ohtani, E. Cooper, T. Toda, H. Kawai
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Investigating factors related to the naturalness of synthesized unison singing Reviewed
K. Nishizawa, R. Yamamoto, W.-C. Huang, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Improvements of discriminative feature space training for anomalous sound detection in unlabeled conditions Reviewed
T. Fujimura, I. Kuroyanagi, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Investigation of perceptual music similarity focusing on each instrumental part Reviewed
Y. Hashizume, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Mandarin speech reconstruction from surface electromyography based on generative adversarial networks Reviewed International coauthorship
F. Li, F. Shen, D. Ma, J. Zhou, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
Medicine in Novel Technology and Devices Vol. 26 ( Article 100359 ) page: 1 - 7 2025.3
-
E2EPref: an end-to-end preference-based framework for speech quality assessment to alleviate bias in direct assessment scores Reviewed Open Access
C.-H. Hu, Y. Yasuda, T. Toda
Computer Speech and Language Vol. 93 ( Article 101799 ) page: 1 - 17 2025.3
-
Serial-OE: Anomalous sound detection based on serial method with outlier exposure capable of using small amounts of anomalous data for training Reviewed
I. Kuroyanagi, T. Hayashi, K. Takeda, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e1 ) page: 1 - 32 2025.1
-
Nonparallel spoken-text-style transfer for linguistic expression control in speech generation Reviewed
D. Yoshioka, Y. Yasuda, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 333 - 346 2025.1
-
Sequence-wise speech waveform modeling via gradient descent optimization of quasi-harmonic parameters Reviewed
S. Chen, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 319 - 332 2025.1
-
Target speaker extraction under noisy underdetermined conditions using conditional variational autoencoder, global style token, and neural postfilter Reviewed
R. Wang, T. Fujimura, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e2 ) page: 1 - 26 2025.1
-
SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge Reviewed International coauthorship
Y. Zhang, Y. Zang, J. Shi, R. Yamamoto, T. Toda, Z. Duan
Proc. IEEE SLT page: 792 - 797 2024.12