Papers - TODA Tomoki
-
Predicting fundamental frequency patterns in electrolaryngeal speech using automated phoneme extraction Reviewed
M. Eshghi, T. Toda
IEEE Access Vol. 13 page: 73831 - 73847 2025.4
-
Generalized sound field interpolation for freely spaced microphone arrays in rotation-robust beamforming Reviewed Open Access
S. Luan, Y. Wakabayashi, T. Toda
Applied Acoustics Vol. 236 ( Article 110706 ) page: 1 - 15 2025.4
-
Mora-level prosody prediction for text-to-speech using Japanese BERT without accentual labels Reviewed
T. Ogura, T. Okamoto, Y. Ohtani, E. Cooper, T. Toda, H. Kawai
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Investigating factors related to the naturalness of synthesized unison singing Reviewed
K. Nishizawa, R. Yamamoto, W.-C. Huang, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Improvements of discriminative feature space training for anomalous sound detection in unlabeled conditions Reviewed
T. Fujimura, I. Kuroyanagi, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Investigation of perceptual music similarity focusing on each instrumental part Reviewed
Y. Hashizume, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Mandarin speech reconstruction from surface electromyography based on generative adversarial networks Reviewed International coauthorship
F. Li, F. Shen, D. Ma, J. Zhou, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
Medicine in Novel Technology and Devices Vol. 26 ( Article 100359 ) page: 1 - 7 2025.3
-
E2EPref: an end-to-end preference-based framework for speech quality assessment to alleviate bias in direct assessment scores Reviewed Open Access
C.-H. Hu, Y. Yasuda, T. Toda
Computer Speech and Language Vol. 93 ( Article 101799 ) page: 1 - 17 2025.3
-
Serial-OE: Anomalous sound detection based on serial method with outlier exposure capable of using small amounts of anomalous data for training Reviewed
I. Kuroyanagi, T. Hayashi, K. Takeda, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e1 ) page: 1 - 32 2025.1
-
Nonparallel spoken-text-style transfer for linguistic expression control in speech generation Reviewed
D. Yoshioka, Y. Yasuda, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 333 - 346 2025.1
-
Sequence-wise speech waveform modeling via gradient descent optimization of quasi-harmonic parameters Reviewed
S. Chen, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 319 - 332 2025.1
-
Target speaker extraction under noisy underdetermined conditions using conditional variational autoencoder, global style token, and neural postfilter Reviewed
R. Wang, T. Fujimura, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e2 ) page: 1 - 26 2025.1
-
SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge Reviewed International coauthorship
Y. Zhang, Y. Zang, J. Shi, R. Yamamoto, T. Toda, Z. Duan
Proc. IEEE SLT page: 792 - 797 2024.12
-
Multi-modal video summarization based on two-stage fusion of audio, visual, and recognized text information Reviewed
Z. Yang, J. He, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Multi-task learning approaches for music similarity representation learning based on individual instrument sounds Reviewed
T. Imamura, Y. Hashizume, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
A study on multimodal fusion and layer adapter in emotion recognition Reviewed International coauthorship
X. Shi, Y. Gao, J. He, J. Mi, X. Li, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Reference-free automatic speech severity evaluation using acoustic unit language modelling Reviewed
B. Halpern, T. Toda
Proc. SpandLDeteriorate Workshop of ACM Multimedia Asia (Workshop on Multi-Biological Sensing Data for Speech and Language Deterioration Prediction) page: 5 pages 2024.12
-
The VoiceMOS Challenge 2024: beyond speech quality prediction Reviewed International coauthorship
W.-C. Huang, S.-W. Fu, E. Cooper, R. Zezario, T. Toda, H.-M. Wang, J. Yamagishi, Y. Tsao
Proc. IEEE SLT page: 813 - 820 2024.12
-
Improved architecture for high-resolution piano transcription to efficiently capture acoustic characteristics of music signals Reviewed
J. Mi, S. Kim, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
End-to-end Mandarin speech reconstruction based on ultrasound tongue images using deep learning Reviewed International coauthorship
F. Li, F. Shen, D. Ma, J. Zhou, S. Zhang, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
IEEE Transactions on Neural Systems and Rehabilitation Engineering Vol. 33 page: 140 - 149 2024.12