Papers - TODA Tomoki
-
Investigating factors related to the naturalness of synthesized unison singing Reviewed
K. Nishizawa, R. Yamamoto, W.-C. Huang, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Improvements of discriminative feature space training for anomalous sound detection in unlabeled conditions Reviewed
T. Fujimura, I. Kuroyanagi, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Investigation of perceptual music similarity focusing on each instrumental part Reviewed
Y. Hashizume, T. Toda
Proc. IEEE ICASSP page: 1 - 5 2025.4
-
Mandarin speech reconstruction from surface electromyography based on generative adversarial networks Reviewed International coauthorship Open Access
F. Li, F. Shen, D. Ma, J. Zhou, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
Medicine in Novel Technology and Devices Vol. 26 ( Article 100359 ) page: 1 - 7 2025.3
-
E2EPref: an end-to-end preference-based framework for speech quality assessment to alleviate bias in direct assessment scores Reviewed Open Access
C.-H. Hu, Y. Yasuda, T. Toda
Computer Speech and Language Vol. 93 ( Article 101799 ) page: 1 - 17 2025.3
-
Nonparallel spoken-text-style transfer for linguistic expression control in speech generation Reviewed Open Access
D. Yoshioka, Y. Yasuda, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 333 - 346 2025.1
-
Sequence-wise speech waveform modeling via gradient descent optimization of quasi-harmonic parameters Reviewed
S. Chen, T. Toda
IEEE Transactions on Audio, Speech and Language Processing Vol. 33 page: 319 - 332 2025.1
-
Target speaker extraction under noisy underdetermined conditions using conditional variational autoencoder, global style token, and neural postfilter Reviewed Open Access
R. Wang, T. Fujimura, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e2 ) page: 1 - 26 2025.1
-
Serial-OE: Anomalous sound detection based on serial method with outlier exposure capable of using small amounts of anomalous data for training Reviewed Open Access
I. Kuroyanagi, T. Hayashi, K. Takeda, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 14 ( 1, e1 ) page: 1 - 32 2025.1
-
SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge Reviewed International coauthorship
Y. Zhang, Y. Zang, J. Shi, R. Yamamoto, T. Toda, Z. Duan
Proc. IEEE SLT page: 792 - 797 2024.12
-
End-to-end Mandarin speech reconstruction based on ultrasound tongue images using deep learning Reviewed International coauthorship Open Access
F. Li, F. Shen, D. Ma, J. Zhou, S. Zhang, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
IEEE Transactions on Neural Systems and Rehabilitation Engineering Vol. 33 page: 140 - 149 2024.12
-
Two-stage framework for robust speech emotion recognition using target speaker extraction in human speech noise conditions Reviewed
J. Mi, X. Shi, D. Ma, J. He, T. Fujimura, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Improved architecture for high-resolution piano transcription to efficiently capture acoustic characteristics of music signals Reviewed
J. Mi, S. Kim, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Multi-modal video summarization based on two-stage fusion of audio, visual, and recognized text information Reviewed
Z. Yang, J. He, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Multi-task learning approaches for music similarity representation learning based on individual instrument sounds Reviewed
T. Imamura, Y. Hashizume, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
A study on multimodal fusion and layer adapter in emotion recognition Reviewed International coauthorship
X. Shi, Y. Gao, J. He, J. Mi, X. Li, T. Toda
Proc. APSIPA ASC page: 6 pages 2024.12
-
Reference-free automatic speech severity evaluation using acoustic unit language modelling Reviewed
B. Halpern, T. Toda
Proc. SpandLDeteriorate Workshop of ACM Multimedia Asia (Workshop on Multi-Biological Sensing Data for Speech and Language Deterioration Prediction) page: 5 pages 2024.12
-
The VoiceMOS Challenge 2024: beyond speech quality prediction Reviewed International coauthorship
W.-C. Huang, S.-W. Fu, E. Cooper, R. Zezario, T. Toda, H.-M. Wang, J. Yamagishi, Y. Tsao
Proc. IEEE SLT page: 813 - 820 2024.12
-
Multi-speaker text-to-speech training with speaker anonymized data Reviewed International coauthorship
W.-C. Huang, Y.-C. Wu, T. Toda
IEEE Signal Processing Letters Vol. 31 page: 2995 - 2999 2024.10
-
Challenge of singing voice synthesis using only text-to-speech corpus with FIRNet source-filter neural vocoder Reviewed
T. Okamoto, Y. Ohtani, S. Shimizu, T. Toda, H. Kawai
Proc. INTERSPEECH page: 1870 - 1874 2024.9