論文 - 戸田 智基
-
Predicting fundamental frequency patterns in electrolaryngeal speech using automated phoneme extraction 査読有り Open Access
M. Eshghi, T. Toda
IEEE Access 13 巻 頁: 73831 - 73847 2025年4月
-
Generalized sound field interpolation for freely spaced microphone arrays in rotation-robust beamforming 査読有り Open Access
S. Luan, Y. Wakabayashi, T. Toda
Applied Acoustics 236 巻 ( Article 110706 ) 頁: 1 - 15 2025年4月
-
Mora-level prosody prediction for text-to-speech using Japanese BERT without accentual labels 査読有り
T. Ogura, T. Okamoto, Y. Ohtani, E. Cooper, T. Toda, H. Kawai
Proc. IEEE ICASSP 頁: 1 - 5 2025年4月
-
Investigating factors related to the naturalness of synthesized unison singing 査読有り
K. Nishizawa, R. Yamamoto, W.-C. Huang, T. Toda
Proc. IEEE ICASSP 頁: 1 - 5 2025年4月
-
Improvements of discriminative feature space training for anomalous sound detection in unlabeled conditions 査読有り
T. Fujimura, I. Kuroyanagi, T. Toda
Proc. IEEE ICASSP 頁: 1 - 5 2025年4月
-
Investigation of perceptual music similarity focusing on each instrumental part 査読有り
Y. Hashizume, T. Toda
Proc. IEEE ICASSP 頁: 1 - 5 2025年4月
-
Mandarin speech reconstruction from surface electromyography based on generative adversarial networks 査読有り 国際共著
F. Li, F. Shen, D. Ma, J. Zhou, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
Medicine in Novel Technology and Devices 26 巻 ( Article 100359 ) 頁: 1 - 7 2025年3月
-
E2EPref: an end-to-end preference-based framework for speech quality assessment to alleviate bias in direct assessment scores 査読有り Open Access
C.-H. Hu, Y. Yasuda, T. Toda
Computer Speech and Language 93 巻 ( Article 101799 ) 頁: 1 - 17 2025年3月
-
Serial-OE: Anomalous sound detection based on serial method with outlier exposure capable of using small amounts of anomalous data for training 査読有り
I. Kuroyanagi, T. Hayashi, K. Takeda, T. Toda
APSIPA Transactions on Signal and Information Processing 14 巻 ( 1, e1 ) 頁: 1 - 32 2025年1月
-
Nonparallel spoken-text-style transfer for linguistic expression control in speech generation 査読有り
D. Yoshioka, Y. Yasuda, T. Toda
IEEE Transactions on Audio, Speech and Language Processing 33 巻 頁: 333 - 346 2025年1月
-
Sequence-wise speech waveform modeling via gradient descent optimization of quasi-harmonic parameters 査読有り
S. Chen, T. Toda
IEEE Transactions on Audio, Speech and Language Processing 33 巻 頁: 319 - 332 2025年1月
-
Target speaker extraction under noisy underdetermined conditions using conditional variational autoencoder, global style token, and neural postfilter 査読有り
R. Wang, T. Fujimura, T. Toda
APSIPA Transactions on Signal and Information Processing 14 巻 ( 1, e2 ) 頁: 1 - 26 2025年1月
-
SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge 査読有り 国際共著
Y. Zhang, Y. Zang, J. Shi, R. Yamamoto, T. Toda, Z. Duan
Proc. IEEE SLT 頁: 792 - 797 2024年12月
-
End-to-end Mandarin speech reconstruction based on ultrasound tongue images using deep learning 査読有り 国際共著
F. Li, F. Shen, D. Ma, J. Zhou, S. Zhang, L. Wang, F. Fan, T. Liu, X. Chen, T. Toda, H. Niu
IEEE Transactions on Neural Systems and Rehabilitation Engineering 33 巻 頁: 140 - 149 2024年12月
-
Two-stage framework for robust speech emotion recognition using target speaker extraction in human speech noise conditions 査読有り
J. Mi, X. Shi, D. Ma, J. He, T. Fujimura, T. Toda
Proc. APSIPA ASC 頁: 6 pages 2024年12月
-
Improved architecture for high-resolution piano transcription to efficiently capture acoustic characteristics of music signals 査読有り
J. Mi, S. Kim, T. Toda
Proc. APSIPA ASC 頁: 6 pages 2024年12月
-
Multi-modal video summarization based on two-stage fusion of audio, visual, and recognized text information 査読有り
Z. Yang, J. He, T. Toda
Proc. APSIPA ASC 頁: 6 pages 2024年12月
-
Multi-task learning approaches for music similarity representation learning based on individual instrument sounds 査読有り
T. Imamura, Y. Hashizume, T. Toda
Proc. APSIPA ASC 頁: 6 pages 2024年12月
-
A study on multimodal fusion and layer adapter in emotion recognition 査読有り 国際共著
X. Shi, Y. Gao, J. He, J. Mi, X. Li, T. Toda
Proc. APSIPA ASC 頁: 6 pages 2024年12月
-
Reference-free automatic speech severity evaluation using acoustic unit language modelling 査読有り
B. Halpern, T. Toda
Proc. SpandLDeteriorate Workshop of ACM Multimedia Asia (Workshop on Multi-Biological Sensing Data for Speech and Language Deterioration Prediction) 頁: 5 pages 2024年12月