Papers - TODA Tomoki
-
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder Reviewed
Y. Yasuda, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Representation of vocal tract length transformation based on group theory Reviewed
A. Miyashita, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Harmonic-Net: fundamental frequency and speech rate controllable fast neural vocoder Reviewed
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, H. Kawai
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 31 page: 1902 - 1915 2023.5
-
Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion Reviewed
D. Ma, L.P. Violeta, K. Kobayashi, T. Toda
Proc. IEEE SLT page: 949 - 954 2023.1
-
Music similarity calculation of individual instrumental sounds using metric learning Reviewed
Y. Hashizume, L. Li, T. Toda
Proc. APSIPA ASC page: 33 - 38 2022.11
-
Sequence-wise optimization for quasi-harmonic speech waveform modeling Reviewed
S. Chen, T. Toda
Proc. APSIPA ASC page: 1658 - 1663 2022.11
-
Direction-aware target speaker extraction with a dual-channel system based on conditional variational autoencoders under underdetermined conditions Reviewed
R. Wang, L. Li, T. Toda
Proc. APSIPA ASC page: 347 - 353 2022.11
-
Interpretable control for emotional text-to-speech system toward development of sympathetic educational-support robots Reviewed
J. Feng, T. Yoshikawa, T. Toda
Proc. APSIPA ASC page: 342 - 346 2022.11
-
A comparative study of self-supervised speech representation based voice conversion Reviewed International coauthorship
W.-C. Huang, S.-W. Yang, T. Hayashi, T. Toda
IEEE Journal of Selected Topics in Signal Processing Vol. 16 ( 6 ) page: 1308 - 1318 2022.10
-
Investigation of Japanese Png BERT language model in text-to-speech synthesis for pitch accent language Reviewed
Y. Yasuda, T. Toda
IEEE Journal of Selected Topics in Signal Processing Vol. 16 ( 6 ) page: 1319 - 1328 2022.10
-
A cyclical approach to synthetic and natural speech mismatch refinement of neural post-filter for low-cost text-to-speech system Reviewed
Y.-C. Wu, P.L. Tobing, K. Yasuhara, N. Matsunaga, Y. Ohtani, T. Toda
APSIPA Transactions on Signal and Information Processing Vol. 11 ( e30 ) page: 1 - 32 2022.9
-
Investigating self-supervised pretraining frameworks for pathological speech recognition Reviewed
L.P. Violeta, W.-C. Huang, T. Toda
Proc. INTERSPEECH page: 41 - 45 2022.9
-
Unified source-filter GAN with harmonic-plus-noise source excitation generation Reviewed
R. Yoneyama, Y.-C. Wu, T. Toda
Proc. INTERSPEECH page: 848 - 852 2022.9
-
The VoiceMOS Challenge 2022 Reviewed International coauthorship
W.-C. Huang, E. Cooper, Y. Tsao, H.-M. Wang, T. Toda, J. Yamagishi
Proc. INTERSPEECH page: 4536 - 4540 2022.9
-
Spoken-text-style transfer with conditional variational autoencoder and content word storage Reviewed
D. Yoshioka, Y. Yaduda, N. Matsunaga, Y. Ohtani, T. Toda
Proc. INTERSPEECH page: 4576 - 4580 2022.9
-
An evaluation of three-stage voice conversion framework for noisy and reverberant conditions Reviewed
Y. Choi, C. Xie, T. Toda
Proc. INTERSPEECH page: 4910 - 4914 2022.9
-
Improvement of anomalous sound detection method considering the distribution of embedding Invited Reviewed
I. Kuroyanagi, T. Hayashi, K. Takeda, T. Toda
Proc. ICA page: 5 pages 2022.9
-
Noisy-to-noisy voice conversion with pre-training strategy Invited Reviewed
C. Xie, T. Toda
Proc. ICA page: 5 pages 2022.9
-
Modified sound field interpolation method for rotation-robust beamforming with unequally spaced circular microphone array Reviewed
S. Luan, Y. Wakabayashi, T. Toda
Proc. EUSIPCO page: 344 - 348 2022.8
-
Note-level automatic guitar transcription using attention mechanism Reviewed
S. Kim, T. Hayashi, T. Toda
Proc. EUSIPCO page: 229 - 233 2022.8