Papers - TODA Tomoki
-
Preference-based training framework for automatic speech quality assessment using deep neural network Reviewed
C.-H. Hu, Y. Yasuda, T. Toda
Proc. INTERSPEECH page: 546 - 550 2023.8
-
Analysis of mean opinion scores in subjective evaluation of synthetic speech based on tail probabilities Reviewed
Y. Yasuda, T. Toda
Proc. INTERSPEECH page: 5491 - 5495 2023.8
-
Reverberation-controllable voice conversion using reverberation time estimator Reviewed
Y. Choi, C. Xie, T. Toda
Proc. INTERSPEECH page: 2103 - 2107 2023.8
-
E2E-S2S-VC: end-to-end sequence-to-sequence voice conversion Reviewed
T. Okamoto, H. Yamashita, T. Toda, H. Kawai
Proc. INTERSPEECH page: 2043 - 2047 2023.8
-
Emotion awareness in multi-utterance turn for improving emotion prediction in multi-speaker conversation Reviewed International coauthorship
X. Shi, X. Li, T. Toda
Proc. INTERSPEECH page: 765 - 769 2023.8
-
Representation of vocal tract length transformation based on group theory Reviewed
A. Miyashita, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Analysis of Noisy-target Training for DNN-based speech enhancement Reviewed
T. Fujimura, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Intermediate fine-tuning using imperfect synthetic speech for improving electrolaryngeal speech recognition Reviewed
L.P. Violeta, D. Ma, W.-C. Huang, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Source-Filter HiFiGAN: fast and pitch controllable high-fidelity neural vocoder Reviewed International coauthorship
R. Yoneyama, Y.-C. Wu, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
NNSVS: a neural network based singing voice synthesis toolkit Reviewed
R. Yamamoto, R. Yoneyama, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Low-latency electrolaryngeal speech enhancement based on FastSpeech2-based voice conversion and self-supervised speech representation Reviewed
K. Kobayashi, T. Hayashi, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder Reviewed
Y. Yasuda, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Harmonic-Net: fundamental frequency and speech rate controllable fast neural vocoder Reviewed
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, H. Kawai
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 31 page: 1902 - 1915 2023.5
-
Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion Reviewed
D. Ma, L.P. Violeta, K. Kobayashi, T. Toda
Proc. IEEE SLT page: 949 - 954 2023.1
-
Music similarity calculation of individual instrumental sounds using metric learning Reviewed
Y. Hashizume, L. Li, T. Toda
Proc. APSIPA ASC page: 33 - 38 2022.11
-
Direction-aware target speaker extraction with a dual-channel system based on conditional variational autoencoders under underdetermined conditions Reviewed
R. Wang, L. Li, T. Toda
Proc. APSIPA ASC page: 347 - 353 2022.11
-
Interpretable control for emotional text-to-speech system toward development of sympathetic educational-support robots Reviewed
J. Feng, T. Yoshikawa, T. Toda
Proc. APSIPA ASC page: 342 - 346 2022.11
-
Sequence-wise optimization for quasi-harmonic speech waveform modeling Reviewed
S. Chen, T. Toda
Proc. APSIPA ASC page: 1658 - 1663 2022.11
-
Investigation of Japanese Png BERT language model in text-to-speech synthesis for pitch accent language Reviewed
Y. Yasuda, T. Toda
IEEE Journal of Selected Topics in Signal Processing Vol. 16 ( 6 ) page: 1319 - 1328 2022.10
-
A comparative study of self-supervised speech representation based voice conversion Reviewed International coauthorship
W.-C. Huang, S.-W. Yang, T. Hayashi, T. Toda
IEEE Journal of Selected Topics in Signal Processing Vol. 16 ( 6 ) page: 1308 - 1318 2022.10