Papers - TODA Tomoki
-
WaveNeXt: ConvNeXt-based fast neural vocoder without iSTFT layer Reviewed
T. Okamoto, H. Yamashita, Y. Ohtani, T. Toda, H. Kawai
Proc. IEEE ASRU page: 8 pages 2023.12
-
Sequence-to-sequence network training methods for automatic guitar transcription with tokenized outputs Reviewed
S. Kim, K. Takeda, T. Toda
Proc. ISMIR page: 524 - 531 2023.11
-
Evaluating methods for ground-truth-free foreign accent conversion Reviewed
W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 1136 - 1141 2023.11
-
An analysis of personalized speech recognition system development for the deaf and hard-of-hearing Reviewed
L.P. Violeta, T. Toda
Proc. APSIPA ASC page: 1851 - 1856 2023.11
-
Semi-supervised multimodal emotion recognition with consensus decision-making and label correction Reviewed International coauthorship
J. Tian, D. Hu, X. Shi, J. He, X. Li, Y. Gao, T. Toda, X. Xu, X. Hu
Proc. MRAC page: 67 - 73 2023.10
-
Differentiable representation of warping based on Lie group theory Reviewed
A. Miyashita, T. Toda
Proc. IEEE WASPAA page: 5 pages 2023.10
-
Directional target speaker extraction under noisy underdetermined conditions through conditional variational autoencoder with global style tokens Reviewed
R. Wang, T. Toda
Proc. IEEE WASPAA page: 5 pages 2023.10
-
Sound field interpolation with unsupervised calibration for freely spaced circular microphone array in rotation-robust beamforming Reviewed
S. Luan, Y. Wakabayashi, T. Toda
Proc.EUSIPCO page: 21 - 25 2023.9
-
High-fidelity and pitch-controllable neural vocoder based on unified source-filter networks Reviewed
R. Yoneyama, Y.-C. Wu, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 31 page: 3717 - 3729 2023.9
-
Noisy-to-noisy voice conversion under variations of noisy condition Reviewed
C. Xie, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 31 page: 3871 - 3882 2023.9
-
Preference-based training framework for automatic speech quality assessment using deep neural network Reviewed
C.-H. Hu, Y. Yasuda, T. Toda
Proc. INTERSPEECH page: 546 - 550 2023.8
-
Analysis of mean opinion scores in subjective evaluation of synthetic speech based on tail probabilities Reviewed
Y. Yasuda, T. Toda
Proc. INTERSPEECH page: 5491 - 5495 2023.8
-
Reverberation-controllable voice conversion using reverberation time estimator Reviewed
Y. Choi, C. Xie, T. Toda
Proc. INTERSPEECH page: 2103 - 2107 2023.8
-
E2E-S2S-VC: end-to-end sequence-to-sequence voice conversion Reviewed
T. Okamoto, H. Yamashita, T. Toda, H. Kawai
Proc. INTERSPEECH page: 2043 - 2047 2023.8
-
Emotion awareness in multi-utterance turn for improving emotion prediction in multi-speaker conversation Reviewed International coauthorship
X. Shi, X. Li, T. Toda
Proc. INTERSPEECH page: 765 - 769 2023.8
-
Analysis of Noisy-target Training for DNN-based speech enhancement Reviewed
T. Fujimura, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Intermediate fine-tuning using imperfect synthetic speech for improving electrolaryngeal speech recognition Reviewed
L.P. Violeta, D. Ma, W.-C. Huang, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Source-Filter HiFiGAN: fast and pitch controllable high-fidelity neural vocoder Reviewed International coauthorship
R. Yoneyama, Y.-C. Wu, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
NNSVS: a neural network based singing voice synthesis toolkit Reviewed
R. Yamamoto, R. Yoneyama, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6
-
Low-latency electrolaryngeal speech enhancement based on FastSpeech2-based voice conversion and self-supervised speech representation Reviewed
K. Kobayashi, T. Hayashi, T. Toda
Proc. IEEE ICASSP page: 5 pages 2023.6