Papers - TODA Tomoki
-
Time alignment using lip images for frame-based electrolaryngeal voice conversion Reviewed International coauthorship
Y.-S. Liou, W.-C. Huang, M.-C. Yen, S.-W. Tsai, Y.-H. Peng, T. Toda, Y. Tsao, H.-M. Wang
Proc. APSIPA ASC page: 1234 - 1238 2021.12
-
Multi-stream HiFi-GAN with data-driven waveform decomposition Reviewed
T. Okamoto, T. Toda, H. Kawai
Proc. IEEE ASRU page: 610 - 617 2021.12
-
On prosody modeling for ASR+TTS based voice conversion Reviewed International coauthorship
W.-C. Huang, T. Hayashi, X. Li, S. Watanabe, T. Toda
Proc. IEEE ASRU page: 642 - 649 2021.12
-
Mandarin electrolaryngeal speech voice conversion with sequence-to-sequence modeling Reviewed International coauthorship
M.-C. Yen, W.-C. Huang, K. Kobayashi, Y.-H. Peng, S.-W. Tasi, Y. Tsao, T. Toda, J.-S. R. Jang, H.-M. Wang
Proc. IEEE ASRU page: 650 - 657 2021.12
-
HASA-Net: a non-intrusive hearing-aid speech assessment network Reviewed International coauthorship
H.-T. Chiang, Y.-C. Wu, C. Yu, T. Toda, H.-M. Wang, Y.-C. Hu, Y. Tsao
Proc. IEEE ASRU page: 907 - 913 2021.12
-
Mandarin electro-laryngeal speech enhancement based on statistical voice conversion and manual tone control Reviewed International coauthorship
Z. Qian, H. Niu, L. Wang, K. Kobayashi, S. Zhang, T. Toda
Proc. APSIPA ASC page: 546 - 552 2021.12
-
Noisy-to-noisy voice conversion framework with denoising model Reviewed
C. Xie, Y.-C. Wu, P.L. Tobing, W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 814 - 820 2021.12
-
Investigation of text-to-speech-based synthetic parallel data for sequence-to-sequence non-parallel voice conversion Reviewed
D. Ma, W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 870 - 877 2021.12
-
An ensemble approach to anomalous sound detection based on conformer-based autoencoder and binary classifier incorporated with metric learning Reviewed
I. Kuroyanagi, T. Hayashi, Y. Adachi, T. Yoshimura, K. Takeda, T. Toda
Proc. DCASE 2021 Workshop page: 110 - 114 2021.11
-
Singing fundamental frequency contour generation using generalized command response model and score-conditional variational autoencoder Reviewed
S. Seki, H. Taga, T. Toda
Proc. IEEE MLSP page: 1 - 6 2021.10
-
Singing fundamental frequency contour generation using generalized command response model and score-conditional variational autoencoder Reviewed
S. Seki, H. Taga, T. Toda
Proc. IEEE MLSP page: 6 pages 2021.10
-
Anomalous sound detection using a binary classification model and class centroids Reviewed
I. Kuroyanagi, T. Hayashi, K. Takeda, T. Toda
Proc. EUSIPCO page: 1995 - 1999 2021.8
-
学習支援サービスの運用とオンデマンド型を中心としたオンライン授業への展開――名古屋大学における事例――
戸田 智基, 大平 茂輝, 後藤 明史, 出口 大輔, 森 健策
電子情報通信学会誌 Vol. 104 ( 8 ) page: 862 - 866 2021.8
-
Relational data selection for data augmentation of speaker-dependent multi-band MelGAN vocoder Reviewed International coauthorship
Y.-C. Wu, C.-H. Hu, H.-S. Lee, Y.-H. Peng, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda
Proc. INTERSPEECH page: 3630 - 3634 2021.8
-
High-fidelity and low-latency universal neural vocoder based on multiband WaveRNN with data-driven linear prediction for discrete waveform modeling Reviewed
P.L. Tobing, T. Toda
Proc. INTERSPEECH page: 2217 - 2221 2021.8
-
Unified source-filter GAN: unified source-filter network based on factorization of quasi-periodic parallel WaveGAN Reviewed
R. Yoneyama, Y.-C. Wu, T. Toda
Proc. INTERSPEECH page: 2187 - 2191 2021.8
-
A preliminary study of a two-stage paradigm for preserving speaker identity in dysarthric voice conversion Reviewed International coauthorship
W.-C. Huang, K. Kobayashi, Y.-H. Peng, C.-F. Liu, Y. Tsao, H.-M. Wang, T. Toda
Proc. INTERSPEECH page: 1329 - 1333 2021.8
-
Low-latency real-time non-parallel voice conversion based on cyclic variational autoencoder and multiband WaveRNN with data-driven linear prediction Reviewed
P.L. Tobing, T. Toda
Proc. 11th ISCA Speech Synthesis Workshop (SSW11) page: 142 - 147 2021.8
-
Full-band LPCNet: a real-time neural vocoder for 48 kHz audio with a CPU Reviewed
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga, H. Kawai
IEEE Access Vol. 9 page: 94923 - 94933 2021.7
-
Crank: an open-source software for nonparallel voice conversion based on vector-quantized variational autoencoder Reviewed
K. Kobayashi, W.-C. Huang, Y.-C. Wu, P.L. Tobing, T. Hayashi, T. Toda
Proc. IEEE ICASSP page: 5934 - 5938 2021.6