Papers - TODA Tomoki
-
Comparison of real-time multi-speaker neural vocoders on CPUs Reviewed
K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, H. Kawai
Acoustical Science and Technology, Acoustical Letter Vol. 43 ( 2 ) page: 121 - 124 2022.3
-
Neural speech-rate conversion with multispeaker WaveNet vocoder Reviewed
T. Okamoto, K. Matsubara, T. Toda, Y. Shiga, H. Kawai
Speech Communication Vol. 138 page: 1 - 12 2022.3
-
S3PRL-VC: open-source voice conversion framework with self-supervised speech representations Reviewed International coauthorship
W.-C. Huang, S.-W. Yang, T. Hayashi, H.-Y. Lee, S. Watanabe, T. Toda
Proc. AAAI-22 Workshop, W35: Self-Supervised Learning for Audio and Speech Processing page: 5 pages 2022.2
-
Time alignment using lip images for frame-based electrolaryngeal voice conversion Reviewed International coauthorship
Y.-S. Liou, W.-C. Huang, M.-C. Yen, S.-W. Tsai, Y.-H. Peng, T. Toda, Y. Tsao, H.-M. Wang
Proc. APSIPA ASC page: 1234 - 1238 2021.12
-
Multi-stream HiFi-GAN with data-driven waveform decomposition Reviewed
T. Okamoto, T. Toda, H. Kawai
Proc. IEEE ASRU page: 610 - 617 2021.12
-
On prosody modeling for ASR+TTS based voice conversion Reviewed International coauthorship
W.-C. Huang, T. Hayashi, X. Li, S. Watanabe, T. Toda
Proc. IEEE ASRU page: 642 - 649 2021.12
-
Mandarin electrolaryngeal speech voice conversion with sequence-to-sequence modeling Reviewed International coauthorship
M.-C. Yen, W.-C. Huang, K. Kobayashi, Y.-H. Peng, S.-W. Tasi, Y. Tsao, T. Toda, J.-S. R. Jang, H.-M. Wang
Proc. IEEE ASRU page: 650 - 657 2021.12
-
HASA-Net: a non-intrusive hearing-aid speech assessment network Reviewed International coauthorship
H.-T. Chiang, Y.-C. Wu, C. Yu, T. Toda, H.-M. Wang, Y.-C. Hu, Y. Tsao
Proc. IEEE ASRU page: 907 - 913 2021.12
-
Mandarin electro-laryngeal speech enhancement based on statistical voice conversion and manual tone control Reviewed International coauthorship
Z. Qian, H. Niu, L. Wang, K. Kobayashi, S. Zhang, T. Toda
Proc. APSIPA ASC page: 546 - 552 2021.12
-
Noisy-to-noisy voice conversion framework with denoising model Reviewed
C. Xie, Y.-C. Wu, P.L. Tobing, W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 814 - 820 2021.12
-
Investigation of text-to-speech-based synthetic parallel data for sequence-to-sequence non-parallel voice conversion Reviewed
D. Ma, W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 870 - 877 2021.12
-
An ensemble approach to anomalous sound detection based on conformer-based autoencoder and binary classifier incorporated with metric learning Reviewed
I. Kuroyanagi, T. Hayashi, Y. Adachi, T. Yoshimura, K. Takeda, T. Toda
Proc. DCASE 2021 Workshop page: 110 - 114 2021.11
-
Singing fundamental frequency contour generation using generalized command response model and score-conditional variational autoencoder Reviewed
S. Seki, H. Taga, T. Toda
Proc. IEEE MLSP page: 1 - 6 2021.10
-
Singing fundamental frequency contour generation using generalized command response model and score-conditional variational autoencoder Reviewed
S. Seki, H. Taga, T. Toda
Proc. IEEE MLSP page: 6 pages 2021.10
-
Anomalous sound detection using a binary classification model and class centroids Reviewed
I. Kuroyanagi, T. Hayashi, K. Takeda, T. Toda
Proc. EUSIPCO page: 1995 - 1999 2021.8
-
学習支援サービスの運用とオンデマンド型を中心としたオンライン授業への展開――名古屋大学における事例――
戸田 智基, 大平 茂輝, 後藤 明史, 出口 大輔, 森 健策
電子情報通信学会誌 Vol. 104 ( 8 ) page: 862 - 866 2021.8
-
Relational data selection for data augmentation of speaker-dependent multi-band MelGAN vocoder Reviewed International coauthorship
Y.-C. Wu, C.-H. Hu, H.-S. Lee, Y.-H. Peng, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda
Proc. INTERSPEECH page: 3630 - 3634 2021.8
-
High-fidelity and low-latency universal neural vocoder based on multiband WaveRNN with data-driven linear prediction for discrete waveform modeling Reviewed
P.L. Tobing, T. Toda
Proc. INTERSPEECH page: 2217 - 2221 2021.8
-
Unified source-filter GAN: unified source-filter network based on factorization of quasi-periodic parallel WaveGAN Reviewed
R. Yoneyama, Y.-C. Wu, T. Toda
Proc. INTERSPEECH page: 2187 - 2191 2021.8
-
A preliminary study of a two-stage paradigm for preserving speaker identity in dysarthric voice conversion Reviewed International coauthorship
W.-C. Huang, K. Kobayashi, Y.-H. Peng, C.-F. Liu, Y. Tsao, H.-M. Wang, T. Toda
Proc. INTERSPEECH page: 1329 - 1333 2021.8