Papers - TODA Tomoki
-
FIRNET: fundamental frequency controllable fast neural vocoder with trainable finite impulse response filter Reviewed
Y. Ohtani, T. Okamoto, T. Toda, H. Kawai
Proc. IEEE ICASSP page: 10871 - 10875 2024.4
-
An investigation of fundamental frequency pattern prediction for Japanese eelectrolaryngeal speech enhancement based on frame-wise phoneme representations Reviewed Open Access
M. Eshghi, T. Toda
IEEE Access Vol. 12 page: 50137 - 50153 2024.4
-
Dual-channel target speaker extraction based on conditional variational autoencoder and directional information Reviewed Open Access
R. Wang, L. Li, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 32 page: 12 pages 2024.3
-
Fast neural speech waveform generative models with fully-connected layer-based upsampling Reviewed Open Access
H. Yamashita, T. Okamoto, R. Takashima, Y. Ohtani, T. Takiguchi, T. Toda, H. Kawai
IEEE Access Vol. 12 page: 31409 - 31421 2024.2
-
喉頭摘出者における音声収録アプリを用いた術前音声の保存 ―Save the Voice プロジェクト― Reviewed Open Access
西尾 直樹, 戸田 智基, 小林 和弘, 三谷 壮平, 飴矢 美里, 向山 宣昭, 木村 宏之, 徳倉 達也, 坪井 崇, 藤本 保志, 曾根 三千彦
喉頭 Vol. 35 ( 2 ) page: 142 - 147 2023.12
-
The Singing Voice Conversion Challenge 2023 Reviewed International coauthorship
W.-C. Huang, L.P. Violeta, S. Liu, J. Shi, T. Toda
Proc. IEEE ASRU page: 8 pages 2023.12
-
ED-CEC: improving rare word recognition using ASR post-processing based on error detection and context-aware error correction Reviewed
J. He, Z. Yang, T. Toda
Proc. IEEE ASRU page: 6 pages 2023.12
-
Improving severity preservation of healthy-to-pathological voice conversion with global style tokens Reviewed International coauthorship
B. Halpern, W.-C. Huang, L.P. Violeta, R. van Son, T. Toda
Proc. IEEE ASRU page: 7 pages 2023.12
-
A comparative study of voice conversion models with large-scale speech and singing data: the T13 systems for the Singing Voice Conversion Challenge 2023 Reviewed
R. Yamamoto, R. Yoneyama, L.P. Violeta, W.-C. Huang, T. Toda
Proc. IEEE ASRU page: 6 pages 2023.12
-
The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains Reviewed International coauthorship
E. Cooper, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda, J. Yamagishi
Proc. IEEE ASRU page: 7 pages 2023.12
-
WaveNeXt: ConvNeXt-based fast neural vocoder without iSTFT layer Reviewed
T. Okamoto, H. Yamashita, Y. Ohtani, T. Toda, H. Kawai
Proc. IEEE ASRU page: 8 pages 2023.12
-
Sequence-to-sequence network training methods for automatic guitar transcription with tokenized outputs Reviewed
S. Kim, K. Takeda, T. Toda
Proc. ISMIR page: 524 - 531 2023.11
-
An analysis of personalized speech recognition system development for the deaf and hard-of-hearing Reviewed
L.P. Violeta, T. Toda
Proc. APSIPA ASC page: 1851 - 1856 2023.11
-
Evaluating methods for ground-truth-free foreign accent conversion Reviewed
W.-C. Huang, T. Toda
Proc. APSIPA ASC page: 1136 - 1141 2023.11
-
Semi-supervised multimodal emotion recognition with consensus decision-making and label correction Reviewed International coauthorship
J. Tian, D. Hu, X. Shi, J. He, X. Li, Y. Gao, T. Toda, X. Xu, X. Hu
Proc. MRAC page: 67 - 73 2023.10
-
Differentiable representation of warping based on Lie group theory Reviewed
A. Miyashita, T. Toda
Proc. IEEE WASPAA page: 5 pages 2023.10
-
Directional target speaker extraction under noisy underdetermined conditions through conditional variational autoencoder with global style tokens Reviewed
R. Wang, T. Toda
Proc. IEEE WASPAA page: 5 pages 2023.10
-
Sound field interpolation with unsupervised calibration for freely spaced circular microphone array in rotation-robust beamforming Reviewed
S. Luan, Y. Wakabayashi, T. Toda
Proc.EUSIPCO page: 21 - 25 2023.9
-
Noisy-to-noisy voice conversion under variations of noisy condition Reviewed
C. Xie, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 31 page: 3871 - 3882 2023.9
-
High-fidelity and pitch-controllable neural vocoder based on unified source-filter networks Reviewed
R. Yoneyama, Y.-C. Wu, T. Toda
IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 31 page: 3717 - 3729 2023.9