Faculty Profiles - TODA Tomoki

Papers - TODA Tomoki

Division display >> ／ All the affair displays 1 - 244 of about 244

Audio difference learning for audio captioning Reviewed

T. Komatsu, Y. Fujita, K. Takeda, T. Toda

Proc. IEEE ICASSP page： 1456 - 1460 2024.4

ConvNeXt-TTS and ConvNeXt-VC: ConvNeXt-based fast end-to-end sequence-to-sequence text-to-speech and voice conversion Reviewed

T. Okamoto, Y. Ohtani, T. Toda, H. Kawai

Proc. IEEE ICASSP page： 12456 - 12460 2024.4

MF-AED-AEC: speech emotion recognition by leveraging multimodal fusion, ASR error detection, and ASR error correction Reviewed International coauthorship

J. He, X. Shi, X. Li, T. Toda

Proc. IEEE ICASSP page： 11066 - 11070 2024.4

Electrolaryngeal speech intelligibility enhancement through robust linguistic encoders Reviewed

L.P. Violeta, W.-C. Huang, D. Ma, R. Yamamoto, K. Kobayashi, T. Toda

Proc. IEEE ICASSP page： 10961 - 10965 2024.4

FIRNET: fundamental frequency controllable fast neural vocoder with trainable finite impulse response filter Reviewed

Y. Ohtani, T. Okamoto, T. Toda, H. Kawai

Proc. IEEE ICASSP page： 10871 - 10875 2024.4

Dual-channel target speaker extraction based on conditional variational autoencoder and directional information Reviewed

R. Wang, L. Li, T. Toda

IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 32 page： 12 pages 2024.3

Fast neural speech waveform generative models with fully-connected layer-based upsampling Reviewed

H. Yamashita, T. Okamoto, R. Takashima, Y. Ohtani, T. Takiguchi, T. Toda, H. Kawai

IEEE Access Vol. 12 page： 31409 - 31421 2024.2

喉頭摘出者における音声収録アプリを用いた術前音声の保存 ―Save the Voice プロジェクト― Reviewed

西尾直樹, 戸田智基, 小林和弘, 三谷壮平, 飴矢美里, 向山宣昭, 木村宏之, 徳倉達也, 坪井崇, 藤本保志, 曾根三千彦

喉頭 Vol. 35 ( 2 ) page： 142 - 147 2023.12

The Singing Voice Conversion Challenge 2023 Reviewed International coauthorship

W.-C. Huang, L.P. Violeta, S. Liu, J. Shi, T. Toda

Proc. IEEE ASRU page： 8 pages 2023.12

ED-CEC: improving rare word recognition using ASR post-processing based on error detection and context-aware error correction Reviewed

J. He, Z. Yang, T. Toda

Proc. IEEE ASRU page： 6 pages 2023.12

Improving severity preservation of healthy-to-pathological voice conversion with global style tokens Reviewed International coauthorship

B. Halpern, W.-C. Huang, L.P. Violeta, R. van Son, T. Toda

Proc. IEEE ASRU page： 7 pages 2023.12

A comparative study of voice conversion models with large-scale speech and singing data: the T13 systems for the Singing Voice Conversion Challenge 2023 Reviewed

R. Yamamoto, R. Yoneyama, L.P. Violeta, W.-C. Huang, T. Toda

Proc. IEEE ASRU page： 6 pages 2023.12

The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains Reviewed International coauthorship

E. Cooper, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda, J. Yamagishi

Proc. IEEE ASRU page： 7 pages 2023.12

WaveNeXt: ConvNeXt-based fast neural vocoder without iSTFT layer Reviewed

T. Okamoto, H. Yamashita, Y. Ohtani, T. Toda, H. Kawai

Proc. IEEE ASRU page： 8 pages 2023.12

Sequence-to-sequence network training methods for automatic guitar transcription with tokenized outputs Reviewed

S. Kim, K. Takeda, T. Toda

Proc. ISMIR page： 524 - 531 2023.11

Evaluating methods for ground-truth-free foreign accent conversion Reviewed

W.-C. Huang, T. Toda

Proc. APSIPA ASC page： 1136 - 1141 2023.11

An analysis of personalized speech recognition system development for the deaf and hard-of-hearing Reviewed

L.P. Violeta, T. Toda

Proc. APSIPA ASC page： 1851 - 1856 2023.11

Semi-supervised multimodal emotion recognition with consensus decision-making and label correction Reviewed International coauthorship

J. Tian, D. Hu, X. Shi, J. He, X. Li, Y. Gao, T. Toda, X. Xu, X. Hu

Proc. MRAC page： 67 - 73 2023.10

Differentiable representation of warping based on Lie group theory Reviewed

A. Miyashita, T. Toda

Proc. IEEE WASPAA page： 5 pages 2023.10

Directional target speaker extraction under noisy underdetermined conditions through conditional variational autoencoder with global style tokens Reviewed

R. Wang, T. Toda

Proc. IEEE WASPAA page： 5 pages 2023.10

Sound field interpolation with unsupervised calibration for freely spaced circular microphone array in rotation-robust beamforming Reviewed

S. Luan, Y. Wakabayashi, T. Toda

Proc.EUSIPCO page： 21 - 25 2023.9

Noisy-to-noisy voice conversion under variations of noisy condition Reviewed

C. Xie, T. Toda

IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 31 page： 3871 - 3882 2023.9

High-fidelity and pitch-controllable neural vocoder based on unified source-filter networks Reviewed

R. Yoneyama, Y.-C. Wu, T. Toda

IEEE/ACM Transactions on Audio, Speech and Language Processing Vol. 31 page： 3717 - 3729 2023.9

Preference-based training framework for automatic speech quality assessment using deep neural network Reviewed

C.-H. Hu, Y. Yasuda, T. Toda

Proc. INTERSPEECH page： 546 - 550 2023.8

Analysis of mean opinion scores in subjective evaluation of synthetic speech based on tail probabilities Reviewed

Y. Yasuda, T. Toda

Proc. INTERSPEECH page： 5491 - 5495 2023.8

Reverberation-controllable voice conversion using reverberation time estimator Reviewed

Y. Choi, C. Xie, T. Toda

Proc. INTERSPEECH page： 2103 - 2107 2023.8

E2E-S2S-VC: end-to-end sequence-to-sequence voice conversion Reviewed

T. Okamoto, H. Yamashita, T. Toda, H. Kawai

Proc. INTERSPEECH page： 2043 - 2047 2023.8

Emotion awareness in multi-utterance turn for improving emotion prediction in multi-speaker conversation Reviewed International coauthorship

X. Shi, X. Li, T. Toda

Proc. INTERSPEECH page： 765 - 769 2023.8

Representation of vocal tract length transformation based on group theory Reviewed

A. Miyashita, T. Toda