研究者詳細 - 戸田　智基

論文 - 戸田　智基

分割表示 >> ／全件表示 244 件中 1 - 244 件目

Audio difference learning for audio captioning 査読有り

T. Komatsu, Y. Fujita, K. Takeda, T. Toda

Proc. IEEE ICASSP 頁： 1456 - 1460 2024年4月

ConvNeXt-TTS and ConvNeXt-VC: ConvNeXt-based fast end-to-end sequence-to-sequence text-to-speech and voice conversion 査読有り

T. Okamoto, Y. Ohtani, T. Toda, H. Kawai

Proc. IEEE ICASSP 頁： 12456 - 12460 2024年4月

MF-AED-AEC: speech emotion recognition by leveraging multimodal fusion, ASR error detection, and ASR error correction 査読有り国際共著

J. He, X. Shi, X. Li, T. Toda

Proc. IEEE ICASSP 頁： 11066 - 11070 2024年4月

Electrolaryngeal speech intelligibility enhancement through robust linguistic encoders 査読有り

L.P. Violeta, W.-C. Huang, D. Ma, R. Yamamoto, K. Kobayashi, T. Toda

Proc. IEEE ICASSP 頁： 10961 - 10965 2024年4月

FIRNET: fundamental frequency controllable fast neural vocoder with trainable finite impulse response filter 査読有り

Y. Ohtani, T. Okamoto, T. Toda, H. Kawai

Proc. IEEE ICASSP 頁： 10871 - 10875 2024年4月

Dual-channel target speaker extraction based on conditional variational autoencoder and directional information 査読有り

R. Wang, L. Li, T. Toda

IEEE/ACM Transactions on Audio, Speech and Language Processing 32 巻頁： 12 pages 2024年3月

Fast neural speech waveform generative models with fully-connected layer-based upsampling 査読有り

H. Yamashita, T. Okamoto, R. Takashima, Y. Ohtani, T. Takiguchi, T. Toda, H. Kawai

IEEE Access 12 巻頁： 31409 - 31421 2024年2月

喉頭摘出者における音声収録アプリを用いた術前音声の保存 ―Save the Voice プロジェクト― 査読有り

西尾直樹, 戸田智基, 小林和弘, 三谷壮平, 飴矢美里, 向山宣昭, 木村宏之, 徳倉達也, 坪井崇, 藤本保志, 曾根三千彦

喉頭 35 巻 ( 2 ) 頁： 142 - 147 2023年12月

The Singing Voice Conversion Challenge 2023 査読有り国際共著

W.-C. Huang, L.P. Violeta, S. Liu, J. Shi, T. Toda

Proc. IEEE ASRU 頁： 8 pages 2023年12月

ED-CEC: improving rare word recognition using ASR post-processing based on error detection and context-aware error correction 査読有り

J. He, Z. Yang, T. Toda

Proc. IEEE ASRU 頁： 6 pages 2023年12月

Improving severity preservation of healthy-to-pathological voice conversion with global style tokens 査読有り国際共著

B. Halpern, W.-C. Huang, L.P. Violeta, R. van Son, T. Toda

Proc. IEEE ASRU 頁： 7 pages 2023年12月

A comparative study of voice conversion models with large-scale speech and singing data: the T13 systems for the Singing Voice Conversion Challenge 2023 査読有り

R. Yamamoto, R. Yoneyama, L.P. Violeta, W.-C. Huang, T. Toda

Proc. IEEE ASRU 頁： 6 pages 2023年12月

The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains 査読有り国際共著

E. Cooper, W.-C. Huang, Y. Tsao, H.-M. Wang, T. Toda, J. Yamagishi

Proc. IEEE ASRU 頁： 7 pages 2023年12月

WaveNeXt: ConvNeXt-based fast neural vocoder without iSTFT layer 査読有り

T. Okamoto, H. Yamashita, Y. Ohtani, T. Toda, H. Kawai

Proc. IEEE ASRU 頁： 8 pages 2023年12月

Sequence-to-sequence network training methods for automatic guitar transcription with tokenized outputs 査読有り

S. Kim, K. Takeda, T. Toda

Proc. ISMIR 頁： 524 - 531 2023年11月

Evaluating methods for ground-truth-free foreign accent conversion 査読有り

W.-C. Huang, T. Toda

Proc. APSIPA ASC 頁： 1136 - 1141 2023年11月

An analysis of personalized speech recognition system development for the deaf and hard-of-hearing 査読有り

L.P. Violeta, T. Toda

Proc. APSIPA ASC 頁： 1851 - 1856 2023年11月

Semi-supervised multimodal emotion recognition with consensus decision-making and label correction 査読有り国際共著

J. Tian, D. Hu, X. Shi, J. He, X. Li, Y. Gao, T. Toda, X. Xu, X. Hu

Proc. MRAC 頁： 67 - 73 2023年10月

Differentiable representation of warping based on Lie group theory 査読有り

A. Miyashita, T. Toda

Proc. IEEE WASPAA 頁： 5 pages 2023年10月

Directional target speaker extraction under noisy underdetermined conditions through conditional variational autoencoder with global style tokens 査読有り

R. Wang, T. Toda

Proc. IEEE WASPAA 頁： 5 pages 2023年10月

Sound field interpolation with unsupervised calibration for freely spaced circular microphone array in rotation-robust beamforming 査読有り

S. Luan, Y. Wakabayashi, T. Toda

Proc.EUSIPCO 頁： 21 - 25 2023年9月

Noisy-to-noisy voice conversion under variations of noisy condition 査読有り

C. Xie, T. Toda

IEEE/ACM Transactions on Audio, Speech and Language Processing 31 巻頁： 3871 - 3882 2023年9月

High-fidelity and pitch-controllable neural vocoder based on unified source-filter networks 査読有り

R. Yoneyama, Y.-C. Wu, T. Toda

IEEE/ACM Transactions on Audio, Speech and Language Processing 31 巻頁： 3717 - 3729 2023年9月

Preference-based training framework for automatic speech quality assessment using deep neural network 査読有り

C.-H. Hu, Y. Yasuda, T. Toda

Proc. INTERSPEECH 頁： 546 - 550 2023年8月

Analysis of mean opinion scores in subjective evaluation of synthetic speech based on tail probabilities 査読有り

Y. Yasuda, T. Toda

Proc. INTERSPEECH 頁： 5491 - 5495 2023年8月

Reverberation-controllable voice conversion using reverberation time estimator 査読有り

Y. Choi, C. Xie, T. Toda

Proc. INTERSPEECH 頁： 2103 - 2107 2023年8月

E2E-S2S-VC: end-to-end sequence-to-sequence voice conversion 査読有り

T. Okamoto, H. Yamashita, T. Toda, H. Kawai

Proc. INTERSPEECH 頁： 2043 - 2047 2023年8月

Emotion awareness in multi-utterance turn for improving emotion prediction in multi-speaker conversation 査読有り国際共著

X. Shi, X. Li, T. Toda

Proc. INTERSPEECH 頁： 765 - 769 2023年8月

Representation of vocal tract length transformation based on group theory 査読有り

A. Miyashita, T. Toda