TAKEDA, Kazuya
Institutes of Innovation for Future Society Mobility Research Course Professor
Graduate School of Information Science Professor
Graduate School of Informatics Professor
Graduate School
Graduate School of Information Science
Graduate School of Informatics
Undergraduate School
School of Engineering
  1. Ph.D. (Engineering) ( 1993.9   Nagoya University ) 

Research Interests 6

  1. Signal Processing for Acoustic, Speech, Spoken Language and Behavior

  2. Human Machine Interaction

  3. group behavior of sports

  4. Intelligent vehicle

  5. Large signal corpus

  6. ドライバモデル

Research Areas 3

  1. Others / Others  / Intelligent Informatics

  2. Others / Others  / Information and Communication Engineering

  3. Informatics / Intelligent informatics  / human behaviro signal information processing

Current Research Project and SDGs 4

  1. Modeling and Prediction of Driving Behavior

  2. Information Processing of Speech and Acoustic Signals

  3. Human/Machin Interaction

  4. Information processing of sport behavior signals

Research History 12

  1. Nagoya University   Director


  2. Nagoya University   Institute of Innovation for Future Society Mobility Research Course

    2019.4 - 2020.3

  3. Nagoya University   Sch. of Informatics, dept. of Intelligent Systems   Professor


  4. Nagoya University   Leading Graduate School Promotion Department   Professor


  5. Nagoya University   Graduate School of Informatics   Professor


  6. Nagoya University   Green Mobility Collaborative Research Center Green Mobility Collaborative Research Center

    2011.7 - 2017.3

  7. Nagoya University   Innovative Research Center for Preventive Medical Engineering


  8. 名古屋大学 大学院・情報科学研究科・メディア科学専攻   教授


  9. Associate Professor, School/ Graduate School of Engineering, Nagoya University

    1995.4 - 2003.3

      More details


  10. 名古屋大学 工学部・工学研究科   助教授


  11. Supervisor, KDD Research and Development Laboratories

    1990.2 - 1995.3

      More details


  12. Researcher, Speech Information Processing Laboratory, Advanced Telecommunication Research Laboratores

    1986.7 - 1990.2

      More details


Education 2

  1. Nagoya University   Graduate School, Division of Engineering

    1983.4 - 1985.3

      More details

    Country: Japan

  2. Nagoya University   Faculty of Engineering   Department of Electronics and Electric Engineering

    1979.4 - 1983.3

      More details

    Country: Japan

Professional Memberships 40

  1. 日本音響学会   (理事、副会長、支部長等を歴任)


  2. IEEE   (Served as Society BoG (3), Chapter Chairs (2))


  3. The Institute of Electronics, Information and Communication Engineers   member/ Senior member


  4. Information Processing Society of Japan   member


  5. 自動車技術会   会員


  6. ロボット学会   会員


  7. 電子情報通信学会   音声研究会研究専門委員

    2009.4 - 2013.3

  8. 日本音響学会   東海支部 評議員


  9. 電子情報通信学会   論文誌特集号編集長

    2007.1 - 2008.8

  10. 電子情報通信学会   ISS和文論文誌(D)編集長

    2008.8 - 2010.5

  11. IEEE   Member/ Senior Member


  12. The Acoustic Society of Japan   member


  13. Information Processing Society Japan   Chair for SIG Spoken Language Processing

    2006.4 - 2009.3

  14. 情報処理学会   音声言語情報処理研究会 研究連絡委員

    2008.4 - 2010.5

  15. 情報処理学会   論文誌特集号編集委員 音声ドキュメント処理

    2007.9 - 2009.2

  16. 情報処理学会   論文誌特集号編集委員

    2007.5 - 2008.8

  17. 情報処理学会   代表委員

    2007.4 - 2009.3

  18. Institute of Electrical and Electronics Engineers   Secretary, Signal Processing Society Japan Chapter

    2007.1 - 2008.12

  19. Acousitical Sociery Japan   Exective board member

    2006.5 - 2009.4

  20. The Institute of Electronics, Information and Communication Engineers   Borad member of editorial committee

    2006.5 - 2008.4

  21. Institute of Electrical and Electronics Engineers   Standing Committee Chair for student activities, Nagoya Section

    2005.1 - 2006.12

  22. Information Processing Society Japan   Borad member for SIG Spoken Language Processing

    2004.4 - 2007.3

  23. The Institute of Electronics, Information and Communication Engineers   Associate Editor

    2004.4 - 2007.3

  24. 情報処理学会   音声言語情報処理研究会 幹事

    2004.4 - 2006.6

  25. 情報処理学会   音声言語情報処理研究会 音声認識コンソーシアム 実行幹事

    2001.1 - 2003.10

  26. 電子情報通信学会   音声研究会幹事

    2000.5 - 2002.4

  27. GSK(言語資源コンソーシアム)   運営委員

    1999.5 - 2001.4

  28. 映像情報メディア学会   マルチメディア情報処理研究委員会委員

    1999.5 - 2001.4

  29. Acousitical Sociery Japan   member of editorial committee

    1999.4 - 2004.3

  30. 日本音響学会   東海支部 庶務幹事

    1998.5 - 2001.4

  31. 日本音響学会   データベース技術委員会 委員

    1996.5 - 2001.4

  32. 情報処理学会   音声言語情報処理研究会 研究連絡委員

    1996.4 - 2000.3

  33. Information Processing Society Japan   Committee member for SIG Spoken Language Processing

    1996.4 - 1999.3

  34. 情報処理学会   音声言語情報処理研究会 大規模連続音声データベースWG 幹事

    1996.4 - 1998.3

  35. Acousitical Sociery Japan   Reviewer


  36. The Institute of Electronics, Information and Communication Engineers   board member of Speech Technical Group

    1994.5 - 1996.4

  37. The Institute of Electronics, Information and Communication Engineers   Reviewer


  38. IEEE

  39. 日本音響学会

  40. 情報処理学会

Committee Memberships 39

  1. IEEE Intelligent Transportation Systems (ITS) Society   member of the Board of Governors (BoG)  

    2014.1 - 2022.12   

      More details

    Committee type:Academic society

  2. IEEE Signal Processing Society   Tokyo Joint Chapter Chair  

    2019.1 - 2020.12   

  3. IEEE Intelligent Transportation Systems (ITS) Society   Nagoya Chapter Chair  

    2021.1 - 2022.12   

  4. ITU-T FG Distraction   Vice Chair  

    2011.8 - 2013.3   

  5. 日本音響学会   理事・副会長(2017.5-2019.4)  

    2017.5 - 2021.4   

      More details

    Committee type:Academic society

  6. Asia Pacific Signal Information Processing Association   member of BOG (Board of Goveners)  

    2019.1 - 2022.12   

      More details

    Committee type:Academic society

  7. 科学研究費委員会専門委員会   専門委員  

    2017.12 - 2018.11   

      More details

    Committee type:Government

  8. 交通事故低減詳細効果見積のためのシミュレーション技術の開発及び実証 開発検討委員会   委員  

    2017.6 - 2018.3   

      More details

    Committee type:Government

  9. 名古屋市教育委員会 SSH運営指導委員会   委員  


      More details

    Committee type:Municipal

  10. IEEE Intelligent Transportation Society   member of Board of Governors (BoG)  

    2013.1 - 2021.12   

      More details

    Committee type:Academic society

  11. 名古屋産科学研究所 次世代自動車地域産学官フォーラム・産学連携プロジェクト化研究会   委員  

    2011.8 - 2015.3   

  12. 名古屋市教育委員会 魅力ある高等学校づくり推進研究協議会   副会長  

    2011.7 - 2016.3   

  13. 科学技術振興機構 研究成果最適展開支援プログラム専門委員会   専門委員  

    2011.5 - 2017.3   

  14. International Advisory Board, SAFER project, Chalmers Technological University   member  


  15. IEEE International Conference on Acoustic Speech and Signal Processing (ICASSP 2012)   Organizing Committee member, Special Session co-chair  

    2010.3 - 2012.5   

  16. First International Conference on Future Active Safety Technology fast-ZERO'11   Organizing Committee member  

    2009.9 - 2011.9   

  17. The 5th Biennal Workshop on Digital Signal Processing for In-Vehicle Systems   Workshop Co-Chair  

    2009.9 - 2011.9   

  18. 名古屋産科学研究所 CHC企画運営委員会   委員  


  19. 名古屋産業科学研究所   非常勤所員  


  20. 愛知県公害審査委員会   委員  

    2008.11 - 2017.3   

  21. ISCA Interspeech 2010   Organizing Committee member, Industrial Exhibision co-chair  

    2008.9 - 2010.9   

  22. IEEE International Conference on Vehicular Electronics System, (ICVES 2009)   Technical Program Committee, co-chair  

    2008.7 - 2009.11   

  23. International Conference on Pattern Recognition 2010   Track Co-Chair (Editor)  

    2008.6 - 2010.6   

  24. IEEE Intelligent Vehicle Sympsium 2010, IV'10   TPC member (Associate Editor)  

    2008.6 - 2010.6   

  25. Biennial on DSP for in-Vehicle and Mobile Systems   Workshop co-chair  

    2008.6 - 2009.6   

  26. 国立情報学研究所 音声データベース推進委員会   委員  


  27. 情報通信研究機構 外部評価委員会   委員  

    2008.4 - 2011.3   

  28. 科学技術振興機構「科学技術・学術審議会研究評価部会」評価作業部会   委員  

    2007.9 - 2009.3   

  29. Biennial on DSP for in-Vehicle and Mobile Systems   Workshop Co-Chair  

    2006.4 - 2007.6   

  30. 2007 IEEE Automatic Speech Recognition and Understanding Workshop   Demo Session Chair  

    2005.12 - 2007.12   

  31. The 9th ACM International Conference on Multimodal Interfaces (ICMI' 07)   Technical Program Co-Chair  

    2005.4 - 2007.10   

  32. Biennial on DSP for in-Vehicle and Mobile Systems   Co-Chair  

    2004.4 - 2005.9   

  33. Workshop on Real World Corpora in Mobile Environment (RWCinME)   Secretary General  

    2004.4 - 2005.4   

  34. 総務省「戦略的情報通信研究開発推進制度」専門評価委員会   専門評価委員  


  35. ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition (SSPR)   International Scientific Committee member  

    2002.4 - 2003.4   

  36.   Organizing Committee Member  

    2000.7 - 2001.4   

  37. 2004 International Congress on Acoustics (ICA2004)   Program Committee Secretary  

    1998.10 - 2004.10   

  38. Second IEEE Workshop on Interactive Voice Technology for Telecommunications Applications (IVTTA 94)   Publication Committee Chair  

    1993.6 - 1994.9   

  39. 基盤技術研究促進センター   成果管理評価委員  

    1991.10 - 1992.5   

Awards 16

  1. Best Poster Award

    2022.11   ACM SIGSPATIAL 2022   Estimating counterfactual treatment outcomes over time in complex multi-vehicle simulation

    Keisuke Fujii, Koh Takeuchi, Atsushi Kuribayashi, Naoya Takeishi, Yoshinobu Kawahara, Kazuya Takeda

     More details

    Award type:Award from international society, conference, symposium, etc.  Country:United States

  2. 優秀発表賞

    2022.10   日本スポーツ心理学会第49回大会   深層強化学習を用いた最適な協調行動およびその基盤となる認知・意思決定機構の探究

    筒井和詩, 武田一哉

     More details

    Award type:Award from Japanese society, conference, symposium, etc.  Country:Japan

  3. Honorable Mention Award

    2021.8   13th International ACM Conference on Automotive User Interfaces   Automatic Generation of Road Trip Summary Video for Reminiscence and Entertainment using Dashcam Video

    Kana Bito, Itiro Siio, Yoshio Ishiguro, Kazuya Takeda

     More details

    Award type:Award from international society, conference, symposium, etc. 

  4. IEEE ITS Society Outstanding Research Award

    2020.9   IEEE ITS Society   Contributions to Data centric Driving Behavior Modeling

    Kazuya Takeda

     More details

    Award type:International academic award (Japan or overseas)  Country:Japan

  5. 第12回 Journal of Robotics and Mechatronics 優秀論文賞

    2020.1   Fuji technology press  

     More details

    Award type:Honored in official journal of a scientific society, scientific journal  Country:Japan

  6. Best Paper Award at 19th International Conference on Intelligent Transportation Systems (ITSC2016) IEEE ITS Society November 2016

    2016.11   IEEE ITS Society   Compressing Continuous Point Cloud Data using Image Compression Methods

    Chenxi Tu, Eijiro Takeuchi, Chiyomi Miyajima and Kazuya Takeda

     More details

    Award type:Award from international society, conference, symposium, etc.  Country:Japan

  7. 平成21年度 情報処理学会 ITS研究会優秀論文

    2009.12   情報処理学会  

     More details



  8. ICVES 2009 Best Conference Paper Award(IEEE International Conference on Vehicular Electronics and Safety)

    2009.11   Technical Program Committee of IEEE International Conference on Vehicular Electronics and Safety  

  9. 平成21年度日本音響学会秋季大会ポスター賞

    2009.9   日本音響学会  

     More details



  10. Best Paper Award on IEEE 2008 International Workshop on Multimedia Signal Processing

    2008.10   IEEE 2008 International Workshop on Multimedia Signal Processing  

  11. 情報学ワークショップ(WiNF2007)優秀論文賞

    2007.9   情報学ワークショップ(WiNF2007)  

     More details



  12. 日本音響学会2008年春季研究発表会ポスター賞

    2007.9   日本音響学会  

     More details



  13. 情報処理学会音楽情報科学研究会 『夏のシンポジウム2007』 ベストプレゼンテーション賞

    2007.8   情報処理学会音楽情報科学研究会  

     More details


    情報処理学会音楽情報科学研究会 『夏のシンポジウム2007』 ベストプレゼンテーション賞歌声の旋律と動的変動を特徴づけた確率モデルに関する検討大石康智, 後藤真孝, 伊藤克亘, 武田一哉

  電子情報通信学会論文賞


     More details


  日本音響学会技術開発賞


     More details


  粟屋潔学術奨励賞


     More details


Papers 679

  1. Open-Vocabulary Predictive World Models from Sensor Observations

    Karlsson, R; Asfandiyarov, R; Carballo, A; Fujii, K; Ohtani, K; Takeda, K

    SENSORS   Vol. 24 ( 14 )   2024.7

  2. A Survey on Testbench-Based Vehicle-in-the-Loop Simulation Testing for Autonomous Vehicles: Architecture, Principle, and Equipment

    Cheng, JJ; Wang, Z; Zhao, XM; Xu, ZG; Ding, M; Takeda, K

    ADVANCED INTELLIGENT SYSTEMS   Vol. 6 ( 6 )   2024.6

  3. LiDAR Point Cloud Augmentation for Adverse Conditions Using Conditional Generative Model

    Zhang, YX; Ding, M; Yang, HT; Niu, YJ; Ge, MN; Ohtani, K; Zhang, C; Takeda, K

    REMOTE SENSING   Vol. 16 ( 12 )   2024.6

  4. R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut

    Niu, YJ; Ding, M; Ge, MN; Karlsson, R; Zhang, YX; Carballo, A; Takeda, K

    SENSORS   Vol. 24 ( 9 )   2024.5

  5. Decentralized policy learning with partial observation and mechanical constraints for multiperson modeling

    Fujii, K; Takeishi, N; Kawahara, Y; Takeda, K

    NEURAL NETWORKS   Vol. 171   page: 40 - 52   2024.3

  6. Estimating Counterfactual Treatment Outcomes Over Time in Complex Multiagent Scenarios

    Fujii, K; Takeuchi, K; Kuribayashi, A; Takeishi, N; Kawahara, Y; Takeda, K


  7. DRUformer: Enhancing Driving Scene Important Object Detection With Driving Scene Relationship Understanding

    Niu, YJ; Ding, M; Fujii, K; Ohtani, K; Carballo, A; Takeda, K

    IEEE ACCESS   Vol. 12   page: 67589 - 67599   2024

  8. L-DIG: A GAN-Based Method for LiDAR Point Cloud Processing under Snow Driving Conditions

    Zhang, YX; Ding, M; Yang, HT; Niu, YJ; Feng, Y; Ohtani, K; Takeda, K

    SENSORS   Vol. 23 ( 21 )   2023.11

  9. Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation

    Yang, HT; Carballo, A; Zhang, YX; Takeda, K

    SENSORS   Vol. 23 ( 20 )   2023.10

  10. Estimation of control area in badminton doubles with pose information from top and back view drone videos Reviewed

    Ding Ning, Takeda Kazuya, Jin Wenhui, Bei Yingjiu, Fujii Keisuke


  11. Pitching strategy evaluation via stratified analysis using propensity score Reviewed

    Nakahara Hiroshi, Takeda Kazuya, Fujii Keisuke


  12. Localization System for Vehicle Navigation Based on GNSS/IMU Using Time-Series Optimization with Road Gradient Constrain

    Takanose Aoki, Kondo Kaito, Hoda Yuta, Meguro Junichi, Takeda Kazuya

    JOURNAL OF ROBOTICS AND MECHATRONICS   Vol. 35 ( 2 ) page: 387 - 397   2023.4

  13. Multi-Agent Deep-Learning Based Comparative Analysis of Team Sport Trajectories Reviewed

    Zhang Ziyi, Bunker Rory, Takeda Kazuya, Fujii Keisuke

    IEEE ACCESS   Vol. 11   page: 43305 - 43315   2023

  14. Expert-driven Rule-based Refinement of Semantic Segmentation Maps for Autonomous Vehicles

    Manibardo Eric L., Lana Ibai, Del Ser Javier, Carballo Alexander, Takeda Kazuya


  15. Synthesizing Realistic Snow Effects in Driving Images Using GANs and Real Data with Semantic Guidance

    Yang Hanting, Ding Ming, Carballo Alexander, Zhang Yuxiao, Ohtani Kento, Niu Yinjie, Ge Maoning, Feng Yan, Takeda Kazuya


  16. Open-world driving scene segmentation via multi-stage and multi-modality fusion of vision-language embedding Reviewed

    Niu Yingjie, Ding Ming, Zhang Yuxiao, Ge Maoning, Yang Hanting, Takeda Kazuya


  17. LiDAR Point Cloud Translation Between Snow and Clear Conditions Using Depth Images and GANs Reviewed

    Zhang Yuxiao, Ding Ming, Yang Hanting, Niu Yingjie, Feng Yan, Ge Maoning, Carballo Alexander, Takeda Kazuya


  18. Action Valuation of On- and Off-Ball Soccer Players Based on Multi-Agent Deep Reinforcement Learning

    Nakahara, H; Tsutsui, K; Takeda, K; Fujii, K

    IEEE ACCESS   Vol. 11   page: 131237 - 131244   2023

  19. Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from a Monocular Camera and IMUs

    Tanaka, R; Suzuki, T; Takeda, K; Fujii, K


  20. Personalized Causal Factor Generalization for Subjective Risky Scene Understanding with Vision Transformer

    Bao, N; Carballo, A; Tsukada, M; Takeda, K


  21. Recognition Assistance Interface for Human-Automation Cooperation in Pedestrian Risk Prediction

    Kuribayashi, A; Takeuchi, E; Carballo, A; Ishiguro, Y; Takeda, K


  22. Learning to Predict Navigational Patterns from Partial Observations Reviewed

    Robin Karlsson, Alexander Carballo, Francisco Lepe-Salazar, Keisuke Fujii, Kento Ohtani, Kazuya Takeda

    IEEE Robotics and Automation Letters   Vol. 8 ( 9 ) page: 5592 - 5599   2023.9

     More details

    Publishing type:Research paper (scientific journal)  

    Human beings cooperatively navigate rule-constrained environments by adhering to mutually known navigational patterns, which may be represented as directional pathways or road lanes. Inferring these navigational patterns from incompletely observed environments is required for intelligent mobile robots operating in unmapped locations. However, algorithmically defining these navigational patterns is nontrivial. This paper presents the first self-supervised learning (SSL) method for learning to infer navigational patterns in real-world environments from partial observations only. We explain how geometric data augmentation, predictive world modeling, and an information-theoretic regularizer enable our model to predict an unbiased local directional soft lane probability (DSLP) field in the limit of infinite data. We demonstrate how to infer global navigational patterns by fitting a maximum likelihood graph to the DSLP field. Experiments show that our SSL model outperforms two SOTA supervised lane graph prediction models on the nuScenes dataset. We propose our SSL method as a scalable and interpretable continual learning paradigm for navigation by perception.

    DOI: 10.1109/LRA.2023.3291924

    Web of Science


  23. Personalized lane changes using subjective risk-sensitive framework

    Naren Bao, Alexander Carballo, Kazuya Takeda

    Towards Human-Vehicle Harmonization     page: 211 - 222   2023.3

     More details

    Publishing type:Part of collection (book)  

    Most of the current researches on autonomous vehicles' control assume that all vehicles should have the same patterns of driving implemented, resulting in conservative or average systems. However, these results may not be acceptable to drivers who prefer a more aggressive style of driving, while extremely cautious drivers may consider the standard outputs to be too aggressive. In this chapter, we introduce risk-sensitive control (RSC), an inverse optimal control algorithm that estimates risk-sensitive driving features and incorporate them into a receding-horizon controller. RSC uses a meta-learning algorithm to update the parameters of the cost function, continuously improving the controller online as more and more driving data is gathered from the user for subjective risk feedback. The estimator takes into account the individual differences in subjective risk analysis, in terms of driving features and surrounding vehicle locations, by adjusting the cost function and constraints. We test this approach using five-lane change scenarios, some safe and some risky, with 30 real drivers in a CARLA simulation environment. Based on both quantitative and qualitative evaluations, our experimental results demonstrate that the proposed framework can generate users' preferred driving commands during lane changes, that is, commands associated with lower subjective risk, outperforming conventional, model-based predictive control methods in terms of replicating the user's own driving behavior.

    DOI: 10.1515/9783110981223-016


  24. Framework for Generation and Removal of Multiple Types of Adverse Weather from Driving Scene Images Reviewed

    Hanting Yang, Alexander Carballo, Yuxiao Zhang, Kazuya Takeda

    SENSORS   Vol. 23 ( 3 ) page: 1548   2023.2

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:MDPI  

    Weather variation in the distribution of image data can cause a decline in the performance of existing visual algorithms during evaluation. Adding additional samples of target domain to training data or using pre-trained image restoration methods such as de-hazing, de-raining, and de-snowing, to improve the quality of input images are two promising solutions. In this work, we propose Multiple Weather Translation GAN (MWTG), a CycleGAN-based, dual-purpose framework that simultaneously learns weather generation and its removal from image data. MWTG consists of four GANs constrained using cycle consistency that carry out domain translation tasks between hazy, rainy, snowy, and clear weather, using an asymmetric approach. To increase network capacity, we employ a spatial feature transform (SFT) layer to fuse the features extracted from the weather layer, which contains high-level domain information from the previous generators. Further, we collect an unpaired, real-world driving dataset recorded under various weather conditions called Realistic Driving Scenes under Bad Weather (RDSBW). We qualitatively and quantitatively evaluate MWTG using the RDSBW and the variation of Cityscapes that synthesize weather effects, eg., FoggyCityscape. Our experimental results suggest that MWTG can generate realistic weather in clear images and also accurately remove noise from weather images. Furthermore, the SOTA pedestrian detector ASCP is shown to achieve an impressive gain in detection precision after image restoration using the proposed MWTG method.

    DOI: 10.3390/s23031548

    Web of Science

  25. Estimating the effect of hitting strategies in baseball using counterfactual virtual simulation with deep learning Reviewed

    Hiroshi Nakahara, Kazuya Takeda, Keisuke Fujii

    International Journal of Computer Science in Sport   Vol. 22 ( 1 ) page: 1 - 12   2023.1

     More details

    Authorship:Corresponding author   Language:English  

    DOI: 10.2478/ijcss-2023-0001

  26. Perception and sensing for autonomous vehicles under adverse weather conditions: A survey Reviewed

    Yuxiao Zhang, Alexander Carballo, Hanting Yang, Kazuya Takeda

    ISPRS Journal of Photogrammetry and Remote Sensing   Vol. 196   page: 146 - 177   2023.1

     More details

    Authorship:Last author, Corresponding author   Language:English  

  27. RSG-GCN: Predicting Semantic Relationships in Urban Traffic Scene With Map Geometric Prior

    Yafu Tian, Alexander Carballo, Ruifeng Li, Kazuya Takeda


     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    Automated identification of the relationships between traffic actors and surrounding objects, in order to describe their behavior and predict their intentions, has become the focus of increasing attention in the field of autonomous driving. Therefore, in this work, we propose a Road Scene Graphs-Graph Convolutional Network (RSG-GCN) as a novel, graph-based model for predicting the topological graph structure of a given traffic scene. The status of the actors and HD map information are integrated as prior knowledge, allowing the edges linking the actor nodes to capture potential semantic relationships, such as "vehicle approaching pedestrian" and "pedestrian waiting at intersection". To train this model, we created our own RSG dataset, as well as a relational dataset and benchmark derived from nuScenes. Our extensive range of experiments demonstrate that our model can more accurately predict semantic relationships and behavior in a given traffic scene than other popular traffic scene prediction models. In particular, regarding the use of HD map prior knowledge, we found that the resulting increase in accuracy significantly outweighs performance loss caused by the increase in graph size. The downstream applications of RSG include traffic scene retrieval and synthetic traffic scene generation, which are briefly described.

    DOI: 10.1109/OJITS.2023.3260624

    Web of Science

  28. Real-Time Graph-Based Optimization for GNSS-Doppler Integrated RTK-GNSS/IμDR Positioning System in Urban Area

    Aoki Takanose, Eijiro Takeuchi, Alexander Carballo, Junichi Meguro, Kazuya Takeda

    IEEE Intelligent Vehicles Symposium, Proceedings   Vol. 2023-June   2023

     More details

    Publishing type:Research paper (international conference proceedings)  

    Autonomous driving of vehicles and robots requires highly accurate position information, and RTK-GNSS is expected to be utilized for this purpose. In this paper, we propose a robust and real-time operation method by introducing graph optimization into the integrated RTK-GNSS/IMU method. The proposed method is an extension of a method using vehicle trajectories that can estimate positions with lane-level accuracy even in urban areas. The position is estimated by removing GNSS multipaths from the shape of a vehicle trajectory of several hundred meters and averaging the remaining GNSS results. This method does not take into account the errors in the vehicle trajectory and cannot fully benefit from the high accuracy positioning solution of RTK-GNSS. To solve this problem, we introduce graph optimization to the base method, which treats the error state as a probabilistic model. However, general graph optimization methods have problems with processing time and outlier elimination. The proposed method solves these problems by restricting the time series data to be optimized and using a two-step optimization structure. Evaluations show that the proposed method is effective because it satisfies the requirements for real-time operation and improves accuracy compared to conventional methods.

    DOI: 10.1109/IV55152.2023.10186672

    Web of Science


  29. Uncertainty Aware Task Allocation for Human-Automation Cooperative Recognition in Autonomous Driving Systems

    Atsushi Kuribayashi, Eijiro Takeuchi, Alexander Carballo, Yoshio Ishiguro, Kazuya Takeda

    IEEE Intelligent Vehicles Symposium, Proceedings   Vol. 2023-June   2023

     More details

    Publishing type:Research paper (international conference proceedings)  

    Cooperative recognition, a method to achieve human-automation cooperation in the recognition phase of the autonomous driving system, has been proposed to address the challenges in the conventional control phase cooperation, e.g., taking over vehicle control. In cooperative recognition, the operator intervenes in recognition tasks that are difficult for the automated system alone to improve driving efficiency and safety. The challenge is the integration of both human and automated systems while both participants have different characteristics, processing capabilities, and uncertainty in the decisions (recognition results). The objectives of this study are task allocation (i.e., when and for which targets the operator should intervene) taking into account the intervention efficiency and human state. And also combine the human intervention and recognition result of the automated systems to solve the uncertainties in both participants. We formulated this problem with a Partially Observable Markov Decision Process (POMDP). The simulator experiment indicated that the recognition result of the automated system and the operator's intervention were stochastically combined. The intervention requests to the operator adapted to the operator state and could be reduced while maintaining driving efficiency and minimizing risk omissions.

    DOI: 10.1109/IV55152.2023.10186725

    Web of Science


  30. RSG-Search: Semantic Traffic Scene Retrieval Using Graph-Based Scene Representation

    Yafu Tian, Alexander Carballo, Ruifeng Li, Kazuya Takeda

    IEEE Intelligent Vehicles Symposium, Proceedings   Vol. 2023-June   2023

     More details

    Publishing type:Research paper (international conference proceedings)  

    Browsing specific traffic scene in large-scale dataset is an increasing demand from researchers, self-driving community and insurance companies. It is easy to search scenes with specific tags such as "rain", "snow", or "on highway". However, searching specific scene configurations, like "two vehicles waiting for a person crossing the road", is still an open problem. In this paper, we provide RSG-search, a scene-graph based traffic scene retrieval method, based on our previous research on traffic scene-graph generation. By previously translating open datasets to scene graphs, we can ignore irrelevant details, and efficiently search specific scene configuration among thousands of traffic scenes. Experiment results shows that our graph searching method is able to retrieve results for a given query with high accuracy. Our method simplifies the task of scene retrieval, opening opportunities for new applications.

    DOI: 10.1109/IV55152.2023.10186641

    Web of Science


  31. Predictive World Models from Real-World Partial Observations.

    Robin Karlsson, Alexander Carballo, Keisuke Fujii 0001, Kento Ohtani, Kazuya Takeda

    MOST     page: 152 - 166   2023

     More details

    Publishing type:Research paper (international conference proceedings)  

    DOI: 10.1109/MOST57249.2023.00024

    Other Link: https://dblp.uni-trier.de/db/conf/most/most2023.html#KarlssonC0OT23

  32. ViCE: Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment Reviewed

    Robin Karlsson, Tomoki Hayashi, Keisuke Fujii, Alexander Carballo, Kento Ohtani, Kazuya Takeda

    The 33rd British Machine Vision Conference (BMVC 2022)     2022.11

     More details

    Authorship:Last author, Corresponding author   Language:English  

    DOI: 10.48550/arXiv.2111.12460

  33. Methods of Gently Notifying Pedestrians of Approaching Objects when Listening to Music Reviewed

    Yuki Sakashita, Yoshio Ishiguro, Kento Ohtani, Kazuya TakedaTakanori Nishino,

    UIST '22: The 35th Annual ACM Symposium on User Interface Software and Technology   ( 39 ) page: 1 - 4   2022.10

     More details

    Authorship:Corresponding author   Language:English  

    DOI: 10.1145/3526114.3558728

  34. Improvement of anomalous sound detection method considering the distribution of embedding Reviewed

    Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda

    the 24th International Congress on Acoustics     page: 5   2022.10

     More details


  35. ドライブレコーダを活用した旅の要約動画自動生成システム Reviewed

    尾頭 花奈, 石黒 祥生, 椎尾 一郎, 武田 一哉

    コンピュータ ソフトウェア   Vol. 39 ( 4 ) page: 4_144 - 4_157   2022.10

     More details

    Authorship:Last author, Corresponding author   Language:Japanese  

    DOI: 10.11309/jssst.39.4_144

  36. Automatic Fault Detection in Race Walking From a Smartphone Camera via Fine-Tuning Pose Estimation Reviewed

    Tomohiro Suzuki, Kazuya Takeda, Keisuke Fujii

    2022 IEEE 11th Global Conference on Consumer Electronics     2022.10

     More details

    Authorship:Corresponding author   Language:English  

    DOI: 10.1109/GCCE56475.2022.10014142

  37. Evaluation of creating scoring opportunities for teammates in soccer via trajectory prediction Reviewed

    Masakiyo Teranishi, Kazushi Tsutsui, Kazuya Takeda, Keisuke Fujii,

    9th Workshop on Machine Learning and Data Mining for Sports Analytics 2022 (MLSA'22) co-located with the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML-PKDD'22)     2022.9

     More details

    Authorship:Corresponding author   Language:English  

    DOI: 10.48550/arXiv.2206.01899


    Tetsuya Nagashima, Ming Ding, Keisuke FUJII & Kazuya TAKEDA

    33rd Congress of the International Council of the Aeronautical Sciences     2022.9

     More details

    Authorship:Last author, Corresponding author   Language:English  

  39. Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure Reviewed

    Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda

    the 30th European Signal Processing Conference (EUSIPCO)     2022.8

     More details


    DOI: 10.48550/arXiv.2206.05929

  40. Emergence of Collaborative Hunting via Multi-Agent Deep Reinforcement Learning Reviewed

    Kazushi Tsutsui, Kazuya Takeda, Keisuke Fujii

    International Workshop on Human Behavior Understanding (HBU'22) in conjunction with International Conference on Pattern Recognition (ICPR'22)     2022.8

     More details

    Authorship:Corresponding author   Language:English  

  41. Evaluating a third base coach’s decision-making via game theory and machine learning Reviewed

    Hiroshi Nakahara, Kazuya Takeda, Keisuke Fujii

    MathSport International, 2022     2022.7

     More details

    Authorship:Corresponding author   Language:English  

  42. Automatic screen-play classification in basketballvia semi-supervised learning Reviewed

    Ziyi Zhang, Kazuya Takeda, Keisuke Fujii

    Mathsport International 2022     2022.7

     More details

    Authorship:Corresponding author   Language:English  

  43. GNSS/IMU Performance Improvement Based on Acceleration Error Estimation Using Height Variation Reviewed

    Aoki Takanose, Kaito Kondo, Yuta Hoda, Junichi Meguro, Kazuya Takeda

    15th International Symposium on Advanced Vehicle Control (AVEC’22)   Vol. Mo1B-01   2022.5

     More details

    Authorship:Last author, Corresponding author   Language:English  

  44. SecretSign: A Method of Finding a Specific Vehicle Privately and Quickly using Flashing Lights Reviewed

    Yusuke Sakai, Hiromi Morita, Yoshio Ishiguro, Takanori Nishino, Kazuya Takeda

    IEEE Intelligent Transportation Systems Magazine   Vol. 14 ( 1 ) page: 216 - 227   2022.1

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)   Publisher:Institute of Electrical and Electronics Engineers (IEEE)  

    DOI: 10.1109/MITS.2020.2970187

    Web of Science


  45. Data-Driven Risk-Sensitive Control for Personalized Lane Change Maneuvers

    Naren Bao, Linda Capito, Dongfang Yang, Alexander Carballo, Chiyomi Miyajima, Kazuya Takeda

    IEEE ACCESS   Vol. 10   page: 36397 - 36415   2022

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    Most current research in the field of autonomous vehicle control assumes that all vehicles will follow the same patterns of automated driving behavior, resulting in systems with "conservative" or "average" driving styles. These systems may not be acceptable to drivers who prefer a more aggressive style of driving, however, while extremely cautious drivers may consider the standard outputs to be too aggressive. To address this problem, in this paper, we introduce Risk Sensitive Control (RSC), an inverse optimal control algorithm that estimates risk-sensitive driving features and incorporates them into a receding-horizon controller. RSC uses a meta-learning algorithm to update the parameters of the cost function, continuously improving the controller online as more and more driving data is gathered from the user and subjective risk feedback. An estimator takes into account individual differences in subjective risk analysis, in terms of driving features and surrounding vehicle locations, by adjusting the cost function and its constraints. We test this approach using five lane change scenarios, some safe and some risky, with thirty real drivers in a CARLA simulation environment. Our quantitative and qualitative evaluations demonstrate that the proposed framework is able to generate a user's preferred driving maneuvers during lane changes, i.e., control commands the user associates with lower subjective risk, outperforming conventional, model-based predictive control methods in terms of replicating the user's own driving behavior.

    DOI: 10.1109/ACCESS.2022.3163267

    Web of Science

  46. Real-to-Synthetic: Generating Simulator Friendly Traffic Scenes from Graph Representation

    Yafu Tian, Alexander Carballo, Ruifeng Li, Kazuya Takeda

    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV)     page: 1615 - 1622   2022

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    Reproducing real-world traffic scenes in the simulator is fundamental to training self-driving systems. Creating a simulation scenario is a complex task, generally done manually: the ego-vehicle and other entities are placed and their trajectories dened, trying to recreate some situation found in real traffic. To reduce the manual burden, here we propose the Realto-Synthetic toolset. This toolset provides synthetic traffic scene in openDrive format, which can be directly simulated in many simulators such as SUMO or CARLA. Also, we provide a scene generator which generates near-realistic scene from minimum user effort. To maintain the similarity between real-world scene and generated one, here we introduce the concept "Road Scene Graph"(RSG). In this graph, nodes represent entities while edges stand for pairwise relationships. These relationships could be maintained in the scene generation process while the actor is generated according to the distribution sampled from realworld data. Experiments proved that by using "Road Scene Graph", our scene generator proposes a much more convenient way to configure traffic scenes rather than manually defining every actor's initial status and trajectories.

    DOI: 10.1109/IV51971.2022.9827441

    Web of Science

  47. Deepware: An Open-Source Toolkit for Developing and Evaluating Learning-Based and Model-Based Autonomous Driving Models

    Shunya Seiya, Alexander Carballo, Eijiro Takeuchi, Kazuya Takeda

    IEEE ACCESS   Vol. 10   page: 105734 - 105743   2022

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    In recent decades, many learning-based autonomous driving systems have been proposed, and researchers have also created toolkits for developing these systems. These toolkits allow developers to train their models easily, and then test them using simulators. Existing toolkits for learning-based autonomous driving systems have some limitations however, which include inability to reuse modules or to perform accurate comparisons with model-based systems, as well as a lack of support for middle-to-middle models. As a solution, in this paper we introduce Deepware, an end-to-end toolkit for developing and evaluating learning-based autonomous driving models. Deepware includes the tools needed for collecting and evaluating datasets, training models, and evaluating models on simulators or in real-world environments using actual vehicles. Unlike existing toolkits, we used ROS as our platform, which is a set of software frameworks for robot software development widely used in autonomous driving systems as middleware, which allows cooperation with model-based systems. This approach also allows system modules to be shared when building models. In addition, it allows the comparison of learning-based and model-based methods under the same conditions. Moreover, by extracting features from model-based systems, our toolkit can also support middle-to-middle models. The proposed Deepware toolkit and dataset are available at: https://github.com/shunchan0677/deepware.

    DOI: 10.1109/ACCESS.2022.3212152

    Web of Science

  48. Disentangled Bad Weather Removal GAN for Pedestrian Detection

    Yang Hanting, Carballo Alexander, Takeda Kazuya


     More details

  49. Driving Risk and Intervention: Subjective Risk Lane Change Dataset

    Naren Bao, Alexander Carballo, Kazuya Takeda

    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV)     page: 556 - 562   2022

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    When developing truly driverless mobility for the future, one key index used to measure the matureness of a particular self-driving technology is the driver intervention rate. One method which has proven to be effective for decreasing intervention rates is the use of personalized driving models that can mimic the driving style and preferences of a targeted user, so that autonomous driving feels safer and more natural to them. To create such models, quantitative data should be collected from users in order to determine the style of driving that a particular user, or type of user, prefers. In this paper, we introduce the Subjective Risk Lane Change (SRLC) Dataset, which includes ego vehicle driving behavior data, surrounding vehicle location information, and the subjective risk scores of users, collected during both safe and risky lane change scenarios encountered in CARLA simulators, as well as demographic information for our 30 participants. Furthermore, user intervention data for all of our participants was collected from Personalized Model Predictive Controllers during the generated lane change maneuvers. As far as the authors are able to determine, no other public dataset provides driving behavior signal and intervention timing information collected during driver interventions. Our dataset can be used to gain insights into a variety of personal driving styles, allowing the improvement of adaptive autonomous driving systems, and leading to safer and more widely accepted driverless technology.

    DOI: 10.1109/IV51971.2022.9827358

    Web of Science

  50. Occlusion-Aware Motion Planning With Visibility Maximization via Active Lateral Position Adjustment

    Narksri Patiphon, Darweesh Hatem, Takeuchi Eijiro, Ninomiya Yoshiki, Takeda Kazuya

    IEEE ACCESS   Vol. 10   page: 57759 - 57782   2022

     More details

  51. Deep Reinforcement Learning in a Racket Sport for Player Evaluation With Technical and Tactical Contexts.

    Ning Ding, Kazuya Takeda, Keisuke Fujii 0001

    IEEE Access   Vol. 10   page: 54764 - 54772   2022

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

    DOI: 10.1109/ACCESS.2022.3175314

    Web of Science

  52. An enhanced driver's risk perception modeling based on gate recurrent unit network

    Ping Peng, Ding Weiping, Liu Yongkang, Takeda Kazuya

    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV)     page: 1234 - 1240   2022

     More details

  53. Auditory and visual warning information generation of the risk object in driving scenes based on weakly supervised learning

    Niu Yinjie, Ding Ming, Zhang Yuxiao, Ohtani Kento, Takeda Kazuya

    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV)     page: 1572 - 1577   2022

     More details

  54. Cooperative play classification in team sports via semi-supervised learning. Reviewed

    Zhang Ziyi, Kazuya Takeda, Keisuke Fujii

    International Journal of Computer Science in Sport   Vol. 21 ( 1 ) page: 111 - 121   2022

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    DOI: 10.2478/ijcss-2022-0006

  55. Estimating counterfactual treatment outcomes over time in multi-vehicle simulation. Reviewed

    Keisuke Fujii 0001, Koh Takeuchi, Atsushi Kuribayashi, Naoya Takeishi, Yoshinobu Kawahara, Kazuya Takeda

    SIGSPATIAL/GIS   ( 7 ) page: 7 - 4   2022

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:ACM  

    DOI: 10.1145/3557915.3560941

    Other Link: https://dblp.uni-trier.de/db/conf/gis/gis2022.html#0001TKTKT22

  56. Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment.

    Robin Karlsson, Tomoki Hayashi, Keisuke Fujii 0001, Alexander Carballo, Kento Ohtani, Kazuya Takeda

    BMVC     page: 699 - 699   2022

     More details

    Publishing type:Research paper (international conference proceedings)  

    Other Link: https://dblp.uni-trier.de/rec/conf/bmvc/2022

  57. Autonomous Driving in Adverse Weather Conditions: A Survey Reviewed

    Y. Zhang, A. Carballo, H. Yang, K. Takeda

    arXiv preprint arXiv:2112.08936     2021.12

     More details

    Authorship:Last author   Language:English  

  58. ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations Reviewed

    Karlsson, Robin; Hayashi, Tomoki; Fujii, Keisuke; Carballo, Alexander; Ohtani, Kento; Takeda, Kazuya;

    arXiv preprint arXiv:2111.12460     2021.11

     More details

    Authorship:Last author   Language:English  


    Ibuki Kuroyanagi , Tomoki Hayashi , Yusuke Adachi , Takenori Yoshimura , Kazuya Takeda , Tomoki Toda

    Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021)     page: 110 - 114   2021.11

     More details


  60. End-to-End Learning-based Driving System with Branches by Emphasizing Target Direction Reviewed

    Seiya, Shunya; Ohtani, Kento; Carballo, Alexander; Takeuchi, Eijiro; Takeda, Kazuya;

      Vol. 52 ( 6 )   2021.11

     More details

    Authorship:Last author   Language:Japanese  

  61. FollowSelect: Path-based Menu Interaction for Intuitive Navigation Reviewed

      Vol. 62 ( 10 ) page: 1669 - 1680   2021.10

     More details

    Authorship:Last author   Language:Japanese  

    DOI: 10.20729/00213195

  62. Flexible prediction of opponent motion with internal representation in interception behavior Reviewed

    Kazushi Tsutsui, Keisuke Fujii, Kazutoshi Kudo, Kazuya Takeda

    Biological Cybernetics   Vol. 115 ( 5 ) page: 473 - 485   2021.10

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper (scientific journal)   Publisher:Springer Science and Business Media LLC  

    <title>Abstract</title>Skilled interception behavior often relies on accurate predictions of external objects because of a large delay in our sensorimotor systems. To deal with the sensorimotor delay, the brain predicts future states of the target based on the current state available, but it is still debated whether internal representations acquired from prior experience are used as well. Here we estimated the predictive manner by analyzing the response behavior of a pursuer to a sudden directional change of the evasive target, providing strong evidence that prediction of target motion by the pursuer was incompatible with a linear extrapolation based solely on the current state of the target. Moreover, using neural network models, we validated that nonlinear extrapolation as estimated was computationally feasible and useful even against unknown opponents. These results support the use of internal representations in predicting target motion, suggesting the usefulness and versatility of predicting external object motion through internal representations.

    DOI: 10.1007/s00422-021-00891-9

    Web of Science

    Other Link: https://link.springer.com/article/10.1007/s00422-021-00891-9/fulltext.html

  63. Supervised sequential pattern mining of event sequences in sport to identify important patterns of play: an application to rugby union Reviewed

    Rory Bunker, Keisuke Fujii, Hiroyuki Hanada, Ichiro Takeuchi

    PLOS One     2021.9

     More details

  64. Anomalous Sound Detection Using a Binary Classification Model and Class Centroids Reviewed

    Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda , Tomoki Toda

    European Signal Processing Conference2021(EUSIPCO2021)     2021.8

     More details

    Authorship:Corresponding author   Language:English  

  65. Prediction of Personalized Driving Behaviors via Driver-Adaptive Deep Generative Models Reviewed

    Naren Bao, Alexander Carballo, Takeda Kazuya

    2021 IEEE Intelligent Vehicles Symposium (IV)     page: 616 - 621   2021.7

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    DOI: 10.1109/iv48863.2021.9575671


    Ibuki Kuroyanagi , Tomoki Hayashi , Yusuke Adachi, , Takenori Yoshimura , Kazuya Takeda , Tomoki Toda

    Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021)     2021.7

     More details


  67. Leveraging state-of-the-art ASR techniques to audio captioning Reviewed

    Narisetty, Chaitanya; Hayashi, Tomoki; Ishizaki, Ryunosuke; Watanabe, Shinji; Takeda, Kazuya;

    Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021)     page: 160 - 164   2021.7

     More details

    Authorship:Last author   Language:English  

  68. Deadlock-free planner for occluded intersections using estimated visibility of hidden vehicles Reviewed

    Patiphon Narksri, Eijiro Takeuchi, Yoshiki Ninomiya, Kazuya Takeda

    Electronics (Switzerland)   Vol. 10 ( 4 ) page: 1 - 28   2021.2

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

    © 2021 by the authors. Licensee MDPI, Basel, Switzerland. A common approach used for planning blind intersection crossings is to assume that hypothetical vehicles are approaching the intersection at a constant speed from the occluded areas. Such an assumption can result in a deadlock problem, causing the ego vehicle to remain stopped at an intersection indefinitely due to insufficient visibility. To solve this problem and facilitate safe, deadlock-free intersection crossing, we propose a blind intersection planner that utilizes both the ego vehicle and the approaching vehicle’s visibility. The planner uses a particle filter and our proposed visibility-dependent behavior model of approaching vehicles for predicting hidden vehicles. The behavior model is designed based on an analysis of actual driving data from multiple drivers crossing blind intersections. The proposed planner was tested in a simulation and found to be effective for allowing deadlock-free crossings at intersections where a baseline planner became stuck in a deadlock. The effects of perception accuracy and sensor position on output motion were also investigated. It was found that the proposed planner delayed crossing motion when the perception was imperfect. Furthermore, our results showed that the planner decelerated less while crossing the intersection with the front-mounted sensor configuration compared to the roof-mounted configuration due to the improved visibility. The minimum speed difference between the two sensor configurations was 1.82 m/s at an intersection with relatively poor visibility and 1.50 m/s at an intersection with good visibility.

    DOI: 10.3390/electronics10040411

    Web of Science


  69. Motion Analysis and Performance Improved Method for 3D LiDAR Sensor Data Compression Reviewed

    Chenxi Tu, Eijiro Takeuchi, Alexander Carballo, Chiyomi Miyajima, Kazuya Takeda

    IEEE Transactions on Intelligent Transportation Systems   Vol. 22 ( 1 ) page: 243 - 256   2021.1

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    © 2000-2011 IEEE. Continuous point cloud data is being used more and more widely in practical applications such as mapping, localization and object detection in autonomous driving systems, but due to the huge volume of data involved, sharing and storing this data is currently expensive and difficult. One possible solution is the development of more efficient methods of compressing the data. Other researchers have proposed converting 3D point cloud data into 2D images, or using tree structures to store the data. In a previous study targeting streaming point cloud data, we proposed an MPEG-like compression method which utilizes simultaneous localization and mapping (SLAM) results to simulate LiDAR's operating process. In this paper, instead of imitating MPEG, we propose new strategy for more efficient reference frame distribution and more natural frame prediction, and use a different algorithm to encode the residual, greatly improving the algorithm's performance and its stability in different scenarios. We also discuss how various parameters affect compression performance. Using our proposed method, streaming point cloud data collected by LiDAR sensors can be compressed to 1/50th of its original size, with only 2 cm of Root Mean Square Error for each detected point. We evaluate our proposed method by comparing its performance with several other existing point cloud compression methods in three different driving scenarios, demonstrating that our proposed method outperforms them.

    DOI: 10.1109/TITS.2019.2956066

    Web of Science


  70. RSG-Net: Towards Rich Semantic Relationship Prediction for Intelligent Vehicle in Complex Environment Reviewed

    Tian, Yafu; Carballo, Alexander; Li, Ruifeng; Takeda, Kazuya;

    2021 IEEE Intelligent Vehicles Symposium (IV)     page: 546 - 552   2021

     More details

  71. Automatic Generation of Road Trip Summary Video for Reminiscence and Entertainment using Dashcam Video Reviewed

    Kana Bito, Itiro Siio, Yoshio Ishiguro, Kazuya Takeda

    13th International Conference on Automotive User Interfaces and Interactive Vehicular Applications     page: 181 - 190   2021

     More details


    DOI: 10.1145/3409118.3475151

    Web of Science

  72. Learning a Model for Inferring a Spatial Road Lane Network Graph using Self-Supervision Reviewed

    Robin Karlsson, David Robert Wong, Simon Thompson, Kazuya Takeda

    The 24th IEEE International Conference on Intelligent Transportation Systems(ITSC2021)     page: 812 - 819   2021

     More details

  73. A recognition phase Intervention Interface to Improve Naturalness of Autonomous Driving for Distracted Drivers Reviewed

    Atsushi Kuribayashi, Eijiro Takeuchi, Alexander Carballo, Yoshio Ishiguro, Kazuya Takeda

    2021 IEEE International Intelligent Transportation Systems Conference (ITSC)     page: 1737 - 1744   2021

     More details

    Language:Japanese   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    DOI: 10.1109/ITSC48978.2021.9564557

    Web of Science

  74. Visibility Estimation in Complex, Real-World Driving Environments Using High Definition Maps Reviewed

    Patiphon Narksri, Hatem Darweesh, Eijiro Takeuchi, Yoshiki Ninomiya, Kazuya Takeda

    The 24th IEEE International Conference on Intelligent Transportation Systems(ITSC2021)     page: 2847 - 2854   2021

     More details

  75. OpenPlanner 2.0: The Portable Open Source Planner for Autonomous Driving Applications Reviewed

    Hatem Darweesh , Eijiro Takeuchi , and Kazuya Takeda

    IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)     page: 313 - 318   2021

     More details

  76. Characterization of Multiple 3D LiDARs for Localization and Mapping Performance using the NDT Algorithm Reviewed

    Alexander Carballo, Abraham Monrroy, David Wong, Patiphon Narksri, Jacob Lambert, Yuki Kitsukawa, Eijiro Takeuchi, Shinpei Kato, Kazuya Takeda


     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    In this work, we present a detailed comparison of ten different 3D LiDAR sensors for the tasks of mapping and vehicle localization, using as common reference the Normal Distributions Transform (NDT) algorithm implemented in the self-driving open source platform Autoware. LiDAR data used in this study is a subset of our LiDAR Benchmarking and Reference (LIBRE) dataset, captured independently from each sensor, from a vehicle driven on public urban roads multiple times, at different times of the day. In this study, we analyze the performance and characteristics of each LiDAR for the tasks of (1) 3D mapping including an assessment map quality based on mean map entropy, and (2) 6-DOF localization using a ground truth reference map.

    DOI: 10.1109/IVWorkshops54471.2021.9669244

    Web of Science

  77. Eagleye: A Lane-Level Localization Using Low-Cost GNSS/IMU Reviewed

    Aoki Takanose, Yuki Kitsukawa, Junichi Megruo, Eijiro Takeuchi, Alexander Carballo, Kazuya Takeda

    2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)     page: 319 - 326   2021

     More details

    Language:Japanese   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    DOI: 10.1109/IVWorkshops54471.2021.9669209

    Web of Science

  78. A Comparison of Methods for Sharing Recognition Information and Interventions to Assist Recognition in Autonomous Driving System Reviewed

    Atsushi Kuribayashi, Eijiro Takeuchi, Alexander Carballo, Yoshio Ishiguro, Kazuya Takeda

    2021 IEEE Intelligent Vehicles Symposium (IV)     page: 622 - 629   2021

     More details

    Language:Japanese   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    DOI: 10.1109/IV48863.2021.9575707

    Web of Science

  79. How to monitor multiple autonomous vehicles remotely with few observers: An active management method Reviewed

    Ming Ding, Eijiro Takeuchi, Yoshio Ishiguro, Yoshiki Ninomiya, Nobuo Kawaguchi and Kazuya Takeda

    2021 IEEE Intelligent Vehicles Symposium (IV)     page: 1168 - 1173   2021

     More details

  80. Anomalous Sound Detection Using a Binary Classification Model and Class Centroids

    Kuroyanagi Ibuki, Hayashi Tomoki, Takeda Kazuya, Toda Tomoki

    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021)     page: 1995 - 1999   2021

     More details


    Web of Science

  81. A Method for Location Initialization of Handheld Devices using Autonomous Driving Vehicles for Interactive Systems

    Ishiguro Yoshio, Takeda Kazuya


     More details


    DOI: 10.1145/3473682.3480273

    Web of Science

  82. Cross-Lingual Voice Conversion using a Cyclic Variational Auto-encoder and a WaveNet Vocoder Reviewed

    Hikaru Nakatani, Patrick Lumban Tobing, Kazuya Takeda, Tomoki Toda

    2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings     page: 520 - 526   2020.12

     More details

    Publishing type:Research paper (international conference proceedings)  

    © 2020 APSIPA. We propose a novel, cross-lingual voice conversion (VC) method using a cyclic variational auto-encoder (CycleVAE). Voice conversion is the transformation of the voice of one speaker into the voice of another speaker, while cross-lingual VC performs voice conversion between speakers who speak different languages. When using VC methods based on parallel learning, it is necessary to prepare accented speech uttered by the source or target speaker, using the pronunciation system of the speaker's mother tongue. On the other hand, VC methods which use a non-parallel learning approach can utilize the natural speech data of both the source and target speakers, produced in their own native languages. It then becomes necessary, however, to deal with the issues of time-alignment and language mismatches. To address these issues, we apply CycleVAE to cross-lingual VC as a sophisticated, non-parallel method of VC. We also apply the WaveNet vocoder in the waveform generation process of CycleVAE-VC to improve overall conversion quality. Our objective and subjective experimental results when performing cross-lingual VC from a native English speaker to a native Japanese speaker confirm that the proposed method achieves a higher level of naturalness and speaker similarity than a conventional RNN-based parallel VC method using accented speech.


    Other Link: https://dblp.uni-trier.de/conf/apsipa/2020

  83. Generation of Origami Folding Animations from 3D Point Cloud Using Latent Space Interpolation Reviewed

    Chiaki Nakagaito, Takanori Nishino, Kazuya Takeda

    SIGGRAPH Asia 2020 Posters. SA 2020     page: 30 - 2   2020.12

     More details

    Publishing type:Research paper (international conference proceedings)  

    DOI: 10.1145/3415264.3425450


    Other Link: https://dblp.uni-trier.de/db/conf/siggrapha/siggrapha2020posters.html#NakagaitoNT20

  84. Road Scene Graph: A Semantic Graph-Based Scene Representation Dataset for Intelligent Vehicles Reviewed

    Yafu Tian, Alexander Carballo, Ruifeng Li, Kazuya Takeda

    CoRR   Vol. abs/2011.13588   2020.11

     More details

    Publishing type:Research paper (scientific journal)  

    Rich semantic information extraction plays a vital role on next-generation
    intelligent vehicles. Currently there is great amount of research focusing on
    fundamental applications such as 6D pose detection, road scene semantic
    segmentation, etc. And this provides us a great opportunity to think about how
    shall these data be organized and exploited.
    In this paper we propose road scene graph,a special scene-graph for
    intelligent vehicles. Different to classical data representation, this graph
    provides not only object proposals but also their pair-wise relationships. By
    organizing them in a topological graph, these data are explainable,
    fully-connected, and could be easily processed by GCNs (Graph Convolutional
    Networks). Here we apply scene graph on roads using our Road Scene Graph
    dataset, including the basic graph prediction model. This work also includes
    experimental evaluations using the proposed model.


    Other Link: http://arxiv.org/pdf/2011.13588v1


    Koichi Miyazaki, Tatsuya Komatsu, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda

    Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE Workshop)     page: 100 - 104   2020.11

     More details

    Authorship:Last author, Corresponding author   Language:English   Publishing type:Research paper (scientific journal)  

  86. Point Grid Map-Based Mid-To-Mid Driving without Object Detection Reviewed

    Shunya Seiya, Alexander Carballo, Eijiro Takeuchi, and Kazuya Takeda

    2020 IEEE Intelligent Vehicles Symposium(IV)     2020.10

     More details

    Authorship:Last author, Corresponding author   Language:English   Publishing type:Research paper (scientific journal)  

    DOI: 10.1109 / IV47402.2020.9304809

  87. Trajectory prediction with imitation learning reflecting defensive evaluation in team sports Reviewed

    Masakiyo Teranishi, Keisuke Fujii, Kazuya Takeda

    2020 IEEE 9th Global Conference on Consumer Electronics, GCCE 2020     page: 124 - 125   2020.10

     More details

    Publishing type:Research paper (international conference proceedings)  

    © 2020 IEEE. In team sports, qualitative video analysis is mainly used for decision making of team tactics because of the complex interaction. Among the quantitative analyses, most of studies in team trajectory generation have been evaluated only on prediction errors and did not take tactical evaluations (e.g., good defense) into consideration. In this paper, we propose a trajectory prediction method incorporating the defensive evaluation (i.e., how well they protect the goal) into multi-agent imitation learning model. Although the proposed method had the similar prediction performance to the existing method, our method generated an improved trajectory in terms of defensive evaluation.

    DOI: 10.1109/GCCE50665.2020.9291841


    Other Link: https://dblp.uni-trier.de/db/conf/gcce/gcce2020.html#TeranishiFT20

  88. Extracting Human-Like Driving Behaviors from Expert Driver Data Using Deep Learning Reviewed

    Kyle Sama, Yoichi Morales, Hailong Liu, Naoki Akai, Alexander Carballo, Eijiro Takeuchi, Kazuya Takeda

    IEEE Transactions on Vehicular Technology   Vol. 69 ( 9 ) page: 9315 - 9329   2020.9

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    © 1967-2012 IEEE. This paper introduces a method to extract driving behaviors from a human expert driver which are applied to an autonomous agent to reproduce proactive driving behaviors. Deep learning techniques were used to extract latent features from the collected data. Extracted features were clustered into behaviors and used to create velocity profiles allowing an autonomous driving agent could drive in a human-like manner. By using proactive driving behaviors, the agent could limit potential sources of discomfort such as jerk and uncomfortable velocities. Additionally, we proposed a method to compare trajectories where not only the geometric similarity is considered, but also velocity, acceleration and jerk. Experimental results in a simulator implemented in ROS show that the autonomous agent built with the driving behaviors was capable of driving similarly to expert human drivers.

    DOI: 10.1109/TVT.2020.2980197

    Web of Science


  89. Policy learning with partial observation and mechanical constraints for multi-person modeling Reviewed

    Keisuke Fujii, Naoya Takeishi, Yoshinobu Kawahara, Kazuya Takeda

    CoRR   Vol. abs/2007.03155   2020.7

     More details

    Publishing type:Research paper (scientific journal)  

    Extracting the rules of real-world biological multi-agent behaviors is a
    current challenge in various scientific and engineering fields. Biological
    agents generally have limited observation and mechanical constraints; however,
    most of the conventional data-driven models ignore such assumptions, resulting
    in lack of biological plausibility and model interpretability for behavioral
    analyses in biological and cognitive science. Here we propose sequential
    generative models with partial observation and mechanical constraints, which
    can visualize whose information the agents utilize and can generate
    biologically plausible actions. We formulate this as a decentralized
    multi-agent imitation learning problem, leveraging binary partial observation
    models with a Gumbel-Softmax reparameterization and policy models based on
    hierarchical variational recurrent neural networks with physical and
    biomechanical constraints. We investigate the empirical performances using
    real-world multi-person motion datasets from basketball and soccer games.


    Other Link: http://arxiv.org/pdf/2007.03155v1

  90. Personalized Subjective Driving Risk: Analysis and Prediction Reviewed

    Bao Naren, Carballo Alexander, Miyajima Chiyomi, Takeuchi Eijiro, Takeda Kazuya

    JOURNAL OF ROBOTICS AND MECHATRONICS   Vol. 32 ( 3 ) page: 503-519   2020.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  91. Convolution augmented transformer for semi-supervised sound event detection Reviewed

    K Miyazaki, T Komatsu, T Hayashi, S Watanabe, T Toda, K Takeda

    Proc. Workshop Detection Classification Acoust. Scenes Events (DCASE)     page: 100 - 104   2020.6

     More details

    Authorship:Last author, Corresponding author   Language:English   Publishing type:Research paper (scientific journal)  

  92. Weakly-Supervised Sound Event Detection with Self-Attention Reviewed

    Koichi Miyazaki, Tatsuya Komatsu, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda

    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings   Vol. 2020-May   page: 66 - 70   2020.5

     More details

    Publishing type:Research paper (international conference proceedings)  

    © 2020 IEEE. In this paper, we propose a novel sound event detection (SED) method that incorporates a self-attention mechanism of the Transformer for a weakly-supervised learning scenario. The proposed method utilizes the Transformer encoder, which consists of multiple self-attention modules, allowing to take both local and global context information of the input feature sequence into account. Furthermore, inspired by the great success of BERT in the natural language processing field, the proposed method introduces a special tag token into the input sequence for weak label prediction, which enables the aggregation of the whole sequence information. To demonstrate the performance of the proposed method, we conduct the experimental evaluation using the DCASE2019 Task4 dataset. The experimental results demonstrate that the proposed method outperforms the DCASE2019 Task4 baseline method, which is based on the convolutional recurrent neural network, and the self-attention mechanism effectively works for SED.

    DOI: 10.1109/ICASSP40776.2020.9053609


    Other Link: https://dblp.uni-trier.de/db/conf/icassp/icassp2020.html#MiyazakiKH0TT20

  93. Characterization of Multiple 3D LiDARs for Localization and Mapping using Normal Distributions Transform Reviewed

    Alexander Carballo, Abraham Monrroy, David Wong, Patiphon Narksri, Jacob Lambert, Yuki Kitsukawa, Eijiro Takeuchi, Shinpei Kato, Kazuya Takeda

    arXiv preprint arXiv:2004.01374     page: -   2020.4

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  94. LIBRE: The multiple 3d lidar dataset Reviewed

    Alexander Carballo, Jacob Lambert, Abraham Monrroy, David Wong, Patiphon Narksri, Yuki Kitsukawa, Eijiro Takeuchi, Shinpei Kato, Kazuya Takeda

    arXiv preprint arXiv:2003.06129     page: -   2020.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  95. A Survey of Autonomous Driving: Common Practices and Emerging Technologies Reviewed

    Ekim Yurtsever, Jacob Lambert, Alexander Carballo, Kazuya Takeda

    IEEE Access   Vol. 8   page: 58443 - 58469   2020

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    © 2013 IEEE. Automated driving systems (ADSs) promise a safe, comfortable and efficient driving experience. However, fatalities involving vehicles equipped with ADSs are on the rise. The full potential of ADSs cannot be realized unless the robustness of state-of-the-art is improved further. This paper discusses unsolved problems and surveys the technical aspect of automated driving. Studies regarding present challenges, high-level system architectures, emerging methodologies and core functions including localization, mapping, perception, planning, and human machine interfaces, were thoroughly reviewed. Furthermore, many state-of-the-art algorithms were implemented and compared on our own platform in a real-world driving setting. The paper concludes with an overview of available datasets and tools for ADS development.

    DOI: 10.1109/ACCESS.2020.2983149

    Web of Science



    Other Link: http://arxiv.org/pdf/1906.05113v3

  96. Performance Analysis of 10 Models of 3D LiDARs for Automated Driving Reviewed

    Jacob Lambert, Alexander Carballo, Abraham Monrroy Cano, Patiphon Narksri, David Wong, Eijiro Takeuchi, Kazuya Takeda

    IEEE Access   Vol. 8   page: 131699 - 131722   2020

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

    © 2013 IEEE. Automated vehicle technology has recently become reliant on 3D LiDAR sensing for perception tasks such as mapping, localization and object detection. This has led to a rapid growth in the LiDAR manufacturing industry with several competing makers releasing new sensors regularly. With this increased variety of LiDARs, each with different properties such as number of laser emitters, resolution, field-of-view, and price tags, a more in-depth comparison of their characteristics and performance is required. This work compares 10 commonly used 3D LiDARs, establishing several metrics to assess their performance. Various outstanding issues with specific LiDARs were qualitatively identified. The accuracy and precision of individual LiDAR beams and accumulated point clouds are evaluated in a controlled environment at distances from 5 to 180 meters. Reflective targets were used to characterize intensity patterns and quantify the impact of surface reflectivity on accuracy and precision. A vehicle and pedestrian mannequin were also used as additional targets of interest. A thorough assessment of these LiDARs is given with their potential applicability for automated driving tasks. The data collected in these experiments and analysis tools are all shared openly.

    DOI: 10.1109/ACCESS.2020.3009680

    Web of Science



    Yoshimura Takenori, Hayashi Tomoki, Takeda Kazuya, Watanabe Shinji


     More details

  98. Intelligibility enhancement based on speech waveform modification using hearing impairment Reviewed

    Shu Hikosaka, Shogo Seki, Tomoki Hayashi, Kazuhiro Kobayashi, Kazuya Takeda, Hideki Banno, Tomoki Toda

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   Vol. 2020-October   page: 4059 - 4063   2020

     More details

    Publishing type:Research paper (international conference proceedings)  

    Copyright © 2020 ISCA In this paper, we propose a speech waveform modification method which incorporates a hearing impairment simulator, to improve speech intelligibility for the hearing-impaired. The settings of hearing aid devices usually need to be manually adjusted to suit the needs of each user, which creates a significant burden. To address this issue, the proposed method creates a spectral shaping filter, using a hearing impairment simulator capable of estimating speech signals as perceived by a specific hearing-impaired person. We conduct objective and subjective evaluations through simulations using the hearing impairment simulator. Our experimental results demonstrate that; 1) the proposed spectral shaping filter can significantly improve both speech intelligibility and quality, 2) the filter can be combined with a well-known speech intelligibility enhancement technique based on power compensation using dynamic range compression (DRC), and 3) speech intelligibility can be further improved by controlling the trade-off between filtering and DRC-based power compensation.

    DOI: 10.21437/Interspeech.2020-2062


    Other Link: https://dblp.uni-trier.de/db/conf/interspeech/interspeech2020.html#HikosakaSHKTBT20

  99. Point Grid Map-Based Mid-To-Mid Driving without Object Detection Reviewed

    Shunya Seiya, Alexander Carballo, Eijiro Takeuchi, Kazuya Takeda

    IEEE Intelligent Vehicles Symposium, Proceedings     page: 2044 - 2051   2020

     More details

    Publishing type:Research paper (international conference proceedings)  

    © 2020 IEEE. Teaching autonomous vehicles to imitate human driving in complex, urban traffic scenarios is a difficult task. 'End-to-end' autonomous driving systems, based on 'imitation learning', are an expecting approach. A model learns the relationships between sensing input and vehicle control signal outputs. These methods can successfully achieve driving in simple scenarios such as lane keeping. In contrast, the 'mid-to-mid' autonomous driving methods now being proposed. In such framework, the model learns the relationships between pre-processed feature maps from the model-based system as input and the future position of the ego vehicle as the output. Mid-to-mid driving methods can direct vehicles more robustly than end-to-end driving methods in some complex driving environments. However, mid-to-mid driving methods use the results of the object detection module to create the feature map. If object detection fails, or detection performance is poor due to changes in the driving environment, prediction performance may also be degraded. Our proposed method uses a prediction module that outputs point grid maps directly, without the use of an object detection module, which are then incorporated into the feature map. Point grid maps represent the locations of surrounding vehicles and obstacles directly, based on LiDAR point cloud data. Since the results of object detection are not used by the prediction module, detection performance does not affect prediction performance. In this study we conduct two experiments, an off-line evaluation using a Lyft dataset, and an on-line evaluation using the CARLA simulator. The results show that our model can achieve the same level of ego-vehicle position prediction performance as a model using annotated object location information.

    DOI: 10.1109/IV47402.2020.9304809


    Other Link: https://dblp.uni-trier.de/db/conf/ivs/ivs2020.html#SeiyaCTT20

  100. Nagoya University at TRECVID 2014: The instance search task Reviewed

    Cai Zhi Zhu, Yinqiang Zheng, Ichiro Ide, Shinichi Satoh, Kazuya Takeda

    2014 TREC Video Retrieval Evaluation, TRECVID 2014     2020

     More details

    Publishing type:Research paper (international conference proceedings)  

    © 2020 2014 TREC Video Retrieval Evaluation. All rights reserved. This paper presents our recent progress on a video object retrieval system that participated in the Instance Search (INS) task of the TRECVID 2014. Basically the system is a further extension of our previous Bag-of-Words (BOW) framework, with emphasis on pursuing a practical spatial re-ranking method scalable to large video database this year. We take the asymmetrical dissimilarities based system, which performed best in the INS2013 task, as the baseline, and re-rank with an improved spatial verification method. Experiments carried out the TRECVID INS2013 and INS2014 consistently show that, our re-ranking algorithm is able to further improve the baseline system at a rather fast speed.


  101. Intervention Force-based Imitation Learning for Autonomous Navigation in Dynamic Environments. Reviewed

    Tomoya Yokoyama, Shunya Seiya, Eijiro Takeuchi, Kazuya Takeda

        page: 1679 - 1688   2020

     More details

    Publishing type:Research paper (international conference proceedings)  

    Other Link: https://dblp.uni-trier.de/conf/apsipa/2020

  102. Attention-Based Speech Recognition Using Gaze Information

    Osamu Segawa, Tomoki Hayashi, Kazuya Takeda

    2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)     page: 465-470   2019.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  103. ITS+DM Hackathon (ITSC 2017): Lane Departure Prediction with Naturalistic Driving Data Reviewed

    Andrey Alekseenko, Hien Q. Dang, Gaurav Bansal, Javier Sanchez-Medina, Chiyomi Miyajima, Takatsugu Hirayama, Kazuya Takeda, Ichiro Ide

    IEEE Intelligent Transportation Systems Magazine   Vol. 11 ( 4 ) page: 78 - 93   2019.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    © 2009-2012 IEEE. On October 16th, 2017, in Yokohama, Japan, from 8:00 to 18:00, the first Intelligent Transportation Systems plus Data Mining challenge was organized under the umbrella of the 2017 IEEE Intelligent Transportation Systems Conference, the flagship conference of the IEEE Intelligent Transportation Systems Society. This activity was organized thanks to a three way collaboration between the ITS Society, Nagoya University, and the IEEE ITSC 2017 organizers. The twenty-three contestants, coming from eleven different countries, faced a classic Naturalistic Driving problem: Lane Departure detection. This paper presents the three best solutions produced. The solutions submitted by most of the participants were very diverse and interesting, but overall, the top ones concurred in the use of ensemble learning after a very interesting feature engineering phase. This hackathon formulation was complex in several ways. It was complex in terms of class imbalance, the challenge time duration and the fact that the provided dataset included only numerical measurements coming from the inertial unit in the testing car. That restriction made it difficult to expect outstanding results ? the best one was only slightly over 3% above baseline. However, the organizers thought that such complexities pushed participants to show their repertoire as data scientists, taking into consideration for example computer power load of the different algorithms tested, and overall yielding more interesting approaches to share with the community. Additionally, the most interesting learned lessons were shared, from both an organizational and technical point of view.

    DOI: 10.1109/MITS.2018.2880264

    Web of Science


  104. Motion Analysis and Performance Improved Method for 3D LiDAR Sensor Data Compression

    Chenxi Tu,Eijiro Takeuchi,Alexander Carballo,Chiyomi Miyajima,Kazuya Takeda

    IEEE transction on ITS     page: -   2019.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  105. Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder Reviewed

    Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, and Kazuya Takeda

    IEEE access     page: -   2019.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  106. Estimating the Probabilities of Surrounding Vehicles' Intentions and Trajectories using a Behavior Planner

    Hatem Darweesh, Eijiro Takeuchi, Kazuya Takeda

    International journal of automotive engineering   Vol. 10 ( 4 ) page: 299-308   2019.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  107. ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit Reviewed

    Tomoki Hayashi, Ryuichi Yamamoto, Katsuki Inoue, Takenori Yoshimura, Shinji Watanabe, Tomoki Toda, Kazuya Takeda, Yu Zhang, Xu Tan

    arXiv preprint arXiv:1910.10909     page: 00   2019.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  108. Effects on user perception of a'modified'speed experience through in-vehicle virtual reality Reviewed

    Yusuke Sakai, Toshimitsu Watanabe, Yoshio Ishiguro, Takanori Nishino, Kazuya Takeda

    Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications: Adjunct Proceedings     page: 166-170   2019.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  109. LeadingDisplay: a versatile, robotic display for infotainment in autonomous vehicles Reviewed

    Yoshio Ishiguro, Kazuya Takeda

    Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications: Adjunct Proceedings     page: 405-409   2019.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  110. Improving target selection accuracy for vehicle touch screens Reviewed

    Kosuke Ito, Kento Ohtani, Yoshio Ishiguro, Takanori Nishino, Kazuya Takeda

    Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications: Adjunct Proceedings     page: 176-180   2019.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  111. Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis Reviewed

    Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda, Shubham Toshniwal, Karen Livescu

    Proc. Interspeech 2019     page: 4430-4434   2019.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  112. Robustness of Statistical Voice Conversion Based on Direct Waveform Modification Against Background Sounds Reviewed

    Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda, Tomoki Toda

    Proc. Interspeech 2019     page: 684-688   2019.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  113. Effects on the Perception of Speed and Normality When Virtual Reality Scenes Reviewed

    Yusuke Sakai, Toshimitsu Watanabe, Yoshio Ishiguro, Takanori Nishino, Kazuya Takeda

    11th International ACM Conference on Automotive User Interfaces and Interactive Vehicular Applications (2019 Automotive User Interfaces)     2019.9

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)  

  114. Real-time Streaming Point Cloud Compression for 3D LiDAR Sensor Using U-net

    Chenxi Tu,Eijiro Takeuchi,Alexander Carballo,Kazuya Takeda

    IEEE access     page: -   2019.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  115. Overview of the five key research groups within the Behaviour Signal Processing Laboratory (Takeda Laboratory) at Nagoya University

    Kazuya Takeda

    Impact   Vol. 2019 ( 5 ) page: 32-35   2019.6

     More details

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  116. A Survey of Autonomous Driving: Common Practices and Emerging Technologies Reviewed

    Ekim Yurtsever, Jacob Lambert, Alexander Carballo, Kazuya Takeda

    arXiv preprint arXiv:1906.05113     page: 00   2019.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  117. Risky action recognition in lane change video clips using deep spatiotemporal networks with segmentation mask transfer Reviewed

    Ekim Yurtsever, Yongkang Liu, Jacob Lambert, Chiyomi Miyajima, Eijiro Takeuchi, Kazuya Takeda, John HL Hansen

    arXiv     page: 1906.02859   2019.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  118. A Predictive Reward Function for Human-Like Driving Based on a Transition Model of Surrounding Environment Reviewed

    Daiki Hayashi, Yunfei Xu, Takashi Bando, Kazuya Takeda

    2019 International Conference on Robotics and Automation (ICRA)     page: 7618-7624   2019.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  119. Point cloud compression for 3D LiDAR sensor using recurrent neural network with residual blocks Reviewed

    Chenxi Tu, Eijiro Takeuchi, Alexander Carballo, Kazuya Takeda

    2019 International Conference on Robotics and Automation (ICRA)     page: 3274-3280   2019.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  


    Komatsu Tatsuya, Hayashi Tomoki, Kondo Reishi, Toda Tomoki, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  121. Point Cloud Compression for 3D LiDAR Sensor using Recurrent Neural Network with Residual Blocks

    Tu Chenxi, Takeuchi Eijiro, Carballo Alexander, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  122. A Predictive Reward Function for Human-like Driving based on a Transition Model of Surrounding Environment

    Hayashi Daiki, Xu Yunfei, Bando Takashi, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  123. An Empirical Study of Adaptive Training of Daily Activity Classifier and Chat Application Designed for the Elderly to Go Out Reviewed

    Akira Tamamori, Yoshio Ishiguro, Kei Hiroi, Nobuo Kawaguchi, Kazuya Takeda

    IPSJ Transactions on Consumer Device & Systems (CDS)   Vol. 9 ( 2 ) page: 33 - 46   2019.5

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  124. Environmental sound processing and its applications Reviewed

    Koichi Miyazaki, Tomoki Toda, Tomoki Hayashi, Kazuya Takeda

    IEEJ Transactions on Electrical and Electronic Engineering   Vol. 14 ( 3 ) page: 340 - 351   2019.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    © 2019 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc. As part of the effort to develop techniques for understanding environments using sound, many studies in the field of computational auditory scene analysis have focused on using computers to perform functions carried out naturally by the human auditory system. Thanks to recent progress in machine-learning techniques, these environmental sound-processing techniques have significantly improved and a widening variety of applications has resulted in considerable interest in this field. In this review, we introduce the fundamental techniques of environmental sound processing, as well as recent advances in front-end and back-end processing and potential applications for these techniques. Prospects for further progress in the field of environmental sound processing and the challenges still to be overcome are also discussed. © 2019 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.

    DOI: 10.1002/tee.22868

    Web of Science


  125. A traffic flow simulation framework for learning driver heterogeneity from naturalistic driving data using autoencoders Reviewed

    Ekim Yurtsever, Chiyomi Miyajima, Kazuya Takeda

    International Journal of Automotive Engineering   Vol. 10 ( 1 ) page: 86-93   2019.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  126. Real-Time Streaming Point Cloud Compression for 3D LiDAR Sensor Using U-Net

    Tu Chenxi, Takeuchi Eijiro, Carballo Alexander, Takeda Kazuya

    IEEE ACCESS   Vol. 7   page: 113616-113625   2019

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    DOI: 10.1109/ACCESS.2019.2935253

    Web of Science

  127. Impact of driver behavior on fuel consumption: Classification, evaluation and prediction using machine learning Reviewed

    Peng Ping, Wenhu Qin, Yang Xu, Chiyomi Miyajima, Kazuya Takeda

    IEEE Access   Vol. 7   page: 78515 - 78532   2019

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    © 2013 IEEE. Driving behavior has a large impact on vehicle fuel consumption. Dedicated study on the relationship between the driving behavior and fuel consumption can contribute to decreasing the energy cost of transportation and the development of the behavior assessment technology for the ADAS system. Therefore, it is vital to evaluate this relationship in order to develop more ecological driving assistance systems and improve the vehicle fuel economy. However, modeling driving behavior under the dynamic driving conditions is complex, making a quantitative analysis of the relationship between the driving behavior and the fuel consumption difficult. In this paper, we introduce two kinds of machine learning methods for evaluating the fuel efficiency of driving behavior using the naturalistic driving data. In the first stage, we use an unsupervised spectral clustering algorithm to study the macroscopic relationship between driving behavior and fuel consumption, using the data collected during the natural driving process. In the second stage, the dynamic information from the driving environment and natural driving data is integrated to generate a model of the relationship between various driving behaviors and the corresponding fuel consumption features. The dynamic environment factors are coded into a processable, digital form using a deep learning-based object detection method so that the environmental data can be linked with the vehicle's operating signal data to provide the training data for the deep learning network. The training data are labeled according to its fuel consumption feature distribution, which is obtained from the road segment data and historical driving data. This deep learning-based model can then be used as a predictor of the fuel consumption associated with different driving behaviors. Our results show that the proposed method can effectively identify the relationship between the driving behavior and the fuel consumption on both macro and micro levels, allowing for end-to-end fuel consumption feature prediction, which can then be applied in the advanced driving assistance systems.

    DOI: 10.1109/ACCESS.2019.2920489

    Web of Science


  128. Training Engineers in Autonomous Driving Technologies using Autoware

    Carballo Alexander, Wong David, Ninomiya Yoshiki, Kato Shinpei, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  129. Risky Action Recognition in Lane Change Video Clips using Deep Spatiotemporal Networks with Segmentation Mask Transfer

    Yurtsever Ekim, Liu Yongkang, Lambert Jacob, Miyajima Chiyomi, Takeuchi Eijiro, Takeda Kazuya, Hansen John H. L.


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  130. Personalized Safety-focused Control by Minimizing Subjective Risk

    Bao Naren, Yang Dongfang, Carballo Alexander, Ozguner Umit, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  131. Crossing Blind Intersections from a Full Stop Using Estimated Visibility of Approaching Vehicles

    Narksri Patiphon, Takeuchi Eijiro, Ninomiya Yoshiki, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science


    Segawa Osamu, Hayashi Tomoki, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  133. Optimizing Learned Object Detection on Point Clouds from 3D Lidars Through Range and Sparsity Information

    Lambert Jacob, Takeuchi Eijiro, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  134. Daily activity recognition based on recurrent neural network using multi-modal signals Reviewed

    Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda

    APSIPA Transactions on Signal and Information Processing   Vol. 7 ( e21 ) page: 1 - 11   2018.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:CAMBRIDGE UNIV PRESS  

    Copyright © 2018 The Authors. Our aim is to develop a smartphone-based life-logging system. Human activity recognition (HAR) is one of the core techniques to realize it. Recent studies reported the effectiveness of feed-forward neural network (FF-NN) and recurrent neural network (RNN) as a classifier for HAR task. However, there are still unresolved problems in those studies: (1) a life-logging system using only a smartphone for recording device has not been developed, (2) only indoor activities have been utilized for evaluation, (3) insufficient investigations/evaluations of RNN. In this study, we address these unresolved problems as follows: (1) we build a prototype system for life-logging and conduct data recording experiment on this system to include both indoor and outdoor activities. The experimental results of HAR on this new dataset showed that RNN-based classifier was still effective. (2) From the results of a HAR experiment, it was demonstrated that a multi-layered Simple Recurrent Unit with a non-linear transform at the bottom layer and a highway-connection was the most effective. (3) We could grasp the reason for the improvement of RNN from FF-NN by observing the posterior probabilities over test data.

    DOI: 10.1017/ATSIP.2018.25

    Web of Science



    Hayashi Tomoki, Watanabe Shinji, Zhang Yu, Toda Tomoki, Hori Takaaki, Astudillo Ramon, Takeda Kazuya

    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018)     page: 426-433   2018.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  136. Back-translation-style data augmentation for end-to-end ASR

    Tomoki Hayashi, Shinji Watanabe, Yu Zhang, Tomoki Toda, Takaaki Hori, Ramon Astudillo, Kazuya Takeda

    2018 IEEE Spoken Language Technology Workshop (SLT)     page: 426-433   2018.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  137. End-to-End Navigation with Branch Turning Support using Convolutional Neural Network

    Seiya Shunya, Carballo Alexander, Takeuchi Eijiro, Miyajima Chiyomi, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  138. ITS+ DM Hackathon (ITSC 2017): Lane departure prediction with naturalistic driving data

    Andrey Alekseenko, Hien Q Dang, Gaurav Bansal, Javier J Sánchez-Medina, Chiyomi Miyajima, Takatsugu Hirayama, Kazuya Takeda, Ichiro Ide

    IEEE Intelligent Transportation Systems Magazine     page: 00   2018.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  139. Driving Feature Extraction and Behavior Classification Using an Autoencoder to Reproduce the Velocity Styles of Experts Reviewed

    Sama Kyle, Morales Yoichi, Akai Naoki, Liu Hailong, Takeuchi Eijiro, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  140. SecretSign: A Method of Finding an Off-line Target Object without Revealing the Target to Observers

    Sakai Yusuke, Morita Hiromi, Ishiguro Yoshio, Nishino Takanori, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  141. High Density Ground Maps using Low Boundary Height Estimation for Autonomous Vehicles Reviewed

    Carballo Alexander, Takeuchi Eijiro, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  142. SecretSign: A Method of Finding an Off-Line Target Object without Revealing the Target to Observers

    Yusuke Sakai, Hiromi Morita, Yoshio Ishiguro, Takanori Nishino, Kazuya Takeda

    2018 21st International Conference on Intelligent Transportation Systems (ITSC)     page: 3651-3656   2018.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  143. Generalized multichannel variational autoencoder for underdetermined source separation

    Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda

    arXiv preprint arXiv:1810.00223     page: 00   2018.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  144. Connectionist Temporal Classification-based Sound Event Encoder for Converting Sound Events into Onomatopoeic Representations

    Miyazaki Koichi, Hayashi Tomoki, Toda Tomoki, Takeda Kazuya

    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)     page: 852-856   2018.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  145. Anomalous Sound Event Detection Based on WaveNet

    Hayashi Tomoki, Komatsu Tatsuya, Kondo Reishi, Toda Tomoki, Takeda Kazuya

    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)     page: 2494-2498   2018.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  146. End-to-End Autonomous Mobile Robot Navigation with Model-Based System Support

    Carballo Alexander, Seiya Shunya, Lambert Jacob, Darweesh Hatem, Narksri Patiphon, Morales Luis Yoichi, Akai Naoki, Takeuchi Eijiro, Takeda Kazuya

    JOURNAL OF ROBOTICS AND MECHATRONICS   Vol. 30 ( 4 ) page: 563-583   2018.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  147. Tsukuba Challenge 2017 Dynamic Object Tracks Dataset for Pedestrian Behavior Analysis

    Lambert Jacob, Liang Leslie, Morales Luis Yoichi, Akai Naoki, Carballo Alexander, Takeuchi Eijiro, Narksri Patiphon, Seiya Shunya, Takeda Kazuya

    JOURNAL OF ROBOTICS AND MECHATRONICS   Vol. 30 ( 4 ) page: 598-612   2018.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  148. Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization Reviewed

    Shogo Seki, Tomoki Toda, Kazuya Takeda


     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG  

    This paper proposes a semi-supervised source separation method for stereophonic music signals containing multiple recorded or processed signals, where synthesized music is focused on the stereophonic music. As the synthesized music signals are often generated as linear combinations of many individual source signals and their respective mixing gains, phase or phase difference information between inter-channel signals, which represent spatial characteristics of recording environments, cannot be utilized as acoustic clues for source separation. Non-negative Tensor Factorization (NTF) is an effective technique which can be used to resolve this problem by decomposing amplitude spectrograms of stereo channel music signals into basis vectors and activations of individual music source signals, along with their corresponding mixing gains. However, it is difficult to achieve sufficient separation performance using this method alone, as the acoustic clues available for separation are limited. To address this issue, this paper proposes a Cepstral Distance Regularization (CDR) method for NTF-based stereo channel separation, which involves making the cepstrum of the separated source signals follow Gaussian Mixture Models (GMMs) of the corresponding the music source signal. These GMMs are trained in advance using available samples. Experimental evaluations separating three and four sound sources are conducted to investigate the effectiveness of the proposed method in both supervised and semi-supervised separation frameworks, and performance is also compared with that of a conventional NTF method. Experimental results demonstrate that the proposed method yields significant improvements within both separation frameworks, and that cepstral distance regularization provides better separation parameters.

    DOI: 10.1587/transfun.E101.A.1057

    Web of Science


  149. Integrating driving behavior and traffic context through signal symbolization for data reduction and risky lane change detection

    Ekim Yurtever, Suguru Yamazaki, Chiyomi Miyajima, Chiyomi, Kazuya Takeda, Masataka Mori, Kentarou Hitomi, and Masumi Egawa

    IEEE Transactions on Intelligent Vehicles     page: 00   2018.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  


  150. Retrieving a Driving Model Based on Clustered Intersection Data

    Sama Kyle, Morales Yoichi, Akai Naoki, Takeuchi Eijiro, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  151. 畳み込み雑音除去自己符号化器と対数周波数領域スペクトル特徴を用いた楽曲音源強調 Reviewed

    大谷健登, 丹羽健太, 西野隆典, 武田一哉

    電子情報通信学会論文誌D   Vol. J101-D ( 3 ) page: 00   2018.3

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  152. ITSS Technical Activities Spotlight: Getting to Know the Naturalistic Driving Data Analytics Technical Committee [Technical Activities]

    Pujitha Gunaratne, Kazuya Takeda

    IEEE Intelligent Transportation Systems Magazine     page: 167-167   2018.1

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  153. Daily activity recognition with large-scaled real-life recording datasets based on deep neural network using multi-modal signals Reviewed

    Tomoki Hayashi, Masafumi Nishida, Norihide Kitaoka, Tomoki Toda, Kazuya Takeda

    IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences   Vol. E101A ( 1 ) page: 199 - 210   2018.1

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:Institute of Electronics, Information and Communication, Engineers, IEICE  

    © 2018 The Institute of Electronics, Information and Communication Engineers. In this study, toward the development of smartphone-based monitoring system for life logging, we collect over 1,400 hours of data by recording including both the outdoor and indoor daily activities of 19 subjects, under practical conditions with a smartphone and a small camera. We then construct a huge human activity database which consists of an environmental sound signal, triaxial acceleration signals and manually annotated activity tags. Using our constructed database, we evaluate the activity recognition performance of deep neural networks (DNNs), which have achieved great performance in various fields, and apply DNN-based adaptation techniques to improve the performance with only a small amount of subject-specific training data. We experimentally demonstrate that; 1) the use of multi-modal signal, including environmental sound and triaxial acceleration signals with a DNN is effective for the improvement of activity recognition performance, 2) the DNN can discriminate specified activities from a mixture of ambiguous activities, and 3) DNN-based adaptation methods are effective even if only a small amount of subject-specific training data is available.

    DOI: 10.1587/transfun.E101.A.199

    Web of Science


  154. Modeling Driver Risk Perception on City Roads Using Deep Learning Reviewed

    Peng Ping, Yuan Sheng, Wenhu Qin, Chiyomi Miyajima, Kazuya Takeda

    IEEE Access   Vol. 6   page: 68850 - 68866   2018

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    © 2013 IEEE. Research on how risk is perceived by drivers is vital to driving behavior research and driving safety. As risk can be divided into subjective and objective risk, in this paper, we focus on modeling subjective risk perception by drivers using a deep learning method. Different drivers often perceive different levels of subjective risk under the same driving conditions. In addition, different driving conditions or driving events will have different effects on drivers. Based on these two risk perception features, in this paper, we first design an experiment on a city road with two lanes to assess the level of subjective risk perceived by drivers belonging to different groups. We then use a deep learning network-based method to abstract features of the driving environment. These environmental features are integrated with driver risk perception data and this information is used as training and testing data for the learning network. Finally, a long-short-term memory-based method is adopted to model the subjective risk perception of individual drivers based on traffic conditions and vehicle operation data from the driver's vehicle. Our results show that the proposed method can effectively model the subjective risk perception behavior of drivers, allowing for end-to-end risk perception prediction in future driving assistance systems.

    DOI: 10.1109/ACCESS.2018.2879887

    Web of Science


  155. Recognizing emotions from speech using a physical model Reviewed

    Norihide Kitaoka, Shuhei Segawa, Ryota Nishimura, Kazuya Takeda

    Acoustical Science and Technology   Vol. 39 ( 2 ) page: 167 - 170   2018

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:Acoustical Society of Japan  

    DOI: 10.1250/ast.39.167

    Web of Science


  156. Learning How to Drive in Blind Intersections from Human Data Reviewed

    Kyle Sama, Yoichi Morales, Naoki Akai, Eijiro Takeuchi, Kazuya Takeda

    Proceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018     page: 317 - 324   2018

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    © 2018 IEEE. In this paper we present a method to learn how to drive in different types of blind intersections using expert driving data. We cluster different intersections based on the velocity of how drivers approach them, and train a linear SVM classifier for each class of intersection. Through clustering we found that there were three different classes of intersections in typical residential areas in Japan. We used inverse reinforcement learning (IRL) to build a driving model for each type of intersection. The models were trained from 308 trajectories traversed by 5 different drivers. The models and policies were implemented and evaluated in a ROS simulator where the agent is provided a global path, and upon it reaching an intersection, it selects the appropriate trained policy. By doing this, the simulated autonomous vehicle can perform proactive safe driving behaviors when approaching blind intersections.

    DOI: 10.1109/SMC.2018.00064

    Web of Science


  157. Multi-head decoder for end-to-end speech recognition Reviewed

    Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   Vol. 2018-September   page: 801 - 805   2018

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:ISCA  

    © 2018 International Speech Communication Association. All rights reserved. This paper presents a new network architecture called multi-head decoder for end-to-end speech recognition as an extension of a multi-head attention model. In the multi-head attention model, multiple attentions are calculated, and then, they are integrated into a single attention. On the other hand, instead of the integration in the attention level, our proposed method uses multiple decoders for each attention and integrates their outputs to generate a final output. Furthermore, in order to make each head to capture the different modalities, different attention functions are used for each head, leading to the improvement of the recognition performance with an ensemble effect. To evaluate the effectiveness of our proposed method, we conduct an experimental evaluation using Corpus of Spontaneous Japanese. Experimental results demonstrate that our proposed method outperforms the conventional methods such as location-based and multi-head attention models, and that it can capture different speech/linguistic contexts within the attention-based encoder-decoder framework.

    DOI: 10.21437/Interspeech.2018-1655

    Web of Science



    Other Link: http://arxiv.org/pdf/1804.08050v2

  158. Investigation of effectiveness on recurrent neural network for daily activity recognition using multi-modal signals

    A. Tamamori, T. Hayashi, T. Toda, K. Takeda

    Proc. APSIPA     page: 7   2017.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    7 pages, Kuala Lumpur, Malaysia, Dec. 2017 (Invited Talk in Special Session).

  159. An investigation of multi-speaker training for WaveNet vocoder

    T. Hayashi, A. Tamamori, K. Kobayashi, K. Takeda, T. Toda

    Proc. ASRU     page: 712-718   2017.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  160. An Investigation of Recurrent Neural Network for Daily Activity Recognition using Multi-modal Signals

    Tamamori Akira, Hayashi Tomoki, Toda Tomoki, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  161. Involvement of poly-rC binding proteins in posttranscriptional regulation of Sortilin, the cytokine trafficking mediator Reviewed

    Toshiki Yabe-Wada, Shintaro Matsuba, Kazuya Takeda, Akira Nakamura, Caroline C. Philpott, Nobuyuki Onai

    CYTOKINE   Vol. 100   page: 145 - 146   2017.12

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD  

    Web of Science

  162. Duration-controlled LSTM for polyphonic sound event detection

    T. Hayashi, S. Watanabe, T. Toda, T. Hori, J. Le Roux, K. Takeda.

    IEEE/ACM Transactions on Audio, Speech, and Language Processing   Vol. 25 ( 11 ) page: 2059-2070   2017.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  163. Duration-Controlled LSTM for Polyphonic Sound Event Detection Reviewed

    Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda

    IEEE/ACM Transactions on Audio Speech and Language Processing   Vol. 25 ( 11 ) page: 2059 - 2070   2017.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    © 2014 IEEE. This paper presents a new hybrid approach called duration-controlled long short-term memory (LSTM) for polyphonic sound event detection (SED). It builds upon a state-of-the-art SED method that performs frame-by-frame detection using a bidirectional LSTM recurrent neural network (BLSTM), and incorporates a duration-controlled modeling technique based on a hidden semi-Markov model. The proposed approach makes it possible to model the duration of each sound event precisely and to perform sequence-by-sequence detection without having to resort to thresholding, as in conventional frame-by-frame methods. Furthermore, to effectively reduce sound event insertion errors, which often occur under noisy conditions, we also introduce a binary-mask-based postprocessing that relies on a sound activity detection network to identify segments with any sound event activity, an approach inspired by the well-known benefits of voice activity detection in speech recognition systems. We conduct an experiment using the DCASE2016 task 2 dataset to compare our proposed method with typical conventional methods, such as nonnegative matrix factorization and standard BLSTM. Our proposed method outperforms the conventional methods both in an event-based evaluation, achieving a 75.3% F1 score and a 44.2% error rate, and in a segment-based evaluation, achieving an 81.1% F1 score, and a 32.9% error rate, outperforming the best results reported in the DCASE2016 task 2 Challenge.

    DOI: 10.1109/TASLP.2017.2740002

    Web of Science


  164. A Single-Dimensional Interface for Arranging Multiple Audio Sources in Three-Dimensional Space

    Kento Ohtani, Kenta Niwa, Kazuya Takeda

    IEICE TRANSACTIONS on Information and Systems (0.411),   Vol. E100-D ( 10 ) page: pp. 2635-264   2017.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  165. A single-dimensional interface for arranging multiple audio sources in three-dimensional space Reviewed

    Kento Ohtani, Kenta Niwa, Kazuya Takeda

    IEICE Transactions on Information and Systems   Vol. E100D ( 10 ) page: 2635 - 2643   2017.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG  

    Copyright © 2017 The Institute of Electronics, Information and Communication Engineers. A single-dimensional interface which enables users to obtain diverse localizations of audio sources is proposed. In many conventional interfaces for arranging audio sources, there are multiple arrangement parameters, some of which allow users to control positions of audio sources. However, it is difficult for users who are unfamiliar with these systems to optimize the arrangement parameters since the number of possible settings is huge. We propose a simple, single-dimensional interface for adjusting arrangement parameters, allowing users to sample several diverse audio source arrangements and easily find their preferred auditory localizations. To select subsets of arrangement parameters from all of the possible choices, auditory-localization space vectors (ASVs) are defined to represent the auditory localization of each arrangement parameter. By selecting subsets of ASVs which are approximately orthogonal, we can choose arrangement parameters which will produce diverse auditory localizations. Experimental evaluations were conducted using music composed of three audio sources. Subjective evaluations confirmed that novice users can obtain diverse localizations using the proposed interface.

    DOI: 10.1587/transinf.2017EDP7028

    Web of Science


  166. Prediction Method for .Continuous Point Cloud Data Compression Using SLAM Information

    Chenxi Tu, Eijiro Takeuchi, Chiyomi Miyajima, and Kazuya Takeda

    fast-zero     page: SLAM Information," fast-zero, Sep., 2017   2017.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  167. Estimating Risk Levels Perceived by Individuals for Lane Change Scenes,"The fourth International Symposium on Future Active Safety Technology Toward zero traffic accidents

    Naren Bao, Chiyomi Miyajima, Eijiro Takeuchi, Kazuya Takeda, Shinichiro Honda, Toshiya Yoshitani, and Masayoshi Ito

    The fourth International Symposium on Future Active Safety Technology Toward zero traffic accidents     page: 00   2017.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  168. Missing component restoration for masked speech signals based on time-domain spectrogram factorization

    Shogo Seki, Hirokazu Kameoka, Tomoki Toda, Kazuya Takeda

    MLSP2017     page: 6   2017.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  169. Evaluation of Deep Learning-Based Driving Signal Generation Methods for Vehicle Control

    Shunya Seiya, Daiki Hayashi, Eijiro Takeuchi, Chiyomi Miyajima and Kazuya Takeda

    Fast-ZERO, 2017     page: 00   2017.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  170. Estimation of driver's risk feeling toward driving environment using neural network

    Yuan Sheng, Yiyang Li, Chiyomi Miyajima, Eijiro Takeuchi, Kazuya Takeda, Shinichiro Honda, Toshiya Yoshitani, and Masayoshi Ito

    FAST-zero '17     page: 00   2017.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  


  171. Prediction method for continuous point cloud data compression using SLAM information

    Chenxi Tu, Eijiro Takeuchi, Chiyomi Miyajima, and Kazuya Takeda

    4th International Symposium on Future Active Safety Technology toward zero traffic accidents (FAST-zero '17)     page: 00   2017.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  


  172. Speaker-dependent WaveNet vocoder

    Akira Tamamori,Tomoki Hayashi,Kazuhiro Kobayashi,Kazuya Takeda,Tomoki Toda,

    INTERSPEECH 2017     page: pp. 1118-1122   2017.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  173. Stereophonic Music Separation Based on Non-negative Tensor Factorization with Cepstrum Regularization

    Shogo Seki, Tomoki Toda, Kazuya Takeda

    EUSIPCO2017     page: pp. 1011-1015   2017.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  174. Stereophonic Music Separation Based on Non-negative Tensor Factorization with Cepstrum Regularization

    Seki Shogo, Toda Tomoki, Takeda Kazuya

    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)     page: 981-985   2017.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  175. Open Source Integrated Planner for Autonomous Navigation in Highly Dynamic Environments

    Darweesh Hatem, Takeuchi Eijiro, Takeda Kazuya, Ninomiya Yoshiki, Sujiwo Adi, Morales Luis Yoichi, Akai Naoki, Tomizawa Tetsuo, Kato Shinpei

    JOURNAL OF ROBOTICS AND MECHATRONICS   Vol. 29 ( 4 ) page: 668-684   2017.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  176. Continuous point cloud data compression using SLAM based prediction Reviewed

    Chenxi Tu, Eijro Takeuchi, Chiyomi Miyajima, Kazuya Takeda

    IEEE 2017 Intelligent Vehicles Symposium (IV '17)     page: 1744–1751   2017.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  177. Music Staging AI

    Kenta Niwa, Kento Ohtani, Kazuya Takeda

    2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017)     page: 6588-6589   2017.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  178. BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic sound event detection Reviewed

    T. Hayashi, S. Watanabe, T. Toda, T. Hori, J. Le Roux, K. Takeda.

    Proc. ICASSP     page: 766-770   2017.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  179. Signal Processing for Smart Vehicle Technologies: Part 2 [From the Guest Editors] Reviewed

    John H.L. Hansen, Kazuya Takeda, Sanjeev M. Naik, Mohan M. Trivedi, Gerhard U. Schmidt, Yingying Jennifer Chen, Wade Trappe

    IEEE Signal Processing Magazine   Vol. 34 ( 2 ) page: 18 - 21   2017.3

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    DOI: 10.1109/MSP.2017.2650299

    Web of Science



    Hayashi Tomoki, Tamamori Akira, Kobayashi Kazuhiro, Takeda Kazuya, Toda Tomoki


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science


    Hayashi Tomoki, Watanabe Shinji, Toda Tomoki, Hori Takaaki, Le Roux Jonathan, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  182. Continuous Point Cloud Data Compression Using SLAM Based Prediction Reviewed

    Tu Chenxi, Takeuchi Eijiro, Miyajima Chiyomi, Takeda Kazuya

    2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017)     page: 1744-1751   2017

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science


    Seki Shogo, Kameoka Hirokazu, Toda Tomoki, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  184. MUSIC STAGING AI Reviewed

    Niwa Kenta, Ohtani Kento, Takeda Kazuya


     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Web of Science

  185. Speaker-dependent WaveNet vocoder Reviewed

    Akira Tamamori, Tomoki Hayashi, Kazuhiro Kobayashi, Kazuya Takeda, Tomoki Toda

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   Vol. 2017-August   page: 1118 - 1122   2017

     More details

    Language:Japanese   Publishing type:Research paper (international conference proceedings)   Publisher:ISCA-INT SPEECH COMMUNICATION ASSOC  

    Copyright © 2017 ISCA. In this study, we propose a speaker-dependent WaveNet vocoder, a method of synthesizing speech waveforms with WaveNet, by utilizing acoustic features from existing vocoder as auxiliary features of WaveNet. It is expected that WaveNet can learn a sample-by-sample correspondence between speech waveform and acoustic features. The advantage of the proposed method is that it does not require (1) explicit modeling of excitation signals and (2) various assumptions, which are based on prior knowledge specific to speech. We conducted both subjective and objective evaluation experiments on CMU-ARCTIC database. From the results of the objective evaluation, it was demonstrated that the proposed method could generate high-quality speech with phase information recovered, which was lost by a mel-cepstrum vocoder. From the results of the subjective evaluation, it was demonstrated that the sound quality of the proposed method was significantly improved from mel-cepstrum vocoder, and the proposed method could capture source excitation information more accurately.

    DOI: 10.21437/Interspeech.2017-314

    Web of Science


  186. Impact of acoustic similarity on efficiency of verbal information transmission via subtle prosodic cues Reviewed

    Bohan Chen, Norihide Kitaoka, Kazuya Takeda

    Eurasip Journal on Audio, Speech, and Music Processing   Vol. 2016 ( 1 )   2016.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:SPRINGER INTERNATIONAL PUBLISHING AG  

    © 2016, The Author(s). In this study, we investigate the effect of tiny acoustic differences on the efficiency of prosodic information transmission. Study participants listened to textually ambiguous sentences, which could be understood with prosodic cues, such as syllable length and pause length. Sentences were uttered in voices similar to the participant’s own voice and in voices dissimilar to their own voice. The participants then identified which of four pictures the speaker was referring to. Both the eye movement and response time of the participants were recorded. Eye tracking and response time results both showed that participants understood the textually ambiguous sentences faster when listening to voices similar to their own. The results also suggest that tiny acoustic features, which do not contain verbal meaning can influence the processing of verbal information.

    DOI: 10.1186/s13636-016-0097-6

    Web of Science


  187. Investigation on Recurrent Neural Network Architectures for Daily Activity Recognition Reviewed

    Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda

    The 3rd International Conference on Universal Village (UV2016)     page: 00   2016.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    DOI: 00

  188. Sound enhancement system using selective binary filtering Reviewed

    Tomomi Suzuki, Takanori Nishino, Yoshio Ishiguro, Kazuya Takeda

    5th Joint Meeting of the ASA and ASJ     page: 00   2016.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  189. Convolutional Biderectional LSTM-HMM Hybrid System for Polyphonic Sound Event Detection Reviewed

    Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda

    5th Joint Meeting of the ASA and ASJ     page: 00   2016.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  190. Signal Processing for Smart Vehicle Technologies [From the Guest Editors]

    John H.L. Hansen, Kazuya Takeda, Sanjeev M. Naik, Mohan M. Trivedi, Gerhard U. Schmidt, Yingying Chen

    IEEE Signal Processing Magazine   Vol. 33 ( 6 ) page: 12 - 13   2016.11

     More details


    DOI: 10.1109/MSP.2016.2600624

    Web of Science


  191. Compressing Continuous Point Cloud Data Using Image Compression Methods Reviewed

    Chenxi Tu, Eijiro Takeuchi, Chiyomi Miyajima, Kazuya Takeda

    ITSC2016     page: 00   2016.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  192. AI framework to arrange audio objects according to listener preferences Reviewed

    Kento Ohtani, Kenta Niwa, Kazuya Takeda

    5th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan     page: 00   2016.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  193. Stereo Channel Music Signal Separation Based on Nonnegative Tensor Factorization with Cepstrum Regularization Reviewed

    Shogo Seki, Kento Ohtani, Tomoki Toda, Kazuya Takeda

    5th Joint Meeting of the ASA and ASJ     page: 00   2016.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  194. Driver-Behavior Modeling Using On-Road Driving Data: A new application for behavior signal processing Reviewed

    Chiyomi Miyajima, Kazuya Takeda

    IEEE Signal Processing Magazine   Vol. 33 ( 6 ) page: 14 - 21   2016.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC  

    © 2016 IEEE. This article reviews data-centric approaches for statistical modeling of driver behavior. Modeling driver behavior is challenging due to its stochastic nature and the high degree of inter- and intradriver variability. One way to deal with the highly variable nature of driving behavior is to employ a data-centric approach that models driver behavior using large amounts of driving data collected from numerous drivers in a variety of traffic conditions. To obtain large amounts of realistic driving data, several projects have collected real-world driving data. Statistical machine-learning techniques, such as hidden Markov models (HMMs) and deep learning, have been successfully applied to model driver behavior using large amounts of driving data. We have also collected on-road data recording hundreds of drivers over more than 15 years. We have applied statistical signal processing and machine-learning techniques to this data to model various aspects of driver behavior, e.g., driver pedal-operation, car-following, and lane-change behaviors for predicting driver behavior and detecting risky driver behavior and driver frustration. By reviewing related studies and providing concrete examples of our own research, this article is intended to illustrate the usefulness of such data-centric approaches for statistical driver-behavior modeling.

    DOI: 10.1109/MSP.2016.2602377

    Web of Science


  195. Analysis of driver workload when using speech interfaces Reviewed

    D Hayashi, C Miyajima, K Takeda

    The Journal of the Acoustical Society of America   Vol. 140 ( 4 ) page: 2961 - 2961   2016.10

     More details

    Authorship:Last author, Corresponding author   Language:English   Publishing type:Research paper (scientific journal)  

  196. Projection of a virtual speaker into a vehicle using sound field control Reviewed

    T Yamamura, Y Ishiguro, T Nishino, K Takeda

    The Journal of the Acoustical Society of America   Vol. 140 ( 4 ) page: 3063 - 3063   2016.10

     More details

    Authorship:Last author, Corresponding author   Language:English   Publishing type:Research paper (scientific journal)  

  197. Emotion recognition from speech using a physical model Reviewed

    Norihide Kitaoka, Shuhei Segawa, Kazuya Takeda

    Proc. ICA2016     page: 00   2016.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  198. Bidirectional LSTM-HMM Hybrid System for Polyphonic Sound Event Detection Reviewed

    Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda

    DCASE2016     page: 00   2016.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  199. Investigation on Recurrent Neural Network Architectures for Daily Activity Recognition Reviewed

    Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda

    UV2016     page: 00   2016.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  200. Investigation of DNN-based modeling for audio-visual speech recognition Reviewed

    Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu

    2015 First International Workshop on Spoken Language Processing (MLSLP2015)     page: 00   2016.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  201. Recurrent Neural Networkに基づく日常生活行動認識

    玉森 聡、林 知樹、戸田 智基、武田 一哉

    電子情報通信学会技術報告   Vol. 116(189) ( 7 ) page: 12   2016.8

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  202. Accelerated Deformable Part Models on GPUs Reviewed

    Manato Hirabayashi, Shinpei Kato, Masato Edahiro, Kazuya Takeda, Seiichi Mita

    IEEE Transactions on Parallel and Distributed Systems   Vol. 27 ( 6 ) page: 1589 - 1602   2016.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    © 2015 IEEE. Object detection is a fundamental challenge facing intelligent applications. Image processing is a promising approach to this end, but its computational cost is often a significant problem. This paper presents schemes for accelerating the deformable part models (DPM) on graphics processing units (GPUs). DPM is a well-known algorithm for image-based object detection, and it achieves high detection rates at the expense of computational cost. GPUs are massively parallel compute devices designed to accelerate data-parallel compute-intensive workload. According to an analysis of execution times, approximately 98 percent of DPM code exhibits loop processing, which means that DPM could be highly parallelized by GPUs. In this paper, we implement DPM on the GPU by exploiting multiple parallelization schemes. Results of an experimental evaluation of this GPU-accelerated DPM implementation demonstrate that the best scheme of GPU implementations using an NVIDIA GPU achieves a speed up of 8.6x over a naive CPU-based implementation.

    DOI: 10.1109/TPDS.2015.2453962

    Web of Science


  203. Prediction of Individual Driving Behavior on Highway Curves

    Naren Bao, Daiki Hayashi, Chiyomi Miyajima, and Kazuya Takeda

    The third Workshop on Natural- istic Driving Data Analysitics, IEEE Intelligent Vehicles Symposium     page: 00   2016.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  204. Integrating driving behavior and traffic context through signal symbolization. Reviewed

    Yamazaki, S., Miyajima, C., Yurtsever, E., Takeda, K., Mori, M., Hitomi, K., & Egawa, M.

    In Intelligent Vehicles Symposium (IV), 2016 IEEE.     page: pp. 642-647   2016.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  205. Classification of driver's neutral and cognitive distraction states based on peripheral vehicle behavior in driver's gaze transition Reviewed

    Takatsugu Hirayama, Kenji Mase, Chiyomi Miyajima, Kazuya Takeda

    IEEE Transactions on Intelligent Vehicles   Vol. 1 ( 2 ) page: 148 - 157   2016.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    © 2016 IEEE. To support safe driving, numerous methods of detecting distractions using measurements of a driver's gaze have been proposed. These methods empirically focused on certain driving contexts and analyzed gaze behavior under particular peripheral vehicle conditions; therefore, numerous driving situations were not considered. To address this problem with hypothesis-testing approaches, we turn the problem around and propose a data-mining approach that analyzes peripheral vehicle behavior during gaze transitions of drivers in order to compare their neutral driving state with a cognitive distraction state. This change in thinking is the first contribution of this paper. The analysis results show that under the neutral condition, drivers generally turned their gaze to peripheral vehicles to be focused on; however, they did not do this consistently under the distracted condition. As the second contribution, we propose a simple classifier to discriminate between the cognitive distraction and neutral states by analyzing the peripheral vehicle behavior. The proposed classifier can manage various situations and provide high classification accuracy by focusing on gaze transitions from the front view toward other directions.

    DOI: 10.1109/TIV.2016.2599786


  206. Daily activity recognition based on acoustic signals and acceleration signals estimated with Gaussian process Reviewed

    Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda

    2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015     page: 279 - 282   2016.2

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    © 2015 Asia-Pacific Signal and Information Processing Association. We have created corpus of daily activities using wearable sensors. The corpus consists of sound and image data from a camera and motion signals from a smartphone for both indoor and outdoor activities over 72 continuous hours. We propose a method that can interpolate acceleration signals to any sample points with a Gaussian process in order to recognize daily activities. We conducted recognition experiments of daily activities using our corpus. Experimental results showed that the proposed method can improve recognition accuracy compared to a conventional method. This demonstrates the effectiveness of estimating acceleration signals with a Gaussian process to recognize daily activities.

    DOI: 10.1109/APSIPA.2015.7415520

    Web of Science


  207. Audio-visual speech recognition using deep bottleneck features and high-performance lipreading Reviewed

    Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu

    2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015     page: 575 - 582   2016.2

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    © 2015 Asia-Pacific Signal and Information Processing Association. This paper develops an Audio-Visual Speech Recognition (AVSR) method, by (1) exploring high-performance visual features, (2) applying audio and visual deep bottleneck features to improve AVSR performance, and (3) investigating effectiveness of voice activity detection in a visual modality. In our approach, many kinds of visual features are incorporated, subsequently converted into bottleneck features by deep learning technology. By using proposed features, we successfully achieved 73.66% lipreading accuracy in speaker-independent open condition, and about 90% AVSR accuracy on average in noisy environments. In addition, we extracted speech segments from visual features, resulting 77.80% lipreading accuracy. It is found VAD is useful in both audio and visual modalities, for better lipreading and AVSR.

    DOI: 10.1109/APSIPA.2015.7415335

    Web of Science


  208. Tracking driver signage observation using local feature matching and optical flow Reviewed

    Chiyomi Miyajima, Katsuya Sakoyama, Kazuya Takeda

    2015 IEEE/SICE International Symposium on System Integration, SII 2015     page: 479 - 482   2016.2

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)  

    © 2015 IEEE. We investigate a method for identifying objects observed by drivers. Here we focus on roadside signage as an example, and track the driver's observation of signage while driving. A gaze tracking system and a forward-directed video camera are used to determine the driver's region of interest (ROI). The driver's observation of signage is detected by tracking the driver's ROI using optical flow, and by matching the driver's ROI with template images of signboards in a signage database using local feature matching. Driver and signage location information are used to limit candidate signboards for reducing computational cost for image matching. We conduct an experiment to evaluate our method and achieve a 66.2% detection rate of drivers' signboard observation with a false positive rate of 6.6%.

    DOI: 10.1109/SII.2015.7405026


  209. Modeling and detecting excessive trust from behavior signals: Overview of research project and results Reviewed

    Kazuya Takeda

    Human-Harmonized Information Technology, Volume 1: Vertical Impact     page: 57 - 75   2016.1

     More details

    Language:English   Publishing type:Part of collection (book)  

    © Springer Japan 2016. An approach which would allow us to better understand behavioral states inherent in observed behaviors is proposed, based on the development of a mathematical representation of driving behaviors signals using our large driving behavior signal corpus. In particular, the project is aimed at developing technologies for preventing excessive trust in users of automated systems. Misuse/disuse of automation is introduced as a cognitive model of excessive trust, and methods of quantitative measurement are devised. PWARX and GMM models are proposed to represent discrete and continuous information in the cognition/decision/action process. We also develop a method of modeling visual behavior aiming at understanding environmental awareness while driving. We showed the effectiveness of the model experimentally through risky lane change detection. Finally, we show the effectiveness of the method to quantify excessive trust based on developed technology.

    DOI: 10.1007/978-4-431-55867-5_3


  210. Robust example search using bottleneck features for example-based speech enhancement Reviewed

    Atsunori Ogawa, Shogo Seki, Keisuke Kinoshita, Marc Delcroix, Takuya Yoshioka, Tomohiro Nakatani, Kazuya Takeda

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH   Vol. 08-12-September-2016   page: 3733 - 3737   2016

     More details

    Publishing type:Research paper (international conference proceedings)   Publisher:ISCA-INT SPEECH COMMUNICATION ASSOC  

    Copyright © 2016 ISCA. Example-based speech enhancement is a promising approach for coping with highly non-stationary noise. Given a noisy speech input, it first searches in noisy speech corpora for the noisy speech examples that best match the input. Then, it concatenates the clean speech examples that are paired with the matched noisy examples to obtain an estimate of the underlying clean speech component in the input. This framework works well if the noisy speech corpora contain the noise included in the input. However, it is impossible to prepare corpora that cover all types of noisy environments. Moreover, the example search is usually performed using noise sensitive mel-frequency cepstral coefficient features (MFCCs). Consequently, a mismatch between an input and the corpora is inevitable. This paper proposes using bottleneck features (BNFs) extracted from a deep neural network (DNN) acoustic model for the example search. Since BNFs have good noise robustness (invariance), the mismatch is mitigated and thus a more accurate example search can be performed. Experimental results on the Aurora4 corpus show that the example-based approach using BNFs greatly improves the enhanced speech quality compared with that using MFCCs. It also consistently outperforms a conventional DNN-based approach, i.e. a denoising autoencoder.

    DOI: 10.21437/Interspeech.2016-671

    Web of Science


  211. Symbolization for Large-Scale Driving Corpus and Its Applications Reviewed

    Egawa, Masumi Mori, Masataka Takenaka, Kazuhito Bando, Takashi Taniguchi, Tadahiro Miyajima, Chiyomi Takeda, Kazuya

    Transactions of Society of Automotive Engineers of Japan   Vol. 47 ( 5 ) page: 1135 - 1140   2016

     More details

    Authorship:Last author, Corresponding author   Language:English   Publishing type:Research paper (scientific journal)  

  212. Compressing Continuous Point Cloud Data Using Image Compression Methods

    Tu Chenxi, Takeuchi Eijiro, Miyajima Chiyomi, Takeda Kazuya


     More details

  213. Relationship between Speaker/Listener Similarity and Information Transmission Quality in Speech Communication Reviewed

    Bohan Chen, Norihide Kitaoka, Kazuya Takeda

    APSIPA ASC 2015     page: 00   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  214. Audio-visual speech recognition using deep bottleneck features and high-perfromanc lipreading Reviewed

    Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu

    APSIPA ASC 2015     page: 00   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  215. Daily activity recognition based on acoustic signals and acceleration signals estimated with Gaussian process, Reviewed

    Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda

    APSIPA ASC 2015     page: 00   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  216. Tracking driver's observation using local feature matching and optical flow Reviewed

    Chiyomi Miyajima, Katsuya Sakoyama, Kazuya Takeda

    IEEE/SICE International Symbposium on System Integration (SII 2015)     page: 00   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  217. Development and evaluation of spherical microphone baffle with two hollows for binaural recording Reviewed

    Taishi Nakagiri, Toshiki Yamamura, Takanori Nishino, Hiroshi Naruse, and Kazuya Takeda

    Proc. 12th Western Pacific Acoustics Conference 2015 (WESPAC2015),     page: 00   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  218. Single Dimensional Control of Spatial Audio Object Arrangement

    Kento Ohtani, Kenta Niwa, Kazuya Takeda

    12th Western Pacific Acoustics Conference 2015 (WESPAC2015)     page: 00   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  219. Elderly person's emotional state estimation in conversation based on speech features for spoken dialogue systems Reviewed

    Shuhei Segawa, Norihide Kitaoka, Kazuya Takeda

    12th Western pacific Acoustics Conference 2015 (WESPAC2015)     page: 00   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  220. Tracking driver's observation using local feature matching and optical flow Reviewed

    Chiyomi Miyajima,Katsuya Sakoyama, and Kazuya Takeda

    2015 IEEE/SICE International Symposium on System Integration     page: 00   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  221. An Open Approach to Autonomous Vehicles

    S. Kato, E. Takeuchi, Y. Ishiguro, Y. Ninomiya, K. Takeda and T. Hamada

    IEEE Micro   Vol. 35 ( 6 ) page: 60-68   2015.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  222. Driving Scene Classification Using Vehicle Motion Estimated with Smartphone

    Masayuki Tsuboi,Chiyomi Miyajima,Kazuya Takeda

    The 7th Biennial Workshop on Digital Signal Processing for In-Vehicle Systems     page: 00   2015.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  223. Audio-visual processing toward robust speech recognition in cars Reviewed

    Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, Satoru Hayamizu

    7th Biennial Workshop on DSP for In-Vehicle Systems and Safety     page: 00   2015.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  224. Risky Lane Change Detection Based on Symbolization of Driving Data Reviewed

    Suguru Yamazaki,Chiyomi Miyajima,Masataka Mori,Takashi Bando,Kazuhito Takenaka,and Kazuya Takeda

    The 7th Biennial Workshop on Digital Signal Processing for In-Vehicle Systems     page: 00   2015.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  225. Effect of speaking rate and speech complexity on transmission quality during driving navigation task Reviewed

    Bohan Chen, Norihide Kitaoka, Kazuya Takeda

    DSP in Vehicle 2015     page: 00   2015.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  226. Integration of acoustic information in Google Street View using a spherical microphone array Reviewed

    Tomomi Suzuki, Yoshio Ishiguro, Takanori Nishino, and Kazuya Takeda

    Proc. AUN/SEED-Net Regional Conference for Computer and Information Engineering 2015 (RCCIE 2015),     page: 00   2015.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  227. Modelling of Physical Characteristics of Speech under Stress Reviewed

    Xiao Yao,Takatoshi Jitsuhiro,Chiyomi Miyajima,Norihide Kitaoka,and Kazuya Takeda

    IEEE Signal Processing Letters   Vol. 22 ( 10 ) page: 1801-1805   2015.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    DOI: 10.1109/LSP.2015.2434732

  228. Investigation of DNN-based audio-visual speech recognition Reviewed

    Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda

    IEICE Trans. Inf. & Syst     page: 2444-2451   2015.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  229. Driving signature extraction Reviewed

    Ekim Yurtsever,Chiyomi Miyajima, Selpi Selpi, and Kazuya Takeda

    Proc.of 3rd International Symposium on Future Active Safety Technology towards zero traffic accidents,     page: 00   2015.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  230. Integration of Deep Bottleneck Features for Audio-Visual Speech Recognition Reviewed

    Hiroshi Ninomiya, Norihide Kitaoka, Satoshi Tamura, Yurie Iribe, Kazuya Takeda

    Proc. INTERSPEECH2015     page: 00   2015.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  231. Traffic trajectory history and drive path generation using GPS data cloud Reviewed

    Ekim Yurtsever,Chiyomi Miyajima, and Kazuya Takeda

    Proc. of IEEE Intelligent Vehicles Symposium     page: 00   2015.7

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  232. Analysis driver gaze behavior and consistency of decision making during automated driving Reviewed

    Chiyomi Miyajima,Suguru Yamazaki, Takashi Bando,Kentaro Hitomi,Hitoshi Terai,Hiroyuki Okuda,Takatsugu Hirayama,Masumi Egawa,Tatsuya Suzuki,and Kazuya Takeda

    2015IEEE Intelligent Vehicles Symposium   Vol. 4 ( 1 ) page: 59-66   2015.7

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  233. Automatic lane change extraction based on temporal patterns of symbolized driving behavioral data Reviewed

    Masataka Mori, Kazuhito Takenaka, Takashi Bando, Tadahiro Taniguchi,Chiyomi Miyajima, Kazuya Takeda

    Proc. of 2015 IEEE Intelligent Vehicles Symposium (IV '15)     page: 00   2015.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  234. Analyzing driver gaze behavior and consistency of decision making during automated driving Reviewed

    Chiyomi Miyajima, Suguru Yamazaki, Takashi Bando, Kentarou Hitomi, Hitoshi Terai, Hiroyuki Okuda, Takatsugu Hirayama, Masumi Egawa, Tatsuya Suzuki, Kazuya Takeda

    Proc. of 2015 IEEE Intelligent Vehicles Symposium (IV '15)     page: 00   2015.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  235. Traffic trajectory history and drive path generation using GPS data cloud.

    Yurtsever, E., Takeda, K., & Miyajima, C.

    In Intelligent Vehicles Symposium (IV), 2015 IEEE     page: pp. 229-234   2015.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  236. 種々のテキスト検索モデルの頑健性向上による音声ドキュメント検索の高精度化 Reviewed


    情報処理学会論文誌   Vol. 56 ( 3 ) page: 00   2015.3

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  237. Tracking Roadside Signage Observed by Drivers Reviewed

    Katsuya Sakoyama, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    2015 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing     page: 00   2015.2

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  238. 相補ウィーナーフィルタを利用した残響抑圧手法に対するフィルタ係数推定手法 Reviewed


    電子情報通信学会論文誌(A),電子情報通信分野における萌芽的研究小特集   Vol. J98-A ( 2 ) page: 178-189   2015.2

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  239. An evaluation method of aggressiveness of driving behavior using drive recorders Reviewed

    Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    IEEJ Journal of Industry Applications   Vol. 4 ( 1 ) page: 59-66   2015

     More details

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  240. Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features Reviewed

    Norihide Kitaoka, Tomoki Hayashi, Kazuya Takeda

    APSIPA ASC 2014     page: 00   2014.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  241. Investigating the Robustness of Deep Bottleneck Features for Recognizing Speech of Speakers of Various Ages Reviewed

    Norihide Kitaoka, Tomoki Hayashi, Kazuya Takeda

    APSIPA 2014     page: 00   2014.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  242. Development and preliminary analysis of sensor signal database of contiuous daily living activity over the long term Reviewed

    Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda

    APSIPA ASC 2014     page: 00   2014.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  243. Unsupervised energy disaggregation using conditional random fields Reviewed

    Panikos Heracleous, Pongtep Angkititrakul, Norihide Kitaoka, Kazuya Takeda

    IEEE ISGT Europe 2014     page: 00   2014.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  244. Measuring Aggressive Driving Behavior Using signals from drive recorders Reviewed

    Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    IEEE ITSC14     page: 00   2014.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  245. Investigating the Robustness of Deep Bottleneck Features for Recognizing Speech of Speakers of Various Ages Reviewed

    Tomoki Hayashi, Chiyomi Miyajima, Takanori Nishino, Kazuya Takeda

    FORUM ACUSTICUM 2014     page: 00   2014.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  246. Sound image perception for a sound source moving in musical contents Reviewed

    Kento Ohtani, Takanori Nishino, Kazuya Takeda

    FORUM ACUSTICUM 2014     page: 00   2014.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  247. Building Driving Intelligence using Traffic Big Data Reviewed

    Kazuya Takeda

    2014 World Congress on Intelligent Transport Systems     page: 00   2014.9

     More details

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  248. 《特集》サイバーフィジカルシステム:フィールド実証実験(FOT)に向けて


    情報処理学会誌「情報処理」   Vol. 55 ( 9 ) page: 922-927   2014.8

     More details


  249. 音声ドキュメント検索における種々の検討および線形補間係数を自動決定する検索質問拡張 Reviewed


    情報処理学会論文誌   Vol. 55 ( 6 ) page: 1625-1636   2014.6

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  250. Adaptive dereverberation method based on complementary wiener filter and modulation transfer function Reviewed

    Kento Ohtani, Tatsuya Komatsu, Takanori Nishino, Kazuya Takeda

    REVERB workshop     page: 00   2014.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  251. Evalution method for aggressiveness of driving behavior using drive recorders, Reviewed

    Yiang Li,Chiyomi Miyajima,Norihide Kitaoka,Kazuya Takeda,

    IEEJ Journal of Industry Applications,   Vol. 4 ( 1 ) page: 00   2014.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  252. Driving scene retrieval with an integrated similarity measure using driving behavior and environment information Reviewed

    Yiang Li,Chiyomi Miyajima,Norihide Kitaoka,Kazuya Takeda

    IEEJ Journal C,   Vol. 134 ( 5 ) page: 1-8   2014.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  253. 発話セグメントクラスタの評価とそれに基づく改良ボトムアップクラスタリングによる話者ダイアライゼーションの高精度化

    陳伯翰, 北岡教英, 武田一哉

    電子情報通信学会論文誌(D)   Vol. J97-D ( 3 ) page: 540-547   2014.3

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  254. Effective Frame Selection for Blind Source Separation based on Frequency Domain Independent Component Analysis Reviewed

    Yusuke Mizuno,Takanori Nishino,Kazunobu Kondo,Norihide Kitaoka,Kazuya Takeda,

    IEICE Trans,Fundamentals,   Vol. E97-A ( 3 ) page: 784-791   2014.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  255. Use of driver gaze information for detecting risky lane changes Reviewed

    Masataka Mori, Chiyomi Miyajima, Takatsugu Hirayama, Norihide Kitaoka, and Kazuya Takeda

    2014 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP '14)     page: 00   2014.2

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  256. Effect of acoustic and linguistic contexts on human and machine speech recognition

    Norihide Kitaoka, Daisuke Enami, Seiichi Nakagawa

    Computer Speech and Language   Vol. 28   page: 767-787   2014.2

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  257. Improvement of multimodal gestrure and speech recognition performance using time intervals between gestures and accompaning speech

    Madoka Miki, Norihide Kitaoka, Chiyomi Miyajima, Takanori Nishino, Kazuya Takeda

    EURASIP Journal on Audio, Speech, and Music Processing   Vol. 2014 ( 2 ) page: 7pages   2014.1

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    DOI: 10.1186

  258. A graph-based spoken dialog strategy utilizing multiple understanding hypotheses

    Norihide Kitaoka, Yuji Kinoshita, Sunao Hara, Chiyomi Miyajima, Kazuya Takeda

    Information and Media Technologies   Vol. 29 ( 1 ) page: 1-10   2014.1

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  259. 空間相関行列の固有値分布に着目した音源別距離推定 Reviewed

    丹羽健太, 江崎知, 日岡裕輔, 西野隆典, 武田一哉

    電子情報通信学会論文誌   Vol. J97-A ( 2 ) page: 68-76   2014

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  260. Effective Frame Selection for Blind Source Separation based on Frequency Domain Independent Component Analysis

    Yusuke Mizuno, Takanori Nishino, Kazunobu Kondo, Norihide Kitaoka, Kazuya Takeda

    IEICE Trans. Fundamentals   Vol. E97-A ( 3 ) page: 784-791   2014

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  261. Driving scene retrieval with an integrated similarity measure using driving behavior and environment information

    Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    IEEJ Journal C   Vol. 134 ( 5 ) page: 1-8   2014

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  262. Modeling driver gaze and vehicle operation behavior during lane changes Reviewed

    Masataka Mori, Chiyomi Miyajima, Takatsugu Hirayama, Norihide Kitaoka, and Kazuya Takeda

    International Joint Workshop on Advanced Sensing/Visual Attention and Interaction - Toward Creation of Human-Harmonized Information Technology (ASVAI 2013)     page: 00   2013.11

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  263. Spoken document retrieval using both word-based and syllable-based document spaces with latent semantci indexing Reviewed

    Ken Ichikawa, Satoru Tsuge, Norihide Kitaoka, Kazuya Takeda, Kenji Kita

    APSIPA ASC 2013     page: 00   2013.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  264. Toward the development of a driving support system for repressing overtrust and overreliance Reviewed

    Yusuke Tanaka, Takashi Bando, Masumi Egawa, Hiroyuki Okuda, Hitoshi Terai, Takatsugu Hirayama, Chiyomi Miyajima, Daisuke Deguchi, Katsuhiro Kaji, Kazuya Takeda, and Tatsuya Suzuki

    20th ITS World Congress 2013     page: 00   2013.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  265. Integrated modeling of driver gaze and vehicle operation behavior to estimate risk level during lane changes Reviewed

    Masataka Mori, Chiyomi Miyajima, Takatsugu Hirayama, Norihide Kitaoka, Kazuya Takeda

    Proc. IEEE ITSC 2013     page: 00   2013.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  266. Modeling driver gaze and vehicle operation patterns to estimate long-term risk levels of driving behavior Reviewed

    Masataka Mori, Chiyomi Miyajima, Takatatsugu Hirayama, Norihide Kitaoka, and Kazuya Takeda

    Sixth Biennial Workshop on Digital Signal Processing for In-Vehicle Systems (DSP in Vehicles 2013)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  267. An audio-visual in-car corpus "CENSREC-2-AV" for robust bimodal speech recognition: DSP, human-to-vehicle interfaces, driver behavior, and safety Reviewed

    Takuya Kawasaki, Satoshi Tamura, Satoru Hayamizu, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    Sixth Biennial Workshop on Digital Signal Processing for In-Vehicle Systems (DSP in Vehicles 2013)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  268. Toward well-balanced man-machine cooperation in vehicle Reviewed

    Kentarou Hitomi, Takashi Bando, Masumi Egawa, Hiroyuki Okuda, Hitoshi Terai, Takatsugu Hirayama, Chiyomi Miyajima, Daisuke Deguchi, Katsuhiro Kaji, Kazuya Takeda, and Tatsuya Suzuki,

    Sixth Biennial Workshop on Digital Signal Processing for In-Vehicle Systems (DSP in Vehicles 2013)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  269. Adaptation techniques for stochastic driver-behavior modeling Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, and Kazuya Takeda

    Sixth Biennial Workshop on Digital Signal Processing for In-Vehicle Systems (DSP in Vehicles 2013)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  270. Analysis of driving behavior signals recorded from different types of vehicles using CAN and Smartphone Reviewed

    Chiyomi Miyajima, Hiroaki Ishikawa, Masataka Kaneko, Norihide Kitaoka, and Kazuya Takeda

    2nd International Symposium on Future Active Safety Technology toward zero traffic accidents (FAST-zero '13)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  271. Repressing overtrust: driver cooperated driving support systems Reviewed

    Takashi Bando, Masumi Egawa, Hiroyuki Okuda, Hitoshi Terai, Takatsugu Hirayama, Chiyomi Miyajima,Daisuke Deguchi, Katsuhiko Kaji, Kazuya Takeda, and Tatsuya Suzuki

    2nd International Symposium on Future Active Safety Technology toward zero traffic accidents (FAST-zero '13)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  272. An integrated similarity measure for driving scene retrieval using driving behavior and environmental information Reviewed

    Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    2nd International Symposium on Future Active Safety Technology toward zero traffic accidents (FAST-zero '13)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  273. Prediction of context-dependent deceleration behavior Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, and Kazuya Takeda

    2nd International Symposium on Future Active Safety Technology toward zero traffic accidents (FAST-zero '13)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  274. Comparison of lane change behavior of expert and non-expert drivers Reviewed

    Masataka Mori, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    2nd International Symposium on Future Active Safety Technology toward zero traffic accidents (FAST-zero '13)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  275. Analysis of lane change maneuvers based on driver gaze and vehicle operation behavior Reviewed

    Masataka Mori, Chiyomi Miyajima, Takatsugu Hirayama, Norihide Kitaoka, and Kazuya Takeda

    International Conference on Driver Distraction and Inattention 2013, (DDI 2013)     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  276. Modeling Safety of Lane Change Maneuvers Based on Driver Gaze and Vehicle Operation Behavior Reviewed

    Masataka Mori, Chiyomi Miyajima, Takatsugu Hirayama, Norihide Kitaoka, Kazuya Takeda

    DDI 2013     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  277. Classification of speech under stress based on physical modeling

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    Acoustical Science and Technology   Vol. 34 ( 5 ) page: 311-321   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  278. Modeling subjective evaluation of music similarity using tolerance Reviewed

    Shota Kawabuchi, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    Proc. EUSIPCO 2013     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  279. Measuring driving behavior on different types of vehicles Reviewed

    Chiyomi Miyajima, Hiroki Takeshita, Hiroaki Ishikawa, Norihide Kitaoka, Kazuya Takeda

    The SICE Annual Conference 201     page: 00   2013.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  280. Classification of speech under stress by modeling the aerodynamics of the laryngeal ventricle Reviewed

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    Proc. INTERSPEECH2013     page: 00   2013.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  281. Classification of speech under stress based on modeling of the vocal folds and vocal tract Reviewed

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    EURASIP Journal on Audio, Speech, and Music Processing     page: 000   2013.7

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  282. Objective and subjective evaluation of complementary Wiener filter for speech dereverberation Reviewed

    Kento Ohtani, Tatsuya Komatsu, Kazunobu Kondo, Takanori Nishino, and Kazuya Takeda

    21st International Congress on Acoustics (ICA2013)     page: 00   2013.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  283. Stochastic mixture modeling of driving behavior during car following Reviewed

    Pongtep Angkititurakul, Chiyomi Miyajima, and Kazuya Takeda

    Journal of Informatoin and Communication Convergence Engineering   Vol. 11 ( 2 ) page: 95-102   2013.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  284. Stochastic mixture modeling of driving behavior during car following Reviewed

    Pongtep Angkititurakul, Chiyomi Miyajima, and Kazuya Taked

    Journal of Informatoin and Communication Convergence Engineering   Vol. 11 ( 2 ) page: 95-102   2013.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  285. Stochastic mixture modeling of driving behavior during car following Reviewed

    Pongtep Angkititurakul, Chiyomi Miyajima, and Kazuya Takeda

    Journal of Informatoin and Communication Convergence Engineering     page: 00   2013.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  286. Modeling Room Impulse Response via Composites of Spatial-Temporal GP's Reviewed

    Tatsuya Komatsu, Gareth W. Peters, Tomoko Matsui, Ido Nevat, Kazuya Takeda

    ICA 2013 Montreal     page: 00   2013.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  287. Computationally efficient single channel dereverberation based on complementary Wiener filter Reviewed

    Kazunobu Kondo, Yu Takahashi, Tatsuya Komatsu, Takanori Nishino, and Kazuya Takeda

    ICASSP2013     page: 00   2013.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  288. Modeling head-related transfer functions via spatial-temporal Gaussian process Reviewed

    Tatsuya Komatsu, Takanori Nishino, Gareth Peters, Tomoko Matsui, and Kazuya Takeda

    ICASSP2013     page: 00   2013.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  289. 楽曲間の類似判定における許容度の推定 Reviewed

    川渕 将太, 宮島 千代美, 北岡 教英, 武田 一哉

    情報処理学会MUS/EC合同研究会   Vol. Vol.2013-MUS-98 ( 2 ) page: 6pages   2013.5

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  290. Estimation of vocal tract parameters for the classification of speech under stress

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    2013 IEEE International Conference     page: 00   2013.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  291. Analysis and modeling of entrainment in chorus singing Reviewed

    Motonari Kawagishi, Shota Kawabuchi, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    ICASSP2013     page: 00   2013.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  292. 楽曲間の類似判断における個人性データの収集とその分析 Reviewed


    情報処理学会論文誌   Vol. 54 ( 4 ) page: 000   2013.4

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  293. Generative approach for robust acoustic model training for blindly separated speech recognition Reviewed

    Norihide Kitaoka, Yuto Dekiura, Kazuya Takeda

    ICA2013/ASA/CAA     page: 00   2013.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  294. ばね質量系を利用した合唱における歌声のF0ダイナミクスのモデル化 Reviewed

    川岸 基成, 宮島 千代美, 北岡 教英, 武田 一哉

    情報処理学会MUS/EC合同研究会   Vol. Vol.2013-MUS-98 ( 12 ) page: 6pages   2013.3

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  295. Classification of speech under stress using physical features based on two-mass model Reviewed

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

      Vol. SP2012 ( 128 ) page: 47-52   2013.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  296. Spoken document retrieval using combinational use of distances of multiple vector spaces and query expansion with optimized weight parameters Reviewed

    Satoru Tsuge, Hiromasa Ohashi, Norihide Kitaoka, Kazuya Takeda, Kenji Kita

    Proc. NCSP'13     page: 00   2013.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  297. 音声対話システムの発話・動作タグN-gramを用いた課題未達成のオンライン検出 Reviewed

    原 直, 北岡教英, 武田一哉

    電子情報通信学会論文誌(D)   Vol. 96-D ( 1 ) page: 81-93   2013.1

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  298. Behavior Signal Processing for Vehicle Applications Reviewed

    Chiyomi Miyajima, Pongtep Angkititrakul, Kazuya Takeda

    APSIPA Transactions on Signal and Information Processing   Vol. 00 ( 00 ) page: 1-13   2013.1

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  299. Modeling and Analysis of Driving Behavior Based on a Probability-Weighted ARX Model Reviewed

    H Okuda, N Ikami, T Suzuki, Y Tazaki, K Takeda

    IEEE Trans. on Intelligent Transportation Systems   Vol. 14 ( 1 ) page: 98-112   2013.1

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  300. CENSREC-2-AV: An evaluation framework for bimodal speech recognition in real environments Reviewed

    Naoya Ukai, Takuya Kawasaki, Satoshi Tamura, Satoru Hayamizu, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    COCOSDA     page: 00   2012.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  301. Subjective similarity of music: Data collection for individuality analysis Reviewed

    Naoya Ukai, Takuya Kawasaki, Satoshi Tamura, Satoru Hayamizu, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    APSIPA     page: 00   2012.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  302. Self-coaching system based on recorded driving data:Learning from one's experiences Reviewed

    Kazuya Takeda, Chiyomi Miyajima, Tatsuya Suzuki, Pongtep Angkititrakul, Kenji Kurumida, Yuichi Kuroyanagi, Hiroaki Ishikawa, Ryuta Terashima, Toshihiro Wakita, Masato Oikawa, and Yuichi Komada

    IEEE Transactions on Intelligent Transportation systems   Vol. 13   page: 1821-1831   2012.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  303. Acoustic model training using feature vectors generated by manipulating speech parameters of real speakers Reviewed

    Tetsuto Kawai, Norihide Kitaoka, Kazuya Takeda

    Proc. APSIPA ASC 2012     page: 00   2012.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  304. Acoustic model training using pseudo-speaker feature generated by MLLR transofrmaions for robust speech recognition Reviewed

    Arata Itoh, Sunao Hara, Norihide Kitaoka, Kazuya Takeda

    IEICE Trans. Inf. & Syst   Vol. E95-D ( 10 ) page: 2479-2485   2012.10

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  305. Measuring driver awareness based on correlation between gaze behavior and risks of surrounding vehicles Reviewed

    Masataka Mori, Chiyomi Miyajima, Pongtep Angkititrakul, Takatsugu Hirayama, Yiyang Li, Norihide Kitaoka, and Kazuya Takeda

    ITSC 2012     page: 00   2012.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  306. Analysis and prediction of deceleration behavior during car-following using stochastic driver-behavior model Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, and Kazuya Takeda

    ITSC 2012     page: 00   2012.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  307. Classification of stressed speech using physical parameters derived from two-mass model Reviewed

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    INTERSPEECH 2012     page: 00   2012.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  308. Measuring driver awareness based on correlation between gaze behavior and risks of surrounding vehicles Reviewed

    Masataka Mori, Chiyomi Miyajima, Pongtep Angkititrakul, Takatsugu Hirayama, Yiyang li, Norihide Kitaoka, Kazuya Takeda

    ITSC2012     page: 00   2012.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  309. Fast source separation based on selection of effective temporal frames Reviewed

    Yusuke Mizuno, Kazunobu Kondo, Takanori Nishino, Norihide Kitaoka, Kazuya Takeda

    Proc. EUSIPCO 2012     page: 00   2012.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  310. Impact of driving context on stochastic diver-behavior model: Quantitative analysis of car following task Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, and Kazuya Takeda

    2012 IEEE International Conference on Vehicular Electronics and Safety (ICVES 2012)     page: 00   2012.7

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  311. An improved driver-behavior model with combined individual and general driving characteristics Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, and Kazuya Takeda

    2012 IEEE Intelligent Vehicles Symposium (IV'12),     page: 00   2012.6

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  312. Causal analysis of task incompletion for spoken dialogs focused on interactions of user and the system Reviewed

    Sunao Hara, Norihide Kitaoka and Kazuya Takeda

    Proc. LREC 2012     page: 00   2012.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  313. Data collection for individuality analysis on subjective music similarity evaluation Reviewed

    Shota Kawabuchi, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    The Acoustics 2012     page: 00   2012.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  314. Multi-band speech recognition using band-dependent confidence measures of blind source separation Reviewed

    Atsushi Ando, Hiromasa Ohashi, Sunao Hara, Norihide Kitaoka, Kazuya Takeda

    ACOUSTICS 2012     page: 00   2012.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  315. Physical characteristics of vocal folds during speech under stress Reviewed

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda,

    ICASSP2012     page: 00   2012.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  316. Estimating sound source depth using a small-size array Invited Reviewed

    Satoshi Esaki, Kenta Niwa, Takanori Nishino, and Kazuya Takeda

    ICASSP2012     page: 00   2012.3

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

  317. International large-scale vehicle corpora for research on driver behavior on the road Reviewed

    Kazuya Takeda, John Hansen, Pinar Boyraz, Lucas Malta, Chiyomi Miyajima, and Huseyin Abut

    IEEE Transactions on Intelligent Transportation Systems   Vol. 12 ( 4 ) page: 1609-1623   2011.12

    Language:English   Publishing type:Research paper (scientific journal)  

  318. Robust seed model training for speaker adaptation using pseudo-speaker features generated by inverse CMLLR transformation Reviewed

    Arata Itoh, Sunao Hara, Norihide Kitaoka, Kazuya Takeda

    2011 Automatic Speech Recognition and Understanding Workshop (ASRU 2011)     page: 00   2011.12

    Language:English   Publishing type:Research paper (scientific journal)  

  319. Training Robust Acoustic Models Using Features of Pseudo-Speakers Generated by Inverse CMLLR Transformations Reviewed

    Arata Itoh, Sunao Hara, Norihide Kitaoka, Kazuya Takeda

    2011 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2011)     page: 00   2011.10

    Language:English   Publishing type:Research paper (scientific journal)  

  320. Behavior signal processing for vehicular applications Reviewed

    Chiyomi Miyajima, Pongtep Angkititrakul, and Kazuya Takeda

    2011 Asia-Pacific Signal and Information Processing Associaton Annual Summit and Conference (APSIPA 2011)     page: 00   2011.10

    Language:English   Publishing type:Research paper (scientific journal)  

  321. Improving driving behavior by allowing drivers to browse their own recorded driving data Reviewed

    Kazuya Takeda, Chiyomi Miyajima, Tatsuya Suzuki, Kenji Kurumida, Yuichi Kuroyanagi, Hiroaki Ishikawa, Pongtep Angkititrakul, Ryuta Terashima, Toshihiro Wakita, Masato Oikawa, and Yuichi Komada

    International IEEE Conference on Intelligent Transportation Systems (ITSC 2011)     page: 00   2011.10

    Language:English   Publishing type:Research paper (scientific journal)  

  322. Adaptation of driver-behavior model with application to car-following task Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, Kazuya Takeda, Ryuta Terashima, and Toshihiro Wakita,

    First International Symposium on Future Active Safety Technology Toward Zero-Traffic-Accident (FAST-zero 2011),     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  323. CENSREC-4: An evaluation framework for distant-talking speech recognition in reverberant environments Reviewed

    Takahiro Fukumori, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Takeshi Yamada, Kazumasa Yamamoto, Satoru Tsuge, Masakiyo Fujimoto, Tetsuya Takiguchi, Chiyomi Miyajima, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, and Satoshi Nakamura

    Acoustical Science and Technology   Vol. 32 ( 5 ) page: 201-210   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  324. Development and evaluation of Japanese Lombard speech corpus Reviewed

    Tetsuji Ogawa, Takanobu Nishiura, Takeshi Yamada, Norihide Kitaoka, and Tetsunori Kobayashi,

    Inter-noise 2011     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  325. An analysis of the speech under stress using the two-mass vocal fold model Reviewed

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    The 3rd International Workshop on Spoken Dialogue Systems Technology (IWSDS 2011)     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  326. Efficient blind speech separation suitable for embedded devices Reviewed

    Kazunobu Kondo, Yu Takahashi, Seiichi Hashimoto, Hiroshi Saruwatari, Takanori Nishino, Kazuya Takeda

    19th European signal processing conference (EUSIPCO 2011)     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  327. On the feasibility of the Mel frequency scale for sound event recognition in realistic conditions Reviewed

    Huy Dat Tran, Yi Ren Leng, Norihide Kitaoka and Haizhou Li

    Inter-noise 2011     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  328. Adaption of driver-behavior model to car-following tasks Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, Kazuya Takeda, Ryuta Terashima, and Toshihiro Wakita,

    First International Symposium on Future Active Safety Technology Toward Zero-Traffic-Accident (FAST-zero 2011)     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  329. Retrieval systems for recorded driving situations based on measuring similarity between driving behavior signals Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, Kazuya Takeda, Ryuta Terashima, and Toshihiro Wakita,

    FAST-zero2011     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  330. Driving scene retrieval using integrated vehicle motion matching Reviewed

    Yiyang Li, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    5th Biennial Workshop on DSP for In-Vehicle Systems     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  331. An analysis of the speech under stress using the two-mass vocal fold model Reviewed

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda,

    The 3rd International Workshop on Spoken Dialogue Systems Technology (IWSDS 2011)     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  332. A driving diagnosis and feedback system for next-generation drive recorders Reviewed

    Chiyomi Miyajima, Kazuya Takeda, Tatsuya Suzuki, Kenji Kurumida, Yuichi Kuroyanagi, Hiroaki Ishikawa, Pongtep Angkititrakul, Ryuta Terashima, Toshihiro Wakita, Masato Oikawa, and Yuichi Komada

    First International Symposium on Future Active Safety Technology Toward Zero-Traffic-Accident (FAST-zero 2011)     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  333. Efficient blind speech separation suitable for embedded devices Reviewed

    Kazunobu Kondo, Yu Takahashi, Seiichi Hashimoto, Hiroshi Saruwatari, Takanori Nishino, and Kazuya Takeda

    EUSIPCO 2011 (19th European signal processing conference)     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  334. On-line detection of task incompletion for spoken dialog systems using utterance and behavior tag N-gram vectors Reviewed

    Sunao Hara, Norihide Kitaoka, Kazuya Takeda,

    International Workshop on Spoken Dialog Systems (IWSDS2011)     page: 00   2011.9

    Language:English   Publishing type:Research paper (scientific journal)  

  335. Alternative Frequency Scale Cepstral Coefficient for Robust Sound Event Recognition Reviewed

    Yiren Leng, Huy Dat Tran, Norihide Kitaoka, Haizhou Li,

    Proc. INTERSPEECH2011     page: 00   2011.8

    Language:English   Publishing type:Research paper (scientific journal)  

  336. Detection of task-incomplete dialogs based on utterance-and-behavior tag N-gram for spoken dialog systems Reviewed

    Sunao Hara, Norihide Kitaoka, Kazuya Takeda

    Proc. INTERSPEECH2011     page: 00   2011.8

    Language:English   Publishing type:Research paper (scientific journal)  

  337. Music Recommendation System Based on Human-to-human Conversation Recognition Reviewed

    Hiromasa Ohashi, Sunao Hara, Norihide Kitaoka, Kazuya Takeda

    2nd International Workshop on Human-Centric Interfaces for Ambient Intelligence (HCIAmI'11)     page: 00   2011.7

    Language:English   Publishing type:Research paper (scientific journal)  

  338. On the use of the two-mass vocal cord model in characterizing the stress speech Reviewed

    Yao Xiao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda k

    Technical Report of IEICE     page: 00   2011.6

    Language:English   Publishing type:Research paper (scientific journal)  

  339. Detection of distracted driving using a Bayesian network Reviewed

    Hiroaki Ishikawa, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    ICIC Express letters, Part B:Applications, An International journal of Research and Surveys, ICICIC International   Vol. 2 ( 3 ) page: 627-633   2011.6

    Language:English   Publishing type:Research paper (scientific journal)  

  340. Analysis and detection of potentially hazardous situation in real-world driving Reviewed

    Yuichi Kuroyanagi, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    ICICIC Express Letters, PartB:Applications, An Interational Journal of Researxh and Surveys, ICIC International   Vol. 2 ( 3 ) page: 621-626   2011.6

    Language:English   Publishing type:Research paper (scientific journal)  

  341. Modeling and adaptation of stochastic driver-behavior model with application to car following Reviewed

    Pongtep Angkititrakul, Chiyomi Miyajima, and Kazuya Takeda

    2011 IEEE Intelligent Vehicles Symposium     page: 00   2011.6

    Language:English   Publishing type:Research paper (scientific journal)  

  342. Field data collection of a distributed spoken dialog system for music retrieval and its evaluation Reviewed

    Sunao Hara, Norihide Kitaoka, Kazuya Takeda,

    Global Engineering, Science, and Technology society International Transaction on Computer Science and Engineering,   Vol. 64 ( 1 ) page: 33-58   2011.5

    Language:English   Publishing type:Research paper (scientific journal)  

  343. Analysis of real-world driver's frustration Reviewed

    Lucas Malta, Chiyomi Miyajima,Norihide Kitaoka, and Kazuya Takeda,

    IEEE Transactions on Intelligent Transportation Systems   Vol. 12 ( 1 ) page: 109-118   2011.3

    Language:English   Publishing type:Research paper (scientific journal)  

  344. Blind source separation using dodecahedral microphone array under reverberant conditions Reviewed

    Motoki Ogasawara, Takanori Nisino, and Kazuya Takeda

    IEICE TRANSACTIONS on Fundamentalsof Electronics, Communications and Computer   Vol. E94-A ( 3 ) page: 897-906   2011.3

    Language:English   Publishing type:Research paper (scientific journal)  

  345. Improved method of blind speech separation with low computational complexity Reviewed

    Kazunobu Kondo, Yu Takahashi, Seiichi Hashimoto, Hiroshi Saruwatari, Takanori Nishino, and Kazuya Takeda

    Advances in Acoustics and Vibration   Vol. 2011   page: Article ID 765429, 10pages   2011.2

    Language:English   Publishing type:Research paper (scientific journal)  

  346. 音声対話システムの発話系列N-gramを利用した課題未達成対話の検出 Reviewed


    電子情報通信学会論文誌(D)     page: 00   2011.2

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  347. 行動信号処理の現状と展望 Reviewed


    システム制御情報学会誌   Vol. 55 ( 1 ) page: 2-7   2011.1

  348. Blind source separation using dodecahedral microphone array under reverberant conditions

    Motoki Ogasawara, Takanori Nishino, and Kazuya Takeda

    IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences   Vol. 94-A ( 3 ) page: 897-906   2011.1

    Language:English   Publishing type:Research paper (scientific journal)  

  349. 運転行動データベースの構築と応用


    システム/制御/情報   Vol. 55   page: 00   2011.1

  350. Detection of distracted driving using a Bayesian network Reviewed

    Hiroaki Ishikawa,Chiyomi Miyajima,Norihide Kitaoka, and Kazuya Takeda

    2010International Conference on Innovative Computing, Information and Control(ICICIC2010)     page: pp.621-626   2010.12

    Language:English   Publishing type:Research paper (scientific journal)  

  351. Analysis and detection of potentially hazardous situations in real-world driving Reviewed

    Yuichi Kuroyanagi,Chiyomi Miyajima,Norihide Kitaoka, and Kazuya Takeda

    2010International Conference on Innovative Computing, Information and Control(ICICIC2010)451     page: pp.621-626   2010.12

    Language:English   Publishing type:Research paper (scientific journal)  

  352. 自動車運転のマルチモーダル信号収録装置の開発 Reviewed


    電子情報通信学会論文誌(D)   Vol. J93-D ( 5 ) page: 1244-1252   2010.10

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  353. Automatic detection of task-uncompleted dialog for spoken dialog system based on dialog act N-gram Reviewed

    Sunao Hara, Norihide Kitaoka, and Kazuya Takeda

    Proc.INTERSPEECH 2010     page: 3034-3037   2010.9

    Language:English   Publishing type:Research paper (scientific journal)  

  354. CENSREC-1-AV:An audio-visual corpus for noisy bimodal speech recognition Reviewed

    Satoshi Tamura, Chiyomi Miyajima, Norihide Kitaoka, Takeshi Yamada, Satoru Tsuge, Tetsuya Takiguchi, Kazumasa Yamamoto, and Kazuya Takeda

    AVSP2010     page: 00   2010.9

    Language:English   Publishing type:Research paper (scientific journal)  

  355. rapid acoustic model adaption using inverse MLLR-based feature generation Reviewed

    Arata Ito, Sunao Hara, Norihide Kitaoka, and Kazuya Takeda

    International Congress of Acoustics (ICA2010)     page: 942-947   2010.8

    Language:English   Publishing type:Research paper (scientific journal)  

  356. Visualization and dereverberation of head-related transfer function based on spatio-temporal frequency analysis Reviewed

    Yasuko Morimoto, Takanori Nishino, and Kazuya Takeda

    20th International Congress on Acoustics (ICA2010)     page: 00   2010.8

    Language:English   Publishing type:Research paper (scientific journal)  

  357. A Browsing and Retrieval System for Driving Data Reviewed

    Masashi Naito,Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, and Kazuya Takeda

    2010 IEEE Intelligent Vehicles symposium (IV'2010)     page: 00   2010.6

    Language:English   Publishing type:Research paper (scientific journal)  

  358. Use of on-road data in evaluating driver performance metrics Reviewed

    Lucas Malta, Akira Ozaki, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

    SAE International Journal of Passenger Cars Electronic and Electrical Systems     page: 00   2010.5

    Language:English   Publishing type:Research paper (scientific journal)  

  359. Estimation method of user satisfaction using N-gram-based dialog history model for spoken dialog system Reviewed

    Sunao Hara, Norihide Kitaoka, and Kazuya Takeda

    LREC2010     page: 78-83   2010.5

    Language:English   Publishing type:Research paper (scientific journal)  


    Kotaro Ogino,Takatoshi Jitsuhiro,Chiyomi Miyaijma,Kazuya Takeda

    Motoki Ogasawara,Takanori Nishino,Kazuya Takeda

  362. Analysis of real-world driver's frustration Reviewed

    Lucas Malta, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    IEEE Transactions on Intelligent Transportation Systems   Vol. 12 ( 1 ) page: 109-118   2010.3

    Language:English   Publishing type:Research paper (scientific journal)  

  363. A small dodecahedral microphone array for blind source separation Reviewed

    Motoki Ogasawara, Takanori Nishino, and Kazuya Takeda

    ICASSP2010     page: 00   2010.3

    Language:English   Publishing type:Research paper (scientific journal)  

  364. Acoustic Feature Transformation Based on Discriminant Analysis Preservong Local Structure for Speech Recognition Date of Evaluation Reviewed

    Makoto Sakai, Norihide Kitaoka,Yuya Hattori,Seiichi Nakagawa, and Kazuya Takeda

    IEICE Trans, on Information & Systems   Vol. 000 ( 000 ) page: 000   2010.2

    Language:English   Publishing type:Research paper (scientific journal)  

  365. Evaluation of Combinational Use of Discriminant Analysis-based Acoustic Feature Transformation abd Discriminative Training Reviewed

    Makoto SAKAI,Norihide KITAOKA,Yuya HATTORI, Seiichi NAKAGAWA,and Kazuya TAKEDA

    IEICE Trans.on Information & Systems,   Vol. E93-D ( 2 ) page: 395-398   2010.2

    Language:English   Publishing type:Research paper (scientific journal)  

  366. 確率的手法を用いた車線変更軌跡のモデル化 Reviewed


    情報処理学会論文誌   Vol. 51 ( 1 ) page: 131-140   2010.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  367. Representation and comparison of HRTF in spatio-temporal frequency domain Reviewed

    Yasuko Morimoto, Takanori Nishino, and Kazuya Takeda

    3rd International universal communication symposium(IUCS2009)     page: 00   2009.12

    Language:English   Publishing type:Research paper (scientific journal)  

  368. Driver evaluation based on Classificaton of rapid decelerating patterns Reviewed

    Atsumi Naito,Chiyomi Miyajima,Takanori Nishino,Norihide Kitaoka,Kazuya Takeda

  369. Blind Source separation using dodecahedral microphone array for selective listening point audio Reviewed

    Motoki Ogasawara, Takanori Nishino,Kazuya Takeda

  370. A Multimedia Corporus of Driving Behaviors Reviewed

    Lucas Malta,Akira Ozaki,Chiyomi Miyajima,Norihide Kitaoka,Kazuya Takeda

  371. Analysis of measured head-related transfer functions based on spatio-temporal frequency characteristic Reviewed

    Yasuko Morimoto,Takanori Nishino,Kazuya Takeda

  372. Evaluation of driver-behavior models in real-world car-following task I Reviewed

    Pongtep Angkititrakul, Ryuta Trashima, Toshihiro Wakita, Chiyomi Miyajima, Kazuya Takeda, and Tatsuya Suzuki

    ICVES2009     page: 00   2009.11

    Language:English   Publishing type:Research paper (scientific journal)  

  373. Analysis of measured head-related tranfer functions based on spatio-temporal frequency characteristics I Reviewed

    Yasuko Morimoto, Takanori Nishino, and Kazuya Takeda

    IWPASH2009     page: 00   2009.11

    Language:English   Publishing type:Research paper (scientific journal)  

  374. Driver evaluation based on the classification of rapid decelerating patterns Reviewed

    Atsumi Naito, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, and Kazuya Takeda

    ICVES2009     page: 00   2009.11

    Language:English   Publishing type:Research paper (scientific journal)  

  375. Blind source separation using dodecahedral microphone array for selective listening point audio Reviewed

    Motoki Ogasawara, Takanori Nishino, and Kazuya Takeda

    IWPASH2009     page: 00   2009.11

    Language:English   Publishing type:Research paper (scientific journal)  

  376. Automatic identification for singing style based on sung melodic contour characterized in phase plane Reviewed

    Tatsuya Kako, Yasunori Ohishi, Hirokazu Kameoka, Kunio Kashino, Kazuya Takeda

  377. Prediction Model of Driving Behavior Based on Traffic Conditions and Driver Types Reviewed

    Hideomi Amata,Chiyomi Miyajima,Takanori Nishino,Norihide Kitaoka,Kazuya Takeda

    Tatsuya Kako, Yasunori Ohishi,Hirokazu Kameoka,Kunio Kashino, Kazuya Takeda

    International Conference on Music Information Retrieval (ISMIR2009)     page: 393-397   2009.10

    Language:English   Publishing type:Research paper (scientific journal)  

  379. A stochastic signal model for predicting the vehicle trajectory at lane change Reviewed

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    4th Biennial Workshop on Disgal Signal Processing for In-Vehicles Systems and Safety     page: 00   2009.10

    Language:English   Publishing type:Research paper (scientific journal)  

  380. CENSREC-1-C:An evaluation framework for voice activity detection under noisy environments Reviewed

    Norihide Kitaoka,Takeshi Yamada,Satoru Tsuge,Chiyomi Miyajima,Kazumasa Yamamoto,Takanobu Nishiura,Masato Nakayama,Yuki Denda,Masakiyo Fujimoto,Tetsuya Takiguchi,Satoshi Tamura,Shigeki Matsuda,Tetsuji Ogawa,Shingo Kuroiwa,Kazuya Takeda,Satoshi Nakamura

    Acoustical Science and Technology   Vol. 30 ( 5 ) page: 363-371   2009.9

    Language:English   Publishing type:Research paper (scientific journal)  

  381. Blind source separation based on acoustic pressure distribution and normalized rerative phase using dodecahedral microphone array Reviewed

    Motoki Ogasawara,Takanori Nishino,Kazuya Takeda

  382. Analysis of head-related transfer functions based on spatio-temporal frequency characteristics Reviewed

    Yasuko Morimoto,Takanori Nishino,Kazuya Takeda

  383. A Study of Driver Behavior Under Potential Threats in Vehicle Traffic Reviewed

    Lucas Malta, Chiyomi Miyajima, Kazuya Takeda

    IEEE Trans. on ITS   Vol. vol.10 ( no.2 ) page: pp.201-210   2009.6

    Language:English   Publishing type:Research paper (scientific journal)  

  384. Multimodal estimation of a driver's spontaneous irritation Reviewed

    Lucas Malta,Chiyomi Miyajima,Norihide Kitaoka,Kazuya Takeda

  385. A Stochastic Modeling of Vehicle and Analysis Reviewed

    Yoshihiro Nishiwaki,Chiyomi Miyajima,Norihide Kitaoka,Kazuya Takeda

  386. Multimodel real-world driving data collection and analysis Reviewed

    Luxas Malta,Chiyomi Miyajima,Norihide Kitaoka,Kazuya Takeda

  387. Feature transformation based on discriminant analysis preserving local structure for speech recognition Reviewed

    Makoto Sakai, Norihide Kitaoka, Kazuya Takeda

  388. Stochastic modeling of vehicle trajectory during lane-changing Reviewed

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Hidenori Kitaoka, Kazuya Takeda

  389. Spoken dialog stragety based on understanding graph search Reviewed

    Yuji Kinoshita, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

  390. *Selective Listening Point Audio based on Blind Signal Separation and Stereophonic Technology Reviewed

    Kenta Niwa, Takanori Nishino, Kazuya Takeda

    IEICE Trans. on Infomation and Systems   Vol. vol.E92-D ( no.3 ) page: pp.469-476   2009.3

    Language:English   Publishing type:Research paper (scientific journal)  


    Abdul Wahab, Chai Quek, Kazuya Takeda, Chin Tan

    IEEE Transactions on Neural Networks   Vol. vol.20 ( no.4 ) page: pp.563-582   2009.2

    Language:English   Publishing type:Research paper (scientific journal)  

  392. 車載レーザスキャナによる距離データマップの構築と高精度自動車位置推定 Reviewed


    電子情報通信学会論文誌   Vol. 92-D(2)   page: 215-225   2009.2

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  393. Integrated Speech Enhancement Method using Noise Suppression and Dereverberation Reviewed

    Kenta Niwa, Takanori Nishino, Kazuya Takeda

    IEEE Trans. Audio, Speech, and Language Processing   Vol. 17(2)   page: 231-246   2009.2

    Language:English   Publishing type:Research paper (scientific journal)  

  394. On-Going Data Collection of Driving Behavior Signals, In-Vehicle Corpus and Signal Processing for Driver Behavior, (Book Chapter)

    Chiyomi Miyajima, Takashi Kusawaka, Takanori Nishino, Norihide Kitaoka, Katsunobu Itou, and Kazuya Takeda

    Springer     page: 45-54   2009.1

    Language:English   Publishing type:Research paper (scientific journal)  

  395. Speech dereverberation based on maximum likelihood estimation with time-varying Gaussian source model Reviewed

    IEEE Trans, Audio, Speech, and Language Processing   Vol. 16(8)   page: 1512-1527   2008.11

    Language:English   Publishing type:Research paper (scientific journal)  

  396. 人にやさしい音声インターフェース Reviewed

    鹿野清宏、河原達也、猿渡 洋、武田一哉、河原英紀、徳田恵一、西浦敬信、李 晃伸

    情報処理学会論文誌   Vol. 49 ( 11 ) page: 3789-3797   2008.11

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  397. An integrative recognition method for speech and gestures Reviewed

    Madoka Miki, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, and Kazuya Takeda

  398. 3D AV Integrated System Featuring Arbitrary Listening-Point And Viewpoint Generation Reviewed

    Mehrdad Panahpour Tehrani, Kenta Niwa, Norishige Fukushima, Yasushi Hirano, Toshiaki Fujii, Masayuki Tanimoto, Kazuya Takeda, Kenji Mase, Akio Ishikawa, Shigeyuki Sakazawa, Atsushi Koike

  399. Generating Lane-Change Trajectories of Individual Drivers Reviewed

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Norihide Kitaoka, Ryuta TERASHIMA, Toshihiro Wakita, Kazuya Takeda

  400. Parameter Estimation Method of F0 Control Model for Singing Voices Reviewed

    Yasunori Ohishi, Hirokazu Kameoka, Kunio Kashino, Kazuya Takeda

  401. Building and Combining Document and Music Spaces for Music Query-By-Webpage System Reviewed

    Ryoei Takahashi, Yasunori Ohishi, Norihide Kitaoka, Kazuya Takeda

  402. CENSREC-4: Development of Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments Reviewed

    M.Nakayama, Nishiura, Denda, Kitaoka, Yamamoto, Yamada, Tsuge, Miyajima, Fujimoto, Takiguchi, Tamura, Ogawa, Matsuda, Kuroiwa, Takeda, Nakamura

  403. CENSREC-AV: Evaluation frameworks for audio-visual speech recognition Reviewed

    Satoshi TAMURA, Chiyomi MIYAJIMA, Norihide KITAOKA, Satoru HAYAMIZU, and Kazuya TAKEDA

  404. Binaural sound localization for untrained directions based on a Gaussian mixture model Reviewed

    Takanori Nishino and Kazuya Takeda,

  405. 複数モデルの動的選択に基づく気管支枝名自動対応付け手法 Reviewed


    電子情報通信学会論文誌(D)   Vol. J91-D(7)   page: 1851-1861   2008.7

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  406. Multi-modal real-world driving data collection, transcription, and integration using Bayesian network, Reviewed

    Lucas Malta, Chiyomi Miyajima, and Kazuya Takeda,

  407. Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series Reviewed

    Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda and Satoshi Nakamura

  408. In-car Speech Data Collection along with Various Multimodal Signals Reviewed

    Akira Ozaki, Sunao Hara, Takashi Kusakawa, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, Katunobu Itou and Kazuya Takeda

  409. Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation Reviewed

    Tran HUY DAT, Kazuya TAKEDA, Fumitada ITAKURA

    IEICE TRANSACTIONS on Information and Systems   Vol. E91-D ( 3 ) page: 439-447   2008.3

  410. Enclding large array signals into a 3D sound field representation for selective listening point audio based on blind source separation Reviewed

    Kenta Niwa, Takanori Nishino and Kazuya Takeda

  411. Enclding large array signals into a 3D sound field representation for selective listening point audio based on blind source separation, Reviewed

    Kenta Niwa, Takanori Nishino and Kazuya Takeda,

    Abdul Wahab, Chai Quek, Kazuya Takeda, Chin Tan

    IEEE Transactions on Neural Networks     page: 000   2008.2

    Language:English   Publishing type:Research paper (scientific journal)  

  413. Head-Related Transfer Function measurement in sagittal and frontal coordinates (letter) Reviewed

    Takashi Nakado, Takanori Nishino, Kazuya Takeda

    Acoustical Science and Technology,   Vol. Vol29 ( no.5 ) page: 335-337   2008.2

    Language:English   Publishing type:Research paper (scientific journal)  

  414. Estimation of speaker and listener positions in a car using binaural signals Reviewed

    Madoka Takimoto, Takanori Nishino, Hiroyuki Hoshino and Kazuya Takeda

    Acoustical Science and Technology   Vol. 29 ( 1 ) page: 110-112   2008.1

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  415. *相平面に描かれる歌声の基本周波数軌跡:歌唱者の意図する音高目標値系列の推定とハミング検索への応用 Reviewed

    大石 康智, 後藤 真孝, 伊藤 克亘, 武田 一哉

    情報処理学会論文誌   Vol. Vol.49 ( No.11 ) page: pp.3789-3797   2008.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  


  416. 多視点・多聴点データ取得システムを用いた自由視聴点映像生成 Reviewed

    福嶋慶繁, 丹羽健太, 圓道知博, 藤井俊彰, 谷本正幸, 西野隆典, 武田一哉

    電子情報通信学会論文誌   Vol. J91-D ( no.8 ) page: 2039-2041   2008.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  417. Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance, Reviewed

    Norihide Kitaoka, Kazumasa Yamamoto, Tomohiro Kusamizu, Seiichi Nakagawa, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, and Satoshi Nakamura,

  418. Multimodal driving data integration for the analysis of driver's responses to hazardoussituations, Lucas Malta, Chiyomi Miyajima, and Kazuya Takeda, Reviewed

    Lucas Malta, Chiyomi Miyajima, and Kazuya Takeda

  419. 多様な音響環境下における音声認識システム利用時のデータ収集システム Reviewed


    電子情報通信学会論文誌   Vol. J90-D ( 10 ) page: 2807-2816   2007.10

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  420. 複数の認識器を選択的に用いる音声認識システムのためのスコア補正法 Reviewed


    電子情報通信学会論文誌   Vol. J90-D ( 7 ) page: 1773-1780   2007.7

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  421. On-going data collection for driving behavior signal, Reviewed

    Chiyomi Miyajima, Takashi Kusakawa, Takanori Nishino, Norihide Kitaoka, Katsunobu Itou, and Kazuya Takeda

  422. Mining potentially hazardous situations in vehicle traffic using driver's reactions, Reviewed

    Lucas Malta, Chiyomi Miyajima, Kazuya Takeda

  423. Generation of Pedal Operation Patterns of Individual Drivers in Car-Following for Personalized Cruise Control Reviewed

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Norihide Kitaoka, Katsunobu Itou, Kazuya Takeda, Kazuya

  424. Sound localization under conditions of covered ears on the horizontal plane Reviewed

    Madoka Takimoto, Takanori Nishino, Katsunobu Itou and Kazuya Takeda

    Acoustical Science and Technology   Vol. 28 ( 5 ) page: 335-342   2007.5

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  425. スペクトル分析を用いた運転行動信号に含まれる個人性のモデル化 Reviewed


    小澤晃史,西脇由博,脇田敏裕,宮島千代美,伊藤克亘,武田一哉 電子情報通信学会論文誌   Vol. J90-D ( 4 ) page: 1115-1123   2007.4

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  426. Statistical Segmentation and Recognition of Fingertip Trajectories for a Gesture Interface Reviewed

    Kazuhiro MORIMOTO, Chiyomi MIYAJIMA, Norihide KITAOKA, Katunobu ITOU, Kazuya TAKEDA

  427. *Driver Modeling Based on Driving Behavior and Its Evaluation in Driver Identification Invited Reviewed

    C. Miyajima, Y. Nishiwaki, K. Ozawa, T. Wakita, K. Itou, K. Takeda and F. Itakura

    Proceedings of the IEEE   Vol. 95 ( 2 ) page: 427-437   2007.2

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  428. A Stochastic Representation of the Dynamics of Sung Melody Reviewed

    Yasunori Ohishi, Masataka Goto, Katunobu Itou, Kazuya Takeda

  429. A Virtual Button Interface Using Fingertip Movements Reviewed

    Kazuhiro Morimoto, Chiyomi Miyajima, Kazuya Takeda

  430. 楽曲検索システムにおけるプレイリストに適応した音響モデル構築手法に関する検討


  431. Estimation of HRTFs on the horizontal plane using physical features Reviewed

    Takanori Nishino, Naoya Inoue, Kazuya Takeda and Fumitada Itakura

    Applied Acoustics   Vol. Vol.68 ( issue8 ) page: 897-908   2007.1

    Language:English   Publishing type:Research paper (scientific journal)  

  432. 両耳間音圧差の包楽を用いたガウス分布モデルに基づく音源方向推定 Reviewed


    日本音響学会誌   Vol. 63 ( 1 ) page: 3-12   2007.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  433. Rubust in-car speech recognition based on nonlinear multiple regression Reviewed

    Weifeng Li, Kazuya Takeda, Fumitada Itakura

    EURASHIP JOURNAL on Advance in Signal Processing   Vol. 2007 ( Article ID 16921 ) page: 1-10   2007.1

    Language:English   Publishing type:Research paper (scientific journal)  

  434. CENSREC-3:An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments

    Masakiyo Fujimoto, Kazuya Takeda, Satoshi Nakamura

    IEICE Trans, Inf. And Syst.,   Vol. E89-D ( 11 ) page: 2783-2793   2006.11

    Language:English   Publishing type:Research paper (scientific journal)  

  435. On line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement Reviewed

    Tran Huy Dat, Kazuya Takeda, Satoshi Itakura

    Speech Recognition   Vol. 48   page: Issue 11   2006.11

    Language:English   Publishing type:Research paper (scientific journal)  

  436. 指先の動きを利用した仮想ボタン入力インターフェースの検討 Reviewed


    2006年情報科学技術レターズ   Vol. LK-013   page: 305-308   2006.9

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  437. スペクトル包絡と基本周波数の時間変化を利用した歌声と朗読音声の識別 Reviewed


    情報処理学会論文誌   Vol. 47 ( 6 ) page: 1822-1830   2006.6

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  438. Maximum a Posterior Probability and Cumulative Distribution Function Methods for Speech Xpectral Estimation with Application in Noise Suppression Filtering

    Tran Huy Dat, Kazuya Takeda, Fumitada Itakura

    Lecture Notes in Computer Science   Vol. 3817   page: 328-337   2006.2

    Weifeng Li, Katsunobu Itou, Kazuya Takeda

    Tran Huy Dat, Kazuya Takeda, Fumitada Itakura

    Mehrdad Panahpour Tehrani, Yasushi Hirano, Toshiaki Fujii, Shoji Kajita, Kazuya Takeda, Kenji Mase

  442. Robust In-car Speech Recognition Based On Nonlinear Multiple Regression

    Weifeng LI,Kazuya Takeda,Fumitada Itakura

    EURASIP Journal on Advances in Signal Processing   Vol. Volume 2007 ( Article ID 16921 ) page: 1-10   2006.1

    Language:English   Publishing type:Research paper (scientific journal)  

  443. MULTIPOINT MEASURING SYSTEM FOR VIDEO AND SOUND - 100-camera and microphone system -, Reviewed

    Toshiaki Fujii, Kensaku Mori, Kazuya Takeda, Kenji Mase, Masayuki Tanimoto, Yasuhito Suenaga

  444. Characterizing in-car Conversational Speech of Different Dialogue Modes, Reviewed

    Hiroshi Fujimura, Chiyomi Miyajima, Nobuo Kawaguchi, Katsunobu Itou, Kazuya Takeda and Fumitada Itakura

  445. Single-Channel Multiple Regression for In-Car Spee

    Weifeng LI,Katsunobu ITOU,Kazuya TAKEDA,Fumitada ITAKURA

    IEICE Trans. on Inf. & Syst   Vol. VOL.E89-D   page: 1032-1039   2006.1

    Language:English   Publishing type:Research paper (scientific journal)  

  446. 両音間音圧差の特徴量分布をガウス分析近似した音源方向推定モデルの検討


    音講論集・日本音響学会(2006年秋季研究発表会)   ( 37263 ) page: 469-470   2006.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  447. 運転操作信号のケプストラム分析に基づく運転者識別

    武田 一哉,小澤 晃史,西脇 由博,脇田 敏裕,宮島 千代美, 伊藤 克亘

    情報処理学会   Vol. 2006-SLP-60   page: 19-24   2006.1

    Authorship:Lead author   Language:Japanese  

  448. Statistical Analysis for Thesaurus Construction using an Encuclopedic Corpus

    Yasunori Ohishi,Katunobu Itoh,Kazuya Takeda,Atsushi Fujii

  449. Development of Micro-Dodecahedral Loudspeakerfor Measuring Head-Related Transfer Functions In The Proximal Region

    Seiichiro Hosoe,Takanori Nishino,Katunobu Itoh,Kazuya Takeda

  450. Cepstral Analysis of Driving Behavioral Signals for Driver Identification

    Chiyomi Miyajima,Yoshihiro Nishiwaki,Koji Ozawa,Toshihiro Wakita,Katunobu Itoh, Kazuya Takeda

  451. Measurement of Head Related Transfer Functions with Micro-Dodecahedral Loudspeaker in the Proximal Region

    Takanori Nishino,Seiichiro Hosoe,Katunobu Itoh,Kazuya Takeda

  452. On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices

    Yasunori Ohishi,Masataka Goto,Katunobu Itoh,Kazuya Takeda

  453. Gamma Modeling of Speech Power and Its Online Estimation for Statistical Speech Enhancement Reviewed

    Tran Huy Dat, Kazuya Takeda, Fumitada Itakura

    IEICE Trans. Inf. and Syst.,   Vol. E89-D ( 3 ) page: 1040-1049   2006.1

    Language:English   Publishing type:Research paper (scientific journal)  

  454. Driver Identification Using Driving Signals Reviewed

    Toshiyuki Wakita, Chiyomi Miyajima, Katsunobu Itou, and Kazuya Takeda

    IEICE trans, Inf.& Syst.,   Vol. E89-D ( 3 ) page: 1188-1194   2006.1

    Language:English   Publishing type:Research paper (scientific journal)  

  455. 給電指令電話の音声検索システム Reviewed


    電気学会論文誌C   Vol. 125-C ( 9 ) page: 1438-1443   2005.9

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  456. The Sound Wave Ray-Space Reviewed

    M. P. Tehrani, Y. Hirano, T. Fujii, S. Kajita, K. Takeda, M. Tanimoto, K. Mase

  457. CIAIR In-Car Speech Corpus - Influence of the Driving Status(letter)

    Nobuo Kawaguchi, Shigeki Matubara, Kazuya Takeda, and Fumitada Itakura

    IEICE-D(E)   Vol. E88-D ( 3 ) page: 578-582   2005.3

    Language:English   Publishing type:Research paper (scientific journal)  

  458. AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recogniton

    Satoshi Nakamura, Kazuya Takeda, Kazumasa Yamamoto, Takeshi Yamada, Shingo Kuroiwa, Norihide Kitaoka, Takanobu Nishiura, Akira Sasou, Mitsunori Mizumachi, Chiyomi Miyajima, Masakiyo Fujimoto, and Toshiki Endo

    IEICE-D(E)   Vol. E88-D ( 3 ) page: 384-390   2005.3

    Language:English   Publishing type:Research paper (scientific journal)  

  459. Analysis and Recognition of Whispered Speech

    Taisuke Ito, Kazuya Takeda, ando Fumitada Itakura

    Speech Communication   Vol. 45 ( 2 ) page: 139-152   2005.2

    Language:English   Publishing type:Research paper (scientific journal)  

  460. Speaker verification using Gaussian mixture models within changing real car environments Reviewed

    Xianxian Zhang, John Hansen, Pongtep Angkititrakul, Kazuya Takeda

  461. Subjective and Objective Quality Assessment of Regression-enhanced Speech in Real Car Environments Reviewed

    Li Wifent, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura

  462. Modeling of individualities in driving through spectral analysis of behavioral signals Reviewed

    Koji Ozawa, Toshihiro Wakita, Chiyomi Miyajima, Katsunobu Itou, Kazuya Takeda

  463. Speech Enhancement based on SNR-Dependent Empirical Statistical Estimation in Log-Spectral Magnitude domain Reviewed

    Tran Huy Dat, Kazuya Takeda and Fumitada Itakura

  464. Improved Noise Estimation and Log-spectral Regression for In-car Speech Recongnition

    Weifeng LI,Katunobu ITOU,Kazuya TAKEDA

  465. 音声対話インタフェースの長期利用時における学習効果の評価


  466. 実走行車内単語音声データベースCENSREC-3と共通評価環境の構築

    藤本雅清,中村 哲,武田一哉 黒岩 眞吾,山田 武志,北岡教英,山本一公,水町光徳,西浦敬信,佐宗晃

    Toshiyuki KIMURA,Kazuhiko KAKEHI,Kazuya TAKEDA,Fumitada ITAKURA

    Tran Huy DAT,Kazuya TAKEDA,Fumitada ITAKURA

    Kazuya TAKEDA,Tran Huy DAT,Hiroshi FUJIMURA,Fumitada ITAKURA

    Hirosi FUJIMURA,Chiyomi MIYAJIMA,Katsunobu ITOU,Kazuya TAKEDA,Fumitada ITAKURA

    Weifeng Li,Katunobu Itou,Kazuya Takeda,Fumitada Itakura

  472. Speech enhancement based on MAP-log spectral magnitude estimation using the gamma prior of the speech power

    Tran Huy DAT,Kazuya TAKEDA,Fumitada ITAKURA

  473. MAP and cumulative distribution function equalization methods for the speech spectral estimation with application in noise suppression filtering

    Tran Huy DAT,Kazuya TAKEDA,Fumitada ITAKURA

  474. A speech enhancement system based on data clusterin and cumulative histogram equalization

    Tran Huy DAT,Kazuya TAKEDA,Fumitada ITAKURA

  475. 波面合成におけるチャネル数の主観的影響 -音源が正面付近にある場合-


    日本バーチャルリアリティ学会 論文誌   Vol. TVRSJ Vol.10 ( 2 ) page: 257-266   2005.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  476. Environmental Warping for In-car Speech Recognition

    Weifeng LI,Katunobu ITOU,Kazuya TAKEDA,Fumitada ITAKURA

  477. Sound field auralizaion system in free listening positions

    Toshiyuki KIMURA,Wataru MIZUNO,Takanori NISHINO,Kazuya TAKEDA

  478. HRTF modeling using physical features

    Naoya INOUE,Takanori NISHINO,Katsunobu ITOU,Kazuya TAKEDA

  479. Measurement of head-related transfer functions in the proximal region

    Seiichiro HOSOE,Takanori NISHINO,Katunobu ITOU,Kazuya TAKEDA

  480. Evaluation of sound localization under condition of covered ears

    Madoka TAKIMOTO,Takanori NISHINO,Katunobu ITOU,Kazuya TAKEDA

  481. Modeling method of a room impulse response with cepstrum analysis

    Takanori NISHINO,Fuminori SAITO,Katunobu ITOU,Kazuya TAKEDA

  482. Data Collection and Evaluation of Speech Recognition for Motorbike Riders

    Hirosi TANAKA,Hirosi FUJIMURA,Chiyomi MIYAJIMA,Takanori NISHINO,Katunobu ITOU, Kazuya TAKEDA

  483. Discrimination between Singing and Speaking Voices

    Yasunori OHISHI,Masataka GOTO,Katunobu ITO,Kazuya TAKEDA

  484. Driver identification based on spectral analysis of driving behavioral signals

    Yoshihiro NISHIWAKI,Koji OZAWA,Toshihiro WAKITA,Chiyomi MIYAJIMA,Katunobu ITOU, Kazuya TAKEDA

  485. Parametric Versus Non-Parametric Models of Driving Behavior Signals for Driver Identification

    Toshihiro WAKITA,Koji OZAWA,Chiyomi MIYAJIMA,Kazuya TAKEDA

  486. Driver Identification Using Driving Behavior Signals

    Toshihiro WAKITA,Koji OZAWA,Chiyomi MIYAJIMA,Kei IGARASHI,Katunobu ITOU, Kazuya TAKEDA

  487. 運転行動を用いたドライバ識別

    脇田敏裕,小澤晃史,宮島千代美,五十嵐圭,伊藤克亘, 武田一哉

  488. Evaluation of HRTFs estimated using physical features

    Naoya INOUE,Toshiyuki KIMURA,Takanori NISHINO,Katsunobu ITOU,Kazuya TAKEDA

  489. The present status progress and usage of speech databases in Japan Reviewed

    Hisao Kuwabara,Shuichi Itahashi,Mikio Yamamoto,Satoshi Nakamura,Toshiyuki Takezawa, Kazuya Takeda

    Acoustical Science and Technology   Vol. 26 ( 1 ) page: 62-66   2005.1

    Language:English   Publishing type:Research paper (scientific journal)  

  490. 移動音源がある音場の空間符号化ー音源抽出による伝送量の削減ー


    日本バーチャルリアリティ学会 論文誌   Vol. TVRSJ Vol.10   page: 101-109   2005.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  491. Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones

    Weifeng LI,Tetsuya SHINDE,Hiroshi FUJIMURA,Chiyomi MIYAJIMA,Takanori NISHINO, Katunobu ITOU,Kazuya TAKEDA,Fumitada ITAKURA

    IEICE Trans   Vol. Vol.E88-D ( No.3 ) page: 834-390   2005.1

    Language:English   Publishing type:Research paper (scientific journal)  

  492. Construction and Evaluation of a Large In-Car Speech Corpus

    Kazuya TAKEDA,Hiroshi FUJIMURA,Katunobu ITOU,Nobuo KAWAGUCHI,Shigeki MATSUBARA, Fumitada ITAKURA

    IEICE Trans.   Vol. Vol.E88-D ( No.3 ) page: 553-561   2005.1

    Language:English   Publishing type:Research paper (scientific journal)  

  493. Speech Recognition Using Finger Tapping Timings

    Hirmitsu BAN,Chiyomi MIYAJIMA,Katunobu ITOU,Kazuya TAKEDA,Fumitada ITAKURA

    IEICE Trans.   Vol. Vol.E88-D ( No.3 ) page: 667-670   2005.1

    Language:English   Publishing type:Research paper (scientific journal)  

  494. ケプストラム分析を用いた実収録運転行動信号に含まれる個人性のモデル化(レター)


    情報科学技術レターズ   Vol. LL-007   page: 289-292   2005.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  495. Adaptive Log-spectral Regression for In-Car Speech Recognition using Multiple Distributed Microphones(letter)

    Weifeng Li, Kazuya Takeda, and Fumitada Itakura

    IEEE Signal Processing Letters   Vol. 12 ( 4 ) page: 340-343   2005.1

    Language:English   Publishing type:Research paper (scientific journal)  

  496. Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition

    Weifeng Li, Chiyomi Miyajima, Takanori Nishino,Katunobu Itou, Kazuya Takeda, and Fumitada Itakura

    IEICE-A(E)   Vol. E88-A ( 7 ) page: 1716   2005.1

    Language:English   Publishing type:Research paper (scientific journal)  

  497. Speech recognition using synchronization between speech and figre tapping Reviewed

    H. Bann, C.Miyajima, K.Itou, K.Takeda, F.Itakura

  498. Example-based Spoken Dialogue System with Online Example Augmentation Reviewed

    H.Murao, N. Kawaguchi, S. Matsubara, Y. Yamaguchi, K. Takeda and Y. Inagaki

  499. Optimizing Regression for in-car Speech Recognition using Multiple Distributed Microphones Reviewed

    W. Li, K. Takeda and F. Itakura

  500. Recent Progress of Open-Source LVCSR Engine Julius and Japanese Model Repository Reviewed

    T.Kawahara, A.Lee, K.Takeda, K.Itou and K.Shikano

  501. Audio-Visual Speaker Localization for Car Navigation Systems Reviewed

    X.Zhang, K.Takeda, J.Hansen and T.Maeno

  502. CIAIR In-Car Speech Database Reviewed

    N.Kawaguchi, S.Matsubara, Y.Yamaguchi, K.Takeda and F. Itakura

  503. Analysis of In-car speech recognition experiments using a large-scale multi-mode dialogue corpus Reviewed

    K.Fujimura, K.Itou, K.Takeda,F.Itakura

  504. AURORA-2J: Japanese speech data collection for performance evaluation of speech recognition in noise Reviewed

    Satoshi Nakamura, Kazumasa Yamamoto, Kazuya Takeda,Shingo Kuroiwa, Norihide Kitaoka,Takeshi Yamada,Mitsunori Mizumachi, Takanobu Nishiura, Masakiyo Fujimoto, Akira Saso, Toshiki Endo

  505. Dialouge characteristics in different communication modes, Reviewed

    Katsunobu Itou, Kouji Fujimura, Nobuo Kawaguchi, Kazuya Takeda, and Fumitada Itakura

  506. In-Car Spoken Dialogue Corpus and Its Application, Reviewed

    Nobuo Kawaguchi, Shigeki Matsubara, Hiroya Murao, Itsuki Kishida, Yuki Irie, Yukiko Yamaguchi, Kazuya Takeda and Fumitada Itakura

  507. Biometric Identification Using Driving Behavioral Signals Reviewed

    Kei Igarashi, Chiyomi Miyajima, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura and Huseyin Abut

  508. Robust SNR estimation of noisy speech based on Gaussian mixtures modeling on log-power domain Reviewed

    Tran Huy Dat, Kazuya Takeda, Fumitada Itakura

  509. An Advanced Japanese Speech Corpus for In-car Spoken Dialogue Research, Reviewed

    Yuki Irie, Nobuo Kawaguchi, Shigeki Matsubara, Itsuki Kishida, Yukiko Yamaguchi, Kazuya Takeda, Fumitada Itakura, and Yasuyoshi Inagaki

  510. Method for determining sound localization Acoustical Science and Technology by auditory masking

    Kazuhiro UCHIDA,Takanori NISHINO,Kazuya TAKEDA,Fumitada ITAKURA

    Acoustical Science and Technology   Vol. 25 ( 6 ) page: 480-482   2004.1

    Language:English   Publishing type:Research paper (scientific journal)  

  511. In-car speech recognition experiments using a large-scale multi-mode dialogue corpus

    Hiroshi FUJIMURA,Katsunobu ITOU,Kazuya TAKEDA,Fumitada ITAKURA

  512. Subjective assessments for the effect of the number od channel signals on the sound field reproduction used in wavefield synthesis

    Toshiyuki KIMURA,Kazuhiko KAKEHI,Kazuya TAKEDA,Fumitada ITAKURA

  513. Biometric Identification Using Driving Behamitoral Signals

    Kei IGARASHI,Chiyomi MIYAJIMA,Katsunobu ITOU,Kazuya Takeda,Fumitada ITAKURA, Huseyin ABUT

  514. 音声情報処理の観点から見た室内音場の評価について


  515. Multimedia Corpus of In-Car Speech Communication

    Nobuo KAWAGUCHI,Kazuya TAKEDA,Fumitada ITAKURA

  516. 事典コーパスを用いた単語階層関係の統計的解析


  517. 運転行動信号に含まれる個人性に関する検討


  518. Audio-Visual Speaker Localization for Car Navigation Systems

    Xianxian ZHANG,Kazuya TAKEDA,John H. L. HANSEN,Toshiki MAENO

  519. Analysis of In-car speech recognition experiments using a larger-scale multi-mode dialogue corpus

    Hiroshi FUJIMURA,Katsunobu ITOU,Kazuya TAKEDA,Fumitada ITAKURA

    Tatsuya KAWAHARA,Akinobu LEE,Kazuya TAKEDA,Katsunobu ITOU,Kiyohiro SHIKANO

  521. Example-based Spoken Dialogue System with Online Example Augmentation


  522. Optimizing Regression for in-car Speech Recognition using Multiple Distributed Microphones

    Weifeng LI,Kazuya TAKEDA,Fumitada ITAKURA

  523. Speech enhancement based on magnitude estimation using the Gamma prior

    Tran Huy DAT,Weifeng LI,Kazuya TAKEDA,Fumitada ITAKURA

    Hiromitu BAN,Chiyomi MIYAJIMA,Katsunobu ITOU,Kazuya TAKEDA,Fumitada ITAKURA

  525. CIAIR In-Car Speech Database


  526. 音声対話による楽曲検索システム


  527. スパーク音源を用いた頭部伝達関数の測定


  528. 自動車の中での音声認識


    情報処理   Vol. 45 ( 10 ) page: 1038-1043   2004.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  529. Measurement of the Head Related Transfer Function using the Spark Noise

    Takanori NISHINO,Seiichiro HOSOE,Kazuya TAKEDA,Fumitada ITAKURA

  530. DSP for In-Vehicle and Mobile Systems

    Huseyin ABUT,John HANSEN,Kazuya TAKEDA(eds.)

  531. Data Collection and Evaluation of AURORA-2 Japanese Corpus Reviewed

    Satoshi Nakamura, Kazumasa Yamamoto, Kazuya Takeda, Shingo Kuroiwa,, Norihide Kitaoka, Takeshi Yamada, Mitsunori Mizumachi, Takanobu Nishiura, Masakiyo Fujimoto, Akira Saso, Toshiki Endo

    Hideki Banno, Tetsuya Shinde, Kazuya Takeda and Fumitada Itakura

  533. Construction and analysis of the multi-layered in-car spoken dialogue corpus, Reviewed

    Nobuo Kawaguchi, Shigeki Matsubara, Itsuki Kishida, Yuki Irie, Yukiko Yamaguchi, Kazuya Takeda, Fumitada Itakura

  534. In-car speech recognition using distributed microphones Reviewed

    Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura

  535. Is our driving behavior unique? Reviewed

    Kei Igarashi, Kazuya Takeda, Fumitada Itakura, Huseyin Abut

  536. A Study on Domain Recognition of Spoken Dialogue Systems Reviewed

    T. Isobe, S. Hayakawa, H. Murao, T. Mizutani, K. Takeda, F. Itakura

  537. Experiments on Recognition of Lavalier Microphone Speech and Whispered Speech in Real World Environments, Reviewed

    K. Tatara, T. Ito, P. Zolfaghari, K. Takeda & F. Itakura

  538. Integration of Noise Reduction Algorithms for Aurora2 Task Reviewed

    Takeshi Yamada, Jiro Okada, Kazuya Takeda, Norihide Kitaoka, Masakiyo Fujimoto, Shingo Kuroiwa, Kazumasa Yamamoto, Takanobu Nishiura, Mitsunori Mizumachi, Satoshi Nakamura

  539. Acoustical analysis and Recognition of Whispered Speech Reviewed

    Taisuke Itoh, Kazuya Takeda and Fumitada Itakura

    Yoshihide Ban, Hideki Banno, Kazuya Takeda and Fumitada Itakura

  541. Recognition of Consonant-Vowel (CV) Units of Speech in a Broadcase News Corpus Using Support Vector Machines Reviewed

    C.Chandra Sekhar, Kazuya Takeda and Fumitada Itakura

  542. Acoustical analysis and Recognition of Whispered Speech Reviewed

    Taisuke Itoh, Kazuya Takeda and Fumitada Itakura

  543. Robust speech recognition based on selective use of missing frequency band HMMs Reviewed

    Y. Kawamura, K. Takeda, F. Itakura

  544. Multimedia data collection of in-car speech communication Reviewed

    N.Kawaguchi, N.Matsubara, K.Takeda, F.Itakura, Y.Inagaki

  545. Continuous speech recognition without end-point detection Reviewed

    O.Segawa, K. Takeda, F. Itakura ?

  546. A Study on perceptual distance measure for phase spectrum of stimuli Reviewed

    H. Banno, K. Takeda, F. Itakura ?

  547. Blind source separation combining frequency-domain ICA and beam forming Reviewed

    H. Saruwatari, K. Takeda, K.Shikano ?

  548. Direction of arrival estimation based on nonlinear microphone array Reviewed

    H. Saruwatari, H.Kamiyanagida, K. Takeda, F. Itakura, K.Shikano

  549. Close-class-set discrimination method for recognitionof stop-sonsonant-vowel utterances using Support Vector Machines, Reviewed

  550. 水平方向及び仰角方向に関する頭部伝達関数の補間 Reviewed


    日本音響学会誌   Vol. - ( - ) page: -   2001.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  551. Robust speech recognition based on selective use of missing frequency band HMMs

    Y. Kawamura, K. Takeda, F. Itakura

  552. 日本語ディクテーション基本ソフトウエア(99年度版) Reviewed

    河原達也,李晃伸,小林哲則,武田一哉,峯松信明,嵯峨山茂樹,伊藤克亘, 伊藤彰則,山本幹男,山田篤,宇津呂武仁,鹿野清宏

  553. Direction of Arrival Estimation Using Nonlinear Microphone Array Reviewed

    H.Kamiyanagida, H. Saruwatari, K. Takeda, F. Itakura, and K. Shikano

    IEICE Trans. Fundamentals   Vol. E84-A ( 4 ) page: 000   2001.1

  554. WEBベースコースウエアのための音声入力システムの開発 Reviewed


    情報処理学会論文誌   Vol. 23 ( 3 ) page: 605-613   2001.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  555. 車内音声対話収集システムの開発 Reviewed


    電子情報通信学会論文誌,DII, Vol. J84.DII, No.6, pp.909-916   Vol. J84-DII ( 6 ) page: 903-916   2001.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  556. The effect of group delay spectrum on timbre Reviewed

    Hideki Banno, Kazuya Takeda and Fumitada Itakura

    Acoustical Science and Technology   Vol. - ( - ) page: -   2001.1

    Language:English   Publishing type:Research paper (scientific journal)  

  557. Blind Source Separation based on Subband ICA and Beamforming Reviewed

    H.Saruwatari, S.Kurita, K.Takeda, F.Itakura and K.Shikano

  558. Construction of Speech Corpus in Moving Car Environment Reviewed

    N.Kawaguchi, S.Matsubara, H.Iwa, S.Kajita, K.Takeda, F.Itakura and Y.Inagaki

  559. Vector Space Representation of Language Probabilities through SVD of N-gram Matrix Reviewed

    S.Terashima, K.Takeda and F.Itakura

  560. Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition Reviewed

    T.Kawahara, A.Lee, T.Kobayashi, K.Takeda el al

  561. Evaluation of Blind Signal Separation Method Using Directivity Pattern Under Reverberant Conditions Reviewed

    S.Kurita, H. Saruwatari, S. Kajita, K. Takeda, F. Itakura

  562. Speech Enhancement Using Nonlinear Microphone Array with Noise Adaptive Complementary Beamforming Reviewed

    H. Saruwatari, S. Kajita, K. Takeda, F. Itakura

  563. Speech Recognition Based on Space Diversity Using Distributed Multi-Microphone Reviewed

    Y. Shimizu, S. Kajita, K. Takeda, F. Itakura

  564. A New Phonetic Tied-Mixture Model for Efficient Decoding Reviewed

    A. Lee, T. Kawahara, K. Takeda, K. Shikano?

  566. 重回帰分析に基づく頭部伝達特性の推定 Reviewed


    電子情報通信学会論文誌   Vol. J84-A ( 3 ) page: 260-268   2000.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  567. 文部省COEプログラム統合音響情報研究拠点


    日本音響学会誌   Vol. 56 ( 11 ) page: 000   2000.1

  568. どうすれば,データ共有を成功させることができるか


    情報処理,Vol.41, No.7 (2000.7) pp.781-786   Vol. 41 ( 7 ) page: 781-786   2000.1

  569. マルチメディア情報処理(解説記事) (特集、映像情報メディア年報) Reviewed


  570. どうすれば,データ共有を成功させることができるか(解説記事) (特集、ここまできた自然言語処理ー例文の収集とその利用ー) Reviewed


  571. 日本語ディクテーション基本ソフトウエア(98年度版) Reviewed

    河原達也 李晃伸 小林哲則 武田一哉 峯松信明 伊藤克亘 伊藤彰則 山本幹男 山田篤 宇津呂武仁 鹿野清宏

    日本音響学会誌(技術報告)   Vol. 56 ( 4 ) page: 255-259   2000.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  572. Japanese Dictation Toolkit-1998version

    kazuya takeda

     More details

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  573. Phonetic Tied-Mixture モデルを用いた大語彙連続音声認識 Reviewed


     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  


  574. バンドエリミネーションを用いたオーディオ信号へのデータハイディンング Reviewed


     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  575. 側抑制性重みづけを用いた雑音環境下におけるSTRAIGHT分析合成系の品質改善 Reviewed


    電子情報通信学会論文誌   Vol. J38-DII ( 11 ) page: 2180-2189   2000.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  576. 空間音響特性を考慮したスペースダイバシチ型音声認識 Reviewed


    電子情報通信学会論文誌   Vol. J38-DII ( 11 ) page: 2448-2456   2000.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  577. SVDを用いたN-gram確率の線形空間表現 Reviewed


    電子情報通信学会論文誌   Vol. J38-DII ( 11 ) page: 2388-2396   2000.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  578. A Waveform Measure For Predicting Recognition PerformanceDegradation Reviewed

    M.Kondo, K.Takeda and F.Itakura

  579. Japanese Dictation Toolkit: Plug-and-play Framework For Speech Recognition R&D Reviewed

    T.Kawahara, T.Kobayashi, K.Takeda, N.Minematsu, K.Itou, M.Yamamoto, A.Yamada, T.Utsuro, K.Shikano

  580. 事前に他人受理誤り率を指定する話者照合方式 Reviewed

    早川昭二 武田一哉 板倉文忠

    電子情報通信学会論文誌   Vol. J82-DII ( 12 ) page: 2212-2220   1999.12

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  581. Speech enhancement using nonlinear microphone array under nonstationary noise conditions Reviewed

    H.Saruwatari, S.Kajita,K.Takeda and F.Itakura

  582. Voice conversion through nonlinear warping of STRAIGHT spectrum Reviewed

    N.Maeda, H.Banno, S.Kajita, K.Takeda and F.Itakura

  583. JNAS:Japanese speech corpus for large vocabulary continuous speech recognition research Reviewed

    Katunobu ITO,Mikio YAMAMOTO,Kazuya TAKEDA,Tosiyuki TAKEZAWA Tatsuo MATSUOKA,Tetsunori KOBAYASHI,Kiyohiro SHIKANO and Sshuichi ITAHASHI

    Juronal of Acoustic Society Japan(E)   Vol. 20 ( 3 ) page: 119-206   1999.5

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  584. Compensating of Room Acoustic Transfer Functions Affected by Change of Room Temperature Reviewed

    M.Omura, M.Yada, H.Saruwatari, S.Kajita, K.Takeda and F.Itakura

  585. Speech Enhancement Using Nonlinear Microphone Array with Complementary Beamforming Reviewed

    H.Saruwatari, S.Kajita, K.Takeda and F.Itakura

  586. 長さ毎に用意されたセグメント標準パターンとの照合に基づく音声認識 Reviewed


    電子情報通信学会論文誌, (研究速報)   Vol. J82-DII ( 2 ) page: 308-311   1999.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  587. Japanese speech corpus for large vocabulary continuous speech recognition research

    kazuya takeda

    The Journal of the Acoustical Society of Japan   Vol. 20 ( 3 ) page: 199-206   1999.1

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  588. Speech Enhancement Using Nonlinear Microphone Array Based on Complementary Beamforming Reviewed

    H. Saruwatari, S. Kajita, K. Takeda, and F. Itakura

    IEICE Trans.Fundamentals   Vol. E82-A ( 8 ) page: 000   1999.1

  589. 事前に他人受理誤り率を指定する話者照合方式 Reviewed


    電子情報通信学会誌文誌   Vol. J82-DII ( 12 ) page: 2212-2220   1999.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  590. 日本語ディクテーション基本ソフトウェア(97年度版) Reviewed

    河原達也,李晃伸,小林哲則,武田一哉,峯松信明,伊藤克亘, 伊藤彰則,山本幹男,山田篤,宇津呂武仁,鹿野清宏

    日本音響学会誌 (技術報告)   Vol. 20 ( 3 ) page: 233-2239   1999.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  591. 水平面上の頭部伝達関数の補間 Reviewed


    日本音響学会誌   Vol. 55 ( 2 ) page: 91-99   1999.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  592. Estimating entropy of a language from optimal word insertion penalty Reviewed

    K.Takeda, A.Ogawa and F.Itakura

     More details

  593. The Design of the Newspaper-Based Japanese Large Vocabulary Continuous Speech Recognition corpus Reviewed

    K.Itou, M.Yamamoto, K.Takeda, K.Takezawa, T.Matsuoka, K.Kobayashi, K.Shikano

  594. Sharable Software Repository for Japanese Large Vocabulary Continuous Speech Recognition Reviewed

    Recognition T.Kawahara, T.Kobayashi, K.Takeda, N.Minematsu, K.Itou, M.Yamamoto, A.Yamada, T.Utsuro, K.Shikano

  595. Balancing Acoustic and Linguistic Probabilities Reviewed

    A.Ogawa, K.Takeda and F.Itakura

  596. Spectral Weighting of SBCOR for Noise Robust Speech Recognition. Reviewed

    S.Kajita, K.Takeda and F.Itakura

  597. Design and development of Japanese speech corpus for large vocabulary continuous speech recognition. Reviewed

    K.Itou, K.Takeda, T.Takezawa, T.Matsuoka, K.Shikano, T.Kobayashi and S.Itahashi

  598. Common platform of Japanese large vocabulary continuous speech recognizer assessment -proposal and initial results -. Reviewed

    T.Kawahara, A.Lee, T.Kobayashi, K.Takeda, N.Minematsu, K.Itou, A.Itou, M.Yamamoto, A.Yamada, T.Utsuro and K.Shikano

  599. 包絡と音源の独立操作による音声モーフィング Reviewed


    電子情報通信学会論文誌   Vol. J81-DII ( 9 ) page: 1360-1367   1998.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  600. 第3回欧州音声会議(会議報告) Reviewed


  601. Blind Signal Separation for Recognizing Overlapped Speech Reviewed

    T.Taniguti, S.Kajita, K.Takeda, and F.Itakura

    The Journal of the Acoustical Society of Japan   Vol. 19 ( 6 ) page: 385-390   1998.1

    Language:English   Publishing type:Research paper (scientific journal)  

  602. Noise Robust Speech Recognition using Sub-band Cross-Correlation Analysis Reviewed

    Sshouji Kajita, Kazuya Takeda, and Fumitada Itakura

    IEICE Transactions on Information and Systems.   Vol. E81-D ( 10 ) page: 1079-1086   1998.1

    Language:English   Publishing type:Research paper (scientific journal)  

  603. 一般化ベルヌーイ試行に基づく言語確率の補正方法 Reviewed


    電子情報通信学会論文誌   Vol. J81-DII ( 12 ) page: 2703-2711   1998.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  604. 中心周波数の逆数の整数倍の相関を用いた帯域分割自己相関分析 Reviewed


    日本音響学会誌   Vol. 54 ( 2 ) page: 111-118   1998.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  605. Balancing stochastic knowledge on acoustics and linguistics. Reviewed

    K.Takeda, A.Ogawa and F.Itakura

    SSS'97     page: pp.133--138   1997.12

    Authorship:Lead author   Language:English  

  606. Language Modeling for Robust Balancing of Acoustic and Lingustic Probabilities Reviewed

    A.Ogawa, K.Takeda and F.Itakura

  607. Applying Blind Signal Separation to the Recognition of Overlapped Speech Reviewed

    T.Taniguchi, S.Kajita, K.Takeda and F.Itakura

  608. Speech Morphing by Progressive Interpolation of Spectra Reviewed

    H.Banno, S.Kajita, K.Takeda, K.Shikano, F.Itakura

  609. A Language Model Based on Generalized Bernoulli Traials Reviewed

    A.Ogawa, K.Takeda, F.Itakura

  610. Speaker Identification Using Harmonic Structure of LP-residual Spectrum Reviewed

    S.Hayakawa, K.Takeda, F.Itakura

  611. A Binaural Speech Processing Method Using Subband-Crosscorrelation Analysis for Noise Robust Recognition Reviewed

    S.Kajita, K.Takeda, F.Itakura

  612. 種々の定常雑音下におけるLombard音声の認識法の検討 Reviewed

    若尾淳,武田一哉, 板倉文忠

    電子情報通信学会論文誌   Vol. J80-DII ( 7 ) page: 1643-1650   1997.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  613. 音源分離による音声認識性能の改善 Reviewed


  614. ヒューマンスピーチライフノイズに含まれる音声的特徴の検討 Reviewed


    日本音響学会誌   Vol. 53 ( 5 ) page: 337-345   1997.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  615. 複数の発声様式を用いた話者モデルの学習法の検討 Reviewed


    電子情報通信学会論文誌DII   Vol. J80-DII ( 1 ) page: 10月17日   1997.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  616. A Speech Detection Method Using Likelihood of Partial Sentence Hypothesis for Continuous Speech Recognition

    The Transactions of the Institute of Electronics, Information and Communication Engineers DII   Vol. J80-DII ( 11 ) page: 2895-2903   1997.1

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  617. 線形予測残差スペクトルの調波構造に含まれる個人性情報を用いた話者 認識 Reviewed

    早川昭二,武田一哉, 板倉文忠

    電子情報通信学会論文誌   Vol. J80-A ( 9 ) page: 1360-1650   1997.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  618. Feature Space Reduction through Nonlinear Identity Mapping Reviewed

    H.Ito, S.Kajita, K.Takeda, F.Itakura

  619. On the Use of Zero-Crossing Analysis for Multi-Channel Signal Processing Reviewed

    T.Sugihara, S.Kajita, K.Takeda, F.Itakura

  620. Interpolating HRTF for Auditory Virtual Reality Reviewed

    T.Nishino, S.Mase, S.Kajita, K.Takeda, F.Itakura

  621. On the Problems in Applying Bell's Blind Separation to Real Environments Reviewed

    T.Taniguchi, H.Yehia, S.Kajita, K.Takeda, F.Itakura

  622. Variability of Lombard Effects under Different Noise Csnditions Reviewed

    A.Wakao, K.Takeda, F.Itakura

  623. Extracting Speech Features from Human Speech-like Noise Reviewed

    D.Kobayashi, S.Kajita, K.Takeda, F.Itakura

  624. Subband-Crosscorrelation Analysis for Robust Speech Recognition Reviewed

    S.Kajita, K.Takeda, F.Itakura

  625. 大語彙連続音声認識用新聞記事読み上げコーパス


  626. An Acoustically Oriented vocal-Tract Model Reviewed

    Hani C. Yahia, Kazuya Takeda and Fumitada Itakura

    IEICE Transactions on Information and Systems.   Vol. E79-D ( 8 ) page: 1198-1208   1996.1

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  627. Nベスト意味探索と再評価法を用いた大規模内線受付装置の試作 Reviewed


    電子情報通信学会論文誌   Vol. J79-DII ( 12 ) page: 2132-2138   1996.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  628. A Prototype of a Realtime Japanese-Korean Speech Translation System Reviewed

    M.Suzuki, N.Inoue, K.Takeda, F.Yato, S.Yamamoto

  629. Voice-Activated Telephone Extension System for 5,000 Branch Phones Reviewed

    S.Yamamoto, K.Takeda, M.Naito, S.Kuroiwa

  630. Intelligent Network Services using Speech Recognition and Its Field Trailas Reviewed

    S.Yamamoto, K.Takeda, N.Inoue, S.Kuroiwa

  631. Top-down Speech Detection and NBest Meaning Search in a Voice Activated Telephone Extension System Reviewed

    K.Takeda, S.Kuroiwa, M.Naito, S.Yamamoto

  632. Error Analysis of Field Trial Result of a Spoken Dialogue Systems for Telecommunications Reviewed

    Shingo Kuroiwa, Masaki Naito, Naomi Inoue, Seiichi Yamamoto

    IEICE Transactions onInformation and Systems.   Vol. E78D ( 6 ) page: 636-641   1995.1

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  633. 話者の発声様式の相違と話者識別の関係の検討 Reviewed

    後藤雅彦, 武田一哉, 板倉文忠

  634. 砂時計型ニューラルネットによる音声スペクトルの次元圧縮 Reviewed

    伊藤博紀, 武田一哉, 板倉文忠

  635. Optimal Choice Forcusing Matrices for Microphone Array Processing Reviewed

    Zheng Liu, Kazuya Takeda and Humitada Itakura

  636. 2チャンネル信号間の相互相関に基づいたSBCOR分析 Reviewed

    梶田将司, 武田一哉, 板倉文忠

  637. lind Separation法の有効性の検討 Reviewed

    谷口友彦, Hani Yehia, 武田一哉, 板倉文忠

  638. ヒューマンスピーチライク雑音の分析 Reviewed

    小林大祐, 梶田将司, 武田一哉, 板倉文忠

  639. 行列によるトレリス計算を用いたHMMの文レベルでの識別学習 Reviewed

    村上哲範, 武田一哉, 河井恒, 山本誠一

  640. レベル交差時間情報を用いた音声信号の特徴抽出 Reviewed

    杉原貴明, 梶田将司, 武田一哉, 板倉文忠

  641. 音声研究の現状と課題 Reviewed


  642. ヒューマンスピーチライク雑音における音声的特徴の分析 Reviewed

    小林大祐, 梶田将司, 武田一哉, 板倉文忠

  643. 音声の高域を用いた発声内容独立型話者認識 Reviewed

    早川昭二, 武田一哉, 板倉文忠

  644. A vocal-tract area function trajectory representation oriented to the speech production inverse problem Reviewed

    Hani Yehia, Kazuya TAKEDA, Fumitada ITAKURA

  645. 話者識別システムの発声様式の変動に対する頑健性について Reviewed

    後藤雅彦, 武田一哉, 板倉文忠

  646. 異なる雑音環境下で発声されたLombard音声の正規化 Reviewed

    若尾淳, 武田一哉, 板倉文忠

  647. A Voice-Activated Telephone Exchange System and Its Field Trial Reviewed

    S.Yamamoto, K.Takeda, N.Inoue, S.Kuroiwa, M.Naito

  648. A Trellis Based Implementation of Discriminative Trainig Reviewed

    T.Murakami, S.Kuroiwa, K.Takeda, S.Yamamoto

  649. 電話音声の連続音声認識に基づく内線電話受け付け装置の試作と評価


    電子情報通信学会誌   Vol. J77-A ( 2 ) page: 223-231   1994.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  650. Speaker verification using Gaussian mixture models within changing real car environments

    Xianxian Zhang, John Hansen, Pongtep Angkititrakul, Kazuya Takeda

  651. 連続音声認識に基づく内線電話受付け装置の試作と評価(共著) Reviewed

    黒岩眞吾 武田一哉 井ノ上直己 野垣内出 山本誠一他

    電子情報通信学会文誌   Vol. J77A ( 2 ) page: 223-231   1994.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  652. ガーベジHMMを用いた自由発話文中の不要語処理法(共著) Reviewed

    井ノ上直己, 武田一哉, 山本誠一

    電子情報通信学会論文誌   Vol. J77A ( 2 ) page: 215-222   1994.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  653. A Real Time IsolatedWord Recognizer for Telephone Input

    kazuya takeda

    Journal of theAcoustic Society of Japan(E)   Vol. 15   page: 87-96   1994.1

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  654. Online Collection of Spontaneous Speech Using a Voice Activated Telephone Exchanger. Reviewed

    S.Kuroiwa, K.Takeda. N.Inoue, I.Nogaito, S.Yamamoto

  655. Improving Robustness of Network Grammar by Using Class HMM Reviewed

    K.Takeda, N.Inoue, S.Kuroiwa, T.Konuma, S.Yamamoto

    European Conference on Speech Communication Technology, Berlin   Vol. Vol.3   page: pp.1623-1626   1993.10

    Authorship:Lead author   Language:English  

  656. A Voice Activated Extension Telephone Exchange System Reviewed

    S.Kuroiwa, K.Takeda, N.Inoue, I.Nogaito, S.Yamamoto

  657. 内線電話接続受け付け装置のための連続音声認識方式 Reviewed

    武田一哉, 黒岩眞吾, 井ノ上直己, 山本誠一

    KDD R&D 150     page: 39-46   1993.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  658. 言語情報を利用した母音継続時間長の制御(共著) Reviewed


    電子情報通信学会論文誌   Vol. J75(A) ( 3 ) page: 467-473   1993.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  659. Implementation and Evaluation of an Extension Number Guidance System Utlizing Telephone Dialogue Reviewed


  660. Architecture and Algorithms of a Real-time Word Recognizer for Telephone Input Reviewed

    S.Kuroiwa, K.Takeda, F.Yato, S.Yamamoto, K.Owa, M.Shouzakai, R.Matsumoto

  661. The Control of Segmental Duration in Speech Synthesis Using Linguistic Properties Reviewed

    N.Kaiki, K.Takeda, Y.Sagisaka

  662. A Large-Scale Japanese Speech Database Reviewed

    Y.Sagisaka, M.Abe, K.Takeda, S.Katagiri, T.Umeda, H.Kuwabara

  663. On the Unit Search Criteria and Algorithms for Speech Synthesis Using Non-uniform Units Reviewed

    K.Takeda, K.Abe, Y.Sagisaka

  664. Statistical Analysis for Segmental Duration Rules in Japanese Speech Synthesis Reviewed

    N.Kaiki, K.Takeda, Y.Sagisaka

  665. On Unit Selection Algorithms and Their Evaluation in Non-uniform Unit Speech Synthesis Reviewed

    K.Takeda, K.Abe, Y.Sagisaka

  666. ATR Japanese SpeechDatabase as a Tool of Speech Recognition and Synthesis

    kazuya takeda

    Speech Communication   Vol. 9 ( 4 ) page: 357-363   1990.1

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  667. 選択的に合成単位を用いる規則音声合成(共著) Reviewed


    電子情報通信学会論文誌   Vol. J73DII ( 12 ) page: 19,451,950   1990.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  668. Adaptive Manipulation of Non-uniform Synthesis Units using Multi-level Unit Transcriptions Reviewed

    K.Takeda, K.Abe, Y.Sagisaka, H.Kuwabara

     More details

    Authorship:Lead author   Language:English  

  669. On sentence-leuleFactors Governing Segmental Duration inJapanese Reviewed

    K.Takeda, Y.Sagisaka, H.Kuwabara

    Journal of AcousticSociety of America   Vol. 86 ( 6 ) page: 2081-2096   1989.1

    Authorship:Lead author   Language:English   Publishing type:Research paper (scientific journal)  

  670. Construction of a Large-scale Japanese Speech Database and Its Management System Reviewed

    H.Kuwabara, K.Takeda, Y.Sagisaka, S.Katagiri, S.Morikawa, T.Watanabe

  671. 研究用日本語音声データベースの構築(共著) Reviewed

    武田一哉 匂酒芳典 片桐滋 桑原尚夫

    日本音響学会誌   Vol. 44 ( 10 ) page: 747-754   1988.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  672. Acoustic-Phonetic Labels in a Japanese Speech Database Reviewed

    K.Takeda. Y.Sagisaka, S.Katagiri

    European Conference on Speech Technology, September 1987, Edinburg   Vol. Vol. 2   page: pp.13-16   1987.9

    Authorship:Lead author   Language:English  

  673. 部分文仮説の尤度を用いた連続音声認識のための音声区間検出法 Reviewed

    内藤正樹,黒岩眞吾, 山本 誠一, 武田一哉

    電子情報通信学会論文誌   Vol. J80-DII ( 11 ) page: 2895-2903   1987.1

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  674. 住居の音環境に対する都市住民の反応に関する研究 Reviewed


    日本音響学会誌   Vol. 42 ( 10 ) page: 768-773   1986.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  675. 道路に面する地域について Reviewed


    騒音制御   Vol. 10 ( 1 ) page: 40-43   1986.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  676. 住環境騒音暴露パターンの解析と住民反応の尺度に関する研究 Reviewed


    日本音響学会誌   Vol. 41 ( 12 ) page: 870-876   1985.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  677. Study on Noise Environment of Residence in Urban Area Reviewed

    K.Kuno, D.Zheng, K.Takeda, K.Ikegaya and Y.Mishina

  678. 名古屋市域における住居の環境騒音暴露量に関する研究 Reviewed


    日本音響学会誌   Vol. 40 ( 6 ) page: 388-396   1984.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

  679. 都市内住居の騒音暴露量に関する分析 Reviewed


    日本音響学会誌   Vol. 40 ( 8 ) page: 546-553   1984.1

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

▼display all

Books 25

  1. Towards Human-Vehicle Harmonization International journal

    Huseyin Abut , Gerhard Schmidt , Kazuya Takeda , Jacob Lambert and John H.L. Hansen( Role: Joint author)

    De Gruyter  2023.3  ( ISBN:9783110994346

     More details

    Total pages:253   Language:English Book type:Scholarly book

    DOI: doi.org/10.1515/9783110981223-toc

  2. Frontiers of Digital Transformation: Applications of the Real-World Data Circulation Paradigm International journal

    Kazuya Takeda, Ichiro Ide, Victor Muhandiki( Role: Edit)

    Springer  2021.6 

     More details


  3. 自動運転

    二宮, 芳樹, 武田, 一哉

    コロナ社  2021.1  ( ISBN:9784339027754

     More details

    Total pages:xi, 273p   Language:Japanese

    CiNii Books

  4. Vehicles, Drivers, and Safety International journal

    John Hansen, Kazuya Takeda, Gerhard Schmidt and Huseyin Abut( Role: Joint author)

    DeGruyter  2020 

     More details

    Language:English Book type:Scholarly book

    DOI: https://doi.org/10.1515/9783110669787

  5. Integrated modeling of driver gaze and vehicle operation behavior during lane changes in

    Chiyomi Miyajima, Masataka Mori, Takatsugu Hirayama, Norihide Kitaoka, and Kazuya Takeda( Role: Joint author)

    De Gruyter  2017.9 

     More details

    Language:English Book type:General book, introductory book for general audience

    De Gruyter,

  6. Vehicle Systems and Driver Modeling

    Abut, Huseyin, Hansen, John, Schmidt, Gerhard, Takeda, Kazuya, Ko, Hanseok( Role: Joint editor)

    De Gruyter  2017.9 

     More details


  7. 行動情報処理 自動運転システムとの共生を目指して 共立スマートセレクション

    武田 一哉, 土井 美和子( Role: Sole author)

    共立出版  2016.1 

  8. Human Harmonized Information Technology, Volume 1 Vertical Impact

    Kazuya TAKEDA( Role: Joint author ,  Chapter 3, Modeling and Detecting Exessive Trust from Behavior Signals: Overview of Research Project and Results)

    Springer  2016 

  9. Content-based driving scene retrieval using driving behavior and environmental driving Signals

    Yiyang Li, Ryo Nakagawa, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda( Role: Joint author)

    Springer Science-Business  2013 

     More details


  10. Digital Signal Processing for In-Vehicle Systems and Safety

    John H.L.Hansen, Pinar Boyraz, Huseyin Abut, Kazuya Takeda( Role: Joint author)

    Springer  2012.2 

     More details


  11. A stochastic approach for modeling lane-change trajectories

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda( Role: Joint author)

    Springer, Digital Signal Processing for In-Vehicle Systems and Safety  2012.2 

     More details


  12. Selective listening point audio based on blind signal separation and 3D audio effect

    Takanori Nishino, Motoki Ogasawara, Kenta Niwa, and Kazuya Takeda( Role: Joint author)

    World Scientific  2012.2  ( ISBN:13 978-981-43

     More details


  13. Principles and applications of spatial hearing Analysis of measured head-related transfer functions based on spatio-temporal frequency characteristics

    Yasuko Morimoto, Takanori Nishino, and Kazuya Takeda( Role: Joint author)

    World Scientific  2012.2  ( ISBN:13 978-981-4313

     More details


  14. Use of on-road data in evaluating driver performance metrics

    Lucas Malta, Akira Ozaki, Chiyomi Miyajima, and Kazuya Takeda( Role: Joint author)

    SAE International  2012.1 

     More details


  15. An analysis of the speech under stress using the two-mass vocal fold model

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda( Role: Joint author)

    Proceedings of the Paralinguistic Information and Its Integration in Spoken Dialogue Systems Workshop, Ramon Lopez-Cozar Delgado, Tetsunori Kobayashi eds  2011.9  ( ISBN:978-1-4614-133

     More details


  16. On-line detection of task incompletion for spoken dialog systems using utterance and behavior tag N-gram vectors

    Sunao Hara, Norihide Kitaoka, Kazuya Takeda( Role: Joint author)

    Proceedings of the Paralinguistic Information and Its Integration in Spoken Dialogue Systems Workshop, Ramon Lopez-Cozar Delgado, Tetsunori Kobayashi eds  2011.9  ( ISBN:978-1-4614-13

     More details


  17. Music Recommendation System Based on Human-to-human Conversation Recognition

    Hiromasa Ohashi, Sunao Hara, Norihide Kitaoka, Kazuya Takeda( Role: Joint author)

    Workshop Proceedings of the 7th International Conference on Intelligent Environments: Ambient Intelligence and Smart Environments vol.10,   2011.1  ( ISBN:978-1-60750-7

     More details


  18. Effective Multiple Regression for Robust Single and Multi-channel Speech Recognition

    Weifeng Li, Kazuya Takeda, Fumitada Itakura,( Role: Joint author)

    Bentham Science Publishers, Recent Advances in Robust Speech Recognition Technology, J. Ramirez and J.M. Gorriz eds  2011.1  ( ISBN:eISBN: 978-1-608

     More details


  19. 確率と確率過程

    武田一哉,村瀬洋,中野良平,速水悟,中川聖一,菅谷保之( Role: Joint author)

    オーム社  2010.10 

     More details



  20. Driver Identification Based on Spectral Analsis of Driving Behavioral Signals, Advances for IN-Vehicle and Mobile Systems, Challenges

    Yoshihiro Nishiwaki, Koji Ozawa, Toshihiro Wakita, Chiyomi Miyajima, Katsunobu Itou, and Kazuya Takeda( Role: Joint author)

    Springer  2007.4 

     More details


  21. Advances for In-Vehicle and Mobile Systems -Challenges for International Standards -

    Huseyin Abut, John H.L. Hansen, Kazuya Takeda (Eds)( Role: Joint author)

    Springer publisher  2007.1 

     More details


  22. 音響エレクトロニクス(基礎と応用)

    大賀寿郎,鎌倉友男,斉藤繁実,武田一哉( Role: Joint author)

    培風館  2005.5 

     More details


  23. DSP in Mobile and Vehicular Systems (Eds.)

    Huseyin Abut, John Hansen and Kazuya Takeda (Eds.)( Role: Joint author)

    Springer Publisher  2004.1 

     More details


  24. 音声情報処理

    春日正男,船田哲男,林伸二,武田一哉( Role: Joint author)

    コロナ社  2001.1 

     More details



  25. IT-Text 音声認識システム

    鹿野清宏,河原達也,伊藤克亘,武田一哉,山本幹男( Role: Joint author)

    オーム社  2001.1 

     More details



▼display all


  1. An Evaluation of Speech Waveform Modification Methods towards Improvement of Speech Intelligibility in Noisy Environment

    武山 知弘, 小林 和弘, 戸田 智基, 武田 一哉

    電子情報通信学会技術研究報告 = IEICE technical report : 信学技報   Vol. 117 ( 368 ) page: 11 - 16   2017.12

     More details

    Language:Japanese   Publisher:電子情報通信学会  

  2. CNNを用いたEnd-to-Endナビゲーションシステムによるつくばチャレンジへの取り組み International coauthorship

    清谷竣也, CARBALLO Alexander, 竹内栄二朗, 宮島千代美, 宮島千代美, 武田一哉, 武田一哉

    計測自動制御学会システムインテグレーション部門講演会(CD-ROM)   Vol. 18th   2017

     More details



  3. Investigation of DNN-Based Audio-Visual Speech Recognition (Special Section on Recent Advances in Machine Learning for Spoken Language Processing) Reviewed

    Tamura Satoshi, Ninomiya Hiroshi, Kitaoka Norihide, Osuga Shin, Iribe Yurie, Takeda Kazuya, Hayamizu Satoru

    IEICE Transactions on Information and Systems   Vol. 99 ( 10 ) page: 2444 - 2451   2016.10

     More details

    Language:English   Publisher:The Institute of Electronics, Information and Communication Engineers  

    CiNii Books

  4. Adaptation Methods for Daily Activity Recognition Based on Deep Neural Network

      Vol. 116 ( 189 ) page: 1 - 6   2016.8

     More details


    CiNii Books

  5. Daily Activity Recognition based on Recurrent Neural Networks

      Vol. 116 ( 189 ) page: 7 - 12   2016.8

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    CiNii Books

  6. A-14-20 Comparison of Car-Following Behavior among Different Driver Categories

    Goto Taichi, Miyajima Chiyomi, Li Yiyang, Takeda Kazuya, Hiroike Shinya, Sakamoto Shinobu, Honda Shinichiro, Tsukahara Toshiya, Ito Masayoshi

    Proceedings of the IEICE Engineering Sciences Society/NOLTA Society Conference   Vol. 2016   2016.3

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

▼display all

Presentations 441

  1. Dance with Rhythmic Frames:コマ送り提示によるVRダンス学習システム

    時田 聡実(名大),石黒 祥生(東大),大谷 健登(名大),西野 隆典(名城大),武田 一哉(名大)

    第27回 一般社団法人情報処理学会シンポジウム インタラクション2023  2023.3.10 

     More details

    Event date: 2023.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  2. UnifiedGeoMap:2D地図と3D地図の組み合わせによる空間情報の把握

    早川 達也,石黒 祥生,大谷 健登,西野 隆典,武田 一哉

    第27回 一般社団法人情報処理学会シンポジウム インタラクション2023  2023.3.8 

     More details

    Event date: 2023.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  3. サッカーにおけるイベント予測に基づく一般化されたチームの守備評価

    梅基 陸平, 中原 啓, 筒井 和詩, 藤井 慶輔

    2022年度スポーツデータサイエンスコンペティション  2023.1.8 

     More details

    Event date: 2023.1

    Language:Japanese   Presentation type:Oral presentation (general)  


  4. Efficient Training Method for Point Cloud-based Object Detection Models by Combining Environmental Transitions and Active Learning International conference

    Takumi Yamamoto, Kento Ohtani, Tomoki Hayashi, Alexander Carballo, Kazuya Takeda


     More details

    Event date: 2022.12

    Language:English   Presentation type:Oral presentation (general)  


  5. Exploring optimal cooperative behavior and its underlying cognitive and decision mechanisms using deep reinforcement learning


     More details

    Event date: 2022.10

    Language:Japanese   Presentation type:Oral presentation (general)  



    Ibuki Kuroyanagi , Tomoki Hayashi, Kazuya Takeda, Tomoki Toda


     More details

    Event date: 2022.7

    Language:English   Presentation type:Symposium, workshop panel (public)  


  7. Automatic faults detection in race walking using smartphone camera


     More details

    Event date: 2022.6

    Language:Japanese   Presentation type:Oral presentation (general)  


  8. 反実仮想シミュレーションを用いた野球におけるチーム打撃戦略の効果検証


    2022年度人工知能学会全国大会(第36回)  2022.6.16 

     More details

    Event date: 2022.6

    Language:Japanese   Presentation type:Oral presentation (general)  


  9. 軌道予測に基づいた味方の得点機会を創出するサッカー選手の評価

    寺西真聖, 筒井 和詩, 武田 一哉, 藤井 慶輔,

    2022年度人工知能学会全国大会(第36回)  2022.6.16 

     More details

    Event date: 2022.6

    Language:Japanese   Presentation type:Oral presentation (general)  


    DOI: 10.11517/pjsai.JSAI2022.0_3G4OS15b05

  10. Diversity of behavioral strategy in cooperative hunting using multi-agent deep reinforcement learning


     More details

    Event date: 2022.6

    Language:Japanese   Presentation type:Oral presentation (general)  


  11. Player evaluation in a racket sport via deep reinforcement learning with technical and tactical contexts

    Ning DING, Kazuya TAKEDA, Keisuke FUJII

    Japanese Society for Artificial Intelligence  2022.6.14 

     More details

    Event date: 2022.6

    Language:English   Presentation type:Oral presentation (general)  


    DOI: 10.11517/pjsai.JSAI2022.0_1S1IS304

  12. A serial anomalous sound detection method using outlier exposure based on two types of binary classification


     More details

    Event date: 2022.5

    Language:Japanese   Presentation type:Oral presentation (general)  


  13. A serial anomalous sound detection method using outlier exposure based on two types of binary classification International conference

    Kuroyanagi, Ibuki; Hayashi, Tomoki; Takeda, Kazuya; Toda, Tomoki;

    IEICE Technical Report; IEICE Tech. Rep.  2022.5.6  IEICE

     More details

    Event date: 2022.5


  14. Web service to automatically generate travel memory videos from dashcam video data


     More details

    Event date: 2021.12

    Presentation type:Oral presentation (general)  

  15. 機械学習を⽤いた3塁ベースコーチの判断評価

    中原 啓、武田 一哉、藤井 慶輔

    日本野球科学研究会  2021.11.27 

     More details

    Event date: 2021.11

    Presentation type:Poster presentation  


  16. 声質の可視化を用いた所望音声検索システムの提案


    音楽情報科学研究会  2022.11.25 

     More details

    Event date: 2021.11

    Presentation type:Oral presentation (general)  


  17. Anomalous Sound Detection Using a Binary Classification Model with Metric Learning


     More details

    Event date: 2021.9

    Presentation type:Oral presentation (general)  

  18. Extraction of swing motion contributing to prediction of shuttle drop position in badminton International conference

    Tatsuya Yoshikawa, Kazushi Tsutsui, Kazuya Takeda and Keisuke Fujii

    AI for Sports Analytics (AISA) Workshop IJCAI 2021  2022.8.17 

     More details

    Event date: 2021.8

    Presentation type:Oral presentation (general)  

  19. Extraction of swing motion contributing to prediction of shuttle drop position in badminton


     More details

    Event date: 2021.6

    Presentation type:Oral presentation (general)  

  20. Flexible prediction of target motion with internal representation in chase behavior


     More details

    Event date: 2021.6

    Presentation type:Oral presentation (general)  

  21. 複数の自動運転車両を少人数で遠隔監視するための能動的管理手法

    丁明, 竹内栄二郎, 石黒洋生, 二宮芳樹, 河口信夫, 武田一哉

    ロボティクス・メカトロニクス講演会2021(ROBOMECH2021)  2021.6.8 

     More details

    Event date: 2021.6

    Presentation type:Poster presentation  


  22. 電気式人工喉頭を用いた歌唱システムにおける自然な身体動作を利用した歌唱表現付与の提案

    大川舜平, 石黒祥生, 大谷健登, 西野隆典, 小林和弘, 戸田智基, 武田一哉

    インタラクション2021  2021.3.10 

     More details

    Event date: 2021.3



  23. Anomalous Sound Detection Using a Binary Classification Model Considering Class Centroids


     More details

    Event date: 2021.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  24. Comparison of speed following performance by intervention methods in autonomous driving using End-to-End learning


     More details

    Event date: 2020.12

    Language:Japanese   Presentation type:Poster presentation  



    Koichi Miyazaki, Tatsuya Komatsu, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda

    Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE Workshop) 

     More details

    Event date: 2020.11

    Language:Japanese   Presentation type:Oral presentation (general)  


  26. Cross-Lingual Voice Conversion using Cyclic Variational Autoencoder and WaveNet Vocoder


     More details

    Event date: 2020.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  27. 深層学習による三次元折り紙形状推定における視点位置の影響

    中垣内千晶; 西野隆典; 武田一哉;


     More details

    Event date: 2020.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  28. Data-driven modeling of trajectory in human chase and escape behaviors


     More details

    Event date: 2020.6

    Language:Japanese   Presentation type:Oral presentation (general)  


  29. Visual-vibration Learning Based Road Characteristic Estimation Using Road Images

     More details

    Event date: 2020.5

    Language:Japanese   Presentation type:Oral presentation (general)  


  30. Scene-dependent Anomalous Acoustic-event Detection Based on Conditional Wavenet and I-vector International conference

    Tatsuya Komatsu, Tomoki Hayashiy, Reishi Kondo, Tomoki Todaz, Kazuya Takeday


     More details

    Event date: 2020.5

    Language:English   Presentation type:Oral presentation (general)  

    Venue:Virtual Barcelona   Country:Japan  

  31. FollowSelect: 準備動作が必要な機器の利用に適した 経路追従型メニュー選択手法

    榮井 優介, 石黒 祥生, 西野 隆典, 武田 一哉

    インタラクション 2020 

     More details

    Event date: 2020.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  32. CycleVAEを用いたクロスリンガル声質変換(Cross-Lingual Voice Conversion using Cyclic Variational Auto-encoder)

    中谷輝,Patrick Lumban Tobing,武田一哉,戸田智基


     More details

    Event date: 2020.3

    Language:Japanese   Presentation type:Poster presentation  


  33. Optimizing Learned Object Detection on Point Clouds from 3D Lidars Through Range and Sparsity Information International conference

    Jacob Lambert, Eijiro Takeuchi, Kazuya Takeda

    APSIPA ASC 2019 

     More details

    Event date: 2019.11

    Language:English   Presentation type:Poster presentation  

  34. Data-driven modeling of locomotor behaviors in game-based chase and escape interactions International conference

    Tsutsui, K., Fujii, K. and Takeda, K

    Asian Conference on Machine Learning 

     More details

    Event date: 2019.11

    Language:English   Presentation type:Poster presentation  

  35. Training Engineers in Autonomous DrivingTechnologies using Autoware International conference

    Carballo Alexander,Wong David,Ninomiya Yoshiki,Kato Shinpei,Takeda Kazuya

    2019 IEEE Intelligent Transportation Systems Conference (ITSC) 

     More details

    Event date: 2019.10


  36. Personalized Safety-Focused Control by Minimizing Subjective Risk International conference

    Bao Naren,Yang Dongfang,Carballo Alexander,Özgüner Ümit,Takeda Kazuya

    2019 IEEE Intelligent Transportation Systems Conference (ITSC) 

     More details

    Event date: 2019.10


  37. Trajectory prediction for human chase and escape task by data-driven modeling


     More details

    Event date: 2019.9

    Language:Japanese   Presentation type:Poster presentation  


  38. 全方位映像を用いた危険車線変更シーンの検出



     More details

    Event date: 2019.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:大同大学   Country:Japan  

  39. ドライバの視線を用いた認知的注意散漫状態の検出(Detection of driver cognitive distraction using the driver's gaze)

    梅田拓, 宮島千代美, 竹内栄二朗, 武田一哉


     More details

    Event date: 2019.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  40. Trajectory prediction for human chase and escape task by data-driven modeling


     More details

    Event date: 2019.9

    Language:Japanese   Presentation type:Poster presentation  


  41. ドライバの視覚的注意推定に基づく視行動の安全性評価(Safety evaluation of gaze behavior based on driver's visual attention estimation)



     More details

    Event date: 2019.7

    Language:Japanese   Presentation type:Poster presentation  


  42. Recognition Assistance Interface for Autonomous Vehicles International conference

    Atsushi Kuribayashi, Eijiro Takeuchi, Alexander Calballo, Kazuya Takeda

    FAST-zero '19 

     More details

    Event date: 2019.6


    Country:United States  

  43. End-to-End Driving using Point Cloud Features International conference

    Shunya Seiya, Alexander Carballo, Eijiro Takeuchi, Kazuya Takeda


     More details

    Event date: 2019.6

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  44. Point Cloud Compression for 3D LiDAR Sensor using Recurrent Neural Network with Residual Blocks International conference

    Chenxi Tu, Eijiro Takeuchi,Alexander Carballo, Kazuya Takeda


     More details

    Event date: 2019.5

    Language:English   Presentation type:Poster presentation  

  45. End-to-End Navigation with Branch Turning Support using Convolutional Neural Network International conference


     More details

    Event date: 2018.12

    Language:English   Presentation type:Oral presentation (general)  

  46. "A Slope-robust Cascaded Ground Segmentation in 3D Point Cloud for Autonomous Vehicles," International conference

    2018 21st International Conference on Intelligent Transportation Systems (ITSC) 

     More details

    Event date: 2018.11

    Language:English   Presentation type:Oral presentation (keynote)  

    Country:United States Minor Outlying Islands  

  47. Learning How to Drive in Blind Intersections from Human Data International conference


     More details

    Event date: 2018.10

    Language:English   Presentation type:Oral presentation (general)  

  48. Far-infrared images recognition for nighttime pedestrian detection based on transfer learning

    8th Biennial Workshop on DSP in Vehicles 

     More details

    Event date: 2018.10

    Language:English   Presentation type:Poster presentation  


  49. Classification of driving situations by risk level using a deep neural network

    8th Biennial Workshop on DSP in Vehicles 

     More details

    Event date: 2018.10

    Language:English   Presentation type:Poster presentation  


  50. Estimation of driver's risk feeling using information from the driving environment

    8th Biennial Workshop on DSP in Vehicles 

     More details

    Event date: 2018.10

    Language:English   Presentation type:Poster presentation  


  51. Modeling subjective driving risk feeling using ensemble learning methods

    8th Biennial Workshop on DSP in Vehicles 

     More details

    Event date: 2018.10

    Language:English   Presentation type:Oral presentation (keynote)  


  52. Modeling and evaluation of the gaze behavior of individual drivers

    8th Biennial Workshop on DSP in Vehicles 

     More details

    Event date: 2018.10

    Language:English   Presentation type:Poster presentation  


  53. Anomalous Sound Event Detection Based on WaveNet International conference


     More details

    Event date: 2018.9

    Language:English   Presentation type:Oral presentation (general)  

  54. "ESPnet: End-to-End Speech Processing Toolkit," International conference

    Proc. INTERSPEECH, pp. 2207-2211, 2018. 

     More details

    Event date: 2018.9

    Language:English   Presentation type:Poster presentation  


  55. Spectral clustering based approach for evaluating the effect of driving behavior on fuel economy International conference

    2018 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) 

     More details

    Event date: 2018.5

    Language:English   Presentation type:Poster presentation  

    Country:United States  

  56. Multi-Head Decoder for End-to-End Speech Recognition International conference


     More details

    Event date: 2018.4

    Language:English   Presentation type:Oral presentation (general)  

  57. Retrieving a Driving Model Based on Clustered Intersection Data International conference


     More details

    Event date: 2018.4

    Language:English   Presentation type:Oral presentation (general)  

  58. Estimating Subjective Driving Risk Feeling using Random Forest

    Naren Bao, Chiyomi Miyajima, Akira Tamamori, Eijiro Takeuchi, and Kazuya Takeda

    2018 IEICE General Conferences  

     More details

    Event date: 2018.3

    Language:English   Presentation type:Oral presentation (general)  

    Venue:Tokyo   Country:Japan  

  59. WaveNetボコーダにおける学習データ量の影響に関する調査

    林 知樹, 小林 和弘, 玉森 聡, 武田 一哉, 戸田 智基


     More details

    Event date: 2018.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:日本工業大学宮代キャンパス   Country:Japan  

  60. 複数話者WaveNetボコーダに関する調査

    林 知樹, 小林 和弘, 玉森 聡, 武田 一哉, 戸田 智基


     More details

    Event date: 2018.1

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京大学   Country:Japan  

  61. End-to-Endシステムを用いた自立走行ナビゲーション

    清谷 竣也, Alexander Carballo, Jacob Lambert, Hatem Darweesh, Patiphon Narksri, Luis Y. Morales, 赤井 直紀, 竹内 栄二朗, 武田一哉

    つくばチャレンジシンポジウム, 2017. 

     More details

    Event date: 2018.1

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:つくば市   Country:Japan  

  62. CNNを用いたEnd-to-Endナビゲーションによるつくばチャレンジへの取り組み

    清谷 竣也, Alexander Carballo, 竹内栄二朗, 宮島千代美, 武田一哉

    第18回 計測自動制御学会システムインテグレーション部門講演会, 2017. 

     More details

    Event date: 2017.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:仙台市   Country:Japan  

  63. 名古屋大学におけるサーバ型紙レポート・LMS連携システムの開発

    清谷 竣也, 伊藤 瑠哉, 岡本 康佑, 谷川 右京, 大平 茂輝, 出口 大輔, 戸田 智基

    大学ICT推進協議会2017年度年次大会, 2017. 

     More details

    Event date: 2017.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:広島市   Country:Japan  

  64. 雑音環境下における音声了解度向上に向けた音声波形加工手法の評価

    武山 知弘, 小林 和弘, 戸田 智基, 武田 一哉


     More details

    Event date: 2017.12

    Language:Japanese   Presentation type:Poster presentation  

    Venue:早稲田大学   Country:Japan  

  65. DNN適応に基づく非可聴つぶやき認識用話者・環境依存音響モデルの構築

    野田 聖太, 林 知樹, 戸田 智基, 武田 一哉


     More details

    Event date: 2017.12

    Language:Japanese   Presentation type:Poster presentation  

    Venue:早稲田大学   Country:Japan  

  66. Prediction Method for Continuous Point Cloud Data Compression Using SLAM Information International conference

    Chenxi Tu,Eijiro Takeuchi ,Chiyomi Miyajima, Kazuya Takeda


     More details

    Event date: 2017.9

    Language:English   Presentation type:Poster presentation  

  67. 非可聴つぶやき認識のための深層学習に基づく音響モデリング

    野田聖太,林 知樹,戸田 智基,武田 一哉


     More details

    Event date: 2017.9

    Language:Japanese   Presentation type:Poster presentation  


    B2-1, 1 page, Sep., 2017

  68. CTCに基づく音響イベントから擬音語表現への変換

    宮崎 晃一, 林 知樹, 戸田 智基, 武田 一哉


     More details

    Event date: 2017.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:愛媛大学   Country:Japan  

  69. WaveNetボコーダ学習における複数話者音声データの利用に関する検討

    林 知樹, 玉森 聡, 小林 和弘, 武田 一哉, 戸田 智基


     More details

    Event date: 2017.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:愛媛大学   Country:Japan  

  70. 楽器音生成過程を考慮した畳み込みニューラルネットワークに基づく楽曲音源強調

    大谷健登, 丹羽健太, 西野隆典, 武田一哉


     More details

    Event date: 2017.9

    Language:Japanese   Presentation type:Poster presentation  


  71. Estimation of risk as perceived by individual drivers International conference

    Chiyomi Miyajima, Yuan Sheng, and Kazuya Takeda

    The Second Seminar on JSPS Core-to-Core Program 

     More details

    Event date: 2017.8

    Language:English   Presentation type:Oral presentation (general)  

    Venue:Bangkok,Thailand   Country:Thailand  

  72. イベント継続長を明示的に制御したBLSTM-HSMMハイブリッドモデルによる多重音響イベント検出

    林知樹,渡部晋治,戸田智基,堀貴明,Jonathan Le Roux,武田一哉,"イベント継続長を明示的に制御したBLSTM-HSMMハイブリッドモデルによる多重音響イベント検出," 応用音響研究会, 信学技報, vol. 117, no. 138, EA2017-2, pp. 9-14, Jul., 2017


     More details

    Event date: 2017.7

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北海道大学   Country:Japan  

    信学技報, vol. 117, no. 138, EA2017-2, pp. 9-14, Jul., 2017

  73. ケプストラム距離正則化を用いた半教師ありステレオチャネル楽曲音源分離



     More details

    Event date: 2017.6

    Language:Japanese   Presentation type:Poster presentation  

    Venue:御茶ノ水女子大学   Country:Japan  

    Vol. 2017-MUS-115, No. 18, pp. 1-6, Jun., 2017

  74. イベント区間検出統合型BLSTM-HMMハイブリッドモデルによる多重音響イベント検出

    林知樹, 渡部晋治, 戸田智基, 堀貴明, Jonathan Le Roux, 武田一哉


     More details

    Event date: 2017.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神奈川   Country:Japan  

  75. 音声生成過程を考慮したWaveNetに基づく音声波形合成法

    玉森 聡, 林 知樹, 戸田 智基, 武田 一哉


     More details

    Event date: 2017.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神奈川   Country:Japan  

  76. 非可聴つぶやき認識のための通常音声を活用したDNN音響モデル学習

    野田聖太, 林知樹, 戸田智基, 武田一哉


     More details

    Event date: 2017.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:神奈川   Country:Japan  

  77. 統計的音声波形変換に基づく雑音環境下における音声了解度向上



     More details

    Event date: 2017.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神奈川   Country:Japan  

    1-6-12, pp. 231-232,Mar., 2017

  78. 時間領域信号推定に基づく音声スペクトログラムの欠損成分復元

    関 翔悟, 亀岡 弘和, 戸田 智基, 武田 一哉


     More details

    Event date: 2017.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:沖縄産業支援センター   Country:Japan  

    信学技報, Vol. 116, No. 475, EA2016-85, pp. 19-24, Mar. 2017.

  79. Analysis of Individual Risk Perception during Highway Lane-Change Scenes

    Naren Bao, Chiyomi Miyajima, Eijiro Takeuchi, and Kazuya Takeda

    The 79th National Convention of IPSJ.  

     More details

    Event date: 2017.3

    Language:English   Presentation type:Oral presentation (general)  

    Venue:Nagoya   Country:Japan  

  80. Compressing Continuous Point Cloud Data Using Image Compression Methods International conference

    Chenxi Tu,Eijiro Takeuchi ,Chiyomi Miyajima, Kazuya Takeda


     More details

    Event date: 2016.12

    Language:English   Presentation type:Oral presentation (general)  

  81. 深層学習に基づく非可聴つぶやき認識用音響モデルの構築



     More details

    Event date: 2016.10

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:福岡博多   Country:Japan  

  82. Deep Neural Networkに基づく日常生活行動認識における適応手法


    電子情報通信学会 技術報告 

     More details

    Event date: 2016.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北海道札幌市   Country:Japan  

  83. 日常生活行動認識のためのRecurrent Neural Network構造の調査

    玉森 聡、林 友樹、戸田 智基、武田 一哉


     More details

    Event date: 2016.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:富山大学   Country:Japan  

  84. 波面合成法を用いた車内の座席位置における焦点音源生成の検討(Focused source synthesized on seating position of a vehicle using wave field synthesis)

    山村俊貴, 石黒祥生, 西野隆典, 武田一哉


     More details

    Event date: 2016.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:富山   Country:Japan  

  85. Passenger Anxiety Detection Using Eye Blinks Represented as Point Process

     More details

    Event date: 2016.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  86. Filter estimation towards modeling of audio mixing in music production

     More details

    Event date: 2016.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  87. (Stereo Channel Music Signal SeparationBased on Nonnegative Tensor Factorization with Cepstrum Regularization

     More details

    Event date: 2016.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  88. 音楽体験拡張AI ~深層学習を用いた雑音抑圧量推定に基づく楽器音信号強調~(Music staging AI -instrument source enhancement based on noise suppression ratio estimation using deep learning-.)

    大谷健登, 丹羽健太, 武田一哉


     More details

    Event date: 2016.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:富山   Country:Japan  

  89. Deep Neural Networkに基づく日常生活行動認識における適応手法

    林知樹, 北岡教英, 戸田智基, 武田一哉


     More details

    Event date: 2016.8

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:京都   Country:Japan  

  90. Recurrent Neural Networkに基づく日常生活行動認識

    玉森聡, 林知樹, 戸田智基, 武田一哉


     More details

    Event date: 2016.8

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:京都   Country:Japan  

  91. Difference of prosodic information transmission efficiency casued by verbally meaningless acoustic difference : An experimental study

     More details

    Event date: 2016.3

    Language:English   Presentation type:Oral presentation (general)  


  92. 非負値テンソル因子分解を用いた劣決定ステレオチャネル音源分離

    関翔悟, 西野隆典, 戸田智基, 武田一哉


     More details

    Event date: 2016.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:桐蔭横浜大学 (神奈川県横浜市)   Country:Japan  

  93. Underdetermined stereo channel source separation using nonnegative tensor factorization

    Shogo Seki, Takanori Nishino, Tomoki Toda, and Kazuya Takeda

     More details

    Event date: 2016.3

    Language:English   Presentation type:Poster presentation  


  94. Method of presenting sound signals for selective listening system

     More details

    Event date: 2016.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  95. Application of informed music source separation to audio object control system

     More details

    Event date: 2016.3

    Language:Japanese   Presentation type:Poster presentation  


  96. 深層学習によるボトルネック特徴量を用いたマルチモーダル音声認識

    田村哲嗣, 二宮宏史, 北岡教英, 大須賀晋, 入部百合絵, 武田一哉, 速水 悟

    電子情報通信学会 技術研究報告 

     More details

    Event date: 2015.10

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:別府市 大分県   Country:Japan  

  97. 深層学習によるマルチモーダル音声認識 - 深層学習の活用法の調査

    田村哲嗣, 二宮宏史, 北岡教英, 大須賀晋, 入部百合絵, 武田一哉, 速水 悟


     More details

    Event date: 2015.10

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神戸大学   Country:Japan  

  98. 深層学習によるマルチモーダル音声認識 - 画像特徴量の改善

    田村哲嗣, 二宮宏史, 北岡教英, 大須賀晋, 入部百合絵, 武田一哉, 速水 悟


     More details

    Event date: 2015.10

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神戸大学   Country:Japan  

  99. A Large Scale Activity Sensing Environment of Daily Life for Data Circulation

     More details

    Event date: 2015.9

    Language:English   Presentation type:Oral presentation (general)  

    Venue:名古屋工業大学   Country:Japan  

  100. 話者交替の確率モデル化と情報量を用いた話者活性度の評価

    陳伯翰 北岡教英 大武美保子 武田一哉


     More details

    Event date: 2015.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:福島県会津若松市   Country:Japan  

  101. 音声対話システムの対話戦略への応用を目的とした音声からの高齢者の感情識別



     More details

    Event date: 2015.9

    Language:English   Presentation type:Poster presentation  

    Venue:福島県会津若松市   Country:Japan  

  102. Investigation of Sound Quality Improvement of Lossy Audio Signals Based on Deep Neural Network

     More details

    Event date: 2015.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  103. 深層学習による音響・画像特徴量を用いたマルチモーダル音声認識



     More details

    Event date: 2015.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:会津大学 福島県   Country:Japan  

  104. Single Dimensional Control of Spatial Audio Object Arrangement based on Head Related Transfer Func- tions

     More details

    Event date: 2015.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  105. 話者交替行為の情報量を用いた話者活性度の評価

    陳伯翰 北岡教英 大武美保子 武田一哉


     More details

    Event date: 2015.8

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:岩手県盛岡市   Country:Japan  

  106. 音響空間の聴覚的操作 ~ 超臨場化と個性化 ~

    武田一哉, 西野隆典, 丹羽健太, 羽田陽一, 猿渡洋, 西村竜一


     More details

    Event date: 2015.7

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:電気通信大学 (東京都調布市)   Country:Japan  

  107. Evaluation of speaker engagement using turn-taking behavior entropy

    Bohan Chen, Norihide Kitaoka, Mihoko Otake, Kazuya Takeda

     More details

    Event date: 2015.6

    Language:English   Presentation type:Oral presentation (general)  


  108. 音像空間配置のインタラクティブな制御手法



     More details

    Event date: 2015.5

    Language:Japanese   Presentation type:Poster presentation  

    Venue:東京   Country:Japan  

  109. MIDI音源を用いた旋律の言語依存性に関する分析



     More details

    Event date: 2015.5

    Language:Japanese   Presentation type:Poster presentation  

    Venue:東京   Country:Japan  

  110. Language-dependency analysis for local pitch transition

     More details

    Event date: 2015.3 - 2016.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  111. Noise suppression for musical instrument signal based on Gaussian process.

     More details

    Event date: 2015.3

    Language:Japanese   Presentation type:Poster presentation  


  112. 韻律補正した学習者の音声と日本語音節に基づく近似発音の提示による英語発音矯正手法



     More details

    Event date: 2015.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京   Country:Japan  

  113. 音声情報伝達における合理的な音声特徴制御とその伝達効率への影響

    陳伯翰 北岡教英 武田一哉


     More details

    Event date: 2015.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:東京   Country:Japan  

  114. Combination of acoustic information and Google Street View using sphere-shaped microphone array

     More details

    Event date: 2015.3

    Language:Japanese   Presentation type:Poster presentation  


  115. DNNによる環境音と加速度信号を用いた日常生活行動認識



     More details

    Event date: 2015.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:日本,東京(中央大学 後楽園キャンパス)   Country:Japan  

  116. Effective receiver's path finding for 3D audio system

     More details

    Event date: 2015.3

    Language:Japanese   Presentation type:Poster presentation  


  117. Combination of acoustic information and Google Street View using sphere-shaped microphone array

    Tomomi Suzuki, Yoshio Ishiguro, Takanori Nishino, Kazuya Takeda

    Acostic Society of Japan Annual Meeting 

     More details

    Event date: 2015.3

    Language:English   Presentation type:Oral presentation (general)  


  118. Evaluation of Arranged Music Retrieval Performance for Different Music Structures

     More details

    Event date: 2015.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  119. 対話者間の音声特徴類似度と対話の情報伝達効果の関係

    陳伯翰, 北岡教英, 武田一哉


     More details

    Event date: 2014.12

    Language:Japanese   Presentation type:Poster presentation  

    Venue:東京工業大学 東京   Country:Japan  

  120. 自動運転環境下におけるドライバの視行動分析

    山﨑駿 宮島千代美 寺井仁 奥田裕之 平山高嗣 鈴木達也 武田一哉(名古屋大学) 坂東誉司 人見謙太郎 江川万寿三(株式会社デンソー)


     More details

    Event date: 2014.11

    Language:Japanese   Presentation type:Poster presentation  

    Venue:岡山大学   Country:Japan  

  121. スマートフォンによるCAN信号のブラインド推定

    坪井優幸 宮島千代美 武田一哉(名古屋大学) 北岡教英(徳島大学)


     More details

    Event date: 2014.11

    Language:Japanese   Presentation type:Poster presentation  

    Venue:岡山大学   Country:Japan  

  122. (Estimation of Driving Sygnals on In-Vehicle Network Using Smartphone)

    Masayuki Tsuboi, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda

     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  123. (Tap Pattern Recognition using Acoustic Signal Processing)

     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  124. 実世界データ循環が作り出す新しい価値 -data-centric ITSを例として-



     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:中京大学 名古屋   Country:Japan  

  125. 大規模運転データに基づく運転行動の理解とモデル化



     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:九州産業大学   Country:Japan  

  126. ユビキタスセンシングに基づく日常生活行動データベースの構築



     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北海道札幌市北海学園大学豊平キャンパス   Country:Japan  

  127. (Interpolation of instrument waveforms based on Gaussian process)

     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Oral presentation (general)  


  128. (Effects of sound image movements to perception in musical contents)

     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Poster presentation  


  129. 深層学習を用いた音声特徴量の年齢の変動に対する頑健性の調査

    林 知樹, 北岡教英, 武田一哉


     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:北海道・日本   Country:Japan  

  130. 原曲の部分区間を用いたアレンジ曲検索 International conference

    森田一輝, 川渕翔太, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北海道・東京   Country:Japan  

  131. 対話者間音声の類似度と対話の情報伝達効率の関係

    陳伯翰, 北岡教英, 武田一哉


     More details

    Event date: 2014.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:北海道・札幌   Country:Japan  

  132. (Investigation of sound image perception for sound source movements and volume changes in musical contents)

     More details

    Event date: 2014.8

    Language:Japanese   Presentation type:Oral presentation (general)  


  133. 同時発話の高性能な音声認識ースペクトル減算による分離の高速化と分離音を用いた音響モデル学習ー

    出木浦悠人, 松本哲也, 竹内義則, 工藤博章, 大西 昇, 北岡教英, 武田一哉


     More details

    Event date: 2014.6

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京農工大学   Country:Japan  

  134. Kinectを用いたドライバの視線方向推定とその評価

    窄山勝也, 森 真貴, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2014.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:新潟大学   Country:Japan  

  135. ビット 演算に基づく高速な音声ドキュメント検索語検出

    北 研二, 松本和幸, 吉田 稔, 柘植 覚, 北岡教英, 武田一哉


     More details

    Event date: 2014.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋市民センター   Country:Japan  

  136. ブラインド空間的サブトラクションアレーとマッチド音響モデルによる雑音下音声認識の評価

    齋藤 航,北岡 教英,武田 一哉


     More details

    Event date: 2014.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京   Country:Japan  

  137. 種々のテキスト検索モデルの頑健性向上による音声ドキュメント検索の高精度化

    北岡教英, 市川賢, 柘植覚, 武田一哉,北研二


     More details

    Event date: 2014.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋   Country:Japan  

  138. 合唱における歌声の引き込みを利用した歌声F0制御の検討

    川岸基成, 川渕将太, 宮島千代美, 北岡教英, 武田一哉

    情報処理学会 音楽情報科学研究会 

     More details

    Event date: 2014.2

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京   Country:Japan  

  139. 視行動と運転操作行動の統合モデルによる危険な車線変更の検出

    森 真貴, 宮島 千代美, 平山 高嗣, 北岡 教英, 武田 一哉


     More details

    Event date: 2013.10

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名古屋国際会議場   Country:Japan  

  140. 楽曲間主観的類似判定における個人性分析手法の検討



     More details

    Event date: 2013.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋技術科学大学   Country:Japan  

  141. 音声ドキュメント検索手法における拡張クエリの超平面によるモデル化と潜在意味解析の適用

    市川賢, 柘植覚, 北岡教英, 武田一哉,北研二


     More details

    Event date: 2013.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋技術科学大学   Country:Japan  


    Kazunobu Kondo, Yu Takahashi, Tatsuya Komatsu, Takanori Nishino, and Kazuya Takeda

    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2013) 

     More details

    Event date: 2013.5

    Language:English   Presentation type:Oral presentation (general)  


  143. 大規模コーパスを利用した信号処理の研究


    IEICE Technical Report 

     More details

    Event date: 2013.3



  144. 有効観測データ選択に基づくFDICA 音源分離に関する検討

    水野雄介, 近藤多伸, 西野隆典, 北岡教英, 武田一哉


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京工科大学@八王子   Country:Japan  

  145. 単語空間と音節空間を併用した音声ドキュメント検索手法への潜在的意味解析の適用

    市川賢, 北岡教英, 柘植覚, 武田一哉, 北研二,


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  146. Classification of speech under stress using physical features based on two-mass model

    Xiao Yao, Takatoshi Jitsuhiro, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda,

     More details

    Event date: 2013.3

    Language:English   Presentation type:Oral presentation (general)  


  147. 楽曲間の類似判定における許容度の推定

    川渕将太, 宮島千代美, 北岡教英, 武田一哉,


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:公立はこだて未来大学@函館   Country:Japan  

  148. ばね質量系を利用した合唱における歌声のF0 ダイナミクスのモデル化

    川岸基成, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:公立はこだて未来大学@函館   Country:Japan  

  149. 車載ネットワーク信号を用いたドライバ運転行動解析

    兼子政孝, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大   Country:Japan  

  150. 単一マイクロホンによる音響モデルを用いた発話者までの距離推定

    李津, 實廣貴敏, 武田一哉


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京工科大学   Country:Japan  

  151. 相補ウィナーフィルタに基づく残響抑圧音声の主観評価実験

    大谷健登, 小松達也, 近藤多伸, 田邑元一, 西野隆典, 武田一哉


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京工科大学   Country:Japan  

  152. 非負値行列因子分解を用いたモノラル音源の楽器音分離

    齋藤航, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大   Country:Japan  

  153. 仮装分離信号を用いた音響モデルによる複数人同時発話音声認識

    出木浦悠人, 北岡教英, 宮島千代美, 武田一哉


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大   Country:Japan  

  154. Android端末とCANを用いたクラウド運転診断システム

    竹下裕基, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大   Country:Japan  

  155. 運転支援のための視認性と.視行動の計算モデル



     More details

    Event date: 2013.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:宮城県仙台市   Country:Japan  

  156. 音源数評価に基づくFDICA音源分離の計算量削減

    水野雄介, 近藤多伸, 西野隆典, 北岡教英, 武田一哉


     More details

    Event date: 2012.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:国立情報学研究所@東京   Country:Japan  

  157. クラスタ選定によるボトムアップ話者ダイアライゼーションの高精度化

    陳伯翰, 北岡教英, 武田一哉


     More details

    Event date: 2012.12

    Language:Japanese   Presentation type:Poster presentation  

    Venue:東工大@東京   Country:Japan  

  158. 特徴両領域音源分離のためのクロススペクトル抑圧

    安藤厚志, 丹羽健太, 北岡教英, 武田一哉


     More details

    Event date: 2012.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東工大@東京   Country:Japan  

  159. クラスタ検証による話者ダイアライゼーション高精度化

    陳伯翰, 北岡教英, 武田一哉


     More details

    Event date: 2012.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:信州大学   Country:Japan  

  160. 移動音源を対象とする音源分離におけるBlockwise ICA 法の性能評価

    常田諭史, 西野隆典, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2012.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋技術科学大学   Country:Japan  

  161. ノンパラメトリックベイズ法に基づく運転行動の個人性のモデル化

    中川諒, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2012.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋技術科学大学   Country:Japan  

  162. 安全性の異なるドライバの車線変更時の運転行動の比較



     More details

    Event date: 2012.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋技術科学大学   Country:Japan  

  163. Driving Scene Retrieval Based on Driving Behavior and Surrounding Environment

    李亦楊, 宮島千代美, 北岡教英, 武田一哉,


     More details

    Event date: 2012.9

    Language:English   Presentation type:Oral presentation (general)  

    Venue:豊橋技術科学大学   Country:Japan  

  164. 合唱における基本周波数軌跡のモデル化に関する研究

    川岸基成, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2012.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:信州大学   Country:Japan  

  165. 音声認識のための特徴量領域音源分離



     More details

    Event date: 2012.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:信州大学   Country:Japan  

  166. 相補ウィナーフィルタに基づく残響抑圧手法の性能評価実験

    小松達也, 近藤多伸, 西野隆典, 北岡教英, 武田一哉


     More details

    Event date: 2012.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:信州大学   Country:Japan  

  167. Evaluation for vowel-independent classification of speech under stress based on interaction between the vocal folds and the vocal tract

    姚 瀟, 實廣貴敏, 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2012.9

    Language:English   Presentation type:Oral presentation (general)  

    Venue:信州大学   Country:Japan  

  168. 高速道路追い越し運転時のドライバの視行動の個人性の分析

    森 真貴, 宮島千代美, 北岡教英, 武田一哉

    ロボティクス・メカトロニクス講演会2012 (ROBOMEC 2012) 

     More details

    Event date: 2012.5

    Language:Japanese   Presentation type:Poster presentation  

    Venue:浜松   Country:Japan  

  169. 周波数帯域ごとの音源分離信頼度を利用したマルチバンド音声認識

    安藤厚志, 大橋宏正, 原 直, 北岡教英, 武田一哉


     More details

    Event date: 2012.3 - 2021.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神奈川大学   Country:Japan  

  170. Detection for stressed speech based on two-mass model

     More details

    Event date: 2012.3

    Language:English   Presentation type:Oral presentation (general)  


  171. 楽曲間主観的類似度データの収集実験,

    川渕将太 , 宮島千代美, 北岡教英, 武田一哉,


     More details

    Event date: 2012.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神奈川大学   Country:Japan  

  172. 話者パラメータの操作に基づく特徴量生成による音響モデル構築

    川合窒登, 北岡教英, 武田一哉


     More details

    Event date: 2012.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神奈川大学   Country:Japan  

  173. 観測信号のパワーに基づくFDICA 音源分離の計算量削減

    水野雄介, 江崎 知, 近藤 多伸, 西野隆典, 北岡教英, 武田一哉


     More details

    Event date: 2012.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神奈川大学   Country:Japan  

  174. クエリ拡張と音節認識の統合による音声ドキュメント検索

    大橋宏正, 柘植 覚, 北岡教英, 武田一哉, 北 研二


     More details

    Event date: 2012.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:神奈川大学   Country:Japan  

  175. ブラインド音源分離の信頼度を用いたマルチバンド音声認識

    安藤厚志, 大橋宏正, 原 直, 北岡教英, 武田一哉


     More details

    Event date: 2012.2

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東北大学   Country:Japan  

  176. 車載ネットワークを用いた運転データの収集と車種による運転行動の違いの分析


    電子情報通信学会技術報告ITS, IE, ITE-ME, ITE-HI, 

     More details

    Event date: 2012.2

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北海道大学   Country:Japan  

  177. 音声ドキュメント検索におけるクエリ拡張と音節認識の併用の効果

    大橋宏正, 柘植 覚, 北岡教英, 武田一哉, 北 研二,


     More details

    Event date: 2012.2

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東北大学   Country:Japan  

  178. 運転の振り返りに基づく安全運転教育システムの開発

    宮島 千代美, 武田 一哉, 鈴木 達也, 耒見田 健二, 畔柳 雄一, 石川 博章, P. Angkititrakul, 寺嶌 立太, 脇田 敏裕, 及川 雅人, 駒田 悠一


     More details

    Event date: 2011.11

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋技術科学大学   Country:Japan  

  179. 自動車運転行動の信号処理



     More details

    Event date: 2011.11

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:豊橋技術科学大学   Country:Japan  

  180. プライバシー保護のための音源分離による音声抑圧手法



     More details

    Event date: 2011.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大学   Country:Japan  

  181. 楽曲間の主観的類似性判断における個人性の要因の分析

    川渕将太 , 宮島千代美, 北岡教英, 武田一哉,


     More details

    Event date: 2011.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大学   Country:Japan  

  182. 振幅二乗コヒーレンス基準による独立成分分析音源分離の計算量削減の検討

    水野雄介, 江崎知, 近藤多伸, 西野隆典, 北岡教英, 武田一哉


     More details

    Event date: 2011.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:島根大学   Country:Japan  

  183. 車載ネットワークを用いた可搬型運転行動信号収録システムの開発



     More details

    Event date: 2011.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大学   Country:Japan  

  184. 運転行動信号間の類似度に基づいた類似運転状況検索

    中川 諒,宮島千代美,北岡教英,武田一哉

    中川 諒,宮島千代美,北岡教英,武田一哉 

     More details

    Event date: 2011.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大学   Country:Japan  

  185. ベクトル量子化によるカバーソング認識技術の高速化

    永石陽祐 , 宮島千代美, 北岡教英, 武田一哉


     More details

    Event date: 2011.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:三重大学   Country:Japan  

  186. 観測信号間のコヒーレンスに基づくICA音源分離の計算量の削減

    水野雄介, 江崎知, 近藤多伸, 西野隆典, 北岡教英, 武田一哉

    電子情報通信学会 技術報告 

     More details

    Event date: 2011.6

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北海道医療大学   Country:Japan  

  187. 非言語情報を基にしたストレス状態検出の検討

    松尾直司, 鷲尾信之, 原田将治, 釜野晃, 川昭二, 武田一哉

    電子情報通信学会 技術報告 

     More details

    Event date: 2011.6

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名古屋大学   Country:Japan  

  188. Improving head-related impulse response measured in noisy environments with spatio-temporal frequency analysis International conference

    Takanori Nishino and Kazuya Takeda

    ICASSP 2011  

     More details

    Event date: 2011.5

    Language:English   Presentation type:Oral presentation (general)  

    Country:Czech Republic  

  189. Driver risk evaluation based on acceleration deceleration and steering behavior International conference

    Chiyomi Miyajima, Hiroki Ukai, Atsumi Naito, Hideomi Amata, Norihide Kitaoka, and Kazuya Takeda

    2011 IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP2011), (poster) 

     More details

    Event date: 2011.5

    Language:English   Presentation type:Poster presentation  

    Country:Czech Republic  

  190. 常時記録型ドライブレコーダを用いた運転・断教示システムの開発とその評価

    宮島 千代美, 武田 一哉, 鈴木 達也, 耒見田 健二, 畔柳 雄一, 石川 博章, P. Angkititrakul, 寺嶌 立太, 脇田 敏裕, 及川 雅人, 駒田 悠一,


     More details

    Event date: 2011.5

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:パシフィコ横浜   Country:Japan  

  191. 潜在危険分析マニュアルに基づく実環境運転データの分析 International conference

    畔柳 雄一, 石川 博章, 宮島 千代美, 北岡 教英, 武田 一哉


     More details

    Event date: 2011.5

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:パシフィコ横浜   Country:Japan  

  192. 音声対話システムにおける発話・行動タグN-gramを用いた課題未達成対話の検出手法と分析



     More details

    Event date: 2011.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:早稲田大学 東京   Country:Japan  

  193. 雑談音声の認識に基づく楽曲連想再生システム



     More details

    Event date: 2011.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:早稲田大学 東京   Country:Japan  

  194. 合唱における歌声の基本周波数軌跡の分析



     More details

    Event date: 2011.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:早稲田大学 東京   Country:Japan  

  195. NMFを利用した楽曲間類似尺度の構成方法に関する検討



     More details

    Event date: 2011.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:早稲田大学 東京   Country:Japan  

  196. MLLR変換行列に基づいた音響特徴量生成による音響モデル学習



     More details

    Event date: 2011.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:早稲田大学 東京   Country:Japan  

  197. 車内情報インターフェースに関する国際基準の動向


    情報処理学会 技術報告 第7回音声言語情報処理技術デベロッパーズフォーラム 

     More details

    Event date: 2011.1

    Language:Japanese   Presentation type:Oral presentation (general)  


  198. ICA仮想音源の空間分布を用いた室内音場の解析


    電子情報通信学会 応用音響(EA)研究会 

     More details

    Event date: 2011.1

    Language:Japanese   Presentation type:Oral presentation (general)  


  199. 過去の走行データに基づく運転診断/教示システム



     More details

    Event date: 2011.1

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:立命館大学びわこキャンパス、滋賀   Country:Japan  

  200. MLLR変換行列により制約された音響特徴量生成による頑健な音響モデル



     More details

    Event date: 2010.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:国立オリンピック記念青少年総合センター   Country:Japan  

  201. 雑談音声の常時認識による楽曲提案システム



     More details

    Event date: 2010.10

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:幕張メッセ 千葉   Country:Japan  

  202. 音声対話システムの発話系列N-gramを用いた課題未達成対話のオンライン検出



     More details

    Event date: 2010.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:関西大学 大阪   Country:Japan  

  203. 相平面確率モデルを利用した歌唱・楽器演奏の基本周波数軌跡の分析



     More details

    Event date: 2010.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:関西大学 大阪   Country:Japan  

  204. 球形マイクロホンアレイを用いた時空間周波数解析による残響抑圧



     More details

    Event date: 2010.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:関西大学 大阪   Country:Japan  

  205. ベイジアンネットワークを用いた運転中の2次タスク有無の推定


    2010年電気関係学会東海支部連合大会、CD-ROM Proceedings 

     More details

    Event date: 2010.8

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:中部大学、愛知   Country:Japan  

  206. 両耳平均HRTFによる上昇角知覚



     More details

    Event date: 2010.8

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:中部大学 愛知   Country:Japan  

  207. 実環境運転デ-タにおける潜在危険状況の分析


    2010年電気関係学会東海支部連合大会、CD-ROM Proceedings 

     More details

    Event date: 2010.8

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:中部大学、愛知   Country:Japan  

  208. 正十二面体マイクロホンアレイを用いた実環境における音減信号分離の検討



     More details

    Event date: 2010.5

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:甲南大学 兵庫   Country:Japan  

  209. MLLR 変換行列により生成した音声特徴量に基づく高速モデル適応



     More details

    Event date: 2010.3



  210. CENSREC-1-AV:マルチモーダル音声認識コーパスの構築



     More details

    Event date: 2010.3



  211. 多面体マイクロホンアレイを用いた実環境下における優決定ブラインド音源信号分離



     More details

    Event date: 2010.3

    Language:Japanese   Presentation type:Oral presentation (general)  


  212. 楽曲連想再生のための文書特徴量と音響特徴量の対応付け



     More details

    Event date: 2010.2

    Language:Japanese   Presentation type:Oral presentation (general)  


  213. 把持動作から認知状態を推定するための信号処理手法の検討



     More details

    Event date: 2010.1

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:京都大学   Country:Japan  

  214. 時系列アクティブ探索法に基づく運転行動の類似検索



     More details

    Event date: 2010.1

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:京都大学   Country:Japan  

  215. 交差点進入時の運転行動の推定



     More details

    Event date: 2010.1

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:京都大学   Country:Japan  

  216. 音声対話システムの対話履歴N-gramを利用したユーザ満足度推定手法


     More details

    Event date: 2009.12



  217. Prediction model of driving behavior based on traffic conditions and driver types International conference

    Hideomi Amata, Chiyomi Miyajima, Takanori Nishino, and Kazuya Takeda


     More details

    Event date: 2009.10

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  218. A multimedia corpus of driving behaviors International conference

    Lucas Malta, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda


     More details

    Event date: 2009.10

    Language:English   Presentation type:Poster presentation  


  219. Comparison of measured head-related transfer functions using spatio-temporal frequency analysis International conference

    Yasuko Morimoto, Takanori Nishino, and Kazuya Takeda

    158th Meeting of the Acoustical Society of America 

     More details

    Event date: 2009.10

    Language:English   Presentation type:Poster presentation  

    Country:United States  

  220. Automatic identification for singing style based on sung melodic contour characterized in phase plane International conference

    Tasuya Kako, Yasunori Ohishi, Hirokazu Kameoka, Kunio kashino, and Kazuya Takeda


     More details

    Event date: 2009.10

    Language:English   Presentation type:Poster presentation  


  221. Multimodel real-world driving data collection and analysis International conference

    Lucas Malta, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    4th Biennial Workshop on Disigal Signal Processing for In-Vehicle Systems and Safety 

     More details

    Event date: 2009.10

    Language:English   Presentation type:Poster presentation  


  222. 球形マイクロホンアレイを用いたブラインド音源分離



     More details

    Event date: 2009.9



  223. 信号なし交差点における運転行動予測モデル



     More details

    Event date: 2009.9



  224. 多面体マイクロホンアレイによる伝播特性類似度評価関数を用いたブラインド音響信号分離



     More details

    Event date: 2009.9



  225. 楽曲間の主観的類似と音響的類似度との関連付けに関する検討


     More details

    Event date: 2009.9



  226. 相平面確率モデルを用いた歌唱様式の自動分類に関する研究



     More details

    Event date: 2009.9



  227. HRTF測定座標系における時空間周波数特徴量の検討


    日本音響学会 2009年秋季研究発表会講演論文集 

     More details

    Event date: 2009.9



  228. サポートベクターマシンを用いた車線変更の意図の予測



     More details

    Event date: 2009.9



  229. 大規模実環境運転データベースのための閲覧・検索システムの開発



     More details

    Event date: 2009.9



  230. 複数音響モデルからの最適選択による音声認識



     More details

    Event date: 2009.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:愛知工業大学、愛知   Country:Japan  

  231. 急減速時の加速度パターンの分類に基づくドライバの危険性の評価



     More details

    Event date: 2009.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:愛知工業大学、愛知   Country:Japan  

  232. 認知状態識別のための把持ヤコビ行列の特徴抽出



     More details

    Event date: 2009.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:新潟大学、新潟   Country:Japan  

  233. Blind source separation based on acoustic pressure distribution and normalized relative phase using dodecahedral microphone array International conference

    Motoki Ogasawara, Takanori Nishino,and Kazuya Takeda


     More details

    Event date: 2009.8

    Language:English   Presentation type:Oral presentation (general)  

    Country:United Kingdom  

  234. 相平面の描かれるF0の動的変動成分を利用した歌唱様式の自動分類


    SIGMUS 第81回研究発表会 

     More details

    Event date: 2009.7

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:飯坂温泉、福島   Country:Japan  

  235. Analysis of head related transfer functions based on the spatio-temporal frequency characteristic International conference

    Yasuko Morimoto, Takanori Nishino, and Kazuya Takeda

    AES Tokyo Convention2009 

     More details

    Event date: 2009.7

    Language:English   Presentation type:Poster presentation  


  236. A stochastic signal model for predicting the vehicle trajectory at lane change International conference

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    4th Biennial Workshop on Disgal Signal Processing for In-Vehicle Systems and Safety 

     More details

    Event date: 2009.6

    Language:English   Presentation type:Poster presentation  

    Country:United States  

  237. Evaluation of Discriminant Analysis-based Feature Transformation and Discriminative Training for Speech Recognition International conference

    Makoto Sakai, Norihide Kitaoka, Yuya Hattori, Seiichi Nakagawa, and Kazuya Takeda


     More details

    Event date: 2009.6

    Language:English   Presentation type:Oral presentation (general)  

    Country:Russian Federation  

  238. 自動車運転コーパスにおける行動観測信号の統合と利用



     More details

    Event date: 2009.6

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:別府温泉、大分   Country:Japan  

  239. 車線変更軌跡の確率的予測モデル



     More details

    Event date: 2009.6

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:慶応義塾大学、神奈川   Country:Japan  

  240. 常時記録型ドライブレコーダで記録された前方映像の危険状況の類型化について



     More details

    Event date: 2009.6

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:琉球大学、沖縄   Country:Japan  

  241. Multimodel estimation of a driver's spontaneous irritation International conference

    Lucas Malta, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda

    2009 IEEE Intelligent Vehicles Symposium (IV2009), pp573-577 

     More details

    Event date: 2009.6

    Language:English   Presentation type:Poster presentation  


  242. 両耳室内インパルス応答の時空間周波数特性を利用した残響成分の解析



     More details

    Event date: 2009.5

    Language:Japanese   Presentation type:Poster presentation  

    Venue:兵庫県立大学、兵庫   Country:Japan  

  243. 運転挙動の統計的予測モデル



     More details

    Event date: 2009.5

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:パシフィコ横浜、神奈川   Country:Japan  

  244. Feature Transformation based on Discriminant Analysis Preserving Local Structure For Speech Recognition International conference

    Makoto Sakai, Norihide Kitaoka, and Kazuya Takeda


     More details

    Event date: 2009.4

    Language:English   Presentation type:Oral presentation (general)  

    Country:Taiwan, Province of China  

  245. Stochastic modeling of vehicle trajectory during lane-changing International conference

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda


     More details

    Event date: 2009.4

    Language:English   Presentation type:Poster presentation  

    Country:Taiwan, Province of China  

  246. Spoken dialog strategy based on understanding graph search International conference

    Yuji Kinoshita, Chiyomi Miyajima, Norihide Kitaoka, and Kazuya Takeda


     More details

    Event date: 2009.4

    Language:English   Presentation type:Poster presentation  

    Country:Taiwan, Province of China  

  247. 平均クラス誤り最小基準と最大クラス誤り最小基準を組み合わせた音響特徴変換



     More details

    Event date: 2009.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:東京工業大学 東京   Country:Japan  

  248. 理解状態グラフの探索による音声対話戦略



     More details

    Event date: 2009.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京工業大学、東京   Country:Japan  

  249. 上方スピーカーによるトランスオーラルシステムの評価



     More details

    Event date: 2009.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京工業大学 東京   Country:Japan  

  250. 確率モデルに基づく単一チャネル音源分離を用いた背景音楽抑圧



     More details

    Event date: 2009.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:東京工業大学 東京   Country:Japan  

  251. 実環境大規模運転データベース構築と閲覧システムの開発



     More details

    Event date: 2009.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:松山、愛媛   Country:Japan  

  252. 時空間周波数解析を用いたHRTFデータの比較



     More details

    Event date: 2009.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:東京工業大学 東京   Country:Japan  

  253. 音声対話システムのユーザ満足度推論におけるネットワークモデルの構築と評価



     More details

    Event date: 2009.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東京工業大学 東京   Country:Japan  

  254. ベイジアンネットワークを用いた単一チャネル信号による背景音楽の抑圧



     More details

    Event date: 2008.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:早稲田大学、東京   Country:Japan  

  255. Driver's irritation detection using speech recognition results

     More details

    Event date: 2008.12

    Language:English   Presentation type:Poster presentation  


  256. 音声認識システムの満足度評価におけるユーザモデル



     More details

    Event date: 2008.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:早稲田大学、東京   Country:Japan  

  257. Data collection and usability study of a PC-based speech application in various user environments International conference

    Sunao hara, Chiyomi Miyajima,Katsunobu Ito, Kazuya Takeda

    Oriental-COCOSDA 2008 

     More details

    Event date: 2008.11

    Language:English   Presentation type:Oral presentation (general)  


  258. 理解状態のグラフ探索に基づいた音声対話戦略



     More details

    Event date: 2008.11

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:NICT 東京   Country:Japan  

  259. 自動車運転行動のマルチモーダル信号コーパス



     More details

    Event date: 2008.11

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:ソフトピアジャパン 大垣・岐阜   Country:Japan  

  260. 3D AV Integrated System Featuring Arbitary Listening-Point And Viewpoint Generation International conference

    Mehrdad Panahpour Tehrani, Kenta Niwa, Norishige Fukushima, Yasushi Hirano, Toshiaki Fujii, Masayuki Tanimoto, Kazuya Takeda, Kenji Mase, Akio Ishikawa, Shigeyuki Sakazawa, Atsushi Koike

    Proc.of Multimedia Signal Processing, MMSP 2008, PID-213, pp855-860 

     More details

    Event date: 2008.10

    Language:English   Presentation type:Oral presentation (general)  


  261. 個人性を考慮した車線変更時の車両軌跡生成過程のモデル化


    電子情報通信学会 パターン認識・メディア理解研究会 

     More details

    Event date: 2008.10

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:徳島大学 徳島   Country:Japan  

  262. An Integrative Recognition Method for Speech and Gestures International conference

    Madoka Miki, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, Kazuya Takeda

    Tenth International Conference on Multimodal Interfaces (ICMI2008) 

     More details

    Event date: 2008.10

    Language:English   Presentation type:Poster presentation  


  263. Multimodel estimation of a driver's affective state International conference

    Lucas Malta, Chiyomi Miyajima, Kazuya Takeda

    Workshop on affective interaction in natural environments in ICMI 

     More details

    Event date: 2008.10

    Language:English   Presentation type:Oral presentation (general)  


  264. 符号化された既知の楽曲が重畳した音声の雑音抑圧手法に関する検討



     More details

    Event date: 2008.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:九州大学 福岡   Country:Japan  

  265. Building and Combining Document and Music Spaces for Music Query-By-Webpage System International conference

    Ryoei Takahashi, Yasunori Ohishi, Norihide Kitaoka,and Kazuya Takeda


     More details

    Event date: 2008.9

    Language:English   Presentation type:Oral presentation (general)  


  266. Generating lane-change trajectories of individual drivers International conference

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Norihide Kitaoka, Ryuta Terashima, Toshiro Wakita, and Kazuya Takeda


     More details

    Event date: 2008.9

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  267. Parameter Estimation Method of F0 Control Model for Singing Voices International conference

    Yasunori Ohishi,Hirokazu Kameoka, Kunio Kashino, Kazuya Takeda


     More details

    Event date: 2008.9

    Language:English   Presentation type:Poster presentation  


  268. CENSREC-AV:Evaluation frameworks for audio-visual speech recognition International conference

    Satoshi Tamura, Chiyomi Miyajima, Norihide Kitaoka, Satoru Hayamizu, and Kazuya Takeda


     More details

    Event date: 2008.9

    Language:English   Presentation type:Poster presentation  


  269. 判別分析に基づく音響特徴と識別学習の組み合わせによる単語音声認識



     More details

    Event date: 2008.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:九州大学 福岡   Country:Japan  

  270. Convolutive HMMに基づく歌声の基本周波数制御モデルの提案とそのパラメータ最尤推定



     More details

    Event date: 2008.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:九州大学 福岡   Country:Japan  

  271. 多面体マイクロホンアレイを用いたブラインド音源信号分離の検討



     More details

    Event date: 2008.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:九州大学 福岡   Country:Japan  

  272. 無線加速度センサを用いた頭部運動の計測



     More details

    Event date: 2008.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:慶応大学 神奈川   Country:Japan  

  273. スピーカ設置条件が異なるトランスオーラルシステムの音源方向定位の評価



     More details

    Event date: 2008.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:愛知県立大学 愛知   Country:Japan  

  274. 検察タスクのための効率的で自然な音声対話戦略



     More details

    Event date: 2008.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:愛知県立大学 愛知   Country:Japan  

  275. Biaural sound localization for untrained directions based on a Gaussian Mixture model International conference

    Takanori Nishino, and Kazuya Takeda

    16th European Signal Processing Conference(EUSIPCO2008) 

     More details

    Event date: 2008.8

    Language:English   Presentation type:Poster presentation  


  276. ベイジアンネットワークを用いたバイナリマスキングに基づく音源分離



     More details

    Event date: 2008.7

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:ホテル紫苑 岩手   Country:Japan  

  277. Measurements of head-related transfer function in segittal and frontal coordinates International conference

    Takashi Nakado, Takanori Nishino, and Kazuya Takeda


     More details

    Event date: 2008.7

    Language:English   Presentation type:Poster presentation  


  278. 正十二面体マイクロホンアレイを用いた周波数領域ICAのpermutation問題に対する解決策の検討



     More details

    Event date: 2008.7

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:NTTコミュニケーション科学基礎研究所   Country:Japan  

  279. ブレーキ行動特性の類型化モデルについて



     More details

    Event date: 2008.6

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:東北工業大学 宮城   Country:Japan  

  280. Multi-modal real-world driving data collection, transcription, and integration using bayesian network International conference

    Lucas Malta, Pongtep Angkititrakul, Chiyomi Miyajima, and Kazuya Takeda

    IEEE Intelligent Vehicles Symposium 

     More details

    Event date: 2008.6

    Language:English   Presentation type:Poster presentation  


  281. Abrupt steering detection based on the road construction ordinance and vehicle acceleration captured with drive recorders International conference

    Hideomi Amata, Akira Ozaki, Chiyomi Miyajima, Takanori Nishino, and kazuya Takeda

    2008 IEEE International Conference of Innovative Computing Information and Control (ICICIC2008) 

     More details

    Event date: 2008.6

    Language:English   Presentation type:Oral presentation (general)  


  282. Evaluation framework for distant-talking speech recognition under reverberant environments; Newest part of the CENSREC series International conference

    Takanobu Nishimura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, and Kazuya Takeda

    the 6th edition of the Language Resources and Evaluation Conference(LREC2008) 

     More details

    Event date: 2008.5

    Language:English   Presentation type:Poster presentation  


  283. In-car Speech Data Collection along with Various Multimodal Signals International conference

    Akira Ozaki, Sunao Hara, Takashi Kusakawa, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, and Kazuya Takeda

    the 6th edition of the Language Resources and Evaluation Conference(LREC2008) 

     More details

    Event date: 2008.5

    Language:English   Presentation type:Poster presentation  


  284. Encoding large array signals into a 3D sound field representation for selective listening point audio based on blind source separation International conference

    Kenta Niwa, Takanori Nishino, and Kazuya Takeda

    IEEE international Conference on Audio, Speech, and Signal Processing 

     More details

    Event date: 2008.4

    Language:English   Presentation type:Poster presentation  

    Country:United States  

  285. 多様な利用環境における楽曲検索音声対話システムのフィールドテストと評価



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:筑波大学 茨城   Country:Japan  

  286. 車載ドライブレコーダで収録された左右加速度に基づく急ハンドル操作の推定



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:筑波大学 茨城   Country:Japan  

  287. 相平面を利用した歌声のF0軌跡の新しい表現方法



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北九州市立大学 福岡   Country:Japan  

  288. 運転データの同期収録車両の開発と実走行環境における計測



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北九州市立大学 福岡   Country:Japan  

  289. ドライブレコーダで収録した車両加速度を用いた運転特性の認識



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北九州市立大学 福岡   Country:Japan  

  290. 判別分析に基づく音響特徴と識別学習の組み合わせによる単語音声認識



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:千葉工業大学 千葉   Country:Japan  

  291. 受聴位置選択型音場再現のためのブラインド音源分離を用いた多マイクロホンアレー信号の符号化



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:NTT武蔵野R&Dセンター 東京   Country:Japan  

  292. 頭部近傍におけるHRTFの距離依存特性に関する検討



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:NTT武蔵野R&Dセンター 東京   Country:Japan  

  293. 矢状面・前額面座標系HRTFデータベースの構築



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:千葉工業大学 千葉   Country:Japan  

  294. 車載ドライブレコーダで収録された左右加速度に基づく急ハンドル操作の推定



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北九州市 福岡   Country:Japan  

  295. 自由視点TVにおける受聴位置選択型オーディオのための音響空間の符号化法



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:千葉工業大学 千葉   Country:Japan  

  296. ベイジアンネットワークを用いた楽曲検索音声対話データの分析



     More details

    Event date: 2008.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:千葉工業大学 千葉   Country:Japan  

  297. 動作を伴う発話の収集とその認識



     More details

    Event date: 2008.2

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:ホテル暖香園 静岡   Country:Japan  

  298. 楽曲を解説したテキストと音響特徴量との関連付けを利用した楽曲推薦システム



     More details

    Event date: 2008.2

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:ホテル暖香園 静岡   Country:Japan  

  299. 歌声と話し声の自動識別システム



     More details

    Event date: 2008.2

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:ホテル暖香園 静岡   Country:Japan  

  300. 雑音下音声認識評価ワーキンググループ活動報告:認識に影響する要因の個別評価環境(2).



     More details

    Event date: 2007.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:NTけいはんな 京都   Country:Japan  

  301. Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance International conference

    Norihide Kitaoka, Kazumasa Yamamoto, Tomohiro Kusamizu, Seiichi Nakagawa, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, and Satoshi Nakamura

    Proc.IEEE workshop on Automatic Speech Recognition and Understanding (ASRU2007) pp.607-312 

     More details

    Event date: 2007.12

    Language:English   Presentation type:Poster presentation  


  302. Multimodal driving data integration for the analysis of driver's responses to hazardous situations International conference

    Lucas Malta, Chiyomi Miyajima, Norihide Kiatoka, Katsunobu Itou,Kazuya Takeda

    International Workshop on Tagging, Mining and Retrieval of Human Related Activity Information 

     More details

    Event date: 2007.11

    Language:English   Presentation type:Poster presentation  


  303. Statiscal segmentation and recognition of fingertip trajectories for a gesture interface International conference

    Kazuhiro Morimoto, Chiyomi Miyajima, Norihide Kitaoka, Katsunobu Itou, Kazuya Takeda

    International Conference on Multimodal Interfaces 

     More details

    Event date: 2007.11

    Language:English   Presentation type:Poster presentation  


  304. ジェスチャインタフェースのための指先動作軌跡の統計的分割と正規化



     More details

    Event date: 2007.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名古屋工業大学 愛知   Country:Japan  

  305. A Study of driver's reactions to hazard utilizing speech and brake pedal force

    Lucas Malta, Chiyomi Miyajima, and Kazuya Takeda

     More details

    Event date: 2007.9

    Language:English   Presentation type:Poster presentation  


  306. Development of Selectable viewpoint and listening point system for musical performance International conference

    Kenta Niwa, Takanori Nishino, and Kazuya Takeda

    19th International Congress on Acoustics 

     More details

    Event date: 2007.9

    Language:English   Presentation type:Poster presentation  


  307. Development of small sound equipment with micro-dynamic-type of small sound equipment with micro-dynamic-type loudspeakers for HRTF measurement International conference

    Yoshihide Hayakawa, Takanori Nishino, and Kazuya Takeda

    19th International Congress on Acoustics 

     More details

    Event date: 2007.9

    Language:English   Presentation type:Poster presentation  


  308. A Stochastic Representation of the Dynamics of Sung Melody International conference

    Yasunori OHISHI, Masataka GOTO, Katsunobu ITOU, Kazuya Takeda


     More details

    Event date: 2007.9

    Language:English   Presentation type:Oral presentation (general)  


  309. 音声対話システムを用いたフィールドテストの実環境音声データ分析に関する検討



     More details

    Event date: 2007.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:山梨大学 山梨   Country:Japan  

  310. 動作を伴う発話の収集とその認識に関する検討



     More details

    Event date: 2007.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名古屋大学 愛知   Country:Japan  

  311. 自由視聴点TVのための自由聴点音場の構成方法



     More details

    Event date: 2007.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名古屋大学 愛知   Country:Japan  

  312. 歌声の旋律と動的変動を特徴付けるための確率的な表現手法に関する検討


    情報処理学会 音楽情報科学研究会 

     More details

    Event date: 2007.8

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:ロジワールホテル長崎 長崎   Country:Japan  

  313. A virtual button interface using fingertip movements International conference

    Kazuhiro Morimoto, Chiyomi Miyamjima, Katsunobu Itou, and Kazuya Takeda

    2007 IEEE International Conference of Machine Learning and Cybernetics 

     More details

    Event date: 2007.8

    Language:English   Presentation type:Oral presentation (general)  


  314. ブラインド音源分離と頭部伝達関数を用いた音楽演奏を題材とした自由視聴点コンテンツの制作



     More details

    Event date: 2007.6

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:北海道大学 北海道   Country:Japan  

  315. Mining potentially hazardous situations in vehicle traffic using driver's reactions International conference

    Lucas Malta, Chiyomi Miyajima, and Kazuya Takeda

    2007 IEEE Intelligent Vehicles Symposium(IV2007) 

     More details

    Event date: 2007.6

    Language:English   Presentation type:Oral presentation (general)  


  316. Generation of pedal operation patterns of individual driver's in car-following for personalized cruise control International conference

    Yoshihiro Nishiwaki, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, Katsunobu Itou, and Kazuya Takeda

    2007 IEEE Intelligent Vehicles Symposium(IV2007) 

     More details

    Event date: 2007.6

    Language:English   Presentation type:Poster presentation  


  317. On-going data collection for driving behavior signal International conference

    Takashi Kusakawa, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, Katsunobu Itou, and Kazuya Takeda

    2007 Biennial on DSP for in-Vehicle and Mobile Systems(DSPINCARS2007) 

     More details

    Event date: 2007.6

    Language:English   Presentation type:Poster presentation  


  318. 学習データの分布に従う揺らぎのあるHMM音声合成



     More details

    Event date: 2007.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名城大学 愛知   Country:Japan  

  319. 連続指文字認識における手話者の違いに関する検討



     More details

    Event date: 2007.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名城大学 愛知   Country:Japan  

  320. 自由視点・聴点システムのためのコンテンツ制作



     More details

    Event date: 2007.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:芝浦工業大学 東京   Country:Japan  

  321. 小型ダイナミック型スピーカを組み合わせたHRTF計測用小型音源の開発及び比較



     More details

    Event date: 2007.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:芝浦工業大学 東京   Country:Japan  

  322. 楽曲のレビューと音響特徴量との関連付けの検討



     More details

    Event date: 2007.3

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:芝浦工業大学 東京   Country:Japan  

  323. 認識対象語彙に応じた音響モデルの構築に関する検討



     More details

    Event date: 2007.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:芝浦工業大学 東京   Country:Japan  

  324. バイク走行環境下での音声対話システムの性能向上に関する検討



     More details

    Event date: 2007.3

    Language:Japanese   Presentation type:Poster presentation  

    Venue:芝浦工業大学 東京   Country:Japan  

  325. ジェスチャインタフェースのための動作軌跡信号の統計的分割と認識



     More details

    Event date: 2007.2

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:伊豆長岡温泉ニュー八景園(静岡)   Country:Japan  

  326. 自動車内における両耳室内伝達関数を用いた話者と受聴者の位置推定に関する検討


    電子情報通信学会 応用音響研究会 

     More details

    Event date: 2007.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  327. Driver modeling based on driving behavior and its evaluation in driver identification International conference

    Proceedings of the IEEE 

     More details

    Event date: 2007.1

    Language:English   Presentation type:Oral presentation (general)  

  328. 両耳間音圧差の包絡を用いたガウス分布モデルに基づく音源方向推定



     More details

    Event date: 2007.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  329. 楽曲検索音声対話システムの評価



     More details

    Event date: 2007.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  330. 楽曲検索システムにおけるプレイリストに適応した音響モデル構築手法に関する検討



     More details

    Event date: 2007.1

    Language:Japanese   Presentation type:Oral presentation (general)  


  331. スペクトル分析を用いた運転行動信号に含まれる個人性のモデル化

    小澤晃史,西脇由博,脇田敏裕,宮島千代美,伊藤克亘, 武田一哉


     More details

    Event date: 2007.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  332. 汎用PC上で利用された音声対話システムによる音声収集と評価


    情報処理学会研究報告(音声言語情報処理研究会) 第8回音声言語シンポジウム 

     More details

    Event date: 2006.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名古屋大学 愛知   Country:Japan  

  333. 雑音下音声認識評価ワーキンググループ活動報告:認識に影響する要因の個別評価環境

    北岡 教英, 山田 武志, 滝口 哲也, 柘植 覚, 山本 一公, 宮島 千代美, 西浦敬信, 中山 雅人, 傳田 遊亀, 藤本 雅清, 田村 哲嗣, 黒岩 眞吾, 武田 一哉, 中村 哲


     More details

    Event date: 2006.12

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:名古屋大学 愛知   Country:Japan  

  334. Collection of multimodal data in real-world driving International conference

    Takashi Kusakawa, Chiyomi Miyajima, Takanori Nishino, Katsunobu Itou, and Kazuya Takeda

    The 4th joint meeting ASA and ASJ 

     More details

    Event date: 2006.11

    Language:English   Presentation type:Poster presentation  


  335. Free listening-point synthesizing method in a large microphone array using acoustic transfer function International conference

    Mehrdad Panahpour, Yasushi Hirano, Shoji Kajita, Kenji mase, Taknori Nishino, Kazuya Takeda, and Toshiaki Fujii

    The 4th joint meeting ASA and ASJ 

     More details

    Event date: 2006.11

    Language:English   Presentation type:Poster presentation  


  336. Comparison of visual features for audio-visual speech recognition using the AURORA-2J-AV database International conference

    Takahito Togo, Yukitaka Nimura, Takayuki Kitasaka, Kensaku Mori, Ysuhito Suenaga, Chiyomi Miyajima, and Kazuya Takeda

    ASA&ASJ joint meeting, 

     More details

    Event date: 2006.11

    Language:English   Presentation type:Poster presentation  


  337. Towards the detection of potentially hazardous situations in vehicle traffic using driver speech and brake pedal International conference

    Lucas Malta, Chiyomi Miyajima, Katsunobu Itou, and Kazuya Takeda

    The 4th joint meeting ASA and ASJ 

     More details

    Event date: 2006.11

    Language:English   Presentation type:Poster presentation  


  338. CENSREC-1-C:雑音下音声区間検出手法評価基盤の構築

    北岡 教英, 山田 武志, 柘植 覚, 宮島 千代美, 西浦 敬信, 中山 雅人, 傳田 遊亀, 藤本 雅清, 山本 一公, 滝口 哲也, 黒岩 眞吾, 武田 一哉, 中村 哲


     More details

    Event date: 2006.10

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:早稲田大学 東京   Country:Japan  

  339. 追従走行における個人性を考慮したペダル操作量の推定



     More details

    Event date: 2006.9

    Language:Japanese   Presentation type:Oral presentation (general)  

    Venue:愛知県立大学 愛知   Country:Japan  

  340. 楽器音の音源分離信号分離とHRTFを用いた音像の再配置の検討



     More details

    Event date: 2006.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:金沢大学、石川   Country:Japan  

  341. 雑音下音声区間検出手法評価基盤の構築

    北岡 教英, 西浦 敬信, 中山 雅人, 藤本 雅清, 山田 武志, 滝口 哲也, 山本一公, 宮島 千代美, 柘植 覚, 中村 哲, 武田 一哉, 黒岩 眞吾,


     More details

    Event date: 2006.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:金沢大学、石川   Country:Japan  

  342. バイモーダル音声認識における映像特徴量の比較

    東郷 高浩, 二村 幸孝, 北坂 孝幸, 森 健策, 末永 康仁, 宮島 千代美, 武田 一哉,


     More details

    Event date: 2006.9

    Language:Japanese   Presentation type:Poster presentation  

    Venue:金沢大学、石川   Country:Japan  

  343. On the human capability and acoustic cues for dicriminating the singing and the speaking voices International conference

    Yasunori OHISHI, Masataka GOTO, Katsunobu ITOU, and Kazuya TAKEDA


     More details

    Event date: 2006.8

    Language:English   Presentation type:Oral presentation (general)  


  344. Characterizing in-car Conversational Speech of Different Dialogue Modes International conference

    1st Intl.Conf. on Innovative Computing, Information and Control (ICICIC2006) 

     More details

    Event date: 2006.8

    Language:English   Presentation type:Poster presentation  


  345. Analysis of changes in driving behavior signals for the detection of potentially hazardous situations in vehicle traffic

    Malta Lucas, Chiyomi Miyajima, Katsunobu Itou, and Kazuya Takeda

     More details

    Event date: 2006.7

    Language:English   Presentation type:Oral presentation (general)  


  346. MULTIPOINT MEASURING SYSTEM FOR VIDEO AND SOUND-100 camera and microphone system-, International conference

    Toshiaki Fujii,Kensqku Mori, Kazuya Takeda, Kenji Mase, masayuki Tanimoto, Yasuhito Suenaga

    Proc.of International Conference on Multimedia and Expo, (ICME2006) 

     More details

    Event date: 2006.7

    Language:English   Presentation type:Poster presentation  


  347. Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus International conference

    Yasunori OHISHI, Katsunobu ITOU, Kazuya Takeda, and Atsushi FUJI


     More details

    Event date: 2006.5

    Language:English   Presentation type:Oral presentation (general)  



    Tran Huy Dat, Kazuya Takeda, and Fumitada Itakura


     More details

    Event date: 2006.5

    Language:English   Presentation type:Poster presentation  


  349. Cepstral analysis of driving behavioral signals for driver identification International conference

    Yoshihiro Nishiwaki, Koji Ozawa, Toshihiro Wakita, Chiyomi Miyajima, Katsunobu Itou, and Kazuya Takeda


     More details

    Event date: 2006.5

    Language:English   Presentation type:Poster presentation  



    Mehrdad Panahpour Tehrani, Yasushi Hirano, Toshiaki Fujii, Shoji Kajita, Kazuya Takeda, Kenji Mase,


     More details

    Event date: 2006.5

    Language:English   Presentation type:Poster presentation  


  351. Development of micro-dodecahedral loudspeaker for measuring head-related transfer functions in the proximal region International conference

    Seiichiro Hosoe, Takanori Nishino, Katsunobu Itoh,Kazuya Takeda


     More details

    Event date: 2006.5

    Language:English   Presentation type:Poster presentation  



    Weifeng Li, Katsunobu Itou, Kazuya Takeda,


     More details

    Event date: 2006.5

    Language:English   Presentation type:Poster presentation  


  353. 頭部近傍のHRTF計測用小型12面体スピーカの開発および評価

    細江 誠一郎,西野 隆典,伊藤 克亘,武田 一哉


     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  354. 音声だけでシームレスにハミング検索と曲名検索が可能な楽曲システム


    情報処理学会 音楽情報科学研究会 

     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  355. 指先の動きを利用した仮想ボタン入力インターフェースの検討


    FIT2006 第5回情報科学技術フォ-ラム 

     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  356. 多項式カーネルを利用した歌声と朗読音声の識別特徴の分析



     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  357. 運転車の発話と運転行動を用いた危険な状況の検出


    情報処理学会 情処研報 ISSN 0919-6072 

     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  358. 単語の共起関係と構文情報を利用した単語階層関係の統計的自動識別


    情報処理学会 音声言語情報処理研究会 

     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  359. オンライン被験者実験のための音声対話による楽曲検索システムに関する検討



     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  360. 運転操作信号のケプストラム分析に基づく識別

    小澤晃史,西脇由博,脇田敏裕,宮島千代美,伊藤克亘, 武田一哉


     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  361. 統計モデルを用いた音源方向推定



     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  362. 給電指令電話の音声検索システム



     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  363. 両音間音圧差の特徴量分布をガウス分析近似した音源方向推定モデルの検討



     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  364. 楽器音の音源信号分離とHRTFを用いた音像の再配置の検討



     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  365. 車内対話音声・映像・運転行動・生体信号同期収録システムの実装



     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  366. スペクトル包絡と基本周波数の時間変化を利用した歌声と朗読音声の識別



     More details

    Event date: 2006.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  367. Driver identification using driving behavior International conference

    Toshihiro Wakita, Koji Ozawa, Chiyomi Miyajima, Kazuya Takeda

    Proc. of the 8th International IEEE Conference on Intelligent Transportation Systems (ITSC 05) 

     More details

    Event date: 2005.9

    Language:English   Presentation type:Oral presentation (general)  


  368. Performance Evaluation of H.264 Video Streaming over Inter-Vehicular 802.11 Ad Hoc Networks International conference

    P. Bucciol, E. Masala, N. Kawaguchi, K. Takeda, J.C. De Martin

    Proc. of 16th Annual IEEE International Symposium on Personal Indoor and Mobile Radio Communications (PIMRC), 

     More details

    Event date: 2005.9

    Language:English   Presentation type:Oral presentation (general)  


  369. Speaker verification using Gaussian mixture models within changing real car environments International conference

    Xianxian Zhang, John Hansen, Pongtep Angkititrakul, Kazuya Takeda

    presentation at The nineth European Conference on Speech Communication and Technology (INTERSPEECH2005) 

     More details

    Event date: 2005.9

    Language:English   Presentation type:Oral presentation (general)  


  370. Data Collection and Evaluation of Speech Recognition for Mortorbike Riders International conference

    Hiroshi Tanaka, Hiroshi Fujimura, Chiyomi Miyajima, Takanori Nishino, Katsunobu Itou, Kazuya Takeda

    presentation at The nineth European Conference on Speech Communication and Technology (INTERSPEECH2005) 

     More details

    Event date: 2005.9

    Language:English   Presentation type:Oral presentation (general)  


  371. Discrimination between Singing and Speaking Voices International conference

    Yasunori Ohishi, Masataka Goto, Katsunobu Itou, Kazuya Takeda

    presentation at The nineth European Conference on Speech Communication and Technology (INTERSPEECH2005) 

     More details

    Event date: 2005.9

    Language:English   Presentation type:Oral presentation (general)  


  372. Subjective and Objective Quality Assessment of Regression-enhanced Speech in Real Car Environments International conference

    Li Wifent, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura

    presentation at The nineth European Conference on Speech Communication and Technology (INTERSPEECH2005) 

     More details

    Event date: 2005.9

    Language:English   Presentation type:Oral presentation (general)  


  373. Modeling of individualities in driving through spectral analysis of behavioral signals International conference

    Koji Ozawa, Toshihiro Wakita, Chiyomi Miyajima, Katsunobu Itou, Kazuya Takeda

    Proc. of The Eighth International Symposium on Signal Processing and Its Applications, (ISSPA'2005) 

     More details

    Event date: 2005.8

    Language:English   Presentation type:Oral presentation (general)  


  374. Parametric Versus Non-Parametric Models of Driving Behavior Signals for Driver Identification International conference

    Toshihiro Wakita, Kouji Ozawa, Chiyomi Miyajima, Kazuya Takeda

    Audio- and Video-based Biometric Person Authentication 2005 (AVBPA2005) 

     More details

    Event date: 2005.7

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  375. The Sound Wave Ray-Space International conference

    M. P. Tehrani, Y. Hirano, T. Fujii, S. Kajita, K. Takeda, M. Tanimoto, K. Mase

    IEEE Conference on Multimedia and Expo, (ICME 2005) 

     More details

    Event date: 2005.7

    Language:English   Presentation type:Oral presentation (general)  


  376. Analysis of In-car speech recognition experiments using a large-scale multi-mode dialogue corpus International conference

    Hiroshi Fujimura, Chiyomi Miyajima, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura

    Proc. of International Conference on Acoustic Speech and Signal Processing (ICASSP2005) 

     More details

    Event date: 2005.3

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  377. Two-stage noise spectra estimation and regression based in-car speech recognition using single distant microphone International conference

    Weifeng Li, Katsunobu Itou, Kazuya Takeda

    Proc. of International Conference on Acoustic Speech and Signal Processing (ICASSP2005) 

     More details

    Event date: 2005.3

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  378. Spatial coding based on the extraction of moving sound sources in wavefield synthesis International conference

    Toshiyuki Kimura, Kazuhiko Kakehi, Kazuya Takeda, Fumitada Itakura

    Proc. of International Conference on Acoustic Speech and Signal Processing (ICASSP2005) 

     More details

    Event date: 2005.3

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  379. SNR and local noise power estimations based on Gaussinan mixture modeling on the log-power domain International conference

    Kazuya Takeda, Tran Dat, Hiroshi Fujimura, Fumitada Itakura

    Proc. of International Conference on Acoustic Speech and Signal Processing (ICASSP2005) 

     More details

    Event date: 2005.3

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  380. Generalized Gamma modeling of speech and its online estimation for speech enhancement International conference

    Tran Dat, Kazuya Takeda, Fumitada Itakura

    Proc. of International Conference on Acoustic Speech and Signal Processing (ICASSP2005) 

     More details

    Event date: 2005.3

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  381. ケプストラム分析を用いた実収録運転行動信号に含まれる個人性のモデル化

    西脇 由博,小澤 晃史,宮島 千代美,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  382. Speech enhancement based on a MAP-log spectral magnitude estimation International conference

     More details

    Event date: 2005.1

    Language:English   Presentation type:Oral presentation (general)  

  383. 基本周波数とスペクトル包絡情報を利用した歌声と読み上げ音声の識別に関する検討

    大石康智,宮島千代美,西野隆典,伊藤克亘,武田一哉, 後藤真孝


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  384. バイク走行時収録音声データベースの構築及び評価

    田中寛,藤村浩司,宮島千代美,西野隆典,伊藤克亘, 武田一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  385. 頭部伝達関数を用いた自由聴点立体音場の構築と評価



     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  386. 長時間の音声対話インタフェース利用時における学習効果の評価



     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  387. Speech enhancement based on cumulative distribution function equalization using log-normal distributions modeling in the sub-band power domain International conference

     More details

    Event date: 2005.1

    Language:English   Presentation type:Oral presentation (general)  

  388. 指向性マイクロホンを用いた波面合成法の理論的検討

    木村 敏幸,筧 一彦,武田 一哉,板倉 文忠


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  389. 頭部近傍の頭部伝達関数データベースの構築及び評価

    細江 誠一郎,西野 隆典,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  390. 外耳被覆が音源定位に及ぼす影響の調査

    瀧本 まどか,西野 隆典,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  391. In-car speech recognition Single-channel and multi-channel approaches International conference

     More details

    Event date: 2005.1

    Language:English   Presentation type:Oral presentation (general)  

  392. 大規模被験者実験に向けた音声対話システム構築に関する検討

    原 直,勅使河原 三保子,宮島 千代美,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  393. HMMに基づく連続指文字認識・合成コーパスの構築

    江本 祐太,宮島 千代美,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  394. コーパスに基づく雑音抑圧手法



     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  395. 音素長伸縮による対話音声認識性能向上手法



     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  396. 日本語指文字認識・合成用コーパスの構築

    江本 祐太,宮島 千代美,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  397. マイクロホンアレーを用いた自由聴点音場再生システムに関する理論的検討



     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  398. 運転行動信号に含まれる個人性のモデル化



     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  399. 歌声と朗読音声の識別システム構築のための人間の識別能力の調査と考察

    大石 康智,後藤 真孝,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  400. 局所的・大局的な特徴を利用した歌声と朗読音声の識別

    大石 康智,後藤 真孝,伊藤 克亘,武田 一哉

    情報処理学会 音楽情報科学研究会 

     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  401. バイク走行状態における音声認識

    田中 寛,宮島 千代美,西野 隆典,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  402. 車内音声認識のためのバイモーダルデータベースの構築

    宮島 千代美,根木 大輔,伊藤 克亘,武田 一哉,佐野 昌己, 二宮 芳樹


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  403. 日本のアニメの音声に表された感情と性格 -声のステレオタイプの音声学研究-

    勅使河原 三保子,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  404. 自動車走行観測信号に含まれる個人性のモデル化

    小澤 晃史,脇田 敏裕,宮島 千代美,伊藤 克亘,武田 一哉


     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  405. ケプストラム分析を用いた室内伝達関数のモデル化の検討



     More details

    Event date: 2005.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  406. Optimizing Regression for in-car Speech Recognition using Multiple Distributed Microphones International conference

    W. Li, K. Takeda and F. Itakura

    Proc. of International Conference on Spoken Language Processing, (INTERSPEECH/ICSLP 2004) 

     More details

    Event date: 2004.11

    Language:English   Presentation type:Oral presentation (general)  

    Country:Korea, Republic of  

  407. Speech enhancement based on magnitude estimation using the Gamma prior International conference

    Tran Huy Dat, Weifang Lee, Kazuya Takeda and Fumitada Itakura

    Proc. of International Conference on Spoken Language Processing, (INTERSPEECH/ICSLP 2004) 

     More details

    Event date: 2004.11

    Language:English   Presentation type:Oral presentation (general)  

    Country:Korea, Republic of  

  408. Analysis of In-car speech recognition experiments using a large-scale multi-mode dialogue corpus International conference

    K.Fujimura, K.Itou, K.Takeda,F.Itakura

    Proc. of International Conference on Spoken Language Processing, (INTERSPEECH/ICSLP 2004) 

     More details

    Event date: 2004.11

    Language:English   Presentation type:Oral presentation (general)  

    Country:Korea, Republic of  

  409. CIAIR In-Car Speech Database International conference

    N.Kawaguchi, S.Matsubara, Y.Yamaguchi, K.Takeda and F. Itakura

    Proc. of International Conference on Spoken Language Processing, (INTERSPEECH/ICSLP 2004) 

     More details

    Event date: 2004.11

    Language:English   Presentation type:Oral presentation (general)  

    Country:Korea, Republic of  

  410. Audio-Visual Speaker Localization for Car Navigation Systems International conference

    X.Zhang, K.Takeda, J.Hansen and T.Maeno

    Proc. of International Conference on Spoken Language Processing, (INTERSPEECH/ICSLP 2004) 

     More details

    Event date: 2004.11

    Language:English   Presentation type:Oral presentation (general)  

    Country:Korea, Republic of  

  411. Recent Progress of Open-Source LVCSR Engine Julius and Japanese Model Repository International conference

    T.Kawahara, A.Lee, K.Takeda, K.Itou and K.Shikano

    Proc. of International Conference on Spoken Language Processing, (INTERSPEECH/ICSLP 2004) 

     More details

    Event date: 2004.11

    Language:English   Presentation type:Oral presentation (general)  

    Country:Korea, Republic of  

  412. Speech recognition using synchronization between speech and figre tapping International conference

    H. Bann, C.Miyajima, K.Itou, K.Takeda, F.Itakura

    Proc. of International Conference on Spoken Language Processing, (INTERSPEECH/ICSLP 2004) 

     More details

    Event date: 2004.11

    Language:English   Presentation type:Oral presentation (general)  

    Country:Korea, Republic of  

  413. AURORA-2J: Japanese speech data collection for performance evaluation of speech recognition in noise International conference

    Satoshi Nakamura, Kazumasa Yamamoto, Kazuya Takeda,Shingo Kuroiwa, Norihide Kitaoka,Takeshi Yamada,Mitsunori Mizumachi, Takanobu Nishiura, Masakiyo Fujimoto, Akira Saso, Toshiki Endo

    International Conference on Speech and Language Technology/Oriental-COCOSDA 2004 

     More details

    Event date: 2004.11

    Language:English   Presentation type:Oral presentation (general)  


  414. Biometric Identification Using Driving Behavioral Signals International conference

    Kei Igarashi, Chiyomi Miyajima, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura and Huseyin Abut

    The 2004 IEEE International Conference on Mulitdedia and Expo (ICME'2004) 

     More details

    Event date: 2004.1

    Language:English   Presentation type:Oral presentation (general)  


  415. Example-based Spoken Dialogue System with Online Example Augmentation International conference

    W. Li, K. Takeda and F. Itakura

    Proc. of International Conference on Spoken Language Processing, (INTERSPEECH/ICSLP 2004) 

     More details

    Event date: 2004.1

    Language:English   Presentation type:Oral presentation (general)  

    Country:Korea, Republic of  

  416. An Advanced Japanese Speech Corpus for In-car Spoken Dialogue Research International conference

    Yuki Irie, Nobuo Kawaguchi, Shigeki Matsubara, Itsuki Kishida, Yukiko Yamaguchi, Kazuya Takeda, Fumitada Itakura, and Yasuyoshi Inagaki

    Proc. of International Coordinating Committee on Speech Databases and Speech I/O System Assessment (O-COCOSDA 2004) 

     More details

    Event date: 2004.1

    Language:English   Presentation type:Oral presentation (general)  


  417. Robust SNR estimation of noisy speech based on Gaussian mixtures modeling on log-power domain International conference

    Tran Huy Dat, Kazuya Takeda, Fumitada Itakura

    COST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction  

     More details

    Event date: 2004.1

    Language:English   Presentation type:Oral presentation (general)  


  418. スパーク音源を用いた頭部伝達関数の測定



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  419. 分散して蓄積されたデータを利用する音響モデル学習システム



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  420. 音声・映像の知的統合のための空間音響信号処理



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  421. 車内対話音声のSNR依存音響モデルの評価



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  422. 位置情報による移動音源再現手法の主観的影響



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  423. 虚像法を用いた両耳室内インパルス応答の推定



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  424. 身体的特徴量から推定した頭部伝達関数の評価



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  425. 音声対話システムユーザの課題達成率に基づく分析



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  426. 楽曲検索のための音声対話インタフェース構築に関する検討



     More details

    Event date: 2004.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  427. Data Collection and Evaluation of AURORA-2 Japanese Corpus International conference

    Satoshi Nakamura, Kazumasa Yamamoto, Kazuya Takeda, Shingo Kuroiwa,, Norihide Kitaoka, Takeshi Yamada, Mitsunori Mizumachi, Takanobu Nishiura, Masakiyo Fujimoto, Akira Saso, Toshiki Endo.

    IEEE Workshop on Automatic Speech Recognition and Understanding 

     More details

    Event date: 2003.12

    Language:English   Presentation type:Oral presentation (general)  



    Hideki Banno, Tetsuya Shinde, Kazuya Takeda and Fumitada Itakura

    Proc. of IEEE International Conference on Acoustics Speech and Signal Processing ( 

     More details

    Event date: 2003.5

    Language:English   Presentation type:Oral presentation (general)  

    Country:Hong Kong  

  429. Experiments on Recognition of Lavalier Microphone Speech and Whispered Speech in Real World Environments International conference

    K. Tatara, T. Ito, P. Zolfaghari, K. Takeda & F. Itakura,


     More details

    Event date: 2002.9

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  


    Yoshihide Ban, Hideki Banno, Kazuya Takeda and Fumitada Itakura


     More details

    Event date: 2002.5

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  431. Acoustical analysis and Recognition of Whispered Speech International conference

    Taisuke Itoh, Kazuya Takeda and Fumitada Itakura


     More details

    Event date: 2002.5

    Language:English   Presentation type:Oral presentation (general)  

    Country:United States  

  432. A Study on Domain Recognition of Spoken Dialogue Systems International conference

    T. Isobe, S. Hayakawa, H. Murao, T. Mizutani, K. Takeda, F. Itakura

    Proc. of 8th European Conference on Speech Communication and techonology  

     More details

    Event date: 2002.2

    Language:English   Presentation type:Oral presentation (general)  


  433. Integration of Noise Reduction Algorithms for Aurora2 Task International conference

    Takeshi Yamada, Jiro Okada, Kazuya Takeda, Norihide Kitaoka, Masakiyo Fujimoto, Shingo Kuroiwa, Kazumasa Yamamoto, Takanobu Nishiura, Mitsunori Mizumachi, Satoshi Nakamura

    Proc. of 8th European Conference on Speech Communication and techonology  

     More details

    Event date: 2002.2

    Language:English   Presentation type:Oral presentation (general)  


  434. Recognition of Consonant-Vowel (CV) Units of Speech in a Broadcase News Corpus Using Support Vector Machines International conference

    C.Chandra Sekhar, Kazuya Takeda and Fumitada Itakura

    Proc. of SVM2002, LNCS 2388, 

     More details

    Event date: 2002.1

    Language:English   Presentation type:Oral presentation (general)  


  435. 発声様式毎の話者特徴の多次元尺度解析

    後藤雅彦, 武田一哉, 板倉文忠


     More details

    Event date: 1996.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  436. ヒューマンスピーチライク雑音の分析による音声的特徴の抽出

    小林大祐, 梶田将司, 武田一哉, 板倉文忠


     More details

    Event date: 1996.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  437. 音声の低域と高域の情報を組み合わせたテキスト独立型話者認識

    早川昭二, 武田一哉, 板倉文忠


     More details

    Event date: 1996.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  438. 異なる呈示雑音下で発声されたLombard音声の調査とその認識法の検討

    若尾淳, 武田一哉, 板倉文忠


     More details

    Event date: 1996.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  439. 呈示雑音に対してロバストなロンバード音声の認識方法の検討

    若尾淳, 武田一哉, 板倉文忠


     More details

    Event date: 1995.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  440. 広帯域音声を用いた話者認識における特徴パラメータの時期的変動の影響

    早川昭二, 武田一哉, 板倉文忠


     More details

    Event date: 1995.1

    Language:Japanese   Presentation type:Oral presentation (general)  

  441. 2チャンネル信号間の相互相関に基づいたSBCOR分析の検討

    梶田将司, 武田一哉, 板倉文忠


     More details

    Event date: 1995.1

    Language:Japanese   Presentation type:Oral presentation (general)  

▼display all

Research Project for Joint Research, Competitive Funding, etc. 50

  1. 資質・能力の伸びに効果的な学習法に関する学術コンサルティング

    2020 - 2021


  2. 実践データサイエンティスト育成プログラム〔大学院生・社会人向け〕亜鉛メッキ製造工程における良品条件の解析に関する学術コンサルティング

    2020 - 2021


  3. 実践データサイエンティスト育成プログラム〔大学院生・社会人向け〕データを活用したデジタル・マーケティングに関する学術コンサルティング

    2020 - 2021


  4. 実践データサイエンティスト育成プログラム〔大学院生・社会人向け〕物流データ分析に関する学術コンサルティング

    2020 - 2021


  5. 完全自動運転における危険と異常の予測

    2019 - 2022

    国立研究開発法人科学技術振興機構  受託研究 

  6. (大テーマ)生産工程情報の自動要約および生成(G017) (中テーマ)製作工程の自動生成(G018) (小テーマ)加工履歴のデータマイニングに基づく類似指示書の提案(G020)

    2019 - 2022


  7. (大テーマ)先進技術を用いたコスト競争力のある精密加工・組立技術の獲得(G023) (中テーマ)低剛性小型ロボットによる精密穴加工技術開発(G024) (小テーマ)音・振動データを用いた加工状態推定(G027)

    2019 - 2022


  8. 会計情報の次世代型活用

    2019 - 2020


  9. 養鶏事業の生産効率向上のためのデータ解析

    2019 - 2020


  10. 新車販売台数の需要予測支援

    2019 - 2020


  11. 数値流体力学(CFD)に関するコンサルティング

    2019 - 2020


  12. 各種自立走行機台に対するAI(強化学習・逆強化学習)制御機能開発の為の環境構築

    2019 - 2020


  13. 資質・能力の伸びに効果的な学習法に関する研究

    2019 - 2020


  14. 運転行動モデル化、仮想空間没入に関する研究の製品搭載に向けた共同研究(継続)



  15. 走行データに基づくリスク要因の推定



  16. スマートメータデータを活用した新サービス創出に向けた検討



  17. 自動運転実証実験



  18. 計測車両用のデータロガーシステムの構築

    2018.10 - 2019.3

    一般財団法人日本自動車研究所  受託研究 

  19. 次世代ユビキタスセンシングシステム

    2018.8 - 2019.3


  20. ドライバの運転時の時系列データの解析に関する研究

    2018.7 - 2019.3


  21. 音声・画像認識などを統合した動画アノテーション技術の基礎研究

    2018.5 - 2020.3


  22. 運転スコアリング技術開発の監修

    2018.5 - 2019.3


  23. 危険な運転状況の検出と危険要因の抽出

    2018.4 - 2019.3


  24. 自動走行システムの安全性評価技術構築に向けた研究開発プロジェクト

    2018 - 2021

    一般財団法人日本自動車研究所  受託研究 

  25. 判断根拠を言語化する人工知能の研究開発

    2018 - 2020


  26. 計測車両のデータ収集システムの改良

    2018 - 2019


  27. 人と知能機械との協奏メカニズム解明と協奏価値に基づく新しい社会システムを構築するための基盤技術の創出に関する国立大学法人東海国立大学機構による研究開発

    2016 - 2020

    国立研究開発法人科学技術振興機構  受託研究 

  28. 実世界データ循環学リーダー人材養成プログラム

    2013 - 2019


  29. 行動モデルに基づく過信の抑止

    2009.5 - 2014.9


      More details

    Grant type:Competitive


  30. 運転行動のセンシングと理解に基づく次世代ドライブレコーダの研究開発

    2008.8 - 2011.3

    総務省 SCOPE 

      More details

    Grant type:Competitive


  31. 確率モデルに基づく運転行動予測

    2007.7 - 2008.3


  32. 信号処理手法に基づく自動車運転行動のモデル化

    2007.4 - 2008.3


  33. 実車走行中における生体モニタリング

    2007.4 - 2008.3


  34. モーターサイクル向け音声認識技術に関する研究

    2007.1 - 2008.12


  35. 確率モデルに基づく運転行動予測

    2006.12 - 2007.3


  36. 運転行動データからの感情推定手法の検討

    2006.11 - 2008.2


  37. 信号処理手法に基づく自動車運転行動のモデル化

    2006.6 - 2007.3


  38. 確率モデルによる運転行動信号の生成

    2006.5 - 2008.3

  39. 実車走行中における生体モニタリング

    2006.4 - 2007.3


  40. モーターサイクル向け音声認識技術に関する研究

    2006.3 - 2007.12


  41. 信号処理手法に基づく自動車運転行動のモデル化

    2005.6 - 2006.3


  42. モーターサイクル向け音声認識技術に関する研究

    2005.4 - 2006.12


  43. 確率モデルに基づく運転行動予測

    2005.4 - 2006.3


  44. 話者適応に基づく音響モデルの作成方法に関する研究

    2004.10 - 2005.3


  45. 運転行動理解技術の開発

    2004.10 - 2005.3


  46. 実走行車内音声データベースを使用したカーナビゲーションの研究

    2004.8 - 2005.3


  47. 次世代コミュニケーションロボットの研究開発

    2004.8 - 2005.3


  48. 分散して蓄積された音声データを用いて多様な音響モデルを作成する方法の研究

    2003.5 - 2006.3

  49. 確率的音声認識における音響確率と言語確率の統合方法

    1999.5 - 2001.3

  50. 信号処理的手法を用いた言語のモデル化に関する研究

    1997.5 - 1999.3

▼display all

KAKENHI (Grants-in-Aid for Scientific Research) 21

  1. 実世界複数自律移動体における階層型機械学習モデルに基づく観測・行動制御手法の開発

    Grant number:24K22322  2024.6 - 2026.3

    科学研究費助成事業  挑戦的研究(萌芽)

    武田 一哉

      More details

    Authorship:Principal investigator 

    Grant amount:\6500000 ( Direct Cost: \5000000 、 Indirect Cost:\1500000 )

  2. Cross-disciplinary research on the prediction and control of real-world interactions based on evidence and causality

    Grant number:21H04892  2021.4 - 2024.3

    Grants-in-Aid for Scientific Research  Grant-in-Aid for Scientific Research (A)

      More details

    Authorship:Principal investigator 

    Grant amount:\41210000 ( Direct Cost: \31700000 、 Indirect Cost:\9510000 )

  3. 判断根拠を言語化する人工知能の研究開発

    2018.8 - 2023.3

    NEDO  人工知能技術適用によるスマート社会の実現 

      More details

    Authorship:Principal investigator  Grant type:Competitive

  4. 人と知能機械との協奏メカニズム解明と協奏価値に基づく新しい社会システムを構築するための基盤技術の創出

    2016.6 - 2021.3

    JST  産学共創プラットフォーム共同研究推進プログラム(OPERA) 


      More details

    Authorship:Principal investigator  Grant type:Competitive

  5. アクターネットワーク理論による構築的テクノロジーアセスメントの自動走行車への適用

    Grant number:16K12802  2016.4 - 2019.3

    科学研究費補助金  萌芽研究

    杉原 桂太、久木田 水生、綾部 広則、 伊勢田 哲治、戸田山 和久、黒田 光太郎 、武田 一哉、二瓶 真理子、森山 花鈴、比屋根 均

      More details



  6. ドライバの視行動のモデル化に基づく運転の危険状態推定

    Grant number:15K00231  2015.4 - 2018.3

    科学研究費補助金  基盤研究©

    宮島千代美、武田 一哉

      More details


    Grant amount:\4680000 ( Direct Cost: \3600000 、 Indirect Cost:\1080000 )


  7. 高臨場感音場創生のための多方向同時バイノーラル録音システム

    Grant number:15K00235  2015.4 - 2018.3

    科学研究費補助金  基盤研究©

    西野隆典、武田 一哉

      More details

    Authorship:Collaborating Investigator(s) (not designated on Grant-in-Aid) 

    Grant amount:\4680000 ( Direct Cost: \3600000 、 Indirect Cost:\1080000 )


  8. 音声対話システムを対象とした雑音に頑健な話者までの距離推定の研究

    Grant number:26330211  2014.4 - 2018.3

    科学研究費補助金  基盤研究©

    實廣 貴敏、武田 一哉、鹿野 清宏

      More details

    Authorship:Collaborating Investigator(s) (not designated on Grant-in-Aid) 

    Grant amount:\4810000 ( Direct Cost: \3700000 、 Indirect Cost:\1100000 )

    音声対話システムにおいて,周囲状況を把握する方法の一つとして,単一マイクロホンにおいて,音声そのものからその特性を推定・識別することで,発話者の口からマイクまでのおよその距離を推定する手法を提案する.距離ごとに収録された音声データをDeep Neural Network (DNN)の一種で学習する.使用時には,短時間に区切られた音声フレームをDNNに入力し,推定距離を出力する.全フレームで推定距離の多数決を行うことで1発話の推定距離を得る.0.2 mと5 mの音声識別実験では,約85%の識別率を得ることができた.

  9. プライバシー保護を考慮した個人の同一性判定技術の創出

    Grant number:26540089  2014.4 - 2016.3

    科学研究費補助金  挑戦的萌芽研究26540089

    松井 知子、武田 一哉、南和弘

      More details


    Grant amount:\3640000 ( Direct Cost: \2800000 、 Indirect Cost:\840000 )

    カーネル平均について検討を行い、音声のようなi.i.d.を仮定することが難しい時系列データに対しては、カーネル平均を用いた個人性判定はできない問題があることを明らかにした。この問題に対してWild Bootstrap法による解決を検討したが、音声データはデータ間の相関の強さが動的に変化するためにその方法の適用にも問題があることを確認した。また、DNNを用いて個人性判定を行う方法の検討を行い、個人ごとの音声データが数分の長さで大量に利用できない場合には、DNNがうまく学習できず、十分な精度が得られないことを実験的に確かめた。PIPAについては今後、新たな方向性を検討する必要がある。

  10. 時空間ガウス過程モデルによる音データの判別予測に関する研究

    Grant number:25280067  2013.4 - 2016.3

    科学研究費補助金  基盤研究(B)

    松井 知子、マルコフ コンスタンティン 、上野 玄太、ピータース ギャレス W.、アイド ネバット、武田 一哉

      More details


    Grant amount:\16900000 ( Direct Cost: \13000000 、 Indirect Cost:\3900000 )

    時空間ガウス過程モデルによる分類や回帰を行うための汎用ソフトウェア、Monte Carlo Dynamic Classifier(MCDC)ツールを開発した。MCDCでは時空間ガウス過程モデルとして、状態と観測関数をガウス過程で表した状態空間モデルを考える。また時空間ガウス過程モデルによる音響空間のモデル化について、音波の位相を考慮したカーネル関数として波動方程式に基づくカーネル関数を新たに設計した。実験的に従来のガウスカーネルと比べて高いSDR値を示すことを確認した。さらに時空間ガウス過程モデルを音楽ジャンル分類、音楽ムード推定に適用し、従来法と比べて、高い性能が得られることを確認した。

  11. 空間を超越するインタラクティブ聴覚拡張システムの研究

    Grant number:25280060  2013.4 - 2016.3

    科学研究費補助金  基盤研究(B)

    武田一哉、羽田 陽一、西村 竜一、西野 隆典,猿渡 洋

      More details

    Authorship:Principal investigator 

    Grant amount:\17030000 ( Direct Cost: \13100000 、 Indirect Cost:\3930000 )


  12. 回答行列の三角化に基づく音楽類似度の個人性分析

    Grant number:25540168  2013.4 - 2015.3

    科学研究費補助金  挑戦的萌芽研究

    川渕 翔太 、武田一哉

      More details

    Authorship:Principal investigator 

    Grant amount:\3770000 ( Direct Cost: \2900000 、 Indirect Cost:\870000 )

    RWC 研究用音楽データベースの「ポピュラー音楽」から選択された 200 の楽曲ペアに関して 27 名の被験者が類似度を評価した.全体的な類似度とは別に,メロディ,テンポ・リズム,声質,楽器構成についての類似度も収集した.回答結果の分析から,「似ている/似ていない」の判断境界が個人毎に大きくばらつくことが示唆された.個人に最適化された楽曲間距離関数(重み付けユークリッド距離)を学習することで個人毎に主観的な楽曲間類似度を推定する実験を行った.その結果,距離関数の学習によって「声質」に関する類似性推定の精度が向上したことから,重み付けユークリッド距離を用いた個人適応の効果が明らかになった.

  13. 実世界劣化音声コーパスに基づく音声強調法の研究

    Grant number:19300060  2007.5 - 2010.3

    科学研究費補助金  基盤B 

      More details

    Authorship:Principal investigator 

    Grant amount:\18980000 ( Direct Cost: \14600000 、 Indirect Cost:\4380000 )

  14. ユーザー負担のない話者・環境適応性を実現する自然な音声対話処理技術(音声認識システムのフィールドテスト)

    2007.4 - 2008.3

    科学研究費補助金  70-J-J6802

    武田 一哉

      More details

    Authorship:Principal investigator 

  15. 大規模実世界データに基づく自動車運転行動信号処理の先導的研究

    2006.10 - 2007.3

    科学研究費補助金  175IS119-02

    武田 一哉

      More details

    Authorship:Principal investigator 

  16. ユーザー負担のない話者・環境適応性を実現する自然な音声対話処理技術(音声認識システムのフィールドテスト)

    2006.4 - 2007.3

    科学研究費補助金  70-J-J6802

    武田 一哉

      More details

    Authorship:Principal investigator 

  17. 大規模実世界データに基づく自動車運転行動信号処理の先導的研究

    2006.4 - 2006.9

    科学研究費補助金  175IS119-01

    武田 一哉

      More details

    Authorship:Principal investigator 

  18. 大規模実世界データに基づく自動車運転行動信号処理の先導的研究

    2005.10 - 2008.9

    科学研究費補助金  175IS119-02

    武田 一哉

      More details

    Authorship:Principal investigator 


  19. 大規模実世界データに基づく自動車運転行動信号処理の先導的研究

    2005.10 - 2006.3

    科学研究費補助金  175IS119-01

    武田 一哉

      More details

    Authorship:Principal investigator 

  20. ユーザー負担のない話者・環境適応性を実現する自然な音声対話処理技術(音声認識システムのフィールドテスト)

    2005.4 - 2006.3

    科学研究費補助金  70-J-J6802

    武田 一哉

      More details

    Authorship:Principal investigator 

  21. 基盤ソフトウェアの総合開発(音声認識システムのフィールドテスト)

    2004.4 - 2005.3

    科学研究費補助金  70-J-J6802

    武田 一哉

      More details

    Authorship:Principal investigator 

▼display all

Industrial property rights 32

  1. 音響処理装置

    近藤多伸, 武田一哉

     More details


    Application no:特願2011-40014  Date applied:2011.2

    Announcement no:特開2012-178679 

    Country of applicant:Domestic  

  2. 心理状態推定装置及び心理状態推定方法


     More details

    Application no:2008-270024  Date applied:2008.10

    Country of applicant:Domestic  

  3. 車線変更警報装置及びプログラム


     More details

    Application no:2008-195156  Date applied:2008.7

    Country of applicant:Domestic  

  4. 運転支援装置


     More details

    Application no:2008-090755  Date applied:2008.1

    Country of applicant:Domestic  

  5. 運転支援装置及び運転支援方法


     More details

    Application no:2008-090777  Date applied:2008.1

    Country of applicant:Domestic  

  6. 音声分析方法および音声分析装置

     More details

    Date applied:2007

    Announcement no:2007-079389 

    Country of applicant:Domestic  

  7. ドライバモデル作成装置、運転支援装置、及び運転行動判定装置

     More details

    Date applied:2007

    Announcement no:2007-272834 

    Country of applicant:Domestic  

  8. 運転行動推定装置、運転支援装置、及び車両評価システム

     More details

    Date applied:2007

    Announcement no:2007-176396 

    Country of applicant:Domestic  

  9. Speech Recognition Method

    Kazuya Takeda

     More details

    Application no:US Patent:5,425,127  Date applied:2006

    Country of applicant:Domestic  

  10. Speech endpoint detection method and appratus and continuous speech recognition

    Kazuya Takeda

     More details

    Application no:US Patent:5,740,318  Date applied:2006

    Patent/Registration no:US Patent:5,740,318  Date registered:2006 

    Country of applicant:Domestic  

  11. 終話検出方法及び装置並びに連続音声認識方法及び装置


     More details

    Application no:3004883  Date applied:2006

    Patent/Registration no:3004883  Date registered:2006 

    Country of applicant:Domestic  

  12. 対象音検出方法、信号入力遅延時間検出方法及び音信号処理装置


     More details

    Application no:2003-072451  Date applied:2003

    Patent/Registration no:特許 3925734  Date registered:2007 

    Country of applicant:Domestic  

  13. 音声入力の雑音抑制方法、雑音抑制制御プログラム、記録媒体、及び音声入力装置


     More details

    Application no:2003-140686  Date applied:2003

    Country of applicant:Domestic  

  14. 発話区間検出方法、信号入力遅延時間検出方法および音声信号処理装置


     More details

    Application no:2003-072451  Date applied:2003

    Country of applicant:Domestic  

  15. モデルデータ生成装置、モデルデータ生成方法、およびこれらの方法


     More details

    Application no:2003-112684  Date applied:2003

    Country of applicant:Domestic  

  16. 音声対話のための音声認識方法及び装置

     More details

    Application no:平08-006590  Date applied:1996

    Patent/Registration no:特許 3285704  Date registered:2000 

    Country of applicant:Domestic  

  17. 音声認識における環境適応方法及び環境適応形音声認識装置

     More details

    Application no:特開 平08-211888 

  18. 話者認識方法及び装置

     More details

    Application no:特開 平10-097274 

  19. 話者照合装置及び方法

     More details

    Application no:特開 2000-284798 

    Country of applicant:Domestic  

  20. 着信先情報自動通知サービス装置

     More details

    Application no:特開 平08-181779 

  21. 国番号およびエリアコード自動案内装置

     More details

    Application no:特開 平08-139813 

  22. 終話検出方法及び装置並びに連続音声認識方法及び装置

     More details

    Application no:特開 平08-115093 

  23. 音声対話のための音声認識方法及び装置

     More details

    Application no:特開 平08-006590 

  24. 連続音声認識方法及び装置

     More details

    Application no:特開 平07-261786 

  25. 自由発話音声認識方法

     More details

    Application no:特開 平07-036479 

  26. 音声認識方法

     More details

    Application no:特開 平06-214595 

  27. 音声認識方法

     More details

    Application no:特開 平06-214594 

  28. 電話交換機の接続方法および音声認識方法

     More details

    Application no:特開 平06-214590 

  29. 連続音声認識方法および該方法を用いた装置

     More details

    Application no:特開 平06-180593 

  30. 連続音声認識方法

     More details

    Application no:特開 平06-118989 

  31. 音声認識方法

     More details

    Application no:特開 平06-118987 

  32. 音声認識方法

     More details

    Application no:特開 平05-027794 

▼display all


Teaching Experience (On-campus) 5

  1. 基礎セミナーA


  2. 情報通信工学第1


  3. 数学2及び演習


  4. 情報通信工学第1


  5. 数学2及び演習


Teaching Experience (Off-campus) 10

  1. 音声情報処理論

    2003.4 - 2004.3 愛知県立大学)

  2. メディア情報処理論1

    2003.4 - 2004.3 中京大学)

  3. 音声情報処理論

    2002.4 - 2003.3 愛知県立大学)

  4. メディア情報処理論1

    2002.4 - 2003.3 中京大学)

  5. 音声情報処理論

    2001.4 - 2002.3 愛知県立大学)

  6. 音声情報処理論

    2000.4 - 2001.3 愛知県立大学)

  7. 音声音響行動信号処理特論


  8. 数学2および演習(ラプラス解析・フーリエ解析・偏微分方程式)


  9. 数学1および演習(常微分方程式・ベクトル解析)


  10. 情報理論


▼display all


Social Contribution 2

  1. 名古屋大学協力会講演


     More details


  2. 名古屋大学リレーセミナー


     More details
