次世代ネットワークにおける超臨場感音響相互通信の実現 -Ultra-realistic acoustic interactive communication on next-generation Internet-

International Conferences

Nhut Minh Ngo and Masashi Unoki, "ROBUST AND RELIABLE AUDIOWATERMARKING BASED ON PHASE CODING," Proc. ICASSP2015, April 2015 (accepted).
Taku Shimizu, Jorge Trevino, Shuichi Sakamoto, and Yoiti Suzuki, "Multi-zone sound field reproduction algorithm based on spherical harmonic analysis," in Proceedings of the 2015 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing, 28AM2-1-1, pp. 70-73, Kuala Lumpur, Malaysia. [2015/02/28] (口頭発表)
Xiao Han, Reda Elbarougy, Masato Akagi, Junfeng Li, Thi Duyen Ngo, and The Duy Bui, "A study on perception of emotional states in multiple languages on Valence-Activation approach," Proc NCSP2015, 28AM2-1-5, pp. 86-89, Kuala Lumpur, Malaysia. [2015/02/28]
Zhi Zhu, Ryota Miyauchi, and Masashi Unoki, "Analysis of Speaker Individual Differences on Modulation Spectrum," Proc. 2015 RISP International workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP15), 28AM1-1-5, pp.18-21, Kuala Lumpur, Malaysia. [2015/02/28]
Masato Akagi, Xiao Han, Reda Elbarougy, Yasuhiro Hamada, and Junfeng Li, "Toward Affective Speech-to-Speech Translation: Strategy for Emotional Speech Recognition and Synthesis in Multiple Languages," Proc. APSIPA2014, Siem Reap, Cambodia. [2014/12/10] (招待講演)
Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee and Chai Wuthiwiwatchai, "An Audio Watermarking Scheme based on Singular-Spectrum Analysis," Proc. 13th International Workshop on Digital-forensics and Watermarking (IWDW2014), Taipei, Taiwan. [2014/10/01-04] (to be appear at Lecture Notes in Computer Science (LNCS) by Springer, 2015).
Nhut Minh Ngo and Masashi Unoki, "Watermarking for Digital Audio based on Adaptive Phase Modulation," Proc. 13th International Workshop on Digital-forensics and Watermarking (IWDW2014), Taipei, Taiwan. [2014/10/01-04] (to be appear at Lecture Notes in Computer Science (LNCS) by Springer, 2015)
Shengbei Wang, Masashi Unoki, and Nam Soo Kim, "Formant Enhancement based Speech Watermarking for Tampering Detection," Proc. Interspeech2014, Singapore, pp. 1366-1370. [2014/09/14-18]
Shota Morita, Masshi Unoki, Xugang Lu, and Masato Akagi, "Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments," Proc. ISCSLP2014, pp. 108-112, Singapore, Sept. 2014. [2014/09/12-14]
Shota Morita, Xugang Lu, and Masashi Unoki, "Signal to noise ratio estimation based on an optimal design of subband voice activity detection," Proc. ISCSLP2014, pp. 560-564, Singapore. [2014/09/12-14]
Shota Morita, Masashi Unoki, Xugang Lu, and Masato Akagi, "Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments," Proc. ISCSLP2014, pp. 108-112, Singapore. [2014/09/12-14]
Reda Elbarougy, Xiao HAN, Masato Akagi, and Junfeng Li, "Toward relaying an affective speech-to-speech translator: Cross-language perception of emotional state represented by emotion dimensions," Proc. O-COCOSDA2014, pp. 48-53, Phuket, Thailand. [2014/09/10]
Wang Shengbei and Masashi Unoki, "Watermarking of Speech Signals Based on Formant Enhancement," Proc. 22nd Europian Signal Processing Conference (EUSIPCO2014), Lisbon, Portugal. [2014/09/01-05]
Akinori Ito, "Assessing the intended enthusiasm of singing voice using energy variance," Proceedings of Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Kitakyushu, pp. 558-561. [2014/08/28] (招待講演)
Yukio Iwaya, Makoto Otani, Takao Tsuchiya and Junfeng Li, "Visual Auditory Display by Remote Rendering Via Computer Network," Proceedings of Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Kitakyushu, pp. 598-601. [2014/08/28] (招待講演)
Kazumichi Yoshida, Takashi Nose, Akinori Ito, "Analysis of English pronunciation of singing voices sung by Japanese speakers," Proceedings of Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Kitakyushu, pp. 554-557. [2014/08/28] (招待講演)
Shuichi Sakamoto, Yoshiki Satou, Jorge Trevino and Yoiti Suzuki, "Representation of indivisual HRTFs using weighting coefficients of SENZI," Proceedings of Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Kitakyushu, pp. 606-609. [2014/08/28] (招待講演)
Xing Yang, Risheng Xia, Zhonghua Fu, Junfeng Li, Yonghong Yan, Shuichi Sakamoto and Yoiti Suzuki, "On the Performance and Robustness of Crosstalk Cancelation with Multiple Loudspeakers," Proceedings of Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Kitakyushu, pp. 618-621. [2014/08/28] (招待講演)
Wang Shengbei and Masashi Unoki, "Hybrid Speech Watermarking based on Formant Enhancement and Cochlear Delay," Proc. IIHMSP2014, pp. 272-275, Kitakyushu, Japan. [2014/08/27-29]
Daisuke Morikawa, "Effect of interaural difference for localization of spatially segregated sound," Proc. IIHMSP2014, pp.602-605, Kita-kyusyu, Japan. [2014/08/27-29] (招待講演)
Masato Akagi, Xiao HAN, Reda Elbarougy, Yasuhiro Hamada, and Junfeng Li, "Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System," Proc. IIHMSP2014, pp. 574-577, Kitakyushu, Japan. [2014/08/27-29] (招待講演)
Masato Akagi, and Reda Elbarougy. "Toward Relaying Emotional State for Speech-To-Speech Translator: Estimation of Emotional State for Synthesizing Speech with Emotion," Proc. ICSV2014. Beijing, China. [2014/07/14-17] (招待講演)
Akinori Ito, Yuki Igarashi, Masashi Ito, Takashi Nose, "Tempo modification of music signal using sinusoidal model and LPC-based residue model," Proceedings of the 21st International Congress on Sound and Vibration, Beijing, pp. 1-8. [2014/07/16] (招待講演)
Masashi Unoki, Jessada Karnjana, Shengbei Wang, Nhut Minh Ngo, and Ryota Miyauchi, "COMPARATIVE EVALUATIONS OF INAUDIBLE AND ROBUST WATERMARKING FOR DIGITAL AUDIO SIGNALS," Proc. The 21st International Congress on Sound and Vibration (ICSV21), Beijing, China. [2014/07/14-17]
Naushin Nower, Yang Liu, and Masashi Unoki, "Restoration of Instantaneous Amplitude and Phase using Kalman filter for Speech Enhancement," Proc. 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2014), pp. 4666-4670, Florence, Italy. [2014/05/04-09]
C. D. Salvador, S. Sakamoto, J. Treviño, and Y. Suzuki, "Embedding distance information in binaural renderings of far field recordings," in Proceedings of the EAA Joint Symposium on Auralization and Ambisonics, ISBN 978-3-7983-2704-7, pp. 133–139, Berlin, Germany. [2014/04/05] (ポスター発表)
Yang Liu and Masashi Unoki, "MTF based Kalman filtering with linear prediction for power envelope restoration," 2013 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS 2013), Okinawa Jichi-Kaikan, Naha, Okinawa, Japan, Proc. ISPACS2013, pp. 198-203, [2013/11/13]
Elbarougy, R. and Akagi, M. "Cross-lingual speech emotion recognition system based on a three-layer model for human perception," Proc. APSIPA2013, Kaohsiung, Taiwan. [2013/11/01]
Kenichiro Miwa and Masashi Unoki, "Study on Method for Estimating F0 of Steady Complex Tone in Noisy Reverberant Environments," The Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Beijing, China, Proc. IIHMSP2013, pp. 456-459, [2013/10/18]
Keizo Kato and Akinori Ito, Acoustic Features and Auditory Impressions of Death Growl and Screaming Voice. Proc. Int. Conf. on Intelligent Information Hiding and Multimedia Signal Processing, Beijing, Oral presentation, pp. 460-463, [2013/10/18]
Yuki Igarashi, Masashi Ito and Akinori Ito, Evaluation of Sinusoidal Modeling for Polyphonic Music Signal. Proc. Int. Conf. on Intelligent Information Hiding and Multimedia Signal Processing, Beijing, Oral presentation, pp. 464-466, [2013/10/18]
Kiho Cho, Soo Hyun Bae, In Kyu Choi, Nam Soo Kim, and Masashi Unoki, "Robust Audio Data Hiding Method Based on Phase of Modulated Complex Lapped Transform," The Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Beijing, China, Proc. IIHMSP2013, pp. 263-266, [2013/10/17]
Yohei Abe and Akinori Ito, Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal. Int. Conf. on Intelligent Information Hiding and Multimedia Signal Processing, Beijing, Oral presentation, pp. 271-274, [2013/10/17]
Jorge TREVINO, Takuma OKAMOTO, Yukio IWAYA, Junfeng LI and Yôiti SUZUKI, "Extrapolation of horizontal Ambisonics data from mainstream stereo sources," 9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Beijing, China, oral presentation, Proc. IIH-MSP 2013, pp. 302&8211;305, [2013/10/17]
Akagi, M. and Hisatsune, H. "Admissible range for individualization of head-related transfer function in median plane," Proc. IIHMSP2013, Beijing. [2013/10/17]
Shengbei Wang and Masashi Unoki, "Watermarking method for speech signals based on modifications to LSFs," Proc. IIHMSP 2013, pp. 283-286, Beijing, China, [2013/10]
Masashi Unoki, Kyohei Sasaki, Ryota Miyauchi, Masato Akagi, and Nam Soo Kim, "BLIND METHOD OF ESTIMATING SPEECH TRANSMISSION INDEX FROM REVERBERANT SPEECH SIGNALS," 21st European Signal Processing Conference, Marrakech, Morocco, Proc. EUSIPCO2013, CDROM, [2013/09/11]
Phung, T. N., Luong, M. C., and Akagi, M. "A Hybrid TTS between Unit Selection and HMM-based TTS under limited data conditions," Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain 281-284. [2013/09/02]
Yasuaki KANAI, Shota MORITA, and Masashi UNOKI, "Concurrent processing of voice activity detection and noise reduction using empirical mode decomposition and modulation spectrum analysis," Interspeech, Lyon France, Proc. Interspeech2013, pp. 742-746, [2013/08/26]
Li, J, Chen, F., Akagi, M., and Yan, Y., "Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese," Proc. InterSpeech2013, Lyon, France, 1184-1187. [2013/08/27]
Unoki, M., Ikeda, T., Sasaki, K., Miyauchi, R., Akagi, M., and Kim, N-S. "Blind method of estimating speech transmission index in room acoustics based on concept of modulation transfer function," Proc. ChinaSIP2013, Beijing, 308-312. [2013/07/08]
Chau, D. T., Li, J., and Akagi, M. "Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratio," Proc. ChinaSIP2013, Beijing, 322-326. [2013/07/08]
Li, J., Akagi, M., and Yan, Y. "Objective Japanese intelligibility prediction for noisy speech signals before and after noise-reduction processing," Proc. ChinaSIP2013, Beijing, 352-355. [2013/07/09]
Masashi Unoki, Tomohiro Ikeda, Kyohei Sasaki, Ryota Miyauchi, Masato Akagi, and Nam Soo Kim, "BLIND METHOD OF ESTIMATING SPEECH TRANSMISSION INDEX IN ROOM ACOUSTICS BASED ON CONCEPT OF MODULATION TRANSFER FUNCTION," IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP 2013), Beijing, China, Proc. ChinaSIP2013, pp. 308-312, [2013/07/08]
Jorge TREVINO, Takuma OKAMOTO, Yukio IWAYA and Yôiti SUZUKI, "Evaluation of different spatial windows for a multi-channel audio interpolation system," 21st International Congress on Acoustics, Montreal, Canada, poster presentation, POMA 055028, 9 pages, [2013/06/03]
Cesar SALVADOR, Shuichi SAKAMOTO, Jorge TREVINO, Junfeng LI, Yonghong YAN and Yôiti SUZUKI, "Accuracy of head-related transfer functions synthesized with spherical microphone arrays," 21st International Congress on Acoustics, Montreal, Canada, oral presentation, POMA 055085, 9 pages, [2013/06/06]
Shin Jae Kang, Chang Woo Han, Kang Hyun Lee, Nam Soo Kim, and Masashi Unoki, "IMM-BASED FEATURE COMPENSATION ROBUST TO SLOWLY TIME-VARYING NOISE AND REVERBERATION," IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP 2013), Beijing, China, Proc. ChinaSIP2013, pp. 313-317, [2013/07/08]
Hisatsune, H. and Akagi, M. "A Study on individualization of Head-Related Transfer Function in the median plane," Proc. NCSP2013, Hawaii, USA, 161-164. [2013/03/05]
Phung, T. N., Luong, M. C., and Akagi, M. "Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages," Proc. O-COCOSDA2012, Macau, 129-134. [2012/12/12]
Masashi Unoki and Xugang Lu, "Unified denoising and dereverberation method used in restoration of MTF-based power envelope," Proc. ISCSLP2012, pp. 215-219, Hong Kong, Dec. [2012/12]
Yasuaki Kanai and Masashi Unoki, "Study on robust voice activity detection using empirical mode decomposition and modulation spectrum analysis," Proc. ISCSLP2012, pp. 400-404, Hong Kong, [2012/12]
Phung, T. N., Luong, M. C., and Akagi, M., "A concatenative speech synthesis for monosyllabic languages with limited data," Proc. APSIPA2012, Hollywood, USA [2012/12/06]
Nhut Minh Ngo, Shengbei Wang, and Masashi Unoki, "Method of Digital-audio Watermarking Based on Cochlear Delay in Sub-bands," Proc. ITC-CSCC2012, D-W1-03, Sapporo, [2012/07] (CDROM)
Masashi Unoki and Ryota Miyauchi, "Detection of Tampering in Speech Signals with InaudibleWatermarking Technique," Proc. IIHMSP2012, pp. 118-121, Greece, [2012/07]
Nhut Minh NGO, Masashi Unoki, Ryota Miyauchi, and Yôiti Suzuki, "Data-hiding scheme for digital-audio in amplitude modulation domain," Proc. IIHMSP2012, pp. 114-117, Greece, [2012/07]【Best Paper Award】