Blind Speech Separation Using SRPPHAT Localization and Optimal Beamformer in TwoSpeaker Environments

<?xml version="1.0" encoding="UTF-8"?> <article key="pdf/10005265" mdate="2016-07-03 00:00:00"> <author>Hai Quang Hong Dam and Hai Ho and Minh Hoang Le Ngo</author> <title>Blind Speech Separation Using SRPPHAT Localization and Optimal Beamformer in TwoSpeaker Environments</title> <pages>1529 - 1533</pages> <year>2016</year> <volume>10</volume> <number>8</number> <journal>International Journal of Computer and Information Engineering</journal> <ee>https://publications.waset.org/pdf/10005265</ee> <url>https://publications.waset.org/vol/116</url> <publisher>World Academy of Science, Engineering and Technology</publisher> <abstract>This paper investigates the problem of blind speech separation from the speech mixture of two speakers. A voice activity detector employing the Steered Response Power Phase Transform (SRPPHAT) is presented for detecting the activity information of speech sources and then the desired speech signals are extracted from the speech mixture by using an optimal beamformer. For evaluation, the algorithm effectiveness, a simulation using real speech recordings had been performed in a doubletalk situation where two speakers are active all the time. Evaluations show that the proposed blind speech separation algorithm offers a good interference suppression level whilst maintaining a low distortion level of the desired signal.</abstract> <index>Open Science Index 116, 2016</index> </article>

CINXE.COM

Blind Speech Separation Using SRPPHAT Localization and Optimal Beamformer in TwoSpeaker Environments