Speech Technology

Channel address:

Categories: Technologies

Language: English

Subscribers: 652

▲ Vote (1)

Ratings & Reviews

2.67

3 reviews

Reviews can be left only by registered users. All reviews are moderated by admins.

5 stars

4 stars

3 stars

2 stars

1 stars

The latest Messages 13

2023-02-03 01:05:34 Respected guys

https://arxiv.org/abs/2301.13341

Neural Target Speech Extraction: An Overview

Katerina Zmolikova, Marc Delcroix, Tsubasa Ochiai, Keisuke Kinoshita, Jan Černocký, Dong Yu

Humans can listen to a target speaker even in challenging acoustic conditions that have noise, reverberation, and interfering speakers. This phenomenon is known as the cocktail-party effect. For decades, researchers have focused on approaching the listening ability of humans. One critical issue is handling interfering speakers because the target and non-target speech signals share similar characteristics, complicating their discrimination. Target speech/speaker extraction (TSE) isolates the speech signal of a target speaker from a mixture of several speakers with or without noises and reverberations using clues that identify the speaker in the mixture. Such clues might be a spatial clue indicating the direction of the target speaker, a video of the speaker's lips, or a pre-recorded enrollment utterance from which their voice characteristics can be derived. TSE is an emerging field of research that has received increased attention in recent years because it offers a practical approach to the cocktail-party problem and involves such aspects of signal processing as audio, visual, array processing, and deep learning. This paper focuses on recent neural-based approaches and presents an in-depth overview of TSE. We guide readers through the different major approaches, emphasizing the similarities among frameworks and discussing potential future directions.

297 views22:05

Open / Comment

2023-02-02 23:53:54 https://zenodo.org/record/7389996

289 views20:53

Open / Comment

2023-02-02 02:08:27 For example

329 views23:08

Open / Comment

2023-02-02 02:05:53 IWSLT has nice lecture channel too

https://www.youtube.com/@sigslt

317 views23:05

Open / Comment

2023-02-02 02:04:01

IWSLT also has many speech translation tracks

https://iwslt.org/2023/#shared-tasks

381 views23:04

Open / Comment

2023-02-01 20:02:17 https://twitter.com/shinjiw_at_cmu/status/1620766409390448641

343 views17:02

Open / Comment

2023-01-31 17:48:19 https://github.com/yangdongchao/InstructTTS

248 views14:48

Open / Comment

2023-01-31 15:00:40 https://twitter.com/chrisdonahuey/status/1620232090066497536

285 views12:00

Open / Comment

2023-01-29 01:26:04

https://sites.google.com/view/merlion-ccs-challenge/

About
The inaugural MERLIon CCS Challenge focuses on developing robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous code-switched, child-directed speech collected via Zoom

333 views22:26

Open / Comment

2023-01-26 00:04:57 speechbrain funded by OVH

https://twitter.com/mirco_ravanelli/status/1618345249675542528

521 views21:04

Open / Comment

Speech Technology

Ratings & Reviews

The latest Messages 13

Popular Channels

Related Chats

Popular Channels

Login