🔥 Burn Fat Fast. Discover How! 💪

Speech Technology

Logo of telegram channel speechtech — Speech Technology S
Logo of telegram channel speechtech — Speech Technology
Channel address: @speechtech
Categories: Technologies
Language: English
Subscribers: 652

Ratings & Reviews

2.67

3 reviews

Reviews can be left only by registered users. All reviews are moderated by admins.

5 stars

1

4 stars

0

3 stars

0

2 stars

1

1 stars

1


The latest Messages 13

2023-02-03 01:05:34 Respected guys

https://arxiv.org/abs/2301.13341

Neural Target Speech Extraction: An Overview

Katerina Zmolikova, Marc Delcroix, Tsubasa Ochiai, Keisuke Kinoshita, Jan Černocký, Dong Yu

Humans can listen to a target speaker even in challenging acoustic conditions that have noise, reverberation, and interfering speakers. This phenomenon is known as the cocktail-party effect. For decades, researchers have focused on approaching the listening ability of humans. One critical issue is handling interfering speakers because the target and non-target speech signals share similar characteristics, complicating their discrimination. Target speech/speaker extraction (TSE) isolates the speech signal of a target speaker from a mixture of several speakers with or without noises and reverberations using clues that identify the speaker in the mixture. Such clues might be a spatial clue indicating the direction of the target speaker, a video of the speaker's lips, or a pre-recorded enrollment utterance from which their voice characteristics can be derived. TSE is an emerging field of research that has received increased attention in recent years because it offers a practical approach to the cocktail-party problem and involves such aspects of signal processing as audio, visual, array processing, and deep learning. This paper focuses on recent neural-based approaches and presents an in-depth overview of TSE. We guide readers through the different major approaches, emphasizing the similarities among frameworks and discussing potential future directions.
297 views22:05
Open / Comment
2023-02-02 23:53:54 https://zenodo.org/record/7389996
289 views20:53
Open / Comment
2023-02-02 02:08:27 For example



329 views23:08
Open / Comment
2023-02-02 02:05:53 IWSLT has nice lecture channel too

https://www.youtube.com/@sigslt
317 views23:05
Open / Comment
2023-02-02 02:04:01
IWSLT also has many speech translation tracks

https://iwslt.org/2023/#shared-tasks
381 views23:04
Open / Comment
2023-02-01 20:02:17 https://twitter.com/shinjiw_at_cmu/status/1620766409390448641
343 views17:02
Open / Comment
2023-01-31 17:48:19 https://github.com/yangdongchao/InstructTTS
248 views14:48
Open / Comment
2023-01-31 15:00:40 https://twitter.com/chrisdonahuey/status/1620232090066497536
285 views12:00
Open / Comment
2023-01-29 01:26:04
https://sites.google.com/view/merlion-ccs-challenge/

About
The inaugural MERLIon CCS Challenge focuses on developing robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous code-switched, child-directed speech collected via Zoom
333 views22:26
Open / Comment
2023-01-26 00:04:57 speechbrain funded by OVH

https://twitter.com/mirco_ravanelli/status/1618345249675542528
521 views21:04
Open / Comment