EtherSpace of Nian

Home Publications Blogs About me

Welcome to my site!
I am a PhD. student @ Westlake University. My research interests include sound event detection ; semi-supervised / self-supervised learning in audio processing.
Feel free to contact me: sao_year@126.com

Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks

2023-06-30

[Accepted by TASLP] We introduce a novel Self-Supervised Learning model for audio processing, named ATST-Frame. Thorough experiments on clip/frame-level downstream tasks are implemented. SOTA performances are obtained on most task.

Authors

Xian Li; Nian Shao; Xiaofei Li

ATST Self-supervised plus RCT Semi-supervised Sound Event Detection: Submission to DCASE 2022 Challenge Task 4

2022-06-01

[DCASE 2022 tech. report] The technical report for our submitted system in the DCASE 2022 challenge, task 4. We integrate the pretrained ATST-Clip with a CRNN model and obtain 4th place in the challenge (single model + using extra dataset).

Authors

Nian Shao; Xian Li; Xiaofei Li

RCT: Random Consistency Training for Sound Event Detection

2022-04-01

[Accepcted by Interspeech 2022] We introduce a semi-supervised learning method for SED and test its performance over DESED dataset. We obtain SOTA performance in all CRNN-based models, surpassing SCT and ICT methods.

Authors

Nian Shao; Erfan Loweimi; Xiaofei Li