You (Neil) Zhang  张优 bio photo

You (Neil) Zhang

I am a PhD candidate at the Audio Information Research Lab, University of Rochester, NY, USA. I am fortunate to work with Prof. Zhiyao Duan. Take a look at my CV.

My research focuses on applied machine learning, particularly in speech and audio processing. This includes topics such as audio deepfake detection, personalized spatial audio, and audio-visual rendering and analysis. My research contributions have been showcased at prestigious venues such as ICASSP, SPL, Interspeech, WASPAA, TMM. I received recognition through the Rising Star Program in Signal Processing at ICASSP 2023 and the Graduate Research Fellowship Program from National Institute of Justice.

In my spare time, I am fond of paddle boarding, traveling, and movies.

I am actively seeking faculty positions in academia as well as full-time research scientist roles in industry. Please feel free to reach out if you have opportunities or would like to connect!

News

[2024/09] Our new WildSVDD challenge is ongoing at MIREX 2024 @ ISMIR!

[2024/08] Our SVDD challenge overview paper accepted by SLT 2024!

[2024/08] One paper accepted by MMSP 2024. Congrats, Kyungbok!

[2024/07] We presented our tutorial on “Multimedia Deepfake Detection” at ICME 2024.

[2024/05] I joined Meta Reality Labs Research Audio as a Research Scientist Intern, working with Dr. Ishwarya Ananthabhotla.

[2024/04] I gave a 1-hour tutorial on “Personalizing Spatial Audio: Machine Learning for Personalized Head-Related Transfer Functions (HRTFs) Modeling in Gaming” at 2024 AES International Conference on Audio for Games. [Slides]

[2024/04] I attended NEMISIG 2024, NYC Computer Vision Day 2024 and ICASSP 2024.

[2024/04] Exciting milestone achieved! I have successfully passed my PhD proposal/qualifying exam on “Generalizing Audio Deepfake Detection”. Looking forward to embarking on this fruitful road!

[2024/04] Our inaugural Singing Voice Deepfake Detection (SVDD) 2024 Challenge proposal has been accepted by IEEE Spoken Language Technology Workshop (SLT) 2024! Check out the challenge website here! Registration deadline: June 8th.

[2024/03] I gave a talk at GenAI Spring School and AI Bootcamp on “Audio Deepfake Detection”.

[2024/02] I was delighted to share insights on detecting audio deepfakes in a recent piece by NBC News.

[2023/12] I gave a talk at Nanjing University on “Improving Generalization Ability for Audio Deepfake Detection”. [Slides]

[2023/12] Two papers accepted by ICASSP 2024. (SingFake and Speech Emotion AV Learning) Congrats, Yongyi and Enting!

[2023/11] I sat down with Berkeley Brean at News10NBC to talk about audio deepfake detection. [WHEC-TV Link] [Tweet1] [Tweet2] [Hajim Highlights]

[2023/11] I attended WASPAA, SANE, NRT Annual Meeting, and BASH to present our work on personalized spatial audio. Busy but exciting two weeks!

[2023/10] Received National Institute of Justice’s (NIJ) Graduate Research Fellowship Award. [NIJ Description] [UR News Center] [Hajim Highlights]

[2023/07] One paper accepted by WASPAA 2023. Congrats, Yutong!

[2023/06] Our paper “HRTF Field” was recognized as one of the top 3% of all papers accepted at ICASSP 2023. [Hajim Highlights]

[2023/05] Recognized as one of the ICASSP Rising Stars in Signal Processing. [poster]

[2023/05] One paper accepted by Interspeech 2023. Congrats, Yongyi!

[2023/02] Two papers accepted by ICASSP 2023. (HRTF Field and SAMO) Congrats, Siwen!

[2023/02] Delivered a talk at ISCA SIG-SPSC webinar, titled “Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks”. [slides]

Selected Publications

(For full list, see Publications)

[3] You Zhang, Yongyi Zang, Jiatong Shi, Ryuichi Yamamoto, Tomoki Toda, and Zhiyao Duan, SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge, in Proc. IEEE Spoken Language Technology Workshop (SLT), 2024. [arXiv] [code] [webpage]

[2] You Zhang, Yuxiang Wang, and Zhiyao Duan, HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields, in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023. [DOI] [arXiv] [code] [video] (Recognized as one of the top 3% of all papers accepted at ICASSP 2023)

[1] You Zhang, Fei Jiang, and Zhiyao Duan, One-Class Learning Towards Synthetic Voice Spoofing Detection, IEEE Signal Processing Letters, vol. 28, pp. 937-941, 2021. [DOI] [arXiv] [code] [video] [poster] [slides] [project]

Flag Counter