I am a PhD student at the Audio Information Research Lab, University of Rochester, working with Prof. Zhiyao Duan. Take a look at my CV.

My research interests lie in speech and audio processing for virtual and augmented reality. My recent work has focused on personalized spatial audio, synthetic speech detection, and audio-visual rendering and analysis. In my spare time, I am fond of movies, paddle boarding, and traveling.

If you are interested in my research, or would like collaborate with me, you are welcome to email me.

Selected Publications

(For full list, see Publications)

[1] You Zhang, Fei Jiang, and Zhiyao Duan, One-Class Learning Towards Synthetic Voice Spoofing Detection, IEEE Signal Processing Letters, vol. 28, pp. 937-941, 2021. [link] [arXiv] [code] [video] [poster] [slides] [project]

[2] You Zhang, Yuxiang Wang, and Zhiyao Duan, HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields, 2022. [arXiv] [code]

[3] Sefik Emre Eskimez, You Zhang, and Zhiyao Duan, Speech Driven Talking Face Generation From a Single Image and an Emotion Condition, IEEE Transactions on Multimedia, vol. 24, pp. 3480-3490, 2022. [link] [arXiv] [code] [project]

