Qianyi Wu ("吴潜溢" in Chinese)
Email : wqy9619 [at] gmail.com    
Github     Resume     Scholar     Twitter    

About Me

Qianyi Wu is currently a PhD student at Monash University, Department of Data Science and AI. Qianyi received B.S degree from Special Class for the Gifted Youth at University of Science and Technology of China (USTC) in 2016. And he received M.Sc degree from Graphics and Geometric Computing Laboratory of School of Mathematical Sciences at USTC in 2019, under the supervision of Prof. Juyong Zhang. He spent one year as research intern at Nanyang Technological University, mentored by Prof. Jianfei Cai and Prof. Jianmin Zheng. He worked as a research scientist at SenseTime from 2019 to 2021, working closely with Dr. Wayne Wu.


  • 2022: 9 papers are accepted. (1 NeurIPS, 3 ECCV, 2 CVPR, 2 SIGGRAPH/SIGGRAPH Asia, 1 MICCAI)
  • 07/21: After two wonderful years in Sensetime, I start pursuing my PhD at Monash University! 🐨

  • 2020: 2 papers are accepted. (1 ECCV, 1 NeurIPS)
  • 2019: 1 paper is accepted (1 CVPR)
  • 2018: 1 paper is accepted (1 CVPR spotlight)


Audio-Driven Co-Speech Gesture Image Generation.
Xian Liu, Qianyi Wu, Hang Zhou, Yuanqi Du, Wayne Wu, Dahua Lin, Ziwei Liu
Neural Information Processing Systems (NeurIPS), 2022
[PDF](Coming Soon)
Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers.
Yasheng Sun*, Hang Zhou*, Kaisiyuan Wang, Qianyi Wu, Zhibin Hong, Jingtuo Liu, Errui Ding, Jingdong Wang, Ziwei Liu, Hideki Koike
ACM SIGGRAPH Asia 2022 (Conference Proceedings)
[PDF](Coming Soon)
Object-Compositional Neural Implicit Surfaces.
Qianyi Wu, Xian Liu, Yuedong Chen, Kejie Li, Chuanxia Zheng, Jianfei Cai, Jianmin Zheng
European Conference on Computer Vision (ECCV), 2022
[PDF] [Project Page] [Code]
Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields.
Yuedong Chen, Qianyi Wu, Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai
European Conference on Computer Vision (ECCV), 2022
[PDF] [Project Page] [Code]
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation.
Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, Bolei Zhou
European Conference on Computer Vision (ECCV), 2022

Oral Presentation

[PDF] [Project Page] [Code]
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model.
Xinya Ji, Hang Zhou, Kaisiyuan Wang, Qianyi Wu, Wayne Wu, Feng Xu, Xu Cao
ACM SIGGRAPH 2022 (Conference Proceedings)
[PDF] [Code]
Exploring Smoothness and Class-Separation for Semi-supervised Medical Image Segmentation.
Yicheng Wu, Zhonghua Wu, Qianyi Wu, Zongyuan Ge, Jianfei Cai
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2022
[PDF] [Code]
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation.
Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[PDF] [Project Page] [Code]
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing.
Yanbo Xu*, Yueqin Yin*, Liming Jiang, Qianyi Wu, Chengyao Zheng, Chen Change Loy, Bo Dai, Wayne Wu,
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[PDF] [Project Page] [Code]
AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection.
Hao Zhu*, Chaoyou Fu*, Qianyi Wu, Wayne Wu, Chen Qian, Ran He
Neural Information Processing Systems (NeurIPS), 2020
[PDF] [Project Page]
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking Face Generation.
Kaisiyuan Wang Qianyi Wu, Linsen Song, Zhuoqian Yang, Wayne Wu, Chen Qian, Ran He, Yu Qiao, Chen Change Loy
European Conference on Computer Vision (ECCV), 2020
[PDF] [Project Page] [Code]
Disentangled Representation Learning for 3D Face Shape
Zi-Hang Jiang, Qianyi Wu, Keyu Chen, Juyong Zhang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
[PDF] [Code]
Alive Caricature from 2D to 3D.
Qianyi Wu, Juyong Zhang, Yu-Kun Lai, Jianmin Zheng, Jianfei Cai.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018

Spotlight Presentation

[PDF] [Data]

Industrial experience

SenseAR DigitalHuman - Audio-Driven Virtual Human
SenseAR Digital Human is a human-like intelligent multi-modal interactive system. As a primary member, I and my colleagues research and develop several key algorithms for audio-driven virtual human.
[Product] [Press(China Daily)]


  • National Scholarship, USTC, 2018.