Weijia MaoPh.D. Candidate
Show Lab |
![]() |
I am a fourth-year Ph.D. candidate in Show Lab @ NUS, advised by Prof. Mike Shou.
My research focuses on vision-language models, multimodal reinforcement learning, and video generation.
Selected topics and representative projects include:
I was fortunate to intern at TikTok, where I worked with Hao Chen and Zhenheng Yang.
I am open to discussions and collaborations, and actively seeking opportunities in the industry. Feel free to reach out.
|
Show-o: A Unified Multimodal Model Integrating Diffusion and Autoregressive Modeling
Jinheng Xie†, Weijia Mao†, Zechen Bai†, David Junhao Zhang†, Weihao Wang, Kevin Qinghong Lin, Yuchao Gu, Zhijie Chen, Zhenheng Yang, Mike Zheng Shou*.
ICLR, 2025. |
|
Adv-GRPO: Adversarial Reinforcement Learning for Image Quality Optimization
Weijia Mao, Hao Chen*, Zhenheng Yang, Mike Zheng Shou*.
CVPR, 2026. |
|
BitDance: Scaling Autoregressive Generative Models with Binary Tokens
Yuang Ai, Jiaming Han, Shaobin Zhuang, Weijia Mao, Xuefeng Hu, Ziyan Yang, Zhenheng Yang, Huaibo Huang, Xiangyu Yue, Hao Chen.
[project page]
[paper]
[code]
|
|
EgoExo-V: Bridging Egocentric and Exocentric Video via NeRF and Diffusion Models
Jia-Wei Liu†, Weijia Mao†, Zhongcong Xu, Jussi Keppo, Mike Zheng Shou.
NeurIPS, 2024. |
|
Long-Context Autoregressive Video Modeling with Next-Frame Prediction
Yuchao Gu, Weijia Mao, Mike Zheng Shou.
arXiv, 2025. |
Conference Reviewer: CVPR, ICCV, NeurIPS, ICML, ICLR, etc.
Teaching Assistant: EE4309 Robot Perception ,