Chao-Yuan Wu

I currently work at World Labs as a member of technical staff.

I was a Research Scientist in FAIR, Meta from 2021 to 2023. I received my CS PhD from UT Austin, advised by Philipp Krähenbühl. I did my M.S. at MLD, CMU, advised by Alex Smola. I received the Facebook Fellowship in 2019.

email / CV / Google Scholar / Twitter

Publications



2024

SAM 2: Segment Anything in Images and Videos
Nikhila Ravi, Valentin Gabeur, Yuan-Ting Hu, Ronghang Hu, Chaitanya Ryali, Tengyu Ma, Haitham Khedr,
Roman Rädle, Chloe Rolland, Laura Gustafson, Eric Mintun, Junting Pan, Kalyan Vasudev Alwala, Nicolas Carion,
Chao-Yuan Wu, Ross Girshick, Piotr Dollár, Christoph Feichtenhofer
arXiv, 2024
[paper] [website] [demo] [code]



PointInfinity: Resolution-Invariant Point Diffusion Models
Zixuan Huang, Justin Johnson, Shoubhik Debnath, James M. Rehg, Chao-Yuan Wu
CVPR, 2024
[paper] [website] [code]

2023




Multiview Compressive Coding for 3D Reconstruction
Chao-Yuan Wu, Justin Johnson, Jitendra Malik, Christoph Feichtenhofer, Georgia Gkioxari
CVPR, 2023
[paper] [project page] [code]

2022




MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Chao-Yuan Wu*, Yanghao Li*, Karttikeya Mangalam, Haoqi Fan, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer*
CVPR, 2022 (Oral)
[paper] [code]



Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li*, Chao-Yuan Wu*, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer*
CVPR, 2022
[paper] [code]



A ConvNet for the 2020s
Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, Saining Xie
CVPR, 2022
[paper] [code & models]



Masked Feature Prediction for Self-Supervised Visual Pre-Training
Chen Wei*, Haoqi Fan, Saining Xie, Chao-Yuan Wu, Alan Yuille, Christoph Feichtenhofer*
CVPR, 2022
[paper] [code]



Reversible Vision Transformers
Karttikeya Mangalam, Haoqi Fan, Yanghao Li, Chao-Yuan Wu, Bo Xiong, Christoph Feichtenhofer, Jitendra Malik
CVPR, 2022 (Oral)
[paper] [project page] [code]

2021



Towards Long-Form Video Understanding
Chao-Yuan Wu, Philipp Krähenbühl
CVPR, 2021
[paper] [project page & dataset] [code & models]



Memory Optimization for Deep Networks
Aashaka Shah, Chao-Yuan Wu, Jayashree Mohan, Vijay Chidambaram, Philipp Krähenbühl
ICLR, 2021 (Spotlight)
[paper] [code]

2020




Lossless Image Compression through Super-Resolution
Sheng Cao, Chao-Yuan Wu, Philipp Krähenbühl
arXiv, 2020
[paper] [code & models]



A Multigrid Method for Efficiently Training Video Models
Chao-Yuan Wu, Ross Girshick, Kaiming He, Christoph Feichtenhofer, Philipp Krähenbühl
CVPR, 2020 (Oral)
[paper] [code & models]

2019




Fashion++: Minimal Edits for Outfit Improvement
Wei-Lin Hsiao, Isay Katsman*, Chao-Yuan Wu*, Devi Parikh, Kristen Grauman
ICCV, 2019
[paper] [code & models]
Media coverage: [Facebook AI Blog] [Vogue] [VentureBeat] [WIRED] [deeplearning.ai]



Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu, Christoph Feichtenhofer, Haoqi Fan, Kaiming He, Philipp Krähenbühl, Ross Girshick
CVPR, 2019 (Oral)
[paper] [code & models] [CVPR oral talk]

2018



Video Compression through Image Interpolation
Chao-Yuan Wu, Nayan Singhal, Philipp Krähenbühl
ECCV, 2018
[paper] [details] [code & models]



Compressed Video Action Recognition
Chao-Yuan Wu, Manzil Zaheer, Hexiang Hu, R. Manmatha, Alexander J Smola, Philipp Krähenbühl
CVPR, 2018 (Spotlight)
[paper] [details] [code] [spotlight talk]

2017 or earlier



Sampling Matters in Deep Embedding Learning
Chao-Yuan Wu, R Manmatha, Alexander J Smola, Philipp Krähenbühl
ICCV, 2017
[paper] [details] [code] [code (third-party 1)] [code (third-party 2)]



Doubly Greedy Primal-Dual Coordinate Descent for Sparse Empirical Risk Minimization
Qi Lei, Ian En-Hsu Yen, Chao-Yuan Wu, Inderjit S Dhillon, Pradeep Ravikumar
ICML, 2017


Recurrent Recommender Networks
Chao-Yuan Wu, Amr Ahmed, Alex Beutel, Alexander J Smola, How Jing
WSDM, 2017


Predicting Latent Structured Intents from Shopping Queries
Chao-Yuan Wu, Amr Ahmed, Gowtham Ramani Kumar, Ritendra Datta
WWW, 2017


Joint Training of Ratings and Reviews with Recurrent Recommender Networks
Chao-Yuan Wu, Amr Ahmed, Alex Beutel, Alexander J Smola
ICLR, 2017 Workshop


Spectral Methods for Nonparametric Models
Hsiao-Yu Fish Tung, Chao-Yuan Wu, Manzil Zaheer, Alexander J Smola
arXiv preprint, 2017


Explaining reviews and ratings with PACO: Poisson Additive Co-Clustering
Chao-Yuan Wu, Alex Beutel, Amr Ahmed, Alexander J Smola
WWW, 2016 (Poster)


Using Navigation to Improve Recommendations in Real-time
Chao-Yuan Wu, Christopher V Alvino, Alexander J Smola, Justin Basilico
RecSys, 2016


Jointly Modeling Aspects, Ratings and Sentiments for Movie Recommendation (JMARS)
Qiming Diao, Minghui Qiu, Chao-Yuan Wu, Alexander J Smola, Jing Jiang, Chong Wang
KDD, 2014