ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refin | Gradient Dude

ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement

This paper proposed an improved way to project real images in the StyleGAN latent space (which is required for further image manipulations).

Instead of directly predicting the latent code of a given real image using a single pass, the encoder is tasked with predicting a residual with respect to the current estimate. The initial estimate is set to just average latent code across the dataset. Inverting is done using multiple of forward passes by iteratively feeding the encoder with the output of the previous step along with the original input.

Notably, during inference, ReStyle converges its inversion after a small number of steps (e.g., < 5), taking less than 0.5 seconds per image. This is compared to several minutes per image when inverting using optimization techniques.

The results are impressive! The L2 and LPIPS loss valeus are comparable to optimization-based techniques, while two orders of magnitude faster!

Paper
Code
Colab

Gradient Dude

👨‍🚀 2.46K
Technologies

TL;DR for DL/CV/ML/AI papers from an author of publications at top-tier AI conferences (CVPR, NIPS, ICCV,ECCV). Most ML feeds go for fluff, we go for the real meat. YouTube: youtube.com/c/gradientdude...

Join
▲ Vote (1)

ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refin | Gradient Dude

Login