Zhu, X., Zhou, J., You, L., Yang, X., Chang, J., Zhang, J. J. and Zeng, D., 2024. DFIE3D: 3D-Aware Disentangled Face Inversion and Editing Via Facial-contrastive Learning. IEEE Transactions on Circuits and Systems for Video Technology. (In Press)
Full text available as:
PDF
24TCSVT-DFIE3D-Accepted-low.pdf - Accepted Version Restricted to Repository staff only until 18 March 2026. Available under License Creative Commons Attribution Non-commercial. 5MB | |
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
DOI: 10.1109/TCSVT.2024.3377121
Abstract
Recent advances in NeRF-based 3D-aware GANs have achieved outstanding performance, especially in the realm of human facial representations, making projection of facial images back into their latent space superior and preferable compared to 2D GAN inversion. However, the direct application of 2DGAN inversion techniques to 3DGAN raises challenges due to potential appearance distortions and geometric inconsistences. To tackle these issues, this work presents a novel integrated framework that combines a composite inversion pipeline in both the SS and W+ spaces and integrates a contrastive-based training strategy, ensuring proficient disentanglement within the module. Moreover, we design a facial semantic manipulation technique based on dimensional analysis of the latent code, which is fully compatible with the proposed 3DGAN inversion pipeline. Comprehensive experimental validations substantiate the effectiveness of the proposed approach in executing 3d-aware face inversion and semantic editing tasks, presenting a robust technological solution for a diverse array of digital human modeling applications in the downstream.
Item Type: | Article |
---|---|
ISSN: | 1051-8215 |
Uncontrolled Keywords: | 3DGAN; GAN Inversion; Face Editing |
Group: | Faculty of Media & Communication |
ID Code: | 40106 |
Deposited By: | Symplectic RT2 |
Deposited On: | 16 Jul 2024 13:00 |
Last Modified: | 16 Jul 2024 13:00 |
Downloads
Downloads per month over past year
Repository Staff Only - |