Ning, X., Li, Y., Feng, Z., Liu, J. and Ding, Y., 2024. An efficient multi-scale attention feature fusion network for 4K video frame interpolation. Electronics, 13 (6), 1037.
Full text available as:
Preview |
PDF (OPEN ACCESS)
Published version.pdf - Published Version Available under License Creative Commons Attribution. 2MB |
|
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
DOI: 10.3390/electronics13061037
Abstract
Video frame interpolation aims to generate intermediate frames in a video to showcase finer details. However, most methods are only trained and tested on low-resolution datasets, lacking research on 4K video frame interpolation problems. This limitation makes it challenging to handle high-frame-rate video processing in real-world scenarios. In this paper, we propose a 4K video dataset at 120 fps, named UHD4K120FPS, which contains large motion. We also propose a novel framework for solving the 4K video frame interpolation task, based on a multi-scale pyramid network structure. We introduce self-attention to capture long-range dependencies and self-similarities in pixel space, which overcomes the limitations of convolutional operations. To reduce computational cost, we use a simple mapping-based approach to lighten self-attention, while still allowing for content-aware aggregation weights. Through extensive quantitative and qualitative experiments, we demonstrate the excellent performance achieved by our proposed model on the UHD4K120FPS dataset, as well as illustrate the effectiveness of our method for 4K video frame interpolation. In addition, we evaluate the robustness of the model on low-resolution benchmark datasets.
| Item Type: | Article |
|---|---|
| ISSN: | 2079-9292 |
| Uncontrolled Keywords: | 4K video frame interpolation; 4K video dataset; self-attention; multi-scale; high frame rate |
| Group: | Faculty of Science & Technology |
| ID Code: | 41459 |
| Deposited By: | Symplectic RT2 |
| Deposited On: | 29 Oct 2025 16:23 |
| Last Modified: | 29 Oct 2025 16:23 |
Downloads
Downloads per month over past year
| Repository Staff Only - |
Tools
Tools