Artificial Intelligence-Discipline-FmRead Academic Frontier

Object Detection in 360° Images using FOV IoU 360° cameras have gained widespread use in various fields such as virtual reality, autonomous driving, and security monitoring in recent years. With the increase in 360° image data, the demand for 360° image recognition tasks, especially object detection, is continuously growing. Due to the shortcomings...

dvmark: a deep multiscale framework for video watermarking

DVMark: A Multi-Scale Deep Learning Framework for Video Watermarking Video watermarking technology achieves data hiding by embedding information into the cover video. The DVMark model proposed in this paper is a multi-scale video watermarking solution based on deep learning that boasts high robustness and practicality, capable of resisting various ...

Stacked Deconvolutional Network for Semantic Segmentation Introduction Semantic segmentation is a critical task in the field of computer vision, aiming to classify each pixel in an image and predict its category. However, existing Fully Convolutional Networks (FCNs) have limitations in handling spatial resolution, often leading to problems such as ...

FP-AGE: Leveraging Face Parsing Attention for Facial Age Estimation in the Wild

FP-Age: Face Parsing Attention Mechanism for Facial Age Estimation in the Wild Research Background Age estimation on facial images is a significant computer vision task with extensive applications in forensics, security, health welfare, and social media. However, due to diverse factors such as head pose, facial expressions, and occlusions, the perf...

TGFuse: An Infrared and Visible Image Fusion Approach Based on Transformer and Generative Adversarial Network

TGFuse: A Transformer and Generative Adversarial Network-Based Method for Infrared and Visible Image Fusion Background Introduction With the development of imaging devices and analysis methods, multimodal visual data is rapidly emerging, with many practical applications. In these applications, image fusion plays a significant role in helping the hu...

Unsupervised Temporal Correspondence Learning for Unified Video Object Removal

Unsupervised Temporal Consistency Learning for Consistent Video Object Removal Background and Motivation In the fields of video editing and restoration, Video Object Removal is an essential task with the goal of erasing target objects throughout an entire video, filling the gaps with plausible content. Existing solutions are mainly divided into two...