Seaformer++: Squeeze-Enhanced Axial Transformer for Mobile Visual Recognition

SEAFormer++ - An Efficient Transformer Architecture Designed for Mobile Visual Recognition Research Background and Problem Statement In recent years, the field of computer vision has undergone a significant shift from Convolutional Neural Networks (CNNs) to Transformer-based methods. However, despite Vision Transformers demonstrating excellent glob...

Lidar-guided Geometric Pretraining for Vision-centric 3D Object Detection

Lidar-guided Geometric Pretraining for Vision-centric 3D Object Detection

Lidar-Guided Geometric Pretraining Enhances Performance of Vision-Centric 3D Object Detection Background Introduction In recent years, multi-camera 3D object detection has garnered significant attention in the field of autonomous driving. However, vision-based methods still face challenges in precisely extracting geometric information from RGB imag...

An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-training

An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-training Academic Background In recent years, self-supervised learning (SSL) has made significant progress in the field of computer vision. In particular, the successful application of masked image modeling (MIM) pre-training methods on large-sca...

A Memory-Assisted Knowledge Transferring Framework with Curriculum Anticipation for Weakly Supervised Online Activity Detection

Research Background and Significance In recent years, weakly supervised online activity detection (WS-OAD), as an important topic in high-level video understanding, has garnered widespread attention. Its primary goal is to detect ongoing activities frame-by-frame in streaming videos using only inexpensive video-level annotations. This task holds si...

Sample Correlation for Fingerprinting Deep Face Recognition

Report on Academic Paper: “Sample Correlation for Fingerprinting Deep Face Recognition” Background and Research Problem In recent years, the rapid advancements in deep learning technologies have significantly propelled the development of face recognition. However, commercial face recognition models face increasing intellectual property (IP) threats...