CVPR2022论文速递（2022.4.15）！共16篇！内含2篇Oral！-CFANZ编程社区

整理：AI算法与图像处理

CVPR2022论文和代码整理：https://github.com/DWCTOD/CVPR2022-Papers-with-Code-Demo

大家好, 最近正在优化每周分享的CVPR论文, 目前考虑按照不同类别去分类,方便不同方向的小伙伴挑选自己感兴趣的论文哈

欢迎大家留言其他想法, 合适的话会采纳哈! 求个三连支持一波哈

Updated on : 15 Apr 2022

total number : 16

CVPR2022 Oral - 2 篇

Joint Forecasting of Panoptic Segmentations with Difference Attention (Oral)

标题：差异关注联合预测全景分割

论文/Paper: http://arxiv.org/pdf/2204.07157
代码/Code: None

Deformable Sprites for Unsupervised Video Decomposition (Oral)

标题：无监督视频分解的可变形Sprites

论文/Paper: http://arxiv.org/pdf/2204.07151
代码/Code: None

目标跟踪/Object Tracking - 2 篇

BEHAVE: Dataset and Method for Tracking Human Object Interactions

标题：BEHAVE：数据集和用于跟踪人类对象交互的方法

论文/Paper: http://arxiv.org/pdf/2204.06950
代码/Code: None

SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos

标题：SoccerNet跟踪：多个对象跟踪数据集和足球视频中的基准

论文/Paper: http://arxiv.org/pdf/2204.06918
代码/Code: None

语义分割/Segmentation - 2 篇

Joint Forecasting of Panoptic Segmentations with Difference Attention (Oral)

标题：差异关注联合预测全景分割

论文/Paper: http://arxiv.org/pdf/2204.07157
代码/Code: None

Cross-Image Relational Knowledge Distillation for Semantic Segmentation

标题：语义分割的交叉图像关系知识蒸馏

论文/Paper: http://arxiv.org/pdf/2204.06986
代码/Code: https://github.com/winycg/cirkd

超分/Super-Resolution - 1 篇

Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling

标题：来回看：视频超分辨率与明确的时间差异建模

论文/Paper: http://arxiv.org/pdf/2204.07114
代码/Code: None

Transformers - - 2 篇

MiniViT: Compressing Vision Transformers with Weight Multiplexing

标题：MinioIVIT：压缩具有重量复用的视觉Transformers

论文/Paper: http://arxiv.org/pdf/2204.07154
代码/Code: https://github.com/microsoft/cream

ViTOL: Vision Transformer for Weakly Supervised Object Localization

标题：Vitol：视觉Transformers，用于弱监督对象本地化

论文/Paper: http://arxiv.org/pdf/2204.06772
代码/Code: https://github.com/Saurav-31/ViTOL

行人重识别/Person Re-Identification - 2 篇

Implicit Sample Extension for Unsupervised Person Re-Identification

标题：无监督行人重识别的隐式示例扩展

论文/Paper: http://arxiv.org/pdf/2204.06892
代码/Code: https://github.com/PaddlePaddle/PaddleClas

Clothes-Changing Person Re-identification with RGB Modality Only

标题：改变的人只用RGB模态行人重识别

论文/Paper: http://arxiv.org/pdf/2204.06890
代码/Code: https://github.com/guxinqian/Simple-CCReID.

其他/Other - 6 篇

What's in your hands? 3D Reconstruction of Generic Objects in Hands

标题：你手中的是什么？三维重建在手中的通用物体

论文/Paper: http://arxiv.org/pdf/2204.07153
代码/Code: None

GIFS: Neural Implicit Function for General Shape Representation

标题：GIFS：一般形状表示的神经隐式功能

论文/Paper: http://arxiv.org/pdf/2204.07126
代码/Code: None

The multi-modal universe of fast-fashion: the Visuelle 2.0 benchmark

标题：fast-fashion的多模态宇宙：VISUELLE 2.0基准

论文/Paper: http://arxiv.org/pdf/2204.06972
代码/Code: None

Semi-Supervised Training to Improve Player and Ball Detection in Soccer

标题：半监督训练，以改善足球运动员和球侦查

论文/Paper: http://arxiv.org/pdf/2204.06859
代码/Code: https://github.com/rvandeghen/SST

Pyramidal Attention for Saliency Detection

标题：显着性检测的金字塔Attention

论文/Paper: http://arxiv.org/pdf/2204.06788
代码/Code: https://github.com/tanveer-hussain/EfficientSOD2

OccAM's Laser: Occlusion-based Attribution Maps for 3D Object Detectors on LiDAR Data

标题：Incam的激光：LIDAR数据上的3D对象探测器的基于遮挡的归属映射

论文/Paper: http://arxiv.org/pdf/2204.06577
代码/Code: https://github.com/dschinagl/occam