Recurrent Transformer Networks for Semantic Correspondence

Recurrent Transformer Networks for Semantic CorrespondenceSeungryong Kim, Stephen Lin, Sangryul Jeon, Dongbo Min, and Kwanghoon Sohn

Neural Information Processing Systems (NeurIPS) 2018

Semantic Correspondence• Establishing dense

correspondences between semantically similar images (different instances within the same object)

Introduction

Background

Recurrent Transformer Networks Experimental Results and Discussion

Challenges in Semantic Correspondence• Photometric/geometric deformations, lack of supervisions

Problem Formulation• Given a pair of image 𝐼𝑠 and 𝐼𝑡, infer a fields of affine

transformations for each pixel𝐓𝑖 = 𝐀𝑖 , 𝐟𝑖

that maps pixel 𝑖 to 𝑖′ = 𝑖 + 𝐟𝑖

Intuition of RTNs

Network Configuration

Feature Extraction Networks• To extract features 𝐷𝑠 and 𝐷𝑡, input images 𝐼𝑠 and 𝐼𝑡 are passed

through convolution networks with parameters 𝐖𝐹 such that𝐷𝑖 = 𝐹(𝐼|𝐖𝐹)

using CAT-FCSS, VGGNet (conv4-4), ResNet (conv4-23)

Recurrent Geometric Matching Networks• Constraint correlation volume

𝐶(𝐷𝑖𝑠, 𝐷𝑡(𝐓𝑗)) =< 𝐷𝑖

𝑠, 𝐷𝑡(𝐓𝑗) >/ < 𝐷𝑖𝑠, 𝐷𝑡(𝐓𝑗) >

• Recurrent geometry estimation

𝐓𝑖𝑘 − 𝐓𝑖

𝑘−1 = 𝐹(𝐶(𝐷𝑖𝑠, 𝐷𝑡(𝐓𝑖

𝑘−1))|𝐖𝐺)

Weakly-supervised Learning• Intuition: Matching score between the source feature 𝐷𝑠 at

each pixel 𝑖 and the target feature 𝐷𝑡(𝐓𝑖) should be maximized while keeping the scores of other transformation candidates low

𝐿 𝐷𝑖𝑠, 𝐷𝑡 𝐓 = −

𝑗∈𝑀𝑖

𝑝𝑗∗log(𝑝(𝐷𝑖

𝑠, 𝐷𝑡(𝐓𝑗)))

where the function 𝑝(𝐷𝑖𝑠, 𝐷𝑡(𝐓𝑗)) is a Softmax probability

𝑝(𝐷𝑖𝑠, 𝐷𝑡(𝐓𝑗)) =

exp(𝐶(𝐷𝑖𝑠, 𝐷𝑡(𝐓𝑗)))

𝑙∈𝑀𝑖 exp(𝐶(𝐷𝑖𝑠, 𝐷𝑡(𝐓𝑗)))

where 𝑝𝑗∗ denotes a class label defined as 1 if 𝑗 = 𝑖, 0 otherwise

Ablation Study• RTNs converges in 3-5 iterations• Accuracy improves until window 9 × 9,

but larger window sizes reduce accuracy

Results on TSS Benchmark

Results on PF-WILLOW/PF-PASCAL BenchmarksMethods for geometric invariance in the regularization steps Geometric matching

methods [Rocco’17,’18] Inference using

source/target images 𝐓𝑖 is learned w/𝐓𝑖

using self- or meta-supervision

Methods for geometric invariance in the feature extraction steps STN-based methods

[Choy’16, Kim’18] 𝐀𝑖 is learned wo/𝐀𝑖

𝐟𝑖 is learned w/𝐟𝑖∗

Inference based only source or target image

Recurrent Transformer Networks (RTNs) Weaves the

advantages of both existing STN-based methods and geometric matching methods!

Source Target DCTM SCNet Gmat. w/Inl RTNs

Source Target CAT-FCSS

SCNet Gmat.w/Inl

ResNet feature exhibits the best performance!

Fine-tuned features show improved accuracy!

Learning the feature extraction networks and geometric matching networks jointly can boost accuracy!

RTNs has shown the state-of-the-art performance!

Project webpage: http://diml.yonsei.ac.kr/~srkim/RTNs

Recurrent Transformer Networks for Semantic Correspondence

Documents

Transcript of Recurrent Transformer Networks for Semantic Correspondence

CHiNT - Current Transformer & Potential Transformer

Transformer Design Transformer Design Transformer Design Transformer Design Transformer

Recurrent mutations in SARS-CoV-2 genomes isolated from ... · 2020-11-16 · Correspondence: lucy.dorp.12@ucl.ac.uk (Lucy van Dorp), f.balloux@ucl.ac.uk (François Balloux) Keywords:

Chronic recurrent multifocal osteomyelitis exhibiting ... · cHronic recurrent multiFocAl osteomyelitis exHiBitinG PredominAnce oF PeriosteAl ... IMAGE IN MEDICINE Chronic recurrent

CAR-Net: Clairvoyant Attentive Recurrent Networkopenaccess.thecvf.com/...2018/...Clairvoyant_Attentive_ECCV_2018_… · CAR-Net: Clairvoyant Attentive Recurrent Network 5 in the recurrent

COGNOS – Transformer.. Cognos - Transformer Transformer Basics. Customizing Dimensions. Time Dimension. Drill through in Transformer. Relative time Dimension.

Recurrent Networks

Forecasting with Recurrent Neural Networks: 12 Tricks · 2 In Section 2 we introduce the so-called correspondence principle for neural networks (Trick 1). For us neural networks are

research.sanfordhealth.org/rare-disease-registry Recurrent ... · Recurrent Respiratory Papillomatosis Foundation Recurrent Respiratory Papillomatosis Foundation rrpf.org “While

Dry Resin Transformer , Control Transformer

Recurrent Intussusception

Current Transformer & Potential Transformer

Geometric Matrix Completion with Recurrent Multi-Graph ...papers.nips.cc/paper/6960-geometric-matrix...correspondence on product manifold [41]), or social network analysis (abnormal

Multimodal Assessment of Recurrent and Non-recurrent Conditions … · 2020-01-17 · Multimodal Assessment of Recurrent and Non-recurrent Conditions on Urban Streets Ilona O. Kastenhofer

Neural Information Processing Systems (NeurIPS) 2018 Recurrent ...05-09-45)-05-10-20-1263… · Processing Systems (NeurIPS) 2018 Recurrent Transformer Networks for Semantic Correspondence

Bidirectional Recurrent Convolutional Networks for …papers.nips.cc/paper/5778-bidirectional-recurrent-convolutional... · Bidirectional Recurrent Convolutional Networks for Multi-Frame

Recurrent Event

Recurrent Mutations

Recurrent Epistaxis

Recurrent Transformer Networks for Remote Sensing Scene ...discriminative regions inspired by spatial transformer networks [10]. The original STN in-cludes multiple independent streams,