ICLR 2017 Highlights - Inria · ICLR 2017 Highlights Natural Language Processing Maha ELBAYAD June...

ICLR 2017 HighlightsNatural Language Processing

Maha ELBAYAD

June 2, 2017

Overview

When: April 24 - 26, 2017

Where: Toulon, France

What: Relevant topics

I Representation learningI Reinforcement learningI Metric & kernel learningI Applications in CV; NLP, Robotics,...

StatisticsOverall: A total of 491 papers were submitted and the decisions were:

I Oral: 15 (3%)

I Poster: 183 (37.3%)

I Suggested for workshop: 48 (9.8%)

I Rejected: 245 (49.9%).

Top rated papers:

I (9.67) C. Zhang et al. ”Understanding deep learning requires rethinkinggeneralization.” (Google)

I (9.0) B. Zoph et al. ”Neural architecture search with reinforcement learning.”(Google)

I (8.75) M. Arjovsky et al. ”Towards principled methods for training generativeadversarial networks.” (Facebook)

sources: prlz77.github.io, medium.com/@karpathy

Selected papers

I Zhilin Yang et al. ”Words or Characters? Fine-grained Gating for ReadingComprehension.” (Carnegie Mellon)

I Zhouhan Lin et al. ”A structured self-attentive sentence embedding” (MILA &IBM)

I David Krueger et al. ”Zoneout: Regularizing RNNS by randomly preservinghidden activations.” (MILA)

Words or Characters? Fine-grained Gating for Reading ComprehensionYang, Z., Dhingra, B., Yuan, Y., Hu, J., Cohen, W. W., & Salakhutdinov, R. (2016)

Motivation:To combine word-level and character-level presentations, use a fine-grained gatingmechanism for a dynamic combination.

Word-level Character-levelObtained from a lookup table Applying RNN or CNN on the characters sequenceEach unique token is represented asa vector

The hidden states combined to form therepresentation

Good at memorizing the semantics Suitable for modeling sub-word morphologiesRequires large amount of data tolearn similarities

Capture the similarities by design

Ease of modeling out of vocabuary tokens.

Reading comprehension setting:

Given a document P = (p1, p2, ..pM) and a qeury Q = (q1, ...qM) we want to get theindex or span of indices in the document that matched the query Q.A token pi is denoted as (wi ,Ci ) where:

I wi ≡ the word index in the vocabulary

I Ci ≡ the characters indices.

Besides, for each word w , we feed to the model a vector v encoding its properties e.g.concat(named entity tags, part-of-speech tags, binned document frequency vectors,word-level representation)

Approach:

c = Encode(C )Ew = Encode(w)g = σ(Wgv + bg )h = f (c ,w) = g � c

+ (1− g)�Ew

PerformanceOn the Twitter dataset:

A structured self-attentive sentence embeddingLin, Z., Feng, M., Santos, C. N. D., Yu, M., Xiang, B., Zhou, B., Bengio, Y. (2017)

Motivation:Extract an interpretable sentence embedding in 2D with each row attending to a partof the sentence.Existing approaches are either:

I Universal sentence embeddings: SkipThought, ParagraphVector,..

I Task specific: usually the final hidden state of an RNN

Approach:

M ca suffer from redundancyif the model yields the samesummation weights in A.

Regularization:1) Maximize KLD betweenany two rows of A. (unstable)2) Minimize P = ‖AAT − I‖2F

Model for sentiment analysis.

Experiments:Sentiment analysis - Yelp:Take the review as input and predict the number of stars associated.Author profiling - Age:Input a tweet and predict the age range of the author.

Experiments: Sentiment analysis

Advantages:

The final sentence embedding M have direct access to all hidden states, consequentlythe LSTM is allowed to focus on shorter term context. The long term dependenciesare managed by the attention mechanism.

ConcernsThe experiments do not fully support the claim that this model will improve end taskperformance over more standard attention mechanisms.

Zoneout: Regularizing RNNS by randomly preserving hidden activationsKrueger, D., Maharaj, T., Kramar, J., Pezeshki, M., Ballas, N. Courville, A. (2016)

Motivation & Approach:

zoneout stochastically forces some hidden units to maintaintheir previous values. Like dropout, zoneout uses randomnoise to train a pseudo-ensemble, improving generalization.

Comparison to recurrent dropout

Zoneout: Regularizing RNNS by randomly preserving hidden activationsKrueger, D., Maharaj, T., Kramar, J., Pezeshki, M., Ballas, N. Courville, A. (2016)

Performance improvement

Thank you!

ICLR 2017 Highlights - Inria · ICLR 2017 Highlights Natural Language Processing Maha ELBAYAD June...

Documents

Transcript of ICLR 2017 Highlights - Inria · ICLR 2017 Highlights Natural Language Processing Maha ELBAYAD June...

ICLR 2020 Decoupling representation and classifier ...

ICLR Friday Forum: Wildfire interface mapping for Canada (February 17, 2017)

Sustainability Highlights 2017

170614 iclr reading-public

Under review as a conference paper at ICLR 2017 · Under review as a conference paper at ICLR 2017 RL 2: FAST R EINFORCEMENT L EARNING VIA S LOW R EINFORCEMENT L EARNING Yan Duan

Optimization as a Model for Few-Shot Learning - ICLR 2017 reading seminar

ICLR: Focus on sump pump systems

ICLR Friday Forum: Lot-level approaches to control urban flood risk (April 21, 2017)

ICLR Friday Forum: Hail Impacts on automobiles: A state-of-the-art review (June 16, 2017)

Q4 2017 Highlights - Boston Scientificinvestors.bostonscientific.com/~/media/Files/B/Boston-Scientific-IR/... · Q4 2017 Highlights 6 Q4 2017 Financial & Operational Highlights |

Published as a conference paper at ICLR 2017 - …mschaeke/publications/jaini_transfer_2017.pdfPublished as a conference paper at ICLR 2017 recognition, sleep classiﬁcation and the

ICLR 2017 hurricane briefing (Bob Robichaud, Canadian Hurricane Centre)

ICLR Friday Forum: Great Cascadia megathrust earthquakes (January 26, 2017)

WHAT A NUISANCE - ICLR

National Interface Mapping for Canada - ICLR · 2018. 1. 11. · National Interface Mapping for Canada ICLR Friday Forum Webinar February 17, 2017. 2 ... Mapping the Interface Area

HIGHLIGHTS JANUAR 2017 NEWSLETTER...2018/01/18 · NEWSLETTER HIGHLIGHTS JANUAR 2017 NEWSLETTER HIGHLIGHTS JANUAR 2017 NEWSLETTER HIGHLIGHTS JANUAR 2017 NEWSLETTER HIGHLIGHTS FEBRUAR

ICLR Special Issue

2017 Highlights - Extension

Membership 2017 Highlights

Q2 2017 Highlights - Boston Scientificinvestors.bostonscientific.com › ~ › media › Files › B › ...Q2 2017 Highlights 6 Q2 2017 Financial & Operational Highlights | July 27,