Active Content-Based Crowdsourcing Task Selection

25
Active Content-Based Crowdsourcing Task Selection Piyush Bansal, Carsten Eickhoff, Thomas Hofmann ETH Zurich 1

Transcript of Active Content-Based Crowdsourcing Task Selection

Page 1: Active Content-Based Crowdsourcing Task Selection

Active Content-Based Crowdsourcing Task

Selection

Piyush Bansal, Carsten Eickhoff, Thomas HofmannETH Zurich

1

Page 2: Active Content-Based Crowdsourcing Task Selection

Outline● Past work

○ Exploiting Document content for vote aggregation

● Ongoing extensions○ Crowdsourcing in extreme budget constraints.○ Information theoretic approaches○ Experiments and results○ Conclusion

2

Page 3: Active Content-Based Crowdsourcing Task Selection

State of the Art● Crowdsourced relevance assessment cheap and effective

● Quality control via redundancy yields strong performance

● Untapped source of information: document content

● Key idea: Locality of relevance

Davtyan et al. 2015: Exploiting Document Content for Efficient Aggregation of Crowdsourcing Votes

3

Page 4: Active Content-Based Crowdsourcing Task Selection

Clustering Hypothesis for relevance assessment

4

Page 5: Active Content-Based Crowdsourcing Task Selection

Methods ● (informal) Problem statement: Given a set of relevance assessments,

how accurately can we infer the relevance of unjudged Web pages?

○ Solution ideas: ■ Assign same relevance assessment label as nearest neighbor.■ Borrow relevance assessments from <n> nearest neighbors and

then assign the majority label.■ Smooth expected relevance across similarity space (KDE, GPs)

○ Baseline:■ Majority Voting for label aggregation, and coin toss for unjudged

Web pages.

5

Page 6: Active Content-Based Crowdsourcing Task Selection

Davtyan et. al. - Results

6

Page 7: Active Content-Based Crowdsourcing Task Selection

Motivation for our workConsider the task of search relevance assessment

● Extremely budget-constrained scenario

● Can only ask humans to rate a few Web pages per query

● In previous figure: Number of votes < 1

7

Page 8: Active Content-Based Crowdsourcing Task Selection

A Generic Model of Crowdsourcing

8

Page 9: Active Content-Based Crowdsourcing Task Selection

A Generic Model of Crowdsourcing

9

Page 10: Active Content-Based Crowdsourcing Task Selection

A Generic Model of Crowdsourcing

Difallah et al. 2013: Pick-a-crowd,

Nushi et al. 2015: Crowd Access Path Optimization10

Page 11: Active Content-Based Crowdsourcing Task Selection

A Generic Model of Crowdsourcing

11

Page 12: Active Content-Based Crowdsourcing Task Selection

A Generic Model of Crowdsourcing

12

Kazai et al. 2011: Worker types and personality traits in crowdsourcing relevance labels

Davtyan et al. 2015: Exploiting Document Content for Efficient Aggregation of Crowdsourcing Votes

Page 13: Active Content-Based Crowdsourcing Task Selection

A Generic Model of Crowdsourcing

13

Page 14: Active Content-Based Crowdsourcing Task Selection

Preliminaries● RequestVote

○ Sample radom vote from crowd● AggregateVotes

○ Gaussian Processes (GP) for inferring relevance labels for unjudged documents.

○ Described by mean function (here: constant), ○ and covariance function (here: linear covariance).

14

Page 15: Active Content-Based Crowdsourcing Task Selection

PickDocument● What subset of documents to select for labeling?

○ Typical Active learning problem○ Focus on optimal data acquisition○ Baseline: Random sampling

● Select points that the classifier is most uncertain about○ uncertainty based sampling.

15

Page 16: Active Content-Based Crowdsourcing Task Selection

Solution ● Variance-based sampling:

○ Proxy for “uncertainty”, as entropy is a measure of uncertainty○ Variance-based sampling as approximation to max entropy sampling. ○ In Gaussian processes, the posterior variance does not depend on

the actual observed values of random variables.

16

Page 17: Active Content-Based Crowdsourcing Task Selection

Solution● Selecting points that maximise variance is NP complete2

● However, this criterion is “submodular"○ Submodularity (informally): In mathematics, a submodular set function (also

known as a submodular function) is a set function whose value, informally, has the property that the difference in the incremental value of the function that a single element makes when added to an input set decreases as the size of the input set increases.

○ However, due to Nemhauser (1978), an approximate solution (1 - 1/e) OPT to this is achieved via a greedy algorithm.

2 Krause et al. 2008: Near-optimal sensor placements in Gaussian processes

3 Nemhauser et al. 1978: An analysis of approximations for maximizing submodular set functions17

Page 18: Active Content-Based Crowdsourcing Task Selection

Algorithm: Variance based sampling

18

Page 19: Active Content-Based Crowdsourcing Task Selection

Mutual Information based sampling● Variance-based sampling is only concerned with reducing

uncertainty at sampled points. ● We care about system-wide uncertainty. ● Maximise Mutual Information b/w selected documents and

rest of space.

● Equivalent to maximally minimising the entropy between selected documents, and the rest of space (D\A).

19

Page 20: Active Content-Based Crowdsourcing Task Selection

Algorithm: MI based sampling

20

Page 21: Active Content-Based Crowdsourcing Task Selection

Experiments

● TREC Crowdsourcing Track 2011 data

● 30 (28) topics

● ~100 documents (ClueWeb’09) to be judged per topic

● ~15 historic votes per query-document pair

● Project documents in 100D doc2vec space

21

Page 22: Active Content-Based Crowdsourcing Task Selection

Results - on TREC2011 CrowdSourcing Dataset.

22

Page 23: Active Content-Based Crowdsourcing Task Selection

Qualitative Analysis

23

Page 24: Active Content-Based Crowdsourcing Task Selection

Conclusions● Active Learning for Crowdsourcing Vote Sampling

● Two information-theoretic criteria○ Variance○ Mutual information

● Saves up to 25% budget at constant quality● Can be computed efficiently (greedy)● Does not depend on sampled observations

● In the future: application to other modalities (images, videos)

24

Page 25: Active Content-Based Crowdsourcing Task Selection

Thank you!

Questions?

25