Leveraging Textual Features for Best Answer Prediction in Community-based Question Answering
-
Upload
george-gkotsis -
Category
Internet
-
view
287 -
download
0
Transcript of Leveraging Textual Features for Best Answer Prediction in Community-based Question Answering
GEORGE GKOTSIS 1 , MARIA LIAKATA 2 , CARLOS PEDRINACI 3 , JOHN DOMINGUE 3
Leveraging Textual Features for Best Answer Prediction in
Community-based Question Answering
1King’s College London2Department of Computer Science, University of Warwick3Knowledge Media Institute, The Open University
ICCSS 2015
Questions on social networking sites
8-11June 2015
Recommendations &opinions
Authoritative responses
Expert & Empirical knowledge
ICCSS 2015
Reputation based Answer Rating based
8-11June 2015
“…we observe significant assortativity in the reputations of co-answerers, relationships between reputation and answer speed, and that the probability of an answer being chosen as the best one strongly depends on temporal characteristics of answer arrivals.”
Ashton Anderson, Daniel Huttenlocher, Jon Kleinberg, Jure Leskovec
Discovering Value from Community Activity on Focused Question Answering Sites: A Case Study of Stack Overflow.
KDD 2012
“When available, scoring (or rating) features improve prediction results significantly, which demonstrates the value of community feedback and reputation for identifying valuable answers.”
Grégoire Burel, Yulan He, Harith Alani.
Automatic Identification of Best Answers in Online Enquiry
CommunitiesESWC 2012
State of the art solutions
ICCSS 2015
Best answer prediction in Social Q&A
8-11June 2015
Binary classification problem
Is it solved? Yes, partially
Current solutions depend on:
Answer Ratings
• Score, #comments
Knowledge is Future & Unknown
User Ratings
• User Reputation• UpVotes etc• Preferential
attachment
Knowledge is Past & Not always available
ICCSS 2015
State of the art solutionsSummary
8-11June 2015Our solution
Linguistic User Ratings Answer ratings0.00%
10.00%
20.00%
30.00%
40.00%
50.00%
60.00%
70.00%
80.00%
90.00%
100.00%
Average Precision
ICCSS 2015
StackExchange network
8-11June 2015
SE “is all about getting answers, it’s not a
discussion forum, there’s no chit-chat”
123 Q&A sites5,622,330 users9.5 million questions16.3 million answers9.3 million visits per day
20 June 2014:
ICCSS 2015 8-11June 2015
StackOver-flow91%
The Rest9%
stackoverflow0
1,000,000
2,000,000
3,000,000
4,000,000
5,000,000
6,000,000
7,000,000
8,000,000
3,375,817
3,795,276
Non Accepted Answers
Accepted Answers
September 2013 dumpQuestions with Accepted Answers
ICCSS 2015
Shallow Linguistic features
8-11June 2015
Long history, coming from studies on readability1. Average number of characters per word2. Average number of words per sentence3. Number of words in the longest sentence4. Answer length5. Log Likehood:
Pitler &Nenkova, 2008
ICCSS 2015
Shallow features: Observations
8-11June 2015
Accepted answers tend to be: Longer Differ more from the community vocabulary Contain shorter words Have longer longest sentences Have more words per sentence
But how good are shallow features?
ICCSS 2015
But how good are shallow features?
8-11June 2015
58% macro precision (our baseline)
Possible reasons1. Evolution of language characteristics
Language becomes more eloquent
2. Variance is huge3. Universal classifier looks unreachable, e.g.:
SuperUser average length is 577 Skeptics average length is 2,154
Bad
Good
ICCSS 2015
Objectives
8-11June 2015
Build a classifier which is:
1. Based on linguistic features solely2. Robust
Performs equally well to other classifiers that use user ratings (past knowledge) or answer ratings (future knowledge)
3. Universal Same classifier applicable to as many SE websites
possible (domain agnostic)
ICCSS 2015
Feature discretisationExample for Length
8-11June 2015
Group by question
Question Id
1
5
Answer Id
6
7
Length
2 200
3 150
4 250
150
100
Sort by Length in descending order
Rank
LengthD
1
2
3
1
2
ICCSS 2015
Feature discretisation
8-11June 2015
Category Name Information Gain
Linguistic
Length 0.0226
LongestSentence 0.0121
LL 0.0053
WordsPerSentence 0.0048
CharactersPerWord
0.0052
Linguistic Discretisation
LengthD 0.2168
LongestSentenceD 0.1750
LLD 0.1180
WordsPerSentenceD
0.1404
CharactersPerWordD
0.1162
20x increase
ICCSS 2015
User and answer rating features
8-11June 2015
Category Name
Other
Age
CreationDateD
AnswerCount
User Rating
UserReputation
UserUpVotes
UserDownVotes
UserViews
UserUpDownVotes
Answer rating
Score
CommentCount
ScoreRatio
ICCSS 2015
Evaluation Comparison
8-11June 2015
Case Features Used P R FM AUC
1 Linguistic 0.58 0.60 0.56 0.60
2 Linguistic & Discretisation
0.81 0.70 0.74 0.84
3 Linguistic & Discretisation & Other
0.84 0.7 0.76 0.87
4 Linguistic & Other & User Rating(no discretisation)
0.82 0.69 0.75 0.86
5 Linguistic & Other & User Rating(with discretisation)
0.82 0.72 0.77 0.88
6 All features(Answer and User Rating with discretisation)
0.88 0.85 0.86 0.94
ICCSS 2015 8-11June 2015
ACQUAAutomatic Community-based Question Answering
https://acqua.kmi.open.ac.uk/
ICCSS 2015
Read more about our work
8-11June 2015
It’s All in the Content: State of the Art Best Answer Prediction based on Discretisation of Shallow Linguistic Features. WebSci ’14
ACQUA: Automated Community-based Question Answering through the Discretisation of Shallow Linguistic Features. The Journal of Web Science, 1(1) (preprint available)