User Interests Driven Web Personalization based on...

18
User Interests Driven Web Personalization based on Multiple Social Networks Yi Zeng, Ning Zhong, Xu Ren, Yan Wang International WIC Institute, Beijing University of Technology P.R. China

Transcript of User Interests Driven Web Personalization based on...

Page 1: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

User Interests Driven Web Personalization based on Multiple Social Networks

Yi Zeng, Ning Zhong, Xu Ren, Yan Wang

International WIC Institute, Beijing University of TechnologyP.R. China

Page 2: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

From large scale Web pages to large scale linked open semantic data

June, 2011: 12 Billion RDF Triples from the Web

“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”

October, 2011: 31.6 Billion RDF Triples

March, 2010: 13 Billion RDF Triples

Number of Web Pages that Google indexes1998: 270 million2000: 1 billion2008: 1 trillion

Semantic Data at Web Scale

Page 3: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

3

11 Countries、13 Research Institutions and Universities

The Large Knowledge Collider (LarKC) Project

Page 4: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Personalization for Large scale and Web Enabled Semantic Data Processing (cont.)

• An illustration of the basic idea:

Selected triple set that are related to

user interests

Spyros Kotoulas

Frank van Harmelen’s Ranked Interests

…KnowledgeRDF

Semantic

…DERIIvan Herman

[s, p, “semantic Web mining”]

[s, p, “RDF triple store”]

[s, p, “Spyros Kotoulas”]

Interests related triples

Interests analysis, evaluation and ranking

Original datasets (Semantic Web Dog Food, Twitter, SwetoDBLP)

•Yi Zeng, Erzhong Zhou, Yan Wang, Xu Ren, Yulin Qin, Zhisheng Huang, Ning Zhong. Research Interests : Their Dynamics, Structures and Applications in Unifying Search and Reasoning. Journal of Intelligent Information Systems, Volume 37, Number 1, 65-88, Springer, 2011. •Yi Zeng, Ning Zhong, Yan Wang, Yulin Qin, Zhisheng Huang, Haiyan Zhou, Yiyu Yao, and Frank van Harmelen. User-centric Query Refinement and Processing Using Granularity Based Strategies. Knowledge and Information Systems, Volume 27, Number 3, 419-450, Springer, 2011. •Yi Zeng, Zhisheng Huang, Fenrong Liu, Xu Ren, Ning Zhong. Interest Logic and Its Application on the Web. Proceedings of the 5th International Conference on Knowledge Science, Engineering, and Management (KSEM 2011). Lecture Notes in Artificial Intelligence, Springer, Irvine, California, USA, 2011.

For more details:

Page 5: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Personalization for Large scale and Web Enabled Semantic Data Processing (cont.)

A Comparative Study of Query Time and Efficiency for Different Strategies

Participants 7 DBLP authors:

• Preference order 100% :

• Preference order 100% :

• Preference order 83.3% :

• Preference order 16.7% :

2, 3 1List List List?

2 3List List≈

2 3 1List List List> ?

3 2 1List List List> ? See references in the previous page

SwetoDBLP dataset:1.49x107RDF Triples

Page 6: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Massive Semantic Data from the Social Web• The social Web platforms and the microblog platforms adopt and benefit

from semantic techniques• The semantic Web gets huge data from these Social Web platforms.

• From Web of Contents to Web of People• Users play more and more important roles

• Friends• Personal Notes

• Likes

845 million active usershttp://en.wikipedia.org/wiki/Facebook

Cyber-Social Sensors

• Following, Followers• Real time personal information• interesting news

• Following, Followers• Real time personal information• interesting news

• Following, Followers• Real time personal information• interesting news

• Following, Followers• Real time personal information• interesting news

• Following, Followers• Real time personal information• interesting news

350 million users• 300 million tweets per day• 1.6 billion queries per datehttp://en.wikipedia.org/wiki/Twitter

• Interesting Places• Interesting Events

60 million users

• Friends• Professional Interests• Education Information• Work Experiences

150 million users

Page 7: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Personal Interests Data Fusion Strategies

1( ) ( )

mn n

nI i w I i

== ×∑Weighted Fusion Strategy:

• Average fusion strategy

• Time-sensitive fusion strategy

1 2 .1/

. . 1n

n

w wn

ww

+ + + ==

1 2 1 2

1 2

: :. . .: : :. . .:. . . 1

n n

n

w w w f f fw w w

=+ + + =

Slides 7-10 are from our following paper:Yunfei Ma, Yi Zeng, Xu Ren, and Ning Zhong. User Interest Modeling Based on Multi-source Personal Information Fusion and Semantic Reasoning. Proceedings of the 2011 International Conference on Active Media Technology, Lecture Notes in Computer Science 6890, 195-205, Springer, Lanzhou, China, September 7-9, 2011.

Page 8: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

A comparative study of interests from three single sources

An Illustration of Multi-source Personal Interests Fusion

• User: Frank van Harmelen

• Data Source:

Evolution of Scientific Information Sharing

“Open Science”Challenges Journal Tradition with Web Collaboration

0102030405060

Lin

ked

data

Ope

n da

ta

Web

RD

F

Sem

antic

Web

Lar

KC

SPA

RQ

L

RD

Fa

Scie

nce

Proj

ect

Sear

ch E

ngin

e

Sym

posi

um

PhD

Dru

pal

Info

rmat

ion

Com

pute

r

Indu

stry

Res

earc

h

Am

ster

dam

Uni

vers

ity

Edu

catio

nal I

nstit

ute

Kno

wle

dge

Rep

rese

ntat

ion

Prof

esso

r

Scie

ntif

ic D

irec

tor

Interest Terms

Inte

rest

Val

ues Twitter

FacebookLinkedIn

Top-K interests from different sources• Some of the interests have overlaps among each other.• Diversities among these Top-K interests are even more obvious.

Page 9: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

• Average Fusion : Twitter(7)、Facebook(7),LinkedIn(2)• Time Sensitive Fusion: (1) Top-10 overlaps with Twitter;(2) Values are very close to the ones from Twitter, but entirely different;(3) No interests from Facebook and LinkedIn.

Update frequency:Twitter: f1=2.5, Facebook: f2=0.2, LinkedIn: f3=0.0004 (per day)

1 2 3( ) 0 . 9 2 5 8 ( ) 0 . 0 7 4 1 ( ) 0 . 0 0 0 1 ( )I i I i I i I i+ += × × ×Weighted Interests Fusion Function:

05

10152025303540

Link

ed d

ata

Ope

n da

ta

Web

RD

F

Sem

antic

Web LarK

C

SPAR

QL

RD

Fa

Scie

nce

Proj

ect

Sear

chEn

gine

Sym

posi

um PhD

Interest Terms

Inte

rest

Val

ues Twitter

Average FusionTime-sensitive Fusion

A comparative study of interests from a single source and multiple interests sources

An Illustration of Multi-source Personal Interests Fusion

Page 10: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Interests Representation and Reasoning about Interests

<foaf:Person rdf:about="http://www.cs.vu.nl/~frankh/"><foaf:name>Frank van Harmelen</foaf:name><e-foaf:interest><rdf:Description rdf:about="http://www.wici-lab.org/wici/wiki/index.php/RDF">

<dc:title>RDF</dc:title><e-foaf:cumulative_interest_value rdf:parseType="Resource">

<rdf:value rdf:datatype="&xsd;number"> 21.293 </rdf:value></e-foaf:cumulative_interest_value>

</rdf:Description></e-foaf:interest>

...</foaf:Person>

Interests Representation using e-FOAF:interest

<rdfs:Class rdf: ID="Graph-based Representation"><rdfs:subClassOf rdf: resource="Knowledge Representation"/>

</rdfs:Class><rdfs:Class rdf: ID="RDF">

<rdfs:subClassOf rdf: resource="Graph-based Representation"/></rdfs:Class>

RDF representation of AI Ontology

A Fragment of AI Ontology

Frank van Harmelen is interested in RDF in a certain degree

Reasoning about interests from RDF to Knowledge Representation

Appeared on Frank van Harmelen’s homepage, but not elsewhere.

(http://wiki.larkc.eu/e-foaf:interest)

Page 11: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Active Academic Visit Recommendation Application (AAVRA)

Data Sources: Twitter Data, Semantic Web Dog Food data, Google Maps API

• Collaboration network is already too complex, but…

• Academic collaboration candidates not only appear on publication data, but also on many other social networking environment such as Twitter.

A Snapshot from Semantic Web Dog Food Affiliation Map

Page 12: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

AAVRA: Data Acquisition

Twitter data acquisition

Twitter data acquisition to :• Locate the end user;• Find agents that the end user follows;• User real time interests analysis;• Locating followings and their

interests

Page 13: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

AAVRA: Data Acquisition from SWDF

SELECT DISTINCT $person $person_name $affiliation $affiliation_nameWHERE {

$person a foaf:Person.$person foaf:name $person_name.$person foaf:made $InProceedings.$InProceedings foaf:maker $person_url.$person_url foaf:name "Frank van Harmelen".$person swrc:affiliation $affiliation.$affiliation foaf:name $affiliation_name

}

Real time acqusition by SPARQL end point

Page 14: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

AAVRA: Generating Levels of Recommendation

Interest Levels

Formula Result Sets

1 T1

2 T2

3 T3

4 T4

5 T5

( , ) ( , )p uS WD FC o a u t h o r T F i n g u p∧( , ) ( , )p uS WDFC o a u t h o r T F i n g u p∧ ¬

( , )( , ) p uS WD FT F i n g u p P C o a u t h o r∧( , ) ( , , ) ( )T F i n g u p S I T p u K S WD F p∧ ∧ ¬( , ) ( , , ) ( )T F i n g u p S I T p u K S W D F p∧ ¬ ¬

Interpretations on different groups of data from SWDF and Twitter

Page 15: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

AAVRA: Recommendation Results Analysis

Interest Level Recommendation Ratio(%)

Results Examples

1 0.014 Paul Groth 2 0.210 Spyros Kotoulas(3), Jacopo Urbani(3), Eyal Oren(2),

Henri Bal(2), Zharko Aleksovski(2), Zhisheng Huang(1),...

3 0.154 Kalina Bontcheva, Lynda Hardman, Peter Mika, Steffen Staab, Denny Vrandecic, Ivan Herman, Michael Hausenblas, ...

4 0.505 Stefano Bertolo, Dan Brickley, DERI Galway, Web Foundation, Ontotext AD...

Recommendation Ratio = Recommended Results / Candidate SpaceCandidate Space: 7131 persons (SWDF+Twitter)Calculation of SIT(p,u,K), Top-10 interests, K=1

0.8835% candidates are recommended overall.

Page 16: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Active Academic Visit Recommendation: A Snapshot

( , )( , ) p uS WD FT F i n g u p P C o a u t h o r∧The 3rd level of recommendation:

• University of Sheffield (Kalina Bontcheva)• University of the West of England (Richard McClatchey)

Page 17: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Into the FutureA conservative estimate would be that it would

take 10,000 triples just to describe each human, which gives us 100 trillion (1014).

Pictures from Prof. Ning Zhong’s plenary talk at Web Intelligence 2011

Page 18: User Interests Driven Web Personalization based on ...wic.litislab.fr/2012/slides/WIandC_2012_Yi_online.pdf · User Interests Driven Web Personalization based on Multiple Social Networks

Thank You!