Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

15
Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss

Transcript of Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Page 1: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Ankush GolaRiley Thomasson

Sam PayneAaron Himelman

Akshaya Uttamadoss

Page 2: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Motivation

• Inconvenient to understand browsing history• Weak content discovery tools

Page 3: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Why Use Histograph?

• Visualize how you use the web using graphs• Discover new content tailored to your

browsing habits

Page 4: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Demo: Typical User Flow

histograph.us

Page 5: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

System OverviewExt

Server DBFront End

Page 6: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Discover

• Friends Graph Demo • Suggestions

Page 7: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

System Overview + FriendsExt

Server

DB

Front End

FB

Page 8: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

User Options

• Time Filtering• URL Filtering• Upvote/Downvote

Page 9: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

URL Recommendation Algorithm

• Collaborative filtering algorithm based URL frequencies at each level.

• Hierarchical URL tree designed for efficiency.

Page 10: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

URL Recommendation Algorithm

• Recommend URLS for User A• Construct a tree of the User A’s URLs, another

user’s URLs.• Recursively traverse tree and accumulate

scores. If node is leaf, store in a Rank Table.• Repeat for each other user in the system.• Filter out websites User A has already seen.

Page 11: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Hierarchical Tree

User A

google40% reddit

30%

User B

google33% reddit

20%

reddit/gifs67%

1

2

Level

reddit/funny33%

reddit/politics50%

reddit/sports50%

Page 12: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Scoring

• Final scores for URL is accumulated by traversing tree: score at each level is prop to Bhattacharrya Distance, your frequency, weights from up/down vote.

Page 13: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Challenges in Design

• Chrome Extension Limitations• Extremely Large Datasets (~40,000 nodes per

user)– Processing– Efficient Display– Hierarchical tree uses hashing to find children

Page 14: Ankush Gola Riley Thomasson Sam Payne Aaron Himelman Akshaya Uttamadoss.

Future Steps

• Content Aware Suggestions• Get More Users• Improved Server Platform• More Options– Time period selection– Algorithm adjustment

• Search Functionality• Firefox extension• Map Reduce