The Last Lecture Agenda –1:40-2:00pm Integrating XML and Search Engines—Niagara way...

The Last Lecture

• Agenda– 1:40-2:00pm Integrating XML and Search

Engines—Niagara way – 2:00-2:10pm My concluding remarks (if any)– 2:10-2:45pm Interactive summarization of the

semester– Teaching evaluations (I leave)

This part based on Niagara slid

Niagara

Generating a SEQL Query from XML-QL

A different kind of Containment

“Review”

Main Topics

• Approximately three equal parts:– Information retrieval– Information integration/Aggregation– Information mining– other topics as permitted by time

• Useful course background– CSE 310 Data structures

• (Also 4xx course on Algorithms)

– CSE 412 Databases – CSE 471 Intro to AI

What I said on 1/17

What we did by 4/30

Information Retrieval

• Traditional Model– Given

• a set of documents• A query expressed as a set of

keywords

– Return• A ranked set of documents

most relevant to the query

– Evaluation:• Precision: Fraction of

returned documents that are relevant

• Recall: Fraction of relevant documents that are returned

• Efficiency

• Web-induced headaches– Scale (billions of

documents)

– Hypertext (inter-document connections)

• Consequently– Ranking that takes link

structure into account• Authority/Hub

– Indexing and Retrieval algorithms that are ultra fast

Database Style Retrieval

• Traditional Model (relational)– Given:

• A single relational database– Schema– Instances

• A relational (sql) query

– Return:• All tuples satisfying the

• Evaluation– Soundness/Completeness– efficiency

• Web-induced headaches• Many databases• all are partially complete• overlapping• heterogeneous schemas• access limitations• Network (un)reliability

• Consequently• Newer models of DB• Newer notions of

completeness• Newer approaches for

query planning

What about “mining”

• Didn’t do too much “data” mining – But did do some “web” mining

• Mining the link structure – A/H computation etc

• Clustering the search engine results– K-means; Agglomerative clustering

• Classification as part of focused crawling– The “distiller” approach

Interactive Review…2:00-2:45: An interactive summarization of the class.

Rather than me show up the list of topics we covered, I thought up a more interesting approach for summarizing the class in *your* collective words. Here is how it will go: *Everyone* in the class will be called on to list one topic/technique/issue that they felt they learned from the course.

Generic answers like "I learned about search engines" are discouraged in favor of specific answers (such as "I thought the connection between the dominant eigen values and the way a/h computation works was quite swell").

It is okay to list topics/issues that you got interested in even if those were just a bit beyond what we actually covered.

Note that there is an expectation that when your turn comes you will mention something that has not been mentioned by folks who spoke ahead of you.

Since I get to decide the order in which to call on you, it is best if you jot down upto 5 things you thought you learned so the chance that you will say something different is higher.

Learning Patterns (Web/DB mining)

• Traditional classification learning (supervised)– Given

• a set of structured instances of a pattern (concept)

– Induce the description of the pattern

• Evaluation:– Accuracy of classification

on the test data– (efficiency of learning)

• Mining headaches– Training data is not obvious– Training data is massive– Training instances are noisy

and incomplete

• Consequently– Primary emphasis on fast

classification• Even at the expense of

accuracy

– 80% of the work is “data cleaning”

The Last Lecture Agenda –1:40-2:00pm Integrating XML and Search Engines—Niagara way...

Documents

Transcript of The Last Lecture Agenda –1:40-2:00pm Integrating XML and Search Engines—Niagara way...

RUTHERFORD Public SCHOOL€¦ · 12:10pm Classes resume 1:40pm Afternoon Tea 2:10pm Classes resume 3:10pm End of school day Please note there is no supervision after 3.10pm, with

K L - rrm.comTiled).pdf12:55pm – 2:45pm Pool Lunch Break and Trade Exhibit Visit 2:45pm – 3:45pm Trends in Social Media – RRM Version 2.0 e pick up in part where we left off

Harvard Extension School Expo E-25; Section 8 (7:45PM-9:45PM)

HLNDV Spring Institute 2014 May 2, 2014, 1:15-2:45pm Readmission Session.

Greeks Franklin Menu LRG B · 1 2 3 4 317-739-3900 n FOLLOW US! @greeksfranklin Mon 4:30-10pm, Tues-Thurs 11am-10pm Fri-Sat 11am-12am, Sun 11am-10pm General Email: greeksfranklin@gmail.com

2015 NYSFAAA Conference Tuesday, October 27,2015 1:45pm to 2:45pm A CAMPUS ADAPTABLE APPROACH TO DEFAULT PREVENTION.

miamisalsacongress.commiamisalsacongress.com/wp-content/uploads/2017/07/SaturdayWorkshop... · 1 :45pm 2:45pm SATURDAY ROOM 1 NAPOLEON Showcase Tech Rehearsal Uriel and Vera Ft. Lauderdale

Special Weather Statement issued July 28 at 5:10PM EDT until July 28 at 7:45PM EDT by NWS

Bristol - Citizen Space - What Changes Are We …...Monday 11th February 2:15pm - 6:45pm (Westbury) R Saturday 23rd February 11:15am-3:45pm (Henbury)E Wednesday 27th February 2:15pm-6:45pm

District PLC Meeting Elementary December 3, 2013 2:30 – 3:45pm.

University Courses & Curricula Committee 2016-2017...Nov 02, 2016 · 12:45pm-2:45pm Call to Order 12:45pm Welcome and Instructions, Chair Andy Nowel Remarks from Associate Vice Provost,

€¦ · Songbook. B.B. King's Blues Club Show Times: 8:45pm, 9:45pm & 10:45pm QUEEN'S LOUNGE, 2 Singers/Songwriters and American Pie "101" with Piano Bar Entertainer Jeremy 9:00pm

April 11, 2006 2:45pm – 3:45pm Denver Ballroom 2

June 7, 2018 International Center, Multi-Purpose Room 2:10pm.

MONTHLY ENTERTAINMENT SCHEDULE- …. SIR DUKE FEATURING JESSIE PAYO (pop and R&B 7-10pm) DJ HARMONY (10pm-1am) 10. AYO AWOSIKA BAND (pop/r&b; 7-10pm) DJ HARMONY (10pm-1am) 11. INBAR

(TORONTO) (EDMONTON) Hip Hop DISCREET DA CHOSEN 1 …mawovancouver.org/materials/070700hiphop.pdf · 2:45pm-4:45pm - HipHop Academy-Class of 2007 MC’ing + Beatboxing Tutorial |

Printable 2016 Calendars: 2016 - WordPress.com · April 2016 Sunday Monday Tuesday Wednesday Thursday Friday Saturday 1 April Fool’s Day 2 3 4 4pm-10pm 5 4pm-10pm 6 4pm-10pm 7 4pm-10pm

MIZRACHI MATTERS – PARSHAT VAYETZE ד סב...MIZRACHI MATTERS – PARSHAT VAYETZE Friday, 6 December (8 Kislev) Shacharit Mincha & Ma’ariv: 6: 45pm 1 Candle Lighting: 7:05-7:10pm

One-Day Schedule M p m C1:45pm - 2:00pm Technology at UWG - Campus Center Ballroom Familiarize yourself with technology services offered on campus. 2:10pm - 2:20pm Housing & Residence

Thursday, December 17th Friday, December 18th · 2015. 11. 4. · 5:10pm to 5:30pm: Visit from the Mario Bros mascot (lobby); 5:45pm to 6pm: Visit from the JARO Plouff mascot (lobby);