Intern Presentation - Amit Jaspal
-
Upload
amit-jaspal -
Category
Documents
-
view
56 -
download
4
Transcript of Intern Presentation - Amit Jaspal
![Page 1: Intern Presentation - Amit Jaspal](https://reader031.fdocuments.net/reader031/viewer/2022032104/55d76462bb61eb41658b4698/html5/thumbnails/1.jpg)
Free Text Seach within Hive
- Amit Jaspal- Software Engineering Intern, Cloudera Search- Graduate Student at University of Illinois Urbana Champaign- Worked for D.E.Shaw & Co. before joining UIUC- Undergrad from Indian Institute of Information Technology
![Page 2: Intern Presentation - Amit Jaspal](https://reader031.fdocuments.net/reader031/viewer/2022032104/55d76462bb61eb41658b4698/html5/thumbnails/2.jpg)
Motivation : Enabling analysis of Unstructured Data
![Page 3: Intern Presentation - Amit Jaspal](https://reader031.fdocuments.net/reader031/viewer/2022032104/55d76462bb61eb41658b4698/html5/thumbnails/3.jpg)
Integrating Solr with Hive
![Page 4: Intern Presentation - Amit Jaspal](https://reader031.fdocuments.net/reader031/viewer/2022032104/55d76462bb61eb41658b4698/html5/thumbnails/4.jpg)
SolrStorageHandler - Integration Framework between Hive and Solr
![Page 5: Intern Presentation - Amit Jaspal](https://reader031.fdocuments.net/reader031/viewer/2022032104/55d76462bb61eb41658b4698/html5/thumbnails/5.jpg)
How to use SolrStorageHandler in Hive
● CREATE EXTERNAL TABLE sales_contracts_solr ( id int, string title ... ) STORED BY 'org.apache.hadoop.hive.solr.SolrStorageHandler', TBLPROPERTIES( ‘solr.zookeeper.service.ensemble’ = '127.0.0.1:2181/solr', ‘solr.collection.name’ = ‘sales_contracts’,
‘solr.query’ = ‘termsandconditions:*Sections 19 U.S.C. 1304*’);
● SELECT * from sales_contracts_hive JOIN sales_contracts_solr ON sales_contracts_hive.id = sales_contracts_solr.idwhere sales_contracts_hive.interest_rate > 10%
![Page 6: Intern Presentation - Amit Jaspal](https://reader031.fdocuments.net/reader031/viewer/2022032104/55d76462bb61eb41658b4698/html5/thumbnails/6.jpg)
Thanks
- Patrick Hunt ( Manager and Mentor )- Search Team- Hive Team