Pentaho Data Integration with Kettle
-
Upload
roberto-marchetto -
Category
Technology
-
view
3.437 -
download
11
description
Transcript of Pentaho Data Integration with Kettle
![Page 1: Pentaho Data Integration with Kettle](https://reader031.fdocuments.net/reader031/viewer/2022012403/5562641ed8b42aab1a8b4b36/html5/thumbnails/1.jpg)
Pentaho Data Integration(Kettle)
![Page 2: Pentaho Data Integration with Kettle](https://reader031.fdocuments.net/reader031/viewer/2022012403/5562641ed8b42aab1a8b4b36/html5/thumbnails/2.jpg)
www.robertomarchetto.com
PDI Overview (Kettle)
● An entry-level tool for data manipulation (ETL)● PDI (Kettle) reads procedures stored in XML
format● Spoon is a graphical tool used to develop that
procedures● Procedures are designed linking components● Many data sources can be used, JDBC, files,
web services● JavaScript and Java support for complex
routines
![Page 7: Pentaho Data Integration with Kettle](https://reader031.fdocuments.net/reader031/viewer/2022012403/5562641ed8b42aab1a8b4b36/html5/thumbnails/7.jpg)
www.robertomarchetto.com
Procedure users_dimension
Query users:
SELECT u.id, CONCAT(u.first_name, ' ', u.last_name) as fullname, u.title FROM users uWHERE u.first_name is not null and u.last_name is not null
![Page 9: Pentaho Data Integration with Kettle](https://reader031.fdocuments.net/reader031/viewer/2022012403/5562641ed8b42aab1a8b4b36/html5/thumbnails/9.jpg)
www.robertomarchetto.com
Procedure accounts_dimension
Query accounts:
select a.id, a.name, a.industry, a.billing_address_postalcode, a.billing_address_city, a.billing_address_countryfrom accounts a
![Page 10: Pentaho Data Integration with Kettle](https://reader031.fdocuments.net/reader031/viewer/2022012403/5562641ed8b42aab1a8b4b36/html5/thumbnails/10.jpg)
www.robertomarchetto.com
Procedure opportunities_fact
Query opportunities:
SELECT o.id, o.date_entered, o.date_closed, o.assigned_user_id, o.sales_stage, o.name, o.amount FROM opportunities o WHERE o.sales_stage in ('Closed Won', 'Closed Lost') ORDER BY o.id
![Page 13: Pentaho Data Integration with Kettle](https://reader031.fdocuments.net/reader031/viewer/2022012403/5562641ed8b42aab1a8b4b36/html5/thumbnails/13.jpg)
www.robertomarchetto.com
Using JNDI
● Edit JNDI /simple-jndi/jdbc.properties orC:/Documents and Settings/<user>/.pentaho/simple-jndi/default.properties
![Page 14: Pentaho Data Integration with Kettle](https://reader031.fdocuments.net/reader031/viewer/2022012403/5562641ed8b42aab1a8b4b36/html5/thumbnails/14.jpg)
www.robertomarchetto.com
Running procedures
● Directly from Spoon● From Pentaho BI Suite● Using command line (Kitchen, Pan)
kitchen.bat /file:D:\Jobs\jobname.kjb /level:Basic
● In a clustered enviroment● Using a web services (Carte)
![Page 17: Pentaho Data Integration with Kettle](https://reader031.fdocuments.net/reader031/viewer/2022012403/5562641ed8b42aab1a8b4b36/html5/thumbnails/17.jpg)
www.robertomarchetto.com
Scheduling
● Using Pentaho's scheduler● Using an external scheduler (cron)