Open Source BI Overview
-
Upload
alex-meadows -
Category
Technology
-
view
3.645 -
download
2
description
Transcript of Open Source BI Overview
![Page 1: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/1.jpg)
Open Source Business Intelligence Overview
From Data Source to Analytics and Beyond
![Page 2: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/2.jpg)
Agenda
● Open Source and BI● Data sources● Data Integration● Reporting/Frontend● Analytics● Data Quality● Data Governance
![Page 3: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/3.jpg)
![Page 4: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/4.jpg)
![Page 5: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/5.jpg)
Source: https://www.informs.org/ORMS-Today/Public-Articles/October-Volume-37-Number-5/Back-in-Business
![Page 6: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/6.jpg)
Data Sources
Traditional○ PostgreSQL - http://www.postgresql.org/
■ Pivotal Greenplum - http://gopivotal.com/
○ MySQL - http://www.mysql.com/
■ Percona - http://www.percona.com/
■ MariaDB - https://mariadb.org/
Columnar○ MySQL Derivatives
■ InfiniDB - http://infinidb.org/
■ Infobright - https://www.infobright.com/
○ MonetDB - http://www.monetdb.org/Home
![Page 7: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/7.jpg)
Relational vs Columnar
Source: http://www.calpont.com/images/column-oriented-database.jpg
![Page 8: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/8.jpg)
Data Sources
NoSQL○ Cassandra - http://cassandra.apache.org/
○ MongoDB - http://www.mongodb.org/
○ CouchDB - http://couchdb.apache.org/
○ Infinispan - http://www.jboss.org/infinispan/
○ Hadoop - http://hadoop.apache.org/
■ HBase - http://hbase.apache.org/
■ Hive - http://hive.apache.org/
OLAP○ Mondrian - http://mondrian.pentaho.com/
![Page 9: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/9.jpg)
Source: http://gerardnico.com/wiki/database/oracle/oracle_olap
![Page 10: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/10.jpg)
The Next Wave of Data Sources
Virtualization○ Teiid - http://www.jboss.org/teiid/
Semantic Web/Graph○ Sesame - http://www.openrdf.org/
○ Neo4j - http://www.neo4j.org/
○ OrientDB - http://www.orientdb.org/
○ Infogrid - http://infogrid.org/trac/
![Page 11: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/11.jpg)
Source: http://www.ebizq.net/blogs/guest_session/2009/12/putting-data-to-work-for-cloud-bpm-mdm-and-soa-projects.php
![Page 12: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/12.jpg)
Graph Database
Source: http://en.wikipedia.org/wiki/Graph_database
![Page 13: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/13.jpg)
Data Integration
Kettle - http://kettle.pentaho.com/
Talend - http://www.talend.com/
CloverETL - http://www.cloveretl.com/
![Page 14: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/14.jpg)
Reporting
BIRT (Actuate) - http://www.eclipse.org/birt/phoenix/
Pentaho - http://reporting.pentaho.com/
Jaspersoft - http://community.jaspersoft.com/
Saiku - http://meteorite.bi/saiku
![Page 15: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/15.jpg)
Full Stacks
SpagoBI - http://www.spagoworld.org/xwiki/bin/view/SpagoBI/#
Pentaho - http://www.pentaho.com/
Jaspersoft - http://www.jaspersoft.com/
![Page 16: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/16.jpg)
Analytics
R - http://www.r-project.org/
Weka - http://www.cs.waikato.ac.nz/ml/weka/
RapidMiner - http://rapid-i.com/content/view/181/
![Page 17: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/17.jpg)
Data Quality
Profiling○ DataCleaner - http://datacleaner.org/
○ DQGuru - http://www.sqlpower.ca/page/dqguru
Suites○ Talend - http://www.talend.com/products/data-quality
Testing○ SQLUnit - http://sqlunit.sourceforge.net/
○ dbFit - http://benilovj.github.io/dbfit/
○ etlUnit - https://github.com/dbaAlex/etlUnit (shameless plug :p )
![Page 18: Open Source BI Overview](https://reader033.fdocuments.net/reader033/viewer/2022052505/55503f7eb4c905b2788b4849/html5/thumbnails/18.jpg)
Data Governance
MDM○ Talend - http://www.talend.com/resource/data-governance.html
Business Rules Engine○ JBoss Drools - http://www.jboss.org/drools/ ○ Open Rules - http://openrules.com/