BigInsights 4.0 Data Scientist Module Install

21
BigInsights 4.0 Data Scientist Module Install IBM Cloud Infrastructure for Analytics November 5, 2015

Transcript of BigInsights 4.0 Data Scientist Module Install

Page 1: BigInsights 4.0 Data Scientist Module Install

BigInsights 4.0 Data Scientist

Module Install IBM Cloud Infrastructure for Analytics

November 5, 2015

Page 2: BigInsights 4.0 Data Scientist Module Install

2

Table of Contents

Prerequisites……………………………………………………………………………………………………………………………………………3

Install R……………………………………………………………………………………………………………………………………………………3

Install BigR…………………………………………………………………………………………………………………………………………….12

Prerequisites…………………………………………………………………………………………………………………………….12

Install Text Analytics………………………………………………………………………………………………………………………………17

Page 3: BigInsights 4.0 Data Scientist Module Install

3

1. Prerequisites:

i. BI 4.0 Cluster should be up and running with following services:

a. IOP

b. BigInsights Home

c. BigSheets

d. Bigsql.

ii. Refer section 5.2 in Known Issues and Workaround document for accessing web UI of BigInsights

Home, BigSheets and BigSQL.

2. Install R

Actions -> Add Service

Page 4: BigInsights 4.0 Data Scientist Module Install

4

Note: Keep default selected parameters in Assign Masters Wizard for all the services as it selects master

hostname automatically.

Assign Slaves and Client Wizard

It should be assigned to both master and compute node of the cluster.

Page 5: BigInsights 4.0 Data Scientist Module Install

5

Accept by typing Y in user.accept.R.Licenses section of Advanced r.

Page 6: BigInsights 4.0 Data Scientist Module Install

6

You may see this config validation error. It can be ignored for the time being. This is because the YARN container has grown beyond its heap size setting.

Page 7: BigInsights 4.0 Data Scientist Module Install

7

The config review page should look as below.

Note: Repositories refer to locally provided repo path for installation.

This should be the final state if all the above steps were followed correctly

Page 8: BigInsights 4.0 Data Scientist Module Install

8

Restart the services that show the restart icon on Ambari UI.

Page 9: BigInsights 4.0 Data Scientist Module Install

9

Note: After restarting all the services with restart icon, it should look as below.

Quick way to restart services:

Go to ambari ui:

Hosts -> Master node -> Restart

Repeat the same for compute nodes

Page 10: BigInsights 4.0 Data Scientist Module Install

10

Note: After restart if there is any service that is stopped (marked as red), then start it again before proceeding further.

Page 11: BigInsights 4.0 Data Scientist Module Install

11

Page 12: BigInsights 4.0 Data Scientist Module Install

12

3. Install BigR

3.1 Prerequisites :

i. R should be installed before installing BigR.

ii. Download below mentioned tars on Master and copy it to all compute nodes where BigR needs

to be installed.

base64enc_0.1-2.tar.gz

rJava_0.9-7.tar.gz

chron_2.3-45.tar.gz

data.table_1.9.6.tar.gz

[root@vmkdev0007 ~]# mkdir BigR_Packages

[root@vmkdev0007 ~]# cd BigR_Packages/

Download all the packages inside this directory using wget [Note: Internet access is needed on

Master node].

wget https://cran.r-project.org/src/contrib/base64enc_0.1-3.tar.gz

wget https://cran.r-project.org/src/contrib/rJava_0.9-7.tar.gz

wget https://cran.r-project.org/src/contrib/chron_2.3-47.tar.gz

wget https://cran.r-project.org/src/contrib/data.table_1.9.6.tar.gz

Copy it to all the compute nodes using below command:

ssh root@hostname ‘mkdir –p /root/BigR_Packages/’ && scp * root@hostname:/root/

BigR_Packages/

Sample command:

ssh root@vmkdev0008 ‘mkdir –p /root/BigR_Packages/’ && scp * root@vmkdev0008:/root/

BigR_Packages/

Confirm all the tars are available inside the BigR_Packages directory using below command:

[root@vmkdev0007 BigR_Packages]# ls -ltr

total 4364

-rw-r--r-- 1 root root 3709070 Sep 19 16:13 data.table_1.9.6.tar.gz

-rw-r--r-- 1 root root 36242 Sep 30 02:53 chron_2.3-45.tar.gz

-rw-r--r-- 1 root root 7191 Sep 30 02:53 base64enc_0.1-2.tar.gz

-rw-r--r-- 1 root root 711181 Oct 13 02:54 rJava_0.9-7.tar.gz

Page 13: BigInsights 4.0 Data Scientist Module Install

13

[root@vmkdev0007 BigR_Packages]# R CMD INSTALL base64enc_0.1-2.tar.gz

* installing to library â/usr/lib64/R/libraryâ

* installing *source* package âbase64encâ ...

Similarly, install all the other tars using below commands with exact sequence:

R CMD INSTALL base64enc_0.1-2.tar.gz R CMD INSTALL rJava_0.9-7.tar.gz

R CMD INSTALL chron_2.3-45.tar.gz R CMD INSTALL data.table_1.9.6.tar.gz

After installing all the tars go to ambari UI:

Add Service -> BigR ->

Select compute node for Big R client installation.

Set bigr user password to ambari services password.

Page 14: BigInsights 4.0 Data Scientist Module Install

14

This should be the final state if all the above steps were followed correctly.

Page 15: BigInsights 4.0 Data Scientist Module Install

15

Restart the services that show the restart icon on Ambari UI.

Page 16: BigInsights 4.0 Data Scientist Module Install

16

Page 17: BigInsights 4.0 Data Scientist Module Install

17

4. Install Text Analytics:

Actions -> Add Service -> Text Analytics ->

Set database password to ambari services password. You can change the database username, but any changes to the database hostname are ignored.

Page 18: BigInsights 4.0 Data Scientist Module Install

18

Page 19: BigInsights 4.0 Data Scientist Module Install

19

This should be the final state if all the above steps were followed correctly

Page 20: BigInsights 4.0 Data Scientist Module Install

20

Restart the services that show the restart icon on Ambari UI. Restart BigInsights Home service again to reload all the value add-on services.

Final dashboard after installing all 3 services should look as below:

Page 21: BigInsights 4.0 Data Scientist Module Install

21