OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio...
-
Upload
opennebula-project -
Category
Technology
-
view
513 -
download
2
Transcript of OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio...
![Page 1: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/1.jpg)
OpenNebula implementation in TeideHPC infrastructure
Carlos I. González Vila
OpenNebulaConf 2015 - Barcelona
![Page 2: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/2.jpg)
ITERFounded by the Cabildo Insular de Tenerife in 1990, the island's administrative authority. Its objectives are to promote research activities and technological development, especially those related with the use of the renewable energies.
![Page 3: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/3.jpg)
ITER
ITER
MA
NA
GE
ME
NT
RENEWABLE ENERGIES
Departamento de Fotovoltaica Departamento de Eólica
DIS
EM
INA
TIO
N A
RE
A
MA
INT
EN
AN
CE
Photovoltaics
Sustainable architecture
Wind Energy
INFORMATION AND NEW TECHNOLOGIES
Electronics
Informatics
Groundwater resources
Environmental quality
Volcanology
ENVIRONMENT SCIENCE
![Page 4: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/4.jpg)
TeideHPC
![Page 5: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/5.jpg)
TeideHPC
● Weather forecasting– Wind power generation
– Volcanic smoke dispersion prediction
– Solar power generation
● Bioinformatics● Computational Fluid Dynamics
– Aerodynamics, aeroacoustics
– Industrial design
![Page 6: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/6.jpg)
TeideHPC
● Astrophysics– Solar composition
– Cosmological Parameters Estimation
– Analysis of solar spectro-polarimetric observations
● Computer sciences● Streaming services● Render ● Cloud computing
![Page 7: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/7.jpg)
OpenNebula
● Implementation● Study cases
– Infiniband virtualization
– Interoperability
● Use cases– Multidatastore (lbox)
– Remote visualization
– Render farm
![Page 8: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/8.jpg)
OpenNebula Implementation
● API● Cobbler● Chef● NetApp Storage● Monitoring ● Repository
![Page 9: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/9.jpg)
OpenNebula Implementation
High Performance Computing Cloud computing
● Ethernet network configuration● Infiniband configuration● User auth & permissions
API
![Page 10: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/10.jpg)
OpenNebula Implementation
● API● Cobbler● Chef● NetApp Storage● Monitoring● Repository
![Page 11: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/11.jpg)
OpenNebula Implementation
● API● Cobbler● Chef● NetApp Storage● Monitoring● Repository
![Page 12: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/12.jpg)
OpenNebula Implementation
● API● Cobbler● Chef● NetApp Storage● Monitoring● Repository
![Page 13: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/13.jpg)
OpenNebula Implementation
● API● Cobbler● Chef● NetApp Storage● Monitoring● Repository
![Page 14: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/14.jpg)
Study case – Infiniband virtualization
![Page 15: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/15.jpg)
Study case – Infiniband virtualization
● OpenNebula Ecosystem● KVM SR-IOV Driver
– Single root I/O Virtualization
Virtual functions
PCIe device
![Page 16: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/16.jpg)
Study case – Infiniband virtualization
● OpenNebula Ecosystem● KVM SR-IOV Driver
– Single root I/O Virtualization
Virtual functions
PCIe device
# lspcib0:00.0 Network controller: Mellanox Technologies MT27500 Family [ConnectX-3]b0:00.1 Network controller: Mellanox Technologies MT27500/MT27520 Family [ConnectX-3/ConnectX-3 Pro Virtual Function]b0:00.2 Network controller: Mellanox Technologies MT27500/MT27520 Family [ConnectX-3/ConnectX-3 Pro Virtual Function]
● Enable option in BIOS● Enable IO virtualization in kernel boot line● Define number of Virtual Functions in kernel module load● Burn SR-IOV capable firmware in the HCA (OEM)● Upgrade OFED version (2.3-2.0.5-rhel6.6)
![Page 17: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/17.jpg)
Study case – Infiniband virtualization
● OpenNebula Ecosystem● KVM SR-IOV Driver● Great A'Tuin
– GPU devices support was funded by SURFsaraOpenNebula 4.14 Release notes... Support for GPU consumables, giving the ability to give exlcusive PCI passthrough access to VMs to GPU cards, for HPC computing.
![Page 18: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/18.jpg)
Study case – Interoperability limitations
![Page 19: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/19.jpg)
Study case – Interoperability limitations
● 2 research centers– Share resources
● Conectivity● Federation● Cloud bursting
![Page 20: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/20.jpg)
Study case – Interoperability limitations
● 2 research centers– Share resources
● Conectivity● Federation● Cloud bursting
● End-to-end connection
– L2-L3
– Jumbo frames
● 1 lambda - 10 Gbps
● Ping < 40 ms
● VPN IPsec
● +600Mbps between 1Gbps hosts
![Page 21: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/21.jpg)
Study case – Interoperability limitations
● 2 research centers– Share resources
● Conectivity● Federation● Cloud bursting
Tightly coupled
![Page 22: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/22.jpg)
Study case – Interoperability limitations
● 2 research centers– Share resources
● Conectivity● Federation● Cloud bursting
![Page 23: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/23.jpg)
Study case – Interoperability limitations
● 2 research centers– Share resources
● Conectivity● Federation● Cloud bursting
OpenNebula 4.8 docs... “The remote provider could be a commercial Cloud service, such as Amazon EC2, IBM SoftLayer or Microsoft Azure, or a partner infrastructure running a different OpenNebula instance”
![Page 24: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/24.jpg)
Use case – Multiple datastore
![Page 25: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/25.jpg)
Use case – Multiple datastore
● Host load balancing required
● Migrate / live migrate● Time of migration (live)
– VM RAM size
– Host bandwith (IB)
– Network
![Page 26: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/26.jpg)
Use case – Multiple datastore
● Storage overloaded● Delete-recreate & Deploy to new
datastore● Take image snapshot (same DS)
& create template & deploy– VM HDD size
● Manual migration: – Stop VM + copy image to new DS
+ delete-recreate + deploy + replace image
![Page 27: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/27.jpg)
Use case – Remote visualization
● Lack of rendering hardware● Low performance with ssh -X (even with
compression)● Spice
– Audio+video
– Low BW requirements
● QXL driver (variable desktop resolution)– 1024 x 768 … 2560 x 1600
![Page 28: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/28.jpg)
Use case – Remote visualization
![Page 29: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/29.jpg)
Use case – Remote visualization
![Page 30: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/30.jpg)
Capture the Flag
● 404 VMs● Connected to client private
infrastructure through IPsec tunnel
● 20,8 TB HDD● 12,8 TB RAM● 12800 cores● 100 KW
![Page 32: OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure - Carlos Ignacio González Vila](https://reader031.fdocuments.net/reader031/viewer/2022030306/58cf21381a28abc05f8b6093/html5/thumbnails/32.jpg)
References
● http://blog.scottlowe.org/2009/12/02/what-is-sr-iov/
● http://docs.opennebula.org/4.12/advanced_administration/data_center_federation/introf.html
● http://docs.opennebula.org/4.14/release_notes/release_notes/whats_new.html#opennebula-core