Grid, Globus Toolkit, and OGSA

61

Transcript of Grid, Globus Toolkit, and OGSA

  • 1.

2. McKean CEO & Editorial Director InfoWorld Media Group Kevin 3. 4.

  • Platinum Sponsors

Gold Sponsors 5. 6. Foster Professor, Computer Science University of Chicago Assoc. Director, Mathematics & Computer Science Division Argonne National Laboratory Ian 7. The Grid inYour Future 8. Three Questions

  • What is the Grid?
  • Where is it today, and where is it going?
  • Why should you care?

9. The Grid Is

  • A collaboration & resource sharing infrastructure for scientific applications
  • A distributed service integration and management technology
  • A disruptive technology that enables a virtualized, collaborative, distributed world
  • An open source technology & community
  • A marketing slogan
  • All of the above

10. Grid Past, Present, Future

  • Past
    • Origins and broad adoption in eScience, fueled by open source Globus Toolkit
  • Present
    • Rapidly growing commercial adoption
    • Open Grid Services Architecture (OGSA)
  • Future
    • Key enabler of new applications & industries based on resource virtualization and distributed service integration

11. Why You Should Care

  • 1) Grids address pain points now, e.g.
    • Cost of provisioning for peak demand
    • Data federation and integration
  • 2) Grids are a disruptive technology
    • Usher in (or solve problems of) a virtualized, collaborative, distributed world
    • Potentially significant competitive advantages
  • 3) An open Grid is to your advantage
    • Insist that your suppliers embrace OGSA, refuse proprietary solutions!

12. Overview

  • The power grid analogy & its limitations
  • Grid past: From eScience to eBusiness
  • Grid present: OGSA
  • Grid future: virtualization & ubiquitization
  • Summary

13. The Power Grid Time Quality, economies of scale 14. By Analogy, A Computing Grid

  • Decouple production and consumption
    • Enable on-demand access
    • Achieve economies of scale
    • Enhance consumer flexibility
    • Enable new devices
  • On a variety of scales
    • Department
    • Campus
    • Enterprise
    • Internet

15. Not Exactly a New Idea

  • The time-sharing computer system can unite a group of investigators . one can conceive of such a facility as an intellectual public utility.
    • ~ Fernando Corbato and Robert Fano, 1966
  • We will perhaps see the spread of computer utilities, which, like present electric and telephone utilities, will service individual homes and offices across the country.
    • ~ Len Kleinrock, 1967

16. But, Things are Different Now

  • Networks are far faster (and cheaper)
    • Faster than computer backplanes
  • Computing is very different than pre-Net
    • Our computers have already disintegrated
    • E-commerce increases size of demand peaks
    • Entirely new applications & social structures
  • Weve learned a few things about software

17. But Wait A Minute Computing isnt Really Like Electricity!

  • I import electricity but must export data
  • Computing is not interchangeable but highly heterogeneous
    • Computers, data, sensors, services,
  • Ok, so the story is more complicated
  • But more significantly, the sum can be greater than the parts
    • Real opportunity: Construct new capabilities dynamically from distributed services
    • Virtualization & distributed service mgmt

18. Virtualization & Distributed Service Management Less capable, integrated Less connected User service locus Larger, more integrated More connected Dynamically provisionedDevice Continuum Resource & service aggregation Delivery of virtualized services with QoS guarantees Dynamic, secure service discovery & composition Distributed service management 19. The Fundamental Questions

  • Can I build effective virtualized services?
  • Can I achieve QoS across services?
  • Can I achieve economies of scale?
  • Can I identify applications that yield real competitive advantage?

20. Overview

  • The power grid analogy & its limitations
  • Grid past: From eScience to eBusiness
  • Grid present: OGSA
  • Grid future: virtualization & ubiquitization
  • Summary

21. Origins: Revolution in Science

  • Pre-Internet
    • Theorize &/or experiment, alone or in small teams; publish paper
  • Post-Internet
    • Construct and mine large databases ofobservational or simulation data
    • Develop simulations & analyses
    • Access specialized devices remotely
    • Exchange information withindistributed multidisciplinary teams

22. Science Grids Tier0/1 facility Tier2 facility 10 Gbps link 2.5 Gbps link 622 Mbps link Other link Tier3 facility Cambridge Newcastle Edinburgh Oxford Glasgow Manchester Cardiff Soton London Belfast DL RAL Hinxton 23. 24. NSF TeraGrid

  • NCSA, SDSC, Argonne, Caltech
  • Unprecedented capability
    • 13.6 trillion flop/s
    • 600 terabytes of data
    • 40 gigabits per second
    • Accessible to thousands of scientists working on advanced research
  • www.teragrid.org

25. Data Grids for Physics

  • Enable international community of 1000s to access & analyze petabytes of data
  • Harness computing & storage worldwide
  • Virtual data concepts: manage programs,data, workflow
  • Distributed systemmanagement

26. NEESgrid Earthquake Engineering Collaboratory U.Nevada Reno www.neesgrid.org 27. NASA: Aviation Safety

  • Lift Capabilities
  • Drag Capabilities
  • Responsiveness
  • Deflection capabilities
  • Responsiveness
  • Thrust performance
  • Reverse Thrust performance
  • Responsiveness
  • Fuel Consumption
  • Braking performance
  • Steering capabilities
  • Traction
  • Dampening capabilities

Crew Capabilities - accuracy - perception - stamina - re-action times - SOPs Engine Models Airframe Models Wing Models Landing Gear Models Stabilizer Models Human Models 28. Access Grid Collaboration

  • Enable collaborative work at dozens of sites worldwide,with strong sense of shared presence
  • Combination of commodity audio/video tech + Grid technologies for security, discovery, etc.
  • 150+ sites worldwide,number rising rapidly

Ambient mic (tabletop) Presenter mic Presenter camera Audience camera 29. The Need for New Technology

  • Resource sharing & coordinated problem solving in dynamic, multi-institutional virtual organizations

30. The Grid/eScience World: Status

  • Dozens of major Grid projects in scientific & technical computing/research & education
    • Deployment, application, technology
    • www.mcs.anl.gov/~foster/grid-projects
  • Globus Toolkit broadly adopted as de facto standard for major protocols & services
  • Global Grid Forum a significant force for community building and standardization
    • GGF8: Tokyo, March 2003, 850 people
    • www.gridforum.org; 200+ organizations; Boeing, Merck, Ford, J&J, IBM, Platform,

31. Revolution in Business

  • Pre-Internet
    • Central data processing facility
  • Post-Internet
    • Enterprise computing is highly distributed, heterogeneous, inter-enterprise (B2B)
    • Business processes increasinglycomputing- & data-rich
    • Outsourcing becomes feasible =>service providers of various sorts

32. The New Enterprise Computing Environment 33. eScience/eBusiness Vision

  • Link dynamically acquired resources
    • From collaborators, customers, eUtilities, (members of evolving virtual organization)
  • Into a virtual computing system
    • Dynamic, multi-faceted system spanning institutions and industries
    • Configured to meet instantaneous needs, for:
  • Multi-faceted QoS for demanding workloads
    • Security, performance, reliability,

34. Overview

  • The power grid analogy & its limitations
  • Grid past: From eScience to eBusiness
  • Grid present: OGSA
  • Grid future: virtualization & ubiquitization
  • Summary

35. Grids and Open Standards Increased functionality, standardization Time Custom solutions App-specific Services Open Grid Services Arch GGF: OGSI, (+ OASIS, W3C) Multiple implementations, including Globus Toolkit Web services Globus Toolkit Defacto standards GGF: GridFTP, GSI X.509, LDAP, FTP, 36. Open Grid Services Architecture

  • Service-oriented architecture
    • Key to virtualization, discovery, composition, local-remote transparency
  • Leverage industry standards
    • Internet, Web services
  • Distributed service management
    • A component model for Web services
  • A framework for the definition of composable, interoperable services

The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration, Foster, Kesselman, Nick, Tuecke, 2002 37. Transient Service Instances

  • Web services address discovery & invocation ofpersistent services
    • Interface to persistent state of entire enterprise
  • In Grids, must also supporttransient service instances , created/destroyed dynamically
    • Interfaces to the states of distributed activities
    • E.g. workflow, video conf., dist. data analysis
  • Significant implications for how services are managed, named, discovered, and used
    • In fact, much of Grid is concerned with the management of service instances

38. OGSA Structure

  • A standard substrate: the Grid service
    • Standard interfaces and behaviors that address key distributed system issues
    • A refactoring and extension of the Globus Toolkit protocol suite
  • supports standard service specifications
    • Resource management, databases, workflow, security, diagnostics, etc., etc.
    • Target of current & planned GGF efforts
  • and arbitrary application-specific services based on these & other definitions

39. Open Grid Services Infrastructure Implementation Hosting environment/runtime (C, J2EE, .NET, ) Data access

  • Lifetime management
  • Explicit destruction
  • Soft-state lifetime

Service data element Other standard interfaces: factory, notification, collections Service data element Service data element GridService (required)

  • Introspection:
  • What port types?
  • What policy?
  • What state?

Client Grid Service Handle Grid Service Reference handle resolution 40. Open Grid Services Infrastructure GWD-R (draft-ggf-ogsi- gridservice-23)Editors: Open Grid Services Infrastructure (OGSI)S. Tuecke, ANL http://www.ggf.org/ogsi-wgK. Czajkowski, USC/ISI I. Foster, ANL J. Frey, IBM S. Graham, IBM C. Kesselman, USC/ISI D. Snelling, Fujitsu Labs P. Vanderbilt, NASA February 17, 2003 Open Grid Services Infrastructure (OGSI) 41. The OGSA Platform OGSI GWD-R (draft-ggf-ogsa-platform-3) Editors: Open Grid Services Architecture Platform I. Foster, Argonne & U.Chicago http://www.ggf.org/ogsa-wgD. Gannon, Indiana U. Transport Protocol Hosting Environment Hosting Environment Host. Env.& Protocol Bindings OGSA Platform services: registry, authorization, monitoring, data access, etc., etc.More specialized& domain-specific services Models for resources & other entities Other models Environment- specific profiles Domain- specific profiles OGSA Platform 42. OGSA: Next Steps

  • Technical specifications
    • Open Grid Services Infrastructure is complete
    • Security, data access, Java binding, common resource models, etc., etc., in the pipeline
  • Implementations and compliant products
    • Here: OGSA-based Globus Toolkit v3,
    • Announced: IBM, Avaki, Platform, Sun, NEC, HP, Oracle, UD, Entropia, Insors, ,
  • Rich set of service defns & implementations

43. Globus Toolkit v3 (GT3)

  • Implement core OGSI interfaces
  • Support primary GT2 interfaces
    • High degree of backward compatibility
  • Multiple platforms & hosting environments
    • J2EE, Java, C, .NET, Python
  • New services
    • SLA negotiation (GRAM-2), registry, replica location, community authorization, data,
  • Growing external contributions & adoption

44. Globus Toolkit History NASA initiates Information Power Grid Globus Project wins Global Information Infrastructure Award MPICH-G released The Grid: Blueprint for a New Computing Infrastructurepublished GT 1.0.0 Released Early Application Successes Reported GT 1.1.1 Released GT 1.1.2 Released GT 1.1.3 Released NSF & European Commission Initiate Many New Grid Projects GT 1.1.4 and MPICH-G2 Released Anatomy of the Grid Paper Released First EuroGlobus Conference Held in Lecce Significant Commercial Interest in Grids NSF GRIDS Center Initiated GT 2.0 beta Released Physiology of the Grid Paper Released GT 2.0 Released GT 2.2 Released Only Globus.Org; not downloads from: NMI UK eScience EU DataGrid IBM Platform etc. DARPA, NSF begin funding Grid work 45. Overview

  • The power grid analogy & its limitations
  • Grid past: From eScience to eBusiness
  • Grid present: OGSA
  • Grid future: virtualization & ubiquitization
  • Summary

46. Virtualization & Ubiquitization 47. Converging on Standards-Based Grids Complexity of Workload DepartmentEnterpriseCollaborationInternet OGSA-based Grids Industrial Grid Deployments Scientific Grid Deployments 48. What I Expect from OGSA

  • True open source/industry partnership
    • Globus Toolkit 3.0 provides common Grid middleware framework
    • Industry adds value in services & platforms
  • New applications (and industries?)
    • Grid middleware will create virtual services, servers, and storage from pools of services, servers, clients & storage dispersed throughout the Internet
    • Grid computing utilities will emerge and become like power utilities

49. Platform Symphony Servers: Execution Application Services: Distribution Applications: Delivery 50. Platform Symphony Servers: Execution Application Services: Distribution Applications: Delivery Application Virtualization

  • Automatically connect applications to services
  • Dynamic & intelligent
  • provisioning

51. Platform Symphony Servers: Execution Application Services: Distribution Applications: Delivery Application Virtualization

  • Automatically connect applications to services
  • Dynamic & intelligent
  • provisioning

Infrastructure Virtualization

  • Dynamic & intelligent
  • provisioning
  • Automatic failover

52. A Cross-Institutional GridNY Financial Institution Insurance Group UK Financial Institution NYFinancial Institution Capital Markets Group 53. Overview

  • The power grid analogy & its limitations
  • Grid past: From eScience to eBusiness
  • Grid present: OGSA
  • Grid future: virtualization & ubiquitization
  • Summary

54. Recap: The Grid Is

  • A collaboration & resource sharing infrastructure for scientific applications
  • A distributed service integration and management technology
  • A disruptive technology that enables a virtualized, collaborative, distributed world
  • An open source technology & community
  • A marketing slogan
  • All of the above

55. Grid Past, Present, Future

  • Past
    • Origins and broad adoption in eScience, fueled by open source Globus Toolkit
  • Present
    • Rapidly growing commercial adoption
    • Open Grid Services Architecture (OGSA)
  • Future
    • Key enabler of new applications & industries based on resource virtualization and distributed service integration

56. Why You Should Care

  • 1) Grids address pain points now, e.g.
    • Cost of provisioning for peak demand
    • Data federation and integration
  • 2) Grids are a disruptive technology
    • Usher in (or solve problems of) a virtualized, collaborative, distributed world
    • Potentially significant competitive advantages
  • 3) An open Grid is to your advantage
    • Insist that your suppliers embrace OGSA, refuse proprietary solutions!

57. Summary

  • Grids : Enabling resource sharing & coordinated problem solving in dynamic, multi-institutional virtual organizations
  • Major relevance not to eScience but also eBusiness, as an enabler ofcomputing on demand ,eUtilities ,B2B communities ,
  • Open Grid Services Architecturedefines standards for distributed system integration
  • Globus Toolkitas open source, open architecture solution to key Grid problems

58. For More Information

  • The Globus Project
    • www.globus.org
  • Global Grid Forum
    • www.gridforum.org
  • Background information
    • www.mcs.anl.gov/~foster
  • [email_address]

59. 60. 61.