Leveraging deep learning hyperparameter tuning frameworks · Leveraging deep learning...

iWet: The Intelligent WRF Ensemble ToolLeveraging deep learning hyperparameter tuning frameworks

Derek Jensen, Donald Lucas, Clifford Anderson-Bergman, Sonia WhartonLawrence Livermore National Laboratory

1. Ensembles Make the World a Better Place

Ensemble methods produce more accurate predictions thanindividual runs and provide a measure of uncertainty

Common Types of Meteorological Ensembles– Initial/Boundary conditions (ICBC)

–Data assimilation strategy (DAS)

–Multi model (MM)

–Multi physics (MP)

– Stochastic physics (SP)

Problem 1: The size of the Ensemble grows exponentially withthe number of dimensions considerede.g. 4ICBC∗ 2DAS∗ (3MP)4 = 648 WRF Runs

Problem 2: Effective ensemble runs rely on choosing anappropriate set of input parameters

Problem 3: WRF is expensive with many discrete inputs;traditional gradient-descent optimization will not work

Figure: Trivial example of a four-member WRFensemble, generated using WET, validated againstradiosonde data collected at three locations from27 January – 1 February. Lines indicate the meanvalues and shading represents ±1 standarddeviation.

3. Sequential Model-Based hyperparameter Optimization (SMBO). See [1, 2]

Like a good WRF ensemble, deep neural network (DNN)performance is strongly dependent on tuning

The DNN community has developed many tools to efficientlyoptimize expensive, non-differentiable objective functions

SMBO builds a probability model of the DNN performance anduses it to select the most promising hyperparameters toevaluate next

Table: SMBO for DNN tuning is analogous to tuning a WRF EnsembleAspects of SMBO DNN Tuning WRF Ensemble1. Parameter Space n. layers, dropout, etc. ICBC, DAS, MP, SP2. Objective Function DNN Training Running WRF3. Surrogate Model e.g. Gaussian Process e.g. Gaussian Process4. Acquisition Function e.g. Expected Improvement e.g. Expected Improvement5. Performance History X X

Figure: (left) Example of the unknown objectivefunction and surrogate model and (right)acquisition function, which balancesexploration (gathering more information) andexploitation (making the best decision givencurrent information) to choose the next set ofparameters [2].

2. The WRF Ensemble Tool (WET) – Developed at LLNL

Figure: WRF model workflow. Geogrid.exe creates theterrestrial dataset, Ungrib.exe unpacks the initial andboundary condition datasets, Metgrid.exe horizontallyinterpolates the initial and boundary condition dataonto the model domain, Real.exe verticallyinterpolates data to the model coordinates, andWRF.exe generates the model forecast.

WET:– Automoatically fetches ICBC datasets– Generates the ensemble directory structure–Modifies the WPS and WRF namelists– Allows for namelist consistency logic– Executes each step of the WRF workflow in parallel– Handles restarts

Lists define ensemble iterables:WRF_NAMELIST_CHANGES = pd.Series((’time_control’, ’run_days’): "0, ",(’time_control’, ’run_hours’): "12, ",(’physics’, ’mp_physics’): [’1’, ’2’, ’3’, ’4’],(’physics’, ’bl_pbl_physics’): [’1’, ’2’], })

Currently supports exhaustive and random sampling

References1. Koehrsen, Will “A Conceptual Explanation of Bayesian Hyperparameter Optimization for Machine Learning” Toward Data Science, Jun 24, 20182. Krasser, Martin “Bayesian Optimization” krasserm.github.io, March 21, 20183. Liaw et al., “Tune: A Research Platform for Distributed Model Selection and Training” arXiv preprint arXiv:1807.05118, 2018

This work performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.Release Number: LLNL-POST-772377

4. Putting the i in iWet

Tune is a scalable framework for deep learninghyperparameter search, particularly SMBO

To run Tune, couple a search algorithm with a trialschedulerSearch Algorithm: Where should we sample next? e.g.» Grid Search and Random Search» HyperOpt: SMBO with Tree-structured Parzen Estimators

Trial Scheduler: Where/when to run the trials and when to stopthem? e.g.» Population Based Training» Asynchronous HyperBand

A call to WET and a defined score (e.g. MSE computedbetween WRF output and observations) constitutes theobjective function

Figure: Tune is built on Ray, a flexible,high-performance distributed execution framework[3]

Feasibility Question: Can we transfer state-of-the-art methods used for tuning Deep Learning architectures,such as SMBO, to the problem of tuning ensemble parameters?

Leveraging deep learning hyperparameter tuning frameworks · Leveraging deep learning...

Documents

Transcript of Leveraging deep learning hyperparameter tuning frameworks · Leveraging deep learning...

HyperTuner: Visual Analytics for Hyperparameter Tuning by ... · HyperTuner: Visual Analytics for Hyperparameter Tuning by Professionals ... User Experience Design Cloudera Haley

Open-Source Asynchronous Distributed Hyperparameter ... · Open-Source Asynchronous Distributed Hyperparameter Optimization Based on Apache Spark FOSDEM’20 02.02.2020. ... Inner

Hyperparameter optimization with approximate gradient

Hyperparameter Optimization in Black-box Image Processing ...

Bilevel Programming for Hyperparameter Optimization and ...

REDUCING THE SEARCH SPACE FOR HYPERPARAMETER OPTIMIZATION ...home.engineering.iastate.edu/~chinmay/files/papers/pgsrICASSP19.… · of hyperparameter optimization (HPO) addresses

Hyperparameter Optimization via Sequential Uniform Designs

Leveraging Comparisons between Cultural Frameworks ...ceur-ws.org/Vol-1432/cats_pap1.pdf · Leveraging Comparisons between Cultural Frameworks: Preliminary Investigations of the MAUOC

eGovernment for Citizen: Leveraging Open SOA Standards and Interoperability Frameworks

Building and Leveraging Content Frameworks and Collateral Maps

GreenHadoop : Leveraging Green Energy in Data-Processing Frameworks

Leveraging Frameworks to Slide Heading Develop an ... 2014 Framework... · Leveraging Frameworks to Slide Heading . Develop an Effective Control Environment . ... • Hardware Loss,

Speeding up Automatic Hyperparameter Optimization of Deep ...

Automatically Leveraging MapReduce Frameworks for Data ... · MapReduce is a popular programming paradigm for running large-scale data-intensive computation. Recently, many frameworks

Bayesian Hyperparameter Optimization for Ensemble Learning

Bilevel Programming for Hyperparameter …proceedings.mlr.press/v80/franceschi18a/franceschi18a.pdfBilevel Programming for Hyperparameter Optimization and Meta-Learning Table 1. Links

Hyperparameter Estimation in Exponential Family State ... · Hyperparameter Estimation in Exponential Family State Space Models Sonderforschungsbereich 386, Paper 6 (1995) ... tial

Provenance Supporting Hyperparameter Analysis in Deep ...

Multi-Objective Multi-Fidelity Hyperparameter Optimization ...

Leveraging Telco Frameworks for Smart Grid - · PDF fileLeveraging Telco Frameworks for Smart Grid ... Network GSM Mobile PSTN Data IP Tx ... • RA KPI Framework