NeuralTools6 En

download NeuralTools6 En

of 104

Transcript of NeuralTools6 En

  • 8/11/2019 NeuralTools6 En

    1/104

    NeuralTools Neural Network Add-In for

    Microsoft

    Excel

    Version 6February, 2013

    Palisade Corporation798 Cascadilla St.Ithaca, NY USA 14850(607) 277-8000(607) 277-8001 (fax)http://www.palisade.com (website)[email protected] (e-mail)

  • 8/11/2019 NeuralTools6 En

    2/104

    Copyright NoticeCopyright 2013, Palisade Corporation.

    Trademark AcknowledgmentsMicrosoft, Excel and Windows are registered trademarks of Microsoft Corporation.IBM is a registered trademark of International Business Machines, Inc.Palisade, TopRank, BestFit and RISKview are registered trademarks of PalisadeCorporation.

  • 8/11/2019 NeuralTools6 En

    3/104

    Welcome to NeuralTools for Excel i

    Welcome to NeuralTools forExcel

    WelcomeNeuralTools gives Microsoft Excel - the industry-standard dataanalysis and modeling tool - a new, powerful modeling toolset!NeuralTools is a Microsoft Excel neural networks add-in, allowingyou to analyze data in Excel worksheets and work in the familiarMicrosoft Office environment. By combining a powerful datamanager, along with state-of-the-art neural networks algorithms,NeuralTools brings you the best of two worlds: Microsoft Office ease-

    of-use and reporting, with robust, accurate predictions from neuralnetworks.

    Work Where You're ComfortableIf you know Excel, you'll know NeuralTools! NeuralTools works justas Excel does, with toolbars, menus and custom worksheet functions,all inside of Excel. Unlike stand-alone neural networks software,there's no steep learning curve and upfront training costs withNeuralTools, because you work just as you are used to working inExcel. Your data and variables are in Excel spreadsheets. You canutilize standard Excel formulas for calculations, along with Excelsorting and pivot tables. Reports and charts from your analyses are in

    standard Excel format and can utilize all of Excel's built-in formattingcapabilities.

  • 8/11/2019 NeuralTools6 En

    4/104

    ii Welcome

    NeuralTools AnalysesNeural Networks are capable of learning complex relationships indata. By mimicking the functions of the brain, they can discernpatterns in data, and then extrapolate predictions when given newdata. The problems Neural Networks are used for can be divided intwo general groups:

    Classification Problems: Problems in which you are trying todetermine what type of category an unknown item falls into.Examples include medical diagnoses and prediction of creditrepayment ability.

    Numeric Problems: Situations where you need to predict aspecific numeric outcome. Examples include stock priceforecasting and predicting the level of sales during a futuretime period.

    Neural Networks are used in a broad range of applications including:stock market prediction, credit and loan risk assignment, credit frauddetection, forecasting sales, general business forecasting, investmentrisk, medical diagnosis, research in scientific fields, and control

    systems.NeuralTools includes the latest neural network algorithms to makethe best predictions on both classification problems (called categoryprediction in NeuralTools) and numeric problems.

    NeuralTools Data ManagementNeuralTools provides a comprehensive dataset and variable managerright in Excel, similar to that provided with StatTools, Palisade'sstatistics add-in for Excel. You can define any number of datasets,each with the variables you want to analyze, directly from your datain Excel. NeuralTools intelligently assesses your blocks of data,

    suggesting variable names and types as well as data locations. Yourdatasets and variables can reside in different workbooks andworksheets, allowing you to organize your data as you see fit. Then,you train neural networks that refer to your variables, instead of re-selecting your data over and over again in Excel. And NeuralTools'variables aren't limited in size to a single column of data in an Excelworksheet you can use the same column across up to 255worksheets for a single variable!

  • 8/11/2019 NeuralTools6 En

    5/104

    Welcome to NeuralTools for Excel iii

    NeuralTools ReportingExcel is great for reports and graphs, and NeuralTools makes themost of this. NeuralTools uses Excel-format graphs, which can beeasily customized for new colors, fonts and added text. Report titles,number formats and text can be changed just as in any standard Excelworksheet. Drag and drop tables and charts from NeuralToolsreports straight into your own documents in other applications.

    NeuralTools Industrial also includes Live Prediction, where predictedvalues are calculated as new data is entered into your Excelworksheet. This live calculation happens automatically, just like otherExcel recalculations.

    Data Access and SharingExcel has great data import features, so bringing your existing datainto NeuralTools is easy! Use standard Excel capabilities to read indata from Microsoft SQL Server, Oracle, Microsoft Access, or anyother ODBC compliant database. Load data from text files or otherapplications if you can read it into Excel, you can use it with

    NeuralTools!NeuralTools saves all its results and data in Excel workbooks. Justlike any other Excel file, you can send your NeuralTools results andnetworks to colleagues anywhere. Sharing couldn't be easier!

  • 8/11/2019 NeuralTools6 En

    6/104

  • 8/11/2019 NeuralTools6 En

    7/104

    Table of Contents v

    Table of ContentsChapter 1: Getting Started 1

    Introduction ......................................................................................... 3

    About This Version ................................................................................. 3 NeuralTools Professional and Industrial Versions ........................... 3

    Working with your Operating Environment ...................................... 3 If You Need Help ..................................................................................... 4 NeuralTools System Requirements ...................................................... 6

    Installation Instructions ..................................................................... 7

    General Installation Instructions .......................................................... 7 Setting Up the NeuralTools Icons or Shortcuts .................................. 7 The DecisionTools Suite ......................................................................... 8 Software Activation ................................................................................. 8

    Chapter 2: An Overview of NeuralTools 9

    Overview ............................................................................................ 11

    Why Neural Networks? ......................................................................... 11 NeuralTools and Neural Networks .................................................... 12 NeuralTools Menu and Toolbar .......................................................... 13 Data Sets and the Data Set Manager .................................................. 13 Training a Neural Network .................................................................. 15 Testing a Network .................................................................................. 20 Prediction ................................................................................................. 21 NeuralTools Reports and Charts ......................................................... 23 NeuralTools Utilities ............................................................................. 23 Using NeuralTools with StatTools, Solver and Evolver ................. 24

    Chapter 3: NeuralTools Reference Guide 25

    Introduction ....................................................................................... 27

    Reference: NeuralTools Icons 29

    NeuralTools Toolbar ......................................................................... 29

  • 8/11/2019 NeuralTools6 En

    8/104

    vi Welcome

    Reference: NeuralTools Menu Commands 31

    Introduction ...................................................................................... 31

    Icons in Dialog Boxes ........................................................................... 32

    Command Reference ....................................................................... 33

    Data Set Manager Command .............................................................. 33 Train Command ..................................................................................... 38 Test Command ....................................................................................... 50 Predict Command .................................................................................. 58

    Utilities .............................................................................................. 63

    Application Settings Command ......................................................... 63 Neural Net Manager Command ......................................................... 66 Missing Data Utilities Command ...................................................... 68 Testing Sensitivity Command ............................................................ 71

    More on Neural Networks 73

    Neural Network Basics .................................................................... 73

    Neural Nets vs. Statistical Methods ................................................... 73

    The Structure of a Neural Net ............................................................. 74 Numeric and Category Prediction ...................................................... 74 Training a Net ........................................................................................ 74 Computer Processing of Neural Nets ................................................ 75 Types of Neural Networks .................................................................. 75

    Multi-Layer Feedforward Nets ......................................................... 77

    MLF Architecture .................................................................................. 77 MLF Net Training ................................................................................. 79

    Generalized Regression Neural Nets and Probabilistic NeuralNets ................................................................................................ 83

    Generalized Regression Neural Nets ................................................ 83

    Probabilistic Neural Nets .................................................................... 85 Comparison of MLF Nets to PN/GRN Nets .................................... 89

    Input Transformation ....................................................................... 91

    Recommended Readings ................................................................ 93

    Index 95

  • 8/11/2019 NeuralTools6 En

    9/104

    Chapter 1: Getting Started 1

    Chapter 1: Getting Started

    Introduction ......................................................................................... 3

    About This Version ................................................................................. 3 NeuralTools Professional and Industrial Versions ........................... 3 Working with your Operating Environment ...................................... 3 If You Need Help ..................................................................................... 4 NeuralTools System Requirements ...................................................... 6

    Installation Instructions ..................................................................... 7

    General Installation Instructions .......................................................... 7 Setting Up the NeuralTools Icons or Shortcuts .................................. 7

    @RISK 4.5 Help System Palisade Corporation, 1999

  • 8/11/2019 NeuralTools6 En

    10/104

    2

  • 8/11/2019 NeuralTools6 En

    11/104

    Chapter 1: Getting Started 3

    IntroductionThis introduction describes the contents of the NeuralTools packageand shows how to install NeuralTools and attach it to your copy ofMicrosoft Excel 2003 or higher.

    About This Version

    This version of NeuralTools can be used with Microsoft Excel 2003 orhigher.

    NeuralTools Professional and Industrial VersionsNeuralTools is available in two versions Professional and Industrial.The differences are as follows:

    Datasets in NeuralTools Professional are limited to 1000 cases,while with NeuralTools Industrial their size is only limited bythe available memory.

    Live Prediction, where predicted values are calculated as newdata is entered into your Excel worksheet, is provided inNeuralTools Industrial only. This live calculation happensautomatically, just like other Excel recalculations.

    Working with your Operating EnvironmentThis Users Guide assumes that you have a general knowledge of theWindows operating system and Excel. In particular:

    You are familiar with your computer and using the mouse.

    You are familiar with terms such as icons, click, double-click,menu, window, command and object.

    You understand basic concepts such as directory structures

    and file naming.

  • 8/11/2019 NeuralTools6 En

    12/104

    4 Introduction

    If You Need HelpTechnical support is provided free of charge for all registered users ofNeuralTools with a current maintenance plan, or is available on a perincident charge. To ensure that you are a registered user ofNeuralTools, please register online athttp://www.palisade.com/support/register.asp .

    If you contact us by telephone, please have your serial number andUsers Guide ready. We can offer better technical support if you arein front of your computer and ready to work.

    Before contacting technical support, please review the followingchecklist:

    Have you referred to the on-line help? Have you checked this User's Guide and reviewed the on-line

    multimedia tutorial? Have you read the README.WRI file? It contains current

    information on NeuralTools that may not be included in themanual.

    Can you duplicate the problem consistently? Can youduplicate the problem on a different computer or with adifferent model?

    Have you looked at our site on the World Wide Web? It canbe found at http://www.palisade.com . Our Web site alsocontains the latest FAQ (a searchable database of tech supportquestions and answers) and NeuralTools patches in ourTechnical Support section. We recommend visiting our Website regularly for all the latest information on NeuralTools andother Palisade software.

    Before Calling

  • 8/11/2019 NeuralTools6 En

    13/104

    Chapter 1: Getting Started 5

    Palisade Corporation welcomes your questions, comments orsuggestions regarding NeuralTools. Contact our technical supportstaff using any of the following methods:

    Email us at [email protected]. Telephone us at (607) 277-8000 any weekday from 9:00 AM to 5:00

    PM, EST. Follow the prompt to reach technical support. Fax us at (607) 277-8001. Mail us a letter at:

    Technical SupportPalisade Corporation798 Cascadilla St.Ithaca, NY 14850 USA

    If you want to contact Palisade Europe: Email us at [email protected]. Telephone us at +44 1895 425050 (UK). Fax us at +44 1895 425051 (UK). Mail us a letter at:

    Palisade Europe

    31 The Green West DraytonMiddlesexUB7 7PNUnited Kingdom

    If you want to contact Palisade Asia-Pacific: Email us at [email protected]. Telephone us at +61 2 9252 5922 (AU). Fax us at +61 2 9252 2820 (AU). Mail us a letter at:

    Palisade Asia-Pacific Pty Limited

    Suite 404, Level 420 Loftus StreetSydney NSW 2000Australia

    Regardless of how you contact us, please include the product name,version and serial number. The exact version can be found byselecting the Help About command on the NeuralTools menu inExcel.

    ContactingPalisade

  • 8/11/2019 NeuralTools6 En

    14/104

    6 Introduction

    Telephone support is not available with the student version ofNeuralTools. If you need help, we recommend the followingalternatives:

    Consult with your professor or teaching assistant. Log on to http://www.palisade.com for answers to frequently asked

    questions. Contact our technical support department via e-mail or fax.

    NeuralTools System RequirementsSystem requirements for NeuralTools 6 for Microsoft Excel forWindows include:

    Microsoft Windows XP or higher. Microsoft Excel 2003 or higher.

    StudentVersions

  • 8/11/2019 NeuralTools6 En

    15/104

    Chapter 1: Getting Started 7

    Installation Instructions

    General Installation InstructionsThe Setup program copies the NeuralTools system files into adirectory you specify on your hard disk. To run the Setup program inWindows XP or higher:

    1) Double-click the NeuralTools Setup.exe from your download orinstallation CD and follow the Setup instructions on the screen

    If you encounter problems while installing NeuralTools, verify thatthere is adequate space on the drive to which youre trying to install.After youve freed up adequate space, try rerunning the installation.

    If you wish to remove NeuralTools from your computer, use theControl Panels Add/Remove Programs utility and select the entryfor NeuralTools.

    Setting Up the NeuralTools Icons or ShortcutsIn Windows, setup automatically creates a NeuralTools command in

    the Programs\Palisade DecisionTools menu of the Taskbar.However, if problems are encountered during Setup, or if you wish todo this manually another time, follow these directions. Note that thedirections given below are for Windows XP Professional. Instructionsfor other operating systems may vary.

    1) Click the Start button, and then point to Settings.

    2) Click Taskbar and Start Menu, and then click the Start Menu tab.

    3) Click Customize, click Add, and then click Browse.

    4) Locate the file NeuralTools.EXE, click it and then click OK.

    5) Click Next, and then double-click the menu on which you want the

    program to appear.6) Type the name NeuralTools, and then click Finish.

    7) Click OK on all opened dialogs.

    RemovingNeuralToolsfrom YourComputer

    Creating the

    Shortcut in theWindowsTaskbar

  • 8/11/2019 NeuralTools6 En

    16/104

    8 Installation Instructions

    The DecisionTools SuiteNeuralTools is part of the DecisionTools Suite, a set of products forrisk and decision analysis available from Palisade Corporation. Thedefault installation procedure of NeuralTools puts NeuralTools in asubdirectory of a main Program Files\Palisade directory. This isquite similar to how Excel is often installed into a subdirectory of aMicrosoft Office directory.

    One subdirectory of the Program Files\Palisade directory will be theNeuralTools directory (by default called NeuralTools5). Thisdirectory contains the NeuralTools add-in program file(NEURALTOOLS.XLA) plus example models and other filesnecessary for NeuralTools to run. Another subdirectory of ProgramFiles\Palisade is the SYSTEM directory which contains files neededby every program in the DecisionTools Suite, including common helpfiles and program libraries.

    Software ActivationActivation is a one time license verification process that is required in

    order for your Palisade software to run as a fully licensed product.An activation ID is on your printed/emailed invoice and mayresemble a dash separated sequence like "DNA-6438907-651282-CDM". If you enter your Activation ID during installation, then yoursoftware is activated at the end of the installation process and nofurther user action is required. If you wish to activate your softwareafter installation, select the Help menu License Manager command .

    The License Manager can be used to activate, deactivate and movesoftware licenses. It is also used to manage licenses for networkinstallations. Follow the prompts and dialogs in the License Managerto perform the desired licensing operation.

  • 8/11/2019 NeuralTools6 En

    17/104

    Chapter 2: An Overview of NeuralTools 9

    Chapter 2: An Overview ofNeuralTools

    Overview ............................................................................................ 11

    Why Neural Networks? ......................................................................... 11 NeuralTools and Neural Networks .................................................... 12 NeuralTools Menu and Toolbar .......................................................... 13 Data Sets and the Data Set Manager .................................................. 13

    Variable Types .......................................................................... 15 Multi-Range Data ..................................................................... 15

    Training a Neural Network .................................................................. 15 Combining Training, Testing and Prediction ..................... 16 Net Configurations ................................................................... 16 Training Preview ...................................................................... 17

    Training Process ....................................................................... 18

    Training Reports ....................................................................... 19 Testing a Network .................................................................................. 20

    Testing Reports ......................................................................... 20 Prediction ................................................................................................. 21

    Prediction Results ..................................................................... 22 Live Prediction .......................................................................... 22

    NeuralTools Reports and Charts ......................................................... 23

  • 8/11/2019 NeuralTools6 En

    18/104

    10

  • 8/11/2019 NeuralTools6 En

    19/104

    Chapter 2: An Overview of NeuralTools 11

    OverviewNeuralTools provides you with powerful neural network capabilitiesin an environment that you are familiar with - Microsoft Excel.NeuralTools procedures - such as defining data sets, training andtesting neural networks and predicting values using trained networks- can be run on your data in Excel and the reports and charts fromyour analyses are created in Excel.

    Why Neural Networks?Neural Networks are capable of learning complex relationships indata. By mimicking the functions of the brain, they can discernpatterns in data, and then extrapolate predictions when given newdata. The problems Neural Networks are used for can be divided intwo general groups:

    Classification Problems: Problems in which you are trying todetermine what type of category an unknown item falls into.Examples include medical diagnoses and prediction of creditrepayment ability.

    Numeric Problems: Situations where you need to predict aspecific numeric outcome. Examples include stock priceforecasting and predicting the level of sales during a futuretime period.

    NeuralTools comes with examples that show how to apply neuralnetworks to different prediction problems. TheNeuralTools\Examples folder contains the provided examples asExcel workbooks.

  • 8/11/2019 NeuralTools6 En

    20/104

    12 Overview

    NeuralTools and Neural NetworksWhen using NeuralTools, neural networks are developed and used infour steps:

    Data Preparation - The data you use in NeuralTools isdefined in data sets. A Data Set Manager is used to set updata sets so they can be used over and over again with yourneural networks.

    Training With training, a neural network is generated froma data set comprised of cases with known output values. Thisdata often consists of historical cases for which you know thevalues of output/dependent variable.

    Testing With testing, a trained neural network is tested tosee how well it does at predicting known output values. Thedata used for testing is usually a subset of your historicaldata. This subset was not used in training the network. Aftertesting, the performance of the network is measured bystatistics such as the % of the known answers it correctlypredicted.

    Prediction - A trained neural network is used to predictunknown output values. Once trained and tested, thenetwork can be used as needed to predict outputs for newcase data.

    Training and testing are an iterative, sometimes time-intensiveprocess. Typically, you may train several different times withdifferent settings in order to generate a neural network that tests best.Once you have your "best net" you can quickly use it for predicting.

    Now, let's look at how NeuralTools works in Excel and how youdefine data sets and train and test neural networks using those datasets. Then we will predict unknown output values using trainednetworks.

  • 8/11/2019 NeuralTools6 En

    21/104

    Chapter 2: An Overview of NeuralTools 13

    NeuralTools Menu and ToolbarOnce you have installed NeuralTools, its menu and commands will beincluded as part of the Excel menu bar in Excel 2003 and earlier.There will also be a NeuralTools toolbar displayed. The menu showscommands for 1) defining your data in data sets, 2) training andtesting neural networks and 3) predicting values using trained neuralnetworks. In Excel 2007, all commands are available via the

    NeuralTools ribbon bar.

    Data Sets and the Data Set ManagerData in NeuralTools is structured around cases and variables . Youwork with a data set, or a set of statistical variables, located incontiguous columns with variable names in the first row of the data

    set. Each row in the data set is a case . Each case has a set ofindependent variable values and either a known or missing value forthe dependent output variable. It is the job of NeuralTools to predictoutput variable values for cases where they are not known.

  • 8/11/2019 NeuralTools6 En

    22/104

    14 Overview

    The NeuralTools Data Set Manager allows you to define your datasets, variables and cases. You can then use these predefined variablesfor training and testing neural networks, without re-selecting the datayou wish to analyze over and over. You may place all known,historical cases in one data set and cases you wish to predict resultsfor in a different data set. You can also combine all your data known historical data and data you wish to predict - in a single dataset.

    Each variable in a data set has a name and a range of Excel cellsassociated with it. Each column within the range contains data for adifferent variable. A data set can include multiple blocks of cells,allowing you to put data on different sheets in the same workbook.

    When you are defining a data set, NeuralTools attempts to identifythe variables in a block of cells surrounding the current selection inExcel. This makes it quick and easy to set up a data set with variablenames in the top row and variables laid out by column.

  • 8/11/2019 NeuralTools6 En

    23/104

    Chapter 2: An Overview of NeuralTools 15

    In NeuralTools, variables can be independent or dependent andnumeric or categorical (for example Yes or No, or Red, Green or Blue.)The Data Set Manager attempts to identify the type of each variable inyour data set, but you can override this with your own selections.

    A single column in an Excel 2003 or earlier worksheet can hold up to65,536 data points for a variable. If your variables have more valuesthan this and you choose not to adopt Excel 2007, NeuralTools allowsmultiple cell ranges to be assigned to a single data set. In other

    words, you can "repeat" a data set across multiple sheets, assigningthe same columns in different worksheets to hold all the values for adata set.

    Training a Neural NetworkAfter you have defined a data set that contains cases with knownhistorical values, you can train a neural network using that data.There are different options that determine the type of network thatwill be generated by NeuralTools. Depending on the nature of yourdata, different network options will generate better performingtrained networks (i.e. networks that do a better job predictinganswers). The testing process done following training gives

    precise measurements of how well your trained network does atpredicting output values.

    Training a neural network, along with testing and prediction, requiresthat you specify a data set that contains the data to be used duringtraining. NeuralTools will either save your trained network directlyin your workbook or optionally, to a file on disk.

    Variable Types

    Multi-Range Data

  • 8/11/2019 NeuralTools6 En

    24/104

    16 Overview

    If all your data is in a single data set (including both known historicaldata and new data where you do not know output values),NeuralTools allows you to train and test a network, then predictoutput values, all in a single step. You select to withhold a certainpercentage of the historical data for testing (20% is shown on the priorpage) and then select to automatically predict output values for caseswith missing dependent values. By doing this you quickly can get theanswers you need in one operation.

    NeuralTools supports different neural network configurations to givethe best possible predictions. For classification/category prediction(where the dependent variable is a category type), two types ofnetworks are available: Probabilistic Neural Networks (PNN) andMulti-Layer Feedforward Networks (MLF) . Numeric prediction canbe performed using MLF networks, as well as GeneralizedRegression Neural Networks (GRNN) , which are closely related toPNN networks.

    NeuralTools makes selecting a network configuration easy by offeringa Best Net search. When selected, NeuralTools will train and test avariety of neural network configurations to generate the one the givesthe best predictions for your data. The best configuration isdetermined based on testing data, so for Best Net search the"Automatically Test" option needs to be selected in the Train tab.

    CombiningTraining,Testing andPrediction

    NetConfigurations

  • 8/11/2019 NeuralTools6 En

    25/104

    Chapter 2: An Overview of NeuralTools 17

    Once training and network configuration options are selected,NeuralTools previews what it will perform during network training.Since training is the most time-intensive process in neural networkmodeling, it helps to review the training setup prior to proceeding.NeuralTools will try to identify any problems it has found in yourdata so you can correct them prior to proceeding with training.

    TrainingPreview

  • 8/11/2019 NeuralTools6 En

    26/104

    18 Overview

    As NeuralTools proceeds with training a neural net on your data, itreports how well it is doing. Typically the net gets better and betteras training proceeds, as NeuralTools generates networks that makebetter predictions on your data with fewer errors. Graphs update toshow NeuralTools' progress during training.

    Training stops when any of the stopping conditions you have set such as maximum training time are reached. If you have selected toautomatically test the net or predict missing output values in yourdata set, this will be performed after training.

    TrainingProcess

  • 8/11/2019 NeuralTools6 En

    27/104

    Chapter 2: An Overview of NeuralTools 19

    Training reports show how well your trained net performed.Statistics such as % Bad Predictions show the number of cases in thetraining set for which the network predicted an output value that didnot agree with the actual known value.

    TrainingReports

  • 8/11/2019 NeuralTools6 En

    28/104

    20 Overview

    Testing a NetworkDuring testing, a trained neural network is tested to see how well itdoes at predicting known output values. The testing data is usually asubset of your historical data with known output values. This subsetwas not used in training the network.

    When testing data is in a separate data set, NeuralTools will matchthe variables in the testing data set with those in the training data. Aswith training, NeuralTools will preview your testing setup prior torunning.

    Testing (along with prediction) runs much faster than training.NeuralTools reports how well it did in predicting the known answersin the testing data. This helps you see if the network will be a goodpredictor when applied to cases with unknown output values.

    Testing Reports

  • 8/11/2019 NeuralTools6 En

    29/104

    Chapter 2: An Overview of NeuralTools 21

    PredictionThe end use of a neural network is prediction. You will apply atrained network to new cases where you do not know output valuesbut you want to predict them. NeuralTools offers two methods forprediction 1) a command-driven method for predicting values forcases in a data set, and 2) Live Prediction (Industrial version only) ,where the independent variable values for a case in your worksheet

    can be entered and NeuralTools will automatically calculate thepredicted output value.

    When you wish to predict values for a group of cases in a data set, thePrediction dialog helps you set up the prediction process. You canpredict for just cases with missing output values and optionallyenable Live Prediction so you can make modifications to your data tosee how it will affect predictions. Different trained nets can be usedto see how predicted values will differ.

    As with training and testing, NeuralTools first previews the data andsetup it will use for prediction. Then, predictions are reported to yourworksheet in Excel.

  • 8/11/2019 NeuralTools6 En

    30/104

    22 Overview

    Predicted output values are shown next to the cases for whichprediction is performed. In the screen here, predicted values are inpurple.

    When Live Prediction is enabled, NeuralTools automatically adds anExcel formula to the cell where the predicted value is shown. Thisformula generates the predicted value, so if you change independentvariable values for a case, the predicted value will be automaticallyrecalculated. Using Live Prediction you can simply type data for newcases directly in Excel and automatically generate a new prediction,without going through the Prediction dialog. For example, if theindependent variable values for the case in row 7 in the aboveworksheet are changed as shown, the predicted value automaticallyupdates. As with any worksheet cell, you can reference aLivePrediction cell in any Excel formula.

    (Note: Live Prediction is available in the Industrial version only.)

    PredictionResults

    Live Prediction

  • 8/11/2019 NeuralTools6 En

    31/104

    Chapter 2: An Overview of NeuralTools 23

    NeuralTools Reports and ChartsNeuralTools creates both Summary and Detailed Reports fromtraining, testing and prediction. Summary Reports are shown ontheir own worksheet and have overall information on Testing orTraining. A Detailed Report gives information on a case-by-casebasis and is shown next to the data being reported on. In addition,most of the Summary Report information can be found inside the

    Detailed Report as a comment added to the title cell; that version ofthe Summary Report is referred to as the Quick Summary .

    Whenever NeuralTools creates one or more charts, it places them withthe reports. Charts are created in Excel format and may becustomized using standard Excel chart commands.

    NeuralTools UtilitiesThree utilities are provided to help you to manage neural networkmodeling in NeuralTools. A Neural Net Manager allows you to copyor move trained neural networks between workbooks and files. AMissing Data utility helps identify and correct cases with missingdata in your data sets. Testing Sensitivity analysis helps to determineif the testing results are stable under different random selections oftesting cases.

  • 8/11/2019 NeuralTools6 En

    32/104

    24 Overview

    Using NeuralTools with StatTools, Solver andEvolverNeuralTools is designed to be used with StatTools , the statistics add-

    in for Excel from Palisade. Both products share the same Data SetManager; data sets defined in NeuralTools can be analyzed inStatTools and vice versa. Using StatTools, you can calculate statisticson variables in data sets defined in NeuralTools along with statistics

    on predictions generated by NeuralTools.Detailed Reports generated in NeuralTools are immediately availablefor analysis in StatTools; they automatically show on the list of datasets in StatTools Data Set Manager. This facilitates the use ofStatTools to obtain statistical results beyond those contained inNeuralTools' Summary Reports. For example, a testing SummaryReport includes a histogram of residuals (defined as differencesbetween actual and predicted values). Based on the histogram, theresiduals may appear to be approximately normally distributed. Totest the hypothesis of normal distribution, one of StatTools' normalitytests can be applied to the Residuals variable in the Detailed Report.An example is provided in the file "Abalone Age Prediction withStatTools Analysis.xls".NeuralTools' Live Prediction feature makes it easy to see how changesto independent values affect the prediction. With Live Prediction,other tools available in Excel can be used to explore the relationshipbetween independent variables and the dependent one.

    Solver Excel's built in optimizer can be used with NeuralTools' LivePrediction capability to calculate optimal decision values forpredictions made in NeuralTools. The file "Auto Loans withSolver.xls" provides an example. In the example, a neural net is usedto predict whether a borrower will be making timely payments.However, the network may only be 60% confident of the answer.

    Excel's Solver could then be used to determine a loan amount wherethe network would be 90% sure that the individual will be makingtimely payments. In this case, the optimizer would try different loanamounts while NeuralTools automatically updated the probabilityvalue. Evolver , Palisades genetic algorithm based optimizer, can beused instead of Solver to find the answer. Unlike Solver, Evolver canhandle optimization problems in which there is more than one localoptimum.

  • 8/11/2019 NeuralTools6 En

    33/104

    Chapter 3: NeuralTools Reference Guide 25

    Chapter 3: NeuralToolsReference Guide

    Introduction ....................................................................................... 27

    Reference: NeuralTools Icons 29

    NeuralTools Toolbar ......................................................................... 29

    Reference: NeuralTools Menu Commands 31

    Introduction ....................................................................................... 31

    Icons in Dialog Boxes ............................................................................ 32

    Command Reference ........................................................................ 33

    Data Set Manager Command ............................................................... 33 Train Command ..................................................................................... 38 Test Command ........................................................................................ 50 Predict Command ................................................................................... 58

    Utilities ............................................................................................... 63

    Application Settings Command .......................................................... 63 Neural Net Manager Command .......................................................... 66 Missing Data Utilities Command ....................................................... 68 Testing Sensitivity Command ............................................................. 71

    More on Neural Networks 73

  • 8/11/2019 NeuralTools6 En

    34/104

    26

  • 8/11/2019 NeuralTools6 En

    35/104

    Chapter 3: NeuralTools Reference Guide 27

    IntroductionThe NeuralTools Reference Guide chapter describes the icons,commands, and statistics functions used by NeuralTools. Thischapter is divided into two sections:

    1) Reference: NeuralTools Icons

    2) Reference: NeuralTools Menu Commands

  • 8/11/2019 NeuralTools6 En

    36/104

    28 Introduction

  • 8/11/2019 NeuralTools6 En

    37/104

    Reference: NeuralTools Icons 29

    Reference: NeuralTools IconsNeuralTools Toolbar

    NeuralTools icons are used to define data sets with cases and

    variables and then create and use neural networks on that data.NeuralTools icons appear on the Excel toolbar (i.e., as a customtoolbar in Excel) in Excel 2003 and earlier and on a ribbon in Excel2007. This section briefly describes each icon, outlining the functionsthey perform and the menu command equivalents associated withthem. In Excel 2007, all commands are available via the NeuralToolsribbon bar.

    The following icons are shown on the NeuralTools toolbar in Excel2003 and earlier and/or in NeuralTools dialog boxes.

    Icon Function Performed and Command Equivalent

    Define a data set and variables, or edit or delete anexisting data set and variables

    Command equivalent: Data Set Manager command

    Train a neural network

    Command equivalent: Train command

    Test a neural network

    Command equivalent: Test command

    Predict values using a trained network

    Command equivalent: Predict command

    Run neural network utilities

    Command equivalent: Utilities command

    Display NeuralTools help file

    Command equivalent: Help command

  • 8/11/2019 NeuralTools6 En

    38/104

    30

    The following icons are shown on the NeuralTools ribbon in Excel2007.

    Icon Function Performed and Command Equivalent

    Define a data set and variables, or edit or delete anexisting data set and variables

    Command equivalent: Data Set Manager command

    Train a neural network

    Command equivalent: Train command

    Test a neural network

    Command equivalent: Test command

    Predict values using a trained network

    Command equivalent: Predict command

    Run neural network utilities

    Command equivalent: Utilities command

    Display NeuralTools help file

    Command equivalent: Help command

  • 8/11/2019 NeuralTools6 En

    39/104

    Reference: NeuralTools Menu Commands 31

    Reference: NeuralToolsMenu Commands

    IntroductionThis section of the Reference Guide details the available NeuralToolscommands as they appear on the NeuralTools menu or ribbon inExcel. Commands are discussed as they appear on the menu, startingwith the Data Set Manager command and subsequently movingdown. NeuralTools icons can be used to perform many of theavailable commands. The Reference: NeuralTools Icons section ofthis chapter gives the command equivalents for each NeuralTools

    icon.

  • 8/11/2019 NeuralTools6 En

    40/104

    32 Introduction

    Icons in Dialog BoxesUp to two icons the Help icon and the Application Settings icon may appear in individual NeuralTools dialog boxes. The Help iconallows you to quickly access the help topic on the relevant dialog.The Application Settings icon displays the Application Settings dialogwhere you can enter or edit settings for NeuralTools reports, as wellas default settings for Training, Prediction and Runtime.

  • 8/11/2019 NeuralTools6 En

    41/104

    Reference: NeuralTools Menu Commands 33

    Command Reference

    Data Set Manager Command

    Defines NeuralTools data sets and variables, or edits or deletesan existing data set and variables

    The Data Set Manager command allows you to define your data setswith cases and variables. Once data sets are defined, they may beused for neural network training, testing and prediction. The Data SetManager dialog box allows you to add or remove data sets, name adata set, specify the layout of the variables in a data set, and name thevariables in a data set.

    NeuralTools is structured around variables and cases. You work witha data set, or a set of statistical variables, located in contiguouscolumns in an Excel worksheet with variable names in the first row ofthe data set. Each row in the data set is a case . Each case has a set ofindependent variable values and either a known or missing value forthe dependent output variable.

    Each variable in a data set has a name and a range of Excel cellsassociated with it. A data set can include multiple blocks of cells,allowing you to put data on different sheets in the same workbook.

    When you are defining a data set, NeuralTools attempts to identifythe variables in a block of cells surrounding the current selection inExcel. This can make it quick and easy to set up a data set withvariable names in the top row and variables laid out by column.

    What Are DataSets andVariables?

  • 8/11/2019 NeuralTools6 En

    42/104

    34 Command Reference

    The Data Set options in the Data Set Manager dialog box include: New, Delete adds a new data set, or deletes an existing one.

    Name specifies the name of the data set. Excel Range specifies the Excel Range associated with a data set.

    If multiple cell ranges have been assigned to a data set this entrywill be prefaced by the label Multiple .

    Apply Cell Formatting - adds a grid and colors that identify yourdata sets.

    Multiple - clicking the Multiple button in the Data Set Managerdialog box displays the Multiple Range Selector dialog. Thisdialog allows the entry of individual cell ranges that comprise themultiple cell range data set.

    Import allows variable types to be copied to this data set fromanother data set or trained neural net. The Import Variable Typesdialog allows you to select the location and net to use for variabledefinitions.

    Data Set ManagerDialog Box

  • 8/11/2019 NeuralTools6 En

    43/104

    Reference: NeuralTools Menu Commands 35

    NeuralTools allows multiple cell ranges on different worksheets to beassigned to a single data set. A multiple range data set can be usedwhen:

    1) Each variable in a data set has more than 65,536 data points in anExcel 2003 or earlier worksheet, requiring the data set to extendacross multiple worksheets in the same workbook,

    2) The data for a variable is located in multiple blocks scatteredthroughout the worksheets in a workbook.

    Note: A multiple range data set cannot be defined within a singleworksheet. They can be defined on multiple worksheets within thesame workbook.

    The options in the Multiple Range Selector dialog include:

    Clear All Clears all entered ranges.

    Auto Fill Applies the first range entered (in row 1) to all visible

    worksheets in the active workbook, and enters theseSheetName!CellRange references in the grid

    Select Displays a selector for highlighting a block of cells to beused as a Data Set Range.

    Secondary Ranges Have Variable Names in the First Column(Row) - Multiple range data sets can have variable names labelingeach column in each range listed in the dialog, or variable nameslabeling columns in just the first selected range. The first selectedrange is the range entered in row 1 of the Multiple Range Selectordialog.

    Multiple RangeDat a Sets

  • 8/11/2019 NeuralTools6 En

    44/104

    36 Command Reference

    Each row in the grid in the Data Set Manager dialog box lists thevariables in a data set, including the Excel Data Range that holds thedata points for a variable, the Variable Name , and the Variable Type .

    The Variable Type options include: Dependent Category dependent or output variable whose

    possible values are taken from a set of possible categories; forexample Yes or No, or Red, Green or Blue.

    Dependent Numeric dependent or output variable whosepossible values are numeric.

    Independent Category an independent variable whose possiblevalues are taken from a set of possible categories; for example Yes or No, or Red, Green or Blue.

    Independent Numeric an independent variable whose possiblevalues are numeric.

    Tag a variable that takes the possible values "train", "test" or"predict". This type of variable is used to identify cases in a dataset that will be used for training, testing and prediction.

    Unused a variable in a data set that will not be used in a neuralnet.

    Variable Options

  • 8/11/2019 NeuralTools6 En

    45/104

    Reference: NeuralTools Menu Commands 37

    Tag variables are a special type of variable in a NeuralTools datasetthat are used to identify cases in a data set that will be used fortraining, testing and prediction. They are especially useful when youwant to include all data (to be used for network training, testing andprediction) in a single dataset. When you have a tag variable,NeuralTools selects the cases to use for training, testing or predictionbased on the tag variable's value. By changing tag variable values,you can retrain a network using different cases and see how networkperformance changes. You can also add new cases with unknowndependent variable values to a dataset and assign them to bepredicted using the "Predict" tag. A Tag variable can take only threepossible values:

    Train specifies that the case will be used for training

    Test - specifies that the case will be used for testing Predict - specifies that the case will be used for prediction

    Note: If you have a tag variable in your data set, options in theTraining dialog will change. See the Train command for moreinformation.

    In a single session, NeuralTools allows:

    Up to 256 data sets, located in a single workbook. Up to 16384 variables per data set in Excel 2007 (256 variables in

    earlier versions of Excel). All the data for a single data set mustbe located in the same workbook.

    Number of data points per variable and cases per data set limitedonly by available memory in Excel 2007 (16,777,216 data points inearlier versions of Excel).

    Actual data capacities may be less than shown above depending on

    the system configuration and version of Excel in use. Memorylimitations of Excel itself may also affect data capacities.

    Note: the Data Set Manager dialog box lists all data sets andvariables in the active workbook (this is the workbook listed in thecaption of the Data Set Manager dialog). To list data sets in otherworkbooks, activate the desired workbook in Excel and display the

    Data Set Manager dialog.

    More on TagVariables

    Data Set andVariableCapacities

  • 8/11/2019 NeuralTools6 En

    46/104

    38 Command Reference

    Train Command

    Specifies settings for training a neural network and runs thetraining

    The Train command allows you to 1) specify settings to be used fortraining a neural network in NeuralTools and 2) start training a net.

    The Train tab in the Training dialog box specifies general options fortraining a neural network. It includes the following:

    Data Set shows the data set to be used when training the neuralnetwork. This data set needs to be defined using the Data SetManager and present in the active sheet.

    Save Net As specifies the name and location for the neural netto be trained. Neural networks may be saved to an Excelworkbook or to a file on disk. Click Browse to change the nameor location shown.

    Train Tab

  • 8/11/2019 NeuralTools6 En

    47/104

    Reference: NeuralTools Menu Commands 39

    You may also enter a name and description for the neural network tobe saved.

    The When Training is Completed options allow you to automaticallytest and predict using the trained net following training. This can bedone when the data to test and predict is located in the same data setwith the training data.

    Automatically Test on specifies that either:

    1)

    a % of the cases in the data set will automatically be "heldout" from training to be used for testing

    2) cases where the Tag variable = "test" will be used fortesting. A tag variable is a variable type as specified inthe Data Set Manager.

    Select Same Cases as Long As This Number Is the Same -specifies that the same cases should be used for testing whentraining multiple times. This can be useful when comparingdifferent types of neural nets, to remove the variabilityrelated to the different selections of testing cases. If thecheck box is selected, a number can be entered, and a fixednumber implies a fixed selection of testing cases (the numberis a "random number generator seed").

    Note that Best Net Search uses a fixed selection of testingcases to compare multiple nets, regardless of whether thischeck box is selected.

    Automatically Predict Missing Dependent Values specifiesthat the trained net will be used to predict dependent variablevalues for either:

    1) cases where the dependent variable value is missing, or

    2) cases where the Tag variable = "predict". A tag variable isa variable type as specified in the Data Set Manager.

    Enable Live Prediction specifies that NeuralTools will placeformulas in the cells where the predicted dependent variablevalues are shown to calculate the predicted values. For moreon Live Prediction, see the Predict command in this chapter.

    Calculate Variable Impacts specifies that NeuralTools willcalculate the relative impact of each independent variable inthe training data set in determining the predictions calculatedby the net.

  • 8/11/2019 NeuralTools6 En

    48/104

    40 Command Reference

    The purpose of Variable Impact analysis is to measure the sensitivityof net predictions to changes in independent variables. This analysisis only done on training data. As a result of the analysis, everyindependent variable is assigned a "Relative Variable Impact" value;these are percent values and add to 100%. The lower the percentvalue for a given variable, the less that variable affects the predictions.The results of the analysis can help in the selection of a new set ofindependent variables, one that will allow more accurate predictions.For example, a variable with a low impact value can be eliminated infavor of some new variable. However, one needs to keep in mind thatthe results of the Impact Analysis are relative to a given net. The factthat one net "learned" to disregard a given variable makes it likelythat another net will also "learn" to disregard it; but then again,another training session with a different type of net might "discover"how the variable can make a significant contribution to accuratepredictions. In data sets with smaller numbers of cases and/or largernumbers of variables, the differences in the relative impact of thevariables between trained nets may be more pronounced. Also, it isimportant to remember that these values are "relative". Suppose thatwith two independent variables one is assigned 99%, and the other1%. This means that the latter is much less important than the former,

    but does not mean that it is unimportant, particularly if high accuracyof predictions is desired.

    Some additional points to note about Variable Impact Analysisinclude:

    1) Only the training data set is included in the analysis. (If Auto-Testing or Auto-Prediction are used, those cases are not included.The reason is that they might have numeric values outside thetraining range, which could make analysis results moreunpredictable.)

    2) For a given category independent variable, for every case theanalysis steps through all the valid categories for that variable,

    and measures the change to the predicted value. (With categoryprediction there is no numeric predicted value, but there are rawnumeric net outputs on which the category prediction is based;those numeric outputs are used by the analysis.)

    3) For a given numeric independent variable, for every case theanalysis steps through the range from the minimum to themaximum training value for that variable, measuring the changeto the predicted value (or, in the case of category prediction,change to the raw numeric outputs).

    What is VariableImpact

    Analysis ?

  • 8/11/2019 NeuralTools6 En

    49/104

    Reference: NeuralTools Menu Commands 41

    The purpose of the Variable Impact Analysis is not meant to supportfirm conclusions, like stating with high confidence that a givenvariable is irrelevant. Instead, it's meant to help in a search for the bestset of independent variables: the results of the analysis may be tellingus that a given variable looks irrelevant, sufficiently so that it's worthtrying to train a net without this variable.

    The results of a Variable Impact analysis are displayed in the TrainingSummary report:

  • 8/11/2019 NeuralTools6 En

    50/104

    42 Command Reference

    The Net Configuration tab in the Training dialog allows you to selectthe type of neural network that will be trained on your data. Youmay select a specific net configuration or select a Best Net searchwhere NeuralTools will test a variety of possible configurations toidentify the best performing one for you.

    NeuralTools supports different neural network configurations to givethe best possible predictions. For classification/category prediction,two types of networks are available: Probabilistic Neural Networks

    (PNN) and Multi-Layer Feedforward Networks (MLF) . Numericprediction can be performed using MLF networks, as well asGeneralized Regression Neural Networks (GRNN) , which areclosely related to PNN networks. For more information on thetechnical aspects of the available network configurations, see theMore on Neural Networks section.

    The Net Configuration tab includes the following:

    Type of Net Selects the type of net to be used in training, oralternatively, selects a Best Net search. The Net Configurationtab Options change depending on the type of net selected.Available net types are:

    1) Best Net Search . In a Best Net Search, NeuralTools testsall checked net configurations, including PNN/GRNNand MLFN nets with node counts in the enteredminimum-maximum range. The best performingconfiguration for your data is identified based on theerror obtained on the testing data. If Store All Trial Netsin New Workbook is selected, you will be able toindividually load each tested net (regardless if it was thebest performing network) from a workbook and use it forprediction after training is done; a full testing SummaryReport for each net will also be available.

    NetCon figurationTab

  • 8/11/2019 NeuralTools6 En

    51/104

    Reference: NeuralTools Menu Commands 43

    2) PNN/GRNN Net. These net types require no additionaloptions to be selected for training; for this reason thissetting is the default when NeuralTools is installed. Ifyour data has numeric output values a GRNN networkwill be trained and if your data has categorical outputvalues a PNN network will be trained.

    3) MLFN Net. A Multi-Layer Feedforward Network(MLFN) has one or two hidden layers of nodes.

    By selecting zero nodes for the second layer it will beeliminated. The most reliable way to find the bestconfiguration of an MLFN net is to use the Best NetSearch option instead of the option to train a singleMLFN net. If there is not enough time for Best NetSearch, it is recommended that the "Number of Nodes"values be left as "Automatic".

  • 8/11/2019 NeuralTools6 En

    52/104

    44 Command Reference

    The Runtime tab in the Training dialog allows you to enter stoppingconditions for training. If no stopping conditions are selected,training will stop eventually; that period will be relatively short forPNN/GRNN nets, and much longer with MLF nets. One possibleapproach is to select no stopping conditions and click the Stop buttonin the training progress dialog when no more time is available fortraining. With Best Net Search a time limit for training a single netmust be defined, to ensure that the search algorithm does not devotetoo much time to one particular configuration. The three availablestopping conditions can be combined, specifying that NeuralToolswill stop when any of the conditions are reached.

    The Training Runtime options include: Time Specifies a fixed time limit for training a single network.

    Training may stop before the specified period elapses, as soonas the algorithm determines that it is unlikely that furtherprogress will be made. If a Best Net search is used, each testednet configuration will train for the entered time.

    Trials Specifies that NeuralTools will execute no more thanthe specified number of trials before stopping. With Multi-Layer Feedforward Networks, a "trial" is a single assignment of"weights" to connections between neurons; training consists ofan intelligent search for weights that will generate bestpredictions. With Probabilistic Neural Nets and GeneralizedRegression Neural Nets, a trial is an assignment of "smoothingfactors" to variables. Training consists of a search for bestsmoothing factors.

    Runtime Tab

  • 8/11/2019 NeuralTools6 En

    53/104

    Reference: NeuralTools Menu Commands 45

    Progress Specifies that NeuralTools will stop if it cannotimprove the error statistic at least the entered % with thespecified timeframe.

    The Training Preview dialog shows the setup of the current networktraining along with any errors detected in your data, prior to startingtraining. By examining the contents of this dialog, you can see allyour selected training assumptions as reported by NeuralTools. TheErrors and Warnings section gives a description of any problems

    NeuralTools has detected in your data or settings which you cancorrect if necessary prior to spending time training.

    TrainingPreview Dialog

  • 8/11/2019 NeuralTools6 En

    54/104

    46 Command Reference

    The Training Progress window reports on the status of networktraining as it runs. Graphs detail how NeuralTools is doing atimproving the network and reducing the reported error.

    The Training Progress Window reports error on the training data.Observing changes in this value should not lead to any directconclusions about the quality of predictions the net will make forcases not used in training. Such conclusions should be based on theerror obtained on the testing data. Also note that with numeric

    prediction the error reported in the Progress Window is the RootMean Square error based on scaled data (see information aboutscaling in "Inputs Transformation" section). For category predictionthe reported error is based on numeric representation of categorydata.

    TrainingProgressWindow

  • 8/11/2019 NeuralTools6 En

    55/104

    Reference: NeuralTools Menu Commands 47

    Both summary and detail reports can be created after training. Thesereports detail the performance of the trained neural network. Theactual contents of the generated reports are specified in theApplication Settings dialog, under the Reports to Generate andColumns in Detailed Reports settings.

    Training Summary Report The training summary reportgives statistics and graphs on the performance of the trainedneural net.

    For Category Prediction/ Classification, key statistics and graphs inthe training summary report include:

    1) % Bad Predictions - the percent of cases for which thepredicted category does not agree with the actual category.

    TrainingReports

  • 8/11/2019 NeuralTools6 En

    56/104

    48 Command Reference

    2) Mean Incorrect Probability (available with PNN nets only) -for every case, NeuralTools computes Probability of IncorrectCategories, which is the sum of probabilities assigned byPNN net to incorrect categories. For example, if for a givencase a net assigns 30% probability to red, 20% to yellow, and50% to green, and we know that the correct answer is red,then the value for that case is 20% + 50% = 70%. This valueprovides a case-by-case error measure for category prediction,

    corresponding to the Residual Error for numeric prediction."Mean Incorrect Probability" is the average error value for allthe cases.

    Detailed Reports show Incorrect Probability on a case-by-casebasis, and to better understand the concept it may be helpfulto change Detailed Report settings to show the probabilitiesassigned by a Probabilistic Neural Net to every possiblecategory for the dependent variable. To do that, selectApplication Settings in the Utilities menu, and click thedrop-down menu at the right of the Columns in DetailedReports row. The NeuralTools - Columns to Display inDetailed Reports dialog will display. In that dialog select

    Probabilities of All Categories (for PNN) for Testing. Thentrain a PN Net on a data set with at least 3 categories in thedependent variable (the Auto Loans.xls example file can beused) with Automatically Test selected. In the resultingDetailed Report, observe how the values in the Incorrect% column relate to the probabilities assigned to each possiblecategory; the Incorrect% is the sum of the probabilities for allincorrect categories.

    3) Classification Matrix - compares actual to predictedcategories on category-by-category basis. For example,classification matrix may reveal that a net correctly detects amedical condition in patients that have it, but has some

    tendency to raise false alarms for healthy patients.4) Variable Impacts (if selected) displays the relative impact of

    independent variables on predicted answers.

    5) Histogram of Probability of Incorrect Categories (availablewith PNN nets only) - see "Mean Incorrect Probability" abovefor explanation of "Probability of Incorrect Categories".

  • 8/11/2019 NeuralTools6 En

    57/104

    Reference: NeuralTools Menu Commands 49

    For Numeric Prediction, key statistics and graphs in the trainingsummary report include:

    1) % Bad Predictions - a prediction counts as "bad" if it fallsoutside the defined margin around the actual value; thewidth of the margin is defined as Good/Bad Tolerance(Training) setting in the Application Settings dialog.

    2) Root Mean Square Error - a measure of deviation ofpredictions from actual value (calculated as square root ofaverage square deviation).

    3) Mean Absolute Error - average deviation of predictions fromactual values.

    4) Variable Impacts (if selected) displays the relative impact ofindependent variables on predicted answers.

    5) Histogram of Residuals - "residual" is the difference betweenthe actual and the predicted value.

    6) Scatter plots showing relationships between actual values,predicted values, and residuals.

  • 8/11/2019 NeuralTools6 En

    58/104

    50 Command Reference

    Test Command

    Specifies settings for testing a trained neural network and runsthe testing

    The Test command allows you to 1) specify settings to be used fortesting a trained neural network and then 2) start the testing.

    Testing data usually is data with known output values that were notused in training the net. Options in the Testing dialog include:

    Data Set shows the data set to be used when testing thetrained neural network. This data set needs to be defined

    using the Data Set Manager and be present in the activeworksheet. Net to Use specifies the name and location for the neural net

    to be tested. Neural networks may be saved to an Excelworkbook or to a file on disk. Click Browse to change thename or location shown.

  • 8/11/2019 NeuralTools6 En

    59/104

    Reference: NeuralTools Menu Commands 51

    Variable Matching specifies how variables in the data set to be testedwill be matched with variables in the data set that was used to trainthe net.

    Two options are possible for variable matching:1) Automatic Matching . Variable Names in the testing data

    set are matched by name with those in the trained net'sdata set, and variable types are set based on this matching

    2) Custom Matching . Custom matching allows you toindividually assign the matching of variables in thetesting data set with those in the trained net's data set.This is done when variable names are different in the twodata sets or different assignments are desired.

    The Variable Matching dialog lists the names of variables in each dataset so they can be matched. Only variables with the same type maybe matched. Each time you do a matching, the assignments made arestored with the data set. By clicking Load Prior Matching , you cancycle through previously-made matchings to access a set of previousassignments for the data set.

    VariableM atching

  • 8/11/2019 NeuralTools6 En

    60/104

    52 Command Reference

    The Testing Preview dialog shows the setup of the current networktesting along with any errors detected in your data, prior to startingtesting. By examining the contents of this dialog, you can see all yourselected testing assumptions as reported by NeuralTools. The Errorsand Warnings section gives a description of any problemsNeuralTools has detected in your data which you can correct ifnecessary prior to testing.

    Testing PreviewDialog

  • 8/11/2019 NeuralTools6 En

    61/104

    Reference: NeuralTools Menu Commands 53

    Both summary and detail reports can be created after testing. Thesereports detail the performance of the trained neural network on thetest data set. The actual contents of the generated reports arespecified in the Application Settings dialog, under the Reports toGenerate and Columns in Detailed Reports settings. The detailedreport is especially useful when testing as it shows how the trainednet did at predicting individual output values in the test data set.

    Testing Summary Report The testing summary report gives

    statistics and graphs on the performance of the trained neuralnet on the test data set.

    For Category Prediction, key statistics and graphs in the testingsummary report include:

    1) % Bad Predictions - the percent of cases for which thepredicted category does not agree with the actualcategory.

    2) Mean Incorrect Probability (available with PNN netsonly) - for every case, NeuralTools computes Probabilityof Incorrect Categories, which is the sum of probabilitiesassigned by PNN net to incorrect categories. Forexample, if for a given case a net assigns 30% probabilityto red, 20% to yellow, and 50% to green, and we know

    Testing Reports

  • 8/11/2019 NeuralTools6 En

    62/104

    54 Command Reference

    that the correct answer is red, then the value for that caseis 20% + 50% = 70%. This value provides a case by caseerror measure for category prediction, corresponding tothe Residual Error for numeric prediction. "MeanIncorrect Probability" is the average error value for all thecases.

    Detailed Reports show Incorrect Probability on a case-by-case basis, and to better understand the concept it may be

    helpful to change Detailed Report settings to show theprobabilities assigned by a Probabilistic Neural Net toevery possible category for the dependent variable. To dothat, select Application Settings in the Utilities menu,and click the drop-down menu at the right of theColumns in Detailed Reports row. The NeuralTools -Columns to Display in Detailed Reports dialog willdisplay. In that dialog select Probabilities of AllCategories (for PNN) for Testing. Then train a PN Neton a data set with at least 3 categories in the dependentvariable (the Auto Loans.xls example file can be used)with Automatically Test selected. In the resultingDetailed Report, observe how the values in theIncorrect% column relate to the probabilities assigned toeach possible category; the Incorrect% is the sum of theprobabilities for all incorrect categories.

    3) Classification Matrix - compares actual to predictedcategories on category-by-category basis. For example,classification matrix may reveal that a net correctlydetects a medical condition in patients that have it, buthas some tendency to raise false alarms for healthypatients.

    4) Histogram of Probability of Incorrect Categories (available with PNN nets only) - see "Mean Incorrect

    Probability" above for explanation of "Probability ofIncorrect Categories".

  • 8/11/2019 NeuralTools6 En

    63/104

    Reference: NeuralTools Menu Commands 55

    For Numeric Prediction, key statistics and graphs in the testingsummary report include:

    1) % Bad Predictions - a prediction counts as "bad" if it fallsoutside the defined margin around the actual value; thewidth of the margin is defined in the Good/BadTolerance (Testing) setting in the Application Settingsdialog.

    2) Root Mean Square Error - a measure of deviation ofpredictions from actual value (calculated as square root ofaverage square deviation).

    3) Mean Absolute Error - average deviation of predictionsfrom actual values.

    4) Histogram of Residuals - "residual" is the differencebetween the actual and the predicted value.

    5) Scatter plots showing relationships between actualvalues, predicted values, and residuals.

  • 8/11/2019 NeuralTools6 En

    64/104

    56 Command Reference

    Testing Detailed Report. This report is placed next to thetesting data set and shows how the trained net did atpredicting individual output values in the test data set.

    In the Testing Detailed Report, predictions are marked as "Good" or"Bad" based on the tolerance level set in the Application Settingsdialog. If you are running multiple tests, Detailed Reports may beadded in new columns to the right of the test data set, so you can seehow predictions change for individual cases as new trained nets aretested.

  • 8/11/2019 NeuralTools6 En

    65/104

    Reference: NeuralTools Menu Commands 57

    A popup comment in Excel provides quick access to Summary Reportinformation while examining a Detailed Report. Simply roll themouse over the report header and the popup comment will bedisplayed. Note Comments must be enabled in the Excel ToolsCommand Options Dialog View tab for popup comments to bedisplayed.

    QuickSummaries inDetailedReports

  • 8/11/2019 NeuralTools6 En

    66/104

    58 Command Reference

    Predict Command

    Specifies settings for predicting values using a trained neuralnetwork and runs the prediction

    The Predict command allows you to 1) specify settings to be used forpredicting values using a trained neural network and then 2) run theprediction.

    Data to predict typically are cases with unknown dependent variablevalues. Options in the Testing dialog include:

    Data Set shows the data set to be used for prediction. Thisdata set needs to be defined using the Data Set Manager andbe present in the active worksheet.

    Net to Use specifies the name and location for the neural net

    to be used for prediction. Neural networks may be saved toan Excel workbook or to a file on disk. Click Browse tochange the name or location shown.

    Variable Matching Specifies how variables in the data setwith the prediction data will be matched with variables in thedata set that was used to train the net. Click Edit to changevariable matching. For more information on VariableMatching , see the Test command in this chapter.

  • 8/11/2019 NeuralTools6 En

    67/104

    Reference: NeuralTools Menu Commands 59

    Predict For Selects the cases for which predictions will bemade. Typically you will select to predict cases with MissingDependent Variable Values , but you can make prediction forAll Cases (even those where the dependent variable value isknown) if desired. If you have a Tag variable in the data set,dependent variable values will be predicted only for casesmarked with the Tag "predict".

    Options sets predicted value location and Live Predictionoptions.1) Place Predicted Values Directly in Data Set. This

    option specifies that predicted values will be placeddirectly in the dependent variable location in the data setfor each predicted case, possibly in addition to beingplaced in the Detailed Report (depending on whetherDetailed Reports are selected in the Reports to Generatesetting in Application Settings). This overwrites anycurrent contents of the cell so should be used withcaution. You will be able to identify predicted values bycolor in the data set.

    2) Enable Live Prediction - specifies that NeuralTools willplace formulas in the cells where the predicteddependent variable values are shown. These formulasallow NeuralTools to calculate the predicted values asindependent values change.

    3) Exclude Live Prediction for Cases with Missing orInvalid Values specifies that a live prediction formulawill not be added where input variable values for a caseare missing. Missing input values cause live predictionformulas to return an error value. However, it may beuseful to allow NeuralTools to enter formulas in caseswhere independent values are missing, because as soon

    as missing values are filled, the prediction willautomatically show.

  • 8/11/2019 NeuralTools6 En

    68/104

    60 Command Reference

    Live Prediction is a powerful capability of NeuralTools(Industrial version only) that allows you to performpredictions automatically in Excel without going through aspecific Predict operation. With Live Prediction, NeuralToolsplaces formulas in the cells where the predicted dependentvariable values are shown. These formulas use a customNeuralTools function to calculate the predicted values, suchas:

    = NetOutputPrediction (_PALDS_DG25B8C82B!$A$140,"DG25B8C82B", "VG1DD83AF2", 'Prediction Data'!$A$6:$I$6, A7:I7)

    The actual formula is added to your worksheet byNeuralTools and does not need to be entered by you. Thearguments let NeuralTools identify the trained network inuse, along with the location of the independent values in theworksheet. When the input independent variable values for acase are added or changed, NeuralTools will automaticallyreturn a new predicted value. This makes it simple to addand generate predictions for new cases using an existingtrained net.

    Note: if the prediction will be based on cell values that are notexpected to change, then de-selecting Live Prediction intraining or prediction dialog is recommended; this willminimize the time it takes for Excel to recalculate theworkbook.

    Live Prediction

  • 8/11/2019 NeuralTools6 En

    69/104

    Reference: NeuralTools Menu Commands 61

    The Prediction Preview dialog shows the prediction setup for theselected data set along with any errors detected in your data orsettings, prior to starting prediction. By examining the contents ofthis dialog, you can see all your selected prediction assumptions asreported by NeuralTools. The Errors and Warnings section gives adescription of any problems NeuralTools has detected in your datawhich you can correct if necessary prior to prediction.

    PredictionPreview Dialog

  • 8/11/2019 NeuralTools6 En

    70/104

    62 Command Reference

    Both summary and detail reports can be created after prediction.These reports detail the performance of the trained neural network onthe test data set. The actual contents of the generated reports arespecified in the Application Settings dialog, under the Reports toGenerate and Columns in Detailed Reports settings.

    Prediction Detailed Report. This report is placed next to theprediction data set. It provides a location for predictions whenthe user does not want to place them inside the dependent

    variable in the data set itself; if the dependent variable containshistorical data for some cases, it may be safer to not mix thosehistorical cases with net predictions.

    If you are running multiple predictions, Detailed Reports may beadded in new columns to the right of the data set, so you can see howpredictions change for individual cases as new trained nets are used.

    PredictionReports

  • 8/11/2019 NeuralTools6 En

    71/104

    Reference: NeuralTools Menu Commands 63

    Utilities

    Application Settings Command

    Specifies default settings for training, testing and prediction

    The Application Settings command allows you to select 1) whichreports to generate for training, testing and prediction, 2) whatTraining defaults to use and 3) what Prediction and Runtime defaultsto use. Many Application Settings are defaults to be used in theTraining, Testing or Prediction dialogs. See the description of thosedialogs to get more information on those settings. Other settings arecovered here.

    Reports settings include:

    Reports to Generate - Each operation in NeuralTools canproduce both a summary and a detailed report. However,typically you will want to use the default report setting ascertain reports add little value to certain operations. For

    Reports

  • 8/11/2019 NeuralTools6 En

    72/104

    64 Utilities

    example, the Detailed report is the standard report fromPrediction, and a Summary report in this case adds little value.

    Summary Reports are placed on their own worksheet, whileDetailed Reports are placed in columns to the right of a data set,in the same worksheet as the data set.

    Place Summary Reports in options include:

    - New Worksheet where a new worksheet is created foreach report. You may reuse the same workbook for yourreports or always create a new workbook.

    Detailed Report Location options include:

    - Overwrite Existing Reports , where columns with datafrom prior Detailed Reports in a data set are overwrittenwith new Detailed Reports (to delete a Detailed Reportmanually, select entire columns containing it by clickingand dragging over column headers, then select Deletefrom the Edit menu).

    - Right of the Data Set where new columns are inserted tothe right of the data set to hold the new Detailed Reports.

    - Right of Existing Reports where columns to the right ofthe data set and existing reports are used to hold the newDetailed Reports.

    Columns to Display in Detailed Reports. For every selectedrow, a column will be displayed in the Detailed Report to the

    right of the data set that will show the information for everycase.

  • 8/11/2019 NeuralTools6 En

    73/104

    Reference: NeuralTools Menu Commands 65

    The following columns can be displayed:

    1) Tag Used - "train", "test" and "predict" tags show forevery case if it was used as part of the training or testingset, or whether a prediction was made for a given case.

    2) Prediction Obtained Using Net - Number or categorypredicted by net.

    3) Probability of Predicted Category (for PNN) -Probabilistic Neural Nets not only predict an unknowncategory, but also assign a probability of that category.Not available when categories are predicted using Multi-Layer Feedforward Networks. Does not apply to numericprediction.

    4) Probability of Incorrect Categories (for PNN) - Sum ofprobabilities assigned by a PNN net to incorrectcategories. For example, if for a given case a net assigns30% probability to red, 20% to yellow, and 50% to green,and we know that the correct answer is red, then thevalue for that case is 20% + 50% = 70%. This columnprovides a case-by-case error measure for categoryprediction, corresponding to the "Residual Error" column

    for numeric prediction.5) Residual Error - The difference between the actual and

    the predicted dependent value. Does not apply tocategory prediction.

    6) Good/Bad Evaluation - For numeric prediction, thecolumn states if the prediction for a given case fallswithin the defined margin around the actual value; thewidth of the margin is defined as "Tolerance forGood/Bad Evaluation". For category prediction, thecolumn simply states if the predicted category agreeswith the actual one.

    7) Probabilities of All Categories (for PNN) - If this optionis selected and a Probabilistic Neural Net is trained, onecolumn will be inserted for every dependent category.For example, if the net is used to predict a color, theremay be "red%", "yellow%", and "green%" columns,containing probabilities assigned to each color.

    Tolerance for Good/ Bad Evaluation. For testing and training,if a numeric prediction is within the entered % of the actualdependent variable value, it will be labeled as "Good".

  • 8/11/2019 NeuralTools6 En

    74/104

    66 Utilities

    Neural Net Manager Command

    Allows the copying, moving and deletion of trained neuralnetworks

    The Neural Net Manager Command allows you to manage trainedneural networks, moving them between workbooks and files as wellas adding descriptive information to them.

    Neural networks can be stored in an Excel workbook or in a file ondisk. Any number of networks may be placed in a single Excelworkbook. Using the Neural Net Manager you can move networks tonew workbooks or files, or delete or replace them. This allows you to

    easily analyze data sets in other workbooks using existing trainednetworks, without the workbook with the training data present.

  • 8/11/2019 NeuralTools6 En

    75/104

    Reference: NeuralTools Menu Commands 67

    Neural Net Manager options include: Copy This copies a trained neural network to a different

    location. Simply select the workbook or file where you wish toplace the network.

    Remove Deletes a trained neural network. Replace Overwrites a trained neural network with a new one.

    This feature is available with nets that are used for LivePrediction. After replacement, live predictions that werepreviously made using the old net will be made using the new

    one. This, however, does not apply to Detailed Reports. If adetailed report contains live prediction cells where the net to bereplaced is used, then after replacement those cells will containfixed values.

    Net Information Allows descriptive information to be addedto a network. This helps identify the trained network and theconditions it was trained under.

  • 8/11/2019 NeuralTools6 En

    76/104

    68 Utilities

    Missing Data Utilities Command

    Allows the replacement of missing data and error values in adata set with artificial values

    The Missing Data Utilities command allows you to replace missing orother unwanted data in your data set with artificial values. Caseswith missing variable values are disregarded by NeuralTools duringtraining, testing and prediction, so it is often useful to correct theseprior to processing.

    The Training Preview dialog box will give you a warning when youhave cases with missing values in a data set. If this happens, thesecases can be fixed using the Missing Data Utilities command.

    The Missing Data Utilities dialog has the following options:

    Variables to Modify Provides a list of the variables used in thedata set in the current worksheet and displays the numbermissing, error and (for numeric variables) non-numeric data.Checking a variable selects it to have missing or other unwanteddata replaced.

    The variable list provides a right-click menu with commands forselecting and deselecting groups of variables.

  • 8/11/2019 NeuralTools6 En

    77/104

  • 8/11/2019 NeuralTools6 En

    78/104

    70 Utilities

    The Missing Data Utilities dialog provides one possible approach tomissing data: it generates artificial data where actual data is missing.It may often be better to simply leave missing data as blank cells, andlet NeuralTools disregard cases with missing data. Note thatNeuralTools will not recognize special symbols like "?" as missingdata; question marks need to be cleared, and this can be accomplishedwith the Missing Data Utilities, by selecting "Specific Value" in Valuesto Replace section, and "Clear Cells" in "What to Use for Replacement"section.

    It may also be possible to use NeuralTools to predict missing values inone independent variable fro