Coffalyser v9 Quick Analysis Guide (PRE_RELEASE).pdf

25
Coffalyser v8 This quick analysis manual explains how to perform a simple analysis using the Coffalyser VBA analysis software V8. This manual is an extension on the normal complete manual from v7 and only explains how to perform a quick analysis. A complete overview on support documents can be found at http://www.mlpa.com/coffalyser and in the document available online How_to_use_the_Coffalyser_Support_Page_v1. CONTENT 1. Downloading and installing the Coffalyser 2. Starting the Coffalyser and updating a MLPA mix 3. Loading your MLPA mix 4. Changing the Actual length of the control fragments for automated quality control 5. Importing and visualizing your fragment data files a. Supported sample types b. Adjusting File names 6. Adjusting the actual probes lengths for automated data filtering 7. Changing the metric for normalization 8. Filtering my imported data files. 9. Filtering data for a methylations status analysis 10. Checking my imported signals. 11. Settings your reference data. 12. Setting analysis method. 13. Exploring results (sample reports enabled ) a. Bar charts b. Technical data figures c. MLPA Results Reports 14. Exploring results 2 (No sample reports enabled ) a. Ratio result sheet b. PPMC c. MAD VALUE d. Reference results sheet e. Statistics sheet f. Predictions sheet g. Saving Results Summaries h. HEATMAP images

Transcript of Coffalyser v9 Quick Analysis Guide (PRE_RELEASE).pdf

  • Coffalyser v8

    This quick analysis manual explains how to perform a simple analysis using the Coffalyser VBA analysis

    software V8. This manual is an extension on the normal complete manual from v7 and only explains how

    to perform a quick analysis. A complete overview on support documents can be found at

    http://www.mlpa.com/coffalyser and in the document available online

    How_to_use_the_Coffalyser_Support_Page_v1.

    CONTENT

    1. Downloading and installing the Coffalyser

    2. Starting the Coffalyser and updating a MLPA mix

    3. Loading your MLPA mix

    4. Changing the Actual length of the control fragments for automated quality control

    5. Importing and visualizing your fragment data files

    a. Supported sample types

    b. Adjusting File names

    6. Adjusting the actual probes lengths for automated data filtering

    7. Changing the metric for normalization

    8. Filtering my imported data files.

    9. Filtering data for a methylations status analysis

    10. Checking my imported signals.

    11. Settings your reference data.

    12. Setting analysis method.

    13. Exploring results (sample reports enabled)

    a. Bar charts

    b. Technical data figures

    c. MLPA Results Reports

    14. Exploring results 2 (No sample reports enabled)

    a. Ratio result sheet

    b. PPMC

    c. MAD VALUE

    d. Reference results sheet

    e. Statistics sheet

    f. Predictions sheet

    g. Saving Results Summaries

    h. HEATMAP images

  • 1. Downloading and installing the Coffalyser

    First download the latest install version from our website (http://www.mlpa.com/coffalyser).

    Run the Coffalyser v8.exe file and install the program to an easy to find location (e.g. directly on

    C: or D:). After installation, you first need to make a few minor adjustments to prepare Excel for

    running the Coffalyser VBA program. These changes can be found in the Excel Preparation

    manual for 2003 or 2007 available online. The Coffalyser V8 may work in older version of Excel

    although this is untested and may give errors.

    2. Starting the Coffalyser and updating a MLPA mix.

    The Coffalyser comes prepared for all available MLPA mixes. The contents and sheets of all

    MLPA mixes added can be found in the folder called MLPA mixes. These sheets are needed to

    run the Coffalyser. If your MLPA mix is not added to the program you may find your sheets

    online or it can be requested through email ([email protected]). Copy the sheet into the

    folder MLPA mixes and the program will automatically update after restarting.

    Double click on the Coffalyser.xls file to start the program. In office 2003 you will receive the

    question to enable or disable Macro usage (if security has been set to medium), choose to

    enable the Macros. In Office 2007, you need to click on the button Options next to the security

    warning. A popup window will appear next click on Enable this content (figure 1)

    Figure 1 Enabling of macros in Office 2007

  • 3. Loading your MLPA mix

    After enabling the macros a window will pop-up where you can load your MLPA mix, enter your

    name, kit number and project name. First select your MLPA mix and lot number from the box

    underneath Load your MLPA mix and click on Load mix. When you MLPA mix has been

    loaded a message will appear.

    Now you may fill in your user name and kit number (not required). You may furthermore create

    a new database. Click on new, to create a new database, or select a database from the box

    underneath User Database. Raw data will be stored in a folder named after the database in

    the folder called Raw data database while results will be stored in the in a folder named after

    the database in the folder called MLPA results.

    Change the project name if you start a new project. Your results summary files will be names

    after your project name, so create logical names such as: P002 breast cancer samples 0409 or

    p301 lot 0207 QT-A.

    By selecting Search Saved Project you may reopen an old project containing the raw data files

    and results (if stored during analysis).

    Click on Start MLPA analysis to open the Coffalyser userform.

    Figure 2 Coffalyser mix loading screen.

  • 4. Changing the Actual length of the control fragments for automated quality control

    A deviation between the cloned real fragment lengths and the actually detected fragment

    lengths always exist. To enable the automated quality control of the Coffalyser the actual

    lengths of the control fragments need to be adapted before data filtering. After opening the

    Coffalyser user form, click in the All MLPA mixes screen on control and mut probes on the left

    side of the screen. (You may navigate between the different screens by clicking on the tabs on

    the topside of the program. The tab surrounded with the pink edge is the active screen. ) A list

    with all control fragments will appear as well as (if present) mutation specific probes (figure 4).

    You need to note the actual lengths of the control fragments and mutation specific probes in the

    size called electropherogram (figure 3) and then adjust these lengths in the coffalyser before

    data filtering. The concentration control fragments (Q-fragments) are expected at 61, 68, 74

    and 80 bp and may only be visible in a NO-DNA run. Double click on each of the listed lengths

    and change these to the actually detected length. Repeat this procedure for the denaturation

    fragements (D-fragments) which are expected at 88 and 96 bp and for the Y and X specific

    fragments which are expected at resp. 100 and 105 bp. After finishing click on file and select

    Save all mix changes, overwrite the old file.

    If these fragments are not in your kit, then you dont have to change the lengths and you may

    ignore the possible warnings (e.g. incomplete denaturation).

    Figure 3 MLPA run with DQX fragments.

    0

    10000

    20000

    30000

    40000

    50000

    60000

    70000

    80000

    90000

    50 100 150 200 250 300 350 400 450

    P275 new DQX 20 ng male.C04_08031113LE

    Size (nt)

    Dye

    Si

    gnal

    73.3079. 49

    85.55

    91.17

    95.98

    100. 80

    105. 66

    135.01

    141. 28

    153.37158.42

    163. 87

    177.59183.22

    193.05

    201.60209. 72

    221.23

    229.02

    245. 29

    253.82

    264.28

    282.57

    291. 34

    301. 47

    310.89

    316.97

    336.20

    346.33354.06

    374. 74

    383.75390.98

    397.22

    409. 00

    415. 62

    426.74

    436.64

    445.17

    455.58466. 57

    475.54

    482.77

  • Figure 4 MLPA mixes screens at the control/mutation fragments page.

    5. Importing and visualizing your fragment data files.

    Size called fragment data files as well as fsa files from the ABI-310/ABI-3100/ABI-3700 series can

    be imported in the Coffalyser for data normalization or visualization. Please note that all

    visualization of electropherograms are rebuilds and that it is always recommended to use the

    software you used for size calling for proper troubleshooting.

    Go to the Raw MLPA data screen and click on file. Next select the proper data format that

    matches your raw files. If you are unsure if you file format is correct, navigate to the folder

    called Manuals and open the folder called Possible file formats. This folder contains example

    files of all useable files; open them in Excel to explore the content.

    After importing the separated run names will appear in the left column. You may double click on

    them to visualize the run. By changing the combobox on the bottom left side to show/print or

    show/save image, you may choose not only to show the run in the window but also print or

    save an image of the run (figure 5). Using the Play button will start a slideshow, showing all

    images one by one.

    Supported sample types (example files can be found in the folder called manuals > possible file formats)

    "Import CEQ CSV FILE (single file)"

    Single sample files created in the CEQ fragment analysis software by the export files option.

    "Import CEQ CSV FILE (fragments genotypes)"

    Multiple runs file created in the CEQ fragment analysis software by the option export fragments

    / genotypes

    "Import GENEMAPPER TXT FILE (single)"

  • Single sample files created in Genemapper software by export of a single size called sample file

    "Import GENEMAPPER TXT FILE (multiple runs)"

    Mulitple runs files created in genemapper software exported in the plot view when displaying

    multiple runs

    "Import PEAK SCANNER FILE (combined table)"

    Multiple runs file created in Peak Scanner Software after size calling

    "Import SPECTROPHOTOMETRIX FILE (combined table)"

    Multiple runs file created using the export txt file option the spectrometrix software data

    "Import Megabace file (combined table)"

    Multiple runs file created using the export txt file option after size calling in the Megabace

    software

    "Import LICOR file (GeneTools)"

    Multiple runs file created using by Genetools scanning a slabgel image of a licor device

    "Import GENESCAN TXT FILE (single)"

    Single sample files created in Genescan software by export of a single size called sample file

    "Import GENESCAN PROJECT FILE (multiple runs)"

    Mulitple runs files created in Genescan software exported in the plot view when displaying

    multiple runs

    "Import ABI -310(0)/3700 FSA FILES"

    Size caling of raw fsa files coming from the older series of ABI may be performed directly. These

    files will be size called and transformed to genescan txt files during import.

    Figure 5 Raw data datascreen.

  • Adjusting File names.

    In some cases you may want to adjust the filenames your import, this may for instance be

    practical with a methylation status analysis, where cut and uncut run need to have the same

    names. The Coffalyser allows a few different possibilities to do this.

    Click on Settings and then go to the tab called Sample Names. Cut Sample Names is activated

    on default and cuts each file name down to a maximum of 31 characters (no more is allowed for

    import). Left and right thus indicate the position of characters counting from the left side of the

    file name where the name that will be imported from the filename will start (left) and where it

    will be cut down (Right).

    Alternatively you may enable the options to resize the filenames using SIGNS. Setting a sign will

    cut the filename down to everything before the set sign. For instance if you filename is called

    Sample1_CUT_1909234_A09_Row8, then set your sign to be: CUT. After importing your sample

    name will be Sample1_CUT.

    Finally you may enable to remove track ends; this will cut everything in your file name after the

    _ underscore sign. This may not always be recommended because you may cut too much!

    Figure 6 Sample names setting tab

    6. Adjusting the actual probes lengths for automated data filtering.

    A deviation between the cloned probe lengths and the actually detected probe lengths also

    exists. To filter the correct probe signals we first need to adapt the actual lengths as found after

    size calling. The easiest way to do this may be by using the autobin function. Navigate to the

    data filtering page (figure 7). Next select a number of runs (preferably runs performed on

    normal DNA, with MS-MLPA UNCUT runs) with overall good signals with equal intensities and a

  • low background. Next click on AddA (>>) while the combo box on the left side is set to

    add/remove reference runs. The names will move underneath the header Reference Runs,

    now change the combo box on the left side to perform autobin on references (P) and click on

    the left play button.

    The program will now try to find the actual length belonging to the probes in the mix. After

    finishing the actual lengths will be displayed underneath SET BIN while the cloned lengths of

    the probes can be seen underneath CLONED LENGTH. Check the estimated lengths underneath

    SET BIN with for instance an image of an size called electropherogram and make sure that they

    all contain valid values. Alternatively you may double click on a run name in the left column. All

    fragments lengths and intensities of the set metric will appear in the most right columns. You

    can use this list to define the actual length if they were not estimated correct. Probe signals will

    always be close to the cloned length and will have high signals intensities as compared to

    background signals. Each set bin value can be adapted manually by clicking on the fragment, and

    adjusting the value using the pop up box.

    When you are sure all actual set lengths in the SET BIN list are correct, change the left combo

    box to Save all mix changes and click on the left Play button. Overwrite the old files. The actual

    lengths are now set and saved and can be used every time you use this MLPA mix as long as the

    capillary device, size calling method, gel, size standard etc remain the same.

    To remove imported runs, change the combo box on the left side to Delete imported Files (P)

    and click on the left play button.

  • Figure 7 Data filtering page

    7. Changing the metric for normalization.

    The Coffalyser on default imports the peak heights as the basis for normalization. This is set so,

    because the estimation of the peak area is often determined wrong when peaks are quite wide.

    You may however choose to change the metric to the total peak area found within a bin or the

    highest peak area signal in a bin. A bin is made up out of the actual set probe length in the set

    bin +/- 2 bp (on default).

    Click on Setting and navigate to the Filter options tab (Figure 8). To change the metric to peak

    height, select Use peak areas next to Peak areas or Peak heights. To use the summation of all

    signals in a bin (not recommended, because background will be taken into the probe

    fluorescence) enable the option box Use summation of peaks next to the text Multiple signals

    in a bin.

    This page will also allow you to change the minimal set signal intensity (arbitrary number) and

    the width of each bin.

    Figure 8 Filter options tab

    8. Filtering my imported data files.

    Now the Coffalyser is prepared for automated filtering, you can start filtering and checking your

    probe signals.

    Navigate to the data filtering page (figure 7). Next select the runs which you want to use as

    reference runs. In most cases this will be MLPA runs performed on normal DNA. Note that

    reference and sample runs which are isolated in the same way and are from the same tissue

    type will be easier to compare and provide better results.

    Next click on AddA (>>) while the combobox on the left side is set to add/remove reference

    runs. The names will move underneath the header Reference Runs. Next change the

    combobox on the left side to add/remove sample runs. Select the runs which you want to use

    as sample runs and click on click on AddA (>>). Your reference runs and sample runs should now

    be right underneath the headers REFERENCE RUNS and resp. SAMPLE RUNS.

  • Click on the Play button on the right side to start data filtering. The program will work of both

    lists, when data filtering is finished the quality overview screen should appear.

    9. Filtering data for a methylations status analysis

    Separate rules apply when you want to perform a methylation status analysis. First of all, you

    need to make sure that your cut and uncut runs have equal names to make automatic

    comparison possible. A cut run may for instance be named: Sample1-cut, while the uncut run

    should then be named: Sample1-uncut. Also see the section on Adjusting File names for this.

    Navigate to the data filtering page (figure 7). Next select the all uncut runs in the left column

    and click on AddA (>>) while the combobox on the left side is set to add/remove reference

    runs. The names will move underneath the header Reference Runs. Next change the

    combobox on the left side to add/remove sample runs. Select the CUT runs belonging to your

    previously imported uncut runs and click on click on AddA (>>). Your reference and sample data

    should now be right underneath the headers REFERENCE RUNS and resp. SAMPLE RUNS. Click

    on the play button on the right side to start filtering your data.

    10. Checking my imported signals.

    Direct after analysis the quality control is opened giving an overview of the found signals and

    results of the quality control steps of all imported runs (figure 9). If this menu doesnt open, you

    may click on All run QC in the left window. The displayed parameters are: Run name, DNA

    concentration estimation, Number of probe signals found, Number of reference probes found,

    Ligation fragment found, Male/Female determination (if Y fragment present), Denaturation

    check on fragments 88 bp and denaturation check on fragments 96 bp. Please note that these

    tests are dependent on the correct settings of the actual lengths before data filtering! When the

    probe or control fragments lengths are not set correct, probe signals may be missing or runs

    may seem to be denatured incomplete while this is not the case. In doubt always check your

    runs on the RAW electropherogram.

    To separately remove bad reference or sample runs, first click on Reference runs QC or Sample

    runs QC. The same check appears but now only for the reference or sample runs. To prevent

    errors during analysis, remove reference runs missing any probe signal and remove sample runs

    missing reference probe signals. You may do this by double clicking on the run name in the

    center windows while the Reference runs QC or Sample runs QC is displayed.

    If you think that some probes were not filtered correct, possible because they are missing in

    several runs, click on Reference runs or Sample runs in the left column in the bottom. This will

    display the imported signals of each reference or sample run, allowing you to recognize the

    probes which has been filtered wrong. You need to go one step back and manually adjust the

    set bin for this fragment by double clicking on it.

  • Figure 9 Data analysis page, all runs quality check point.

    11. Settings your reference data.

    Depending on the MLPA mix you used and the samples you have used a normalization method

    needs to be chosen. The coffalyser allows many different normalization methods to fit many

    different situations. All settings can be found by clicking on settings. For you convenience a

    number of presets has been created which can be chosen from the combo box on the right side.

    First you need to consider your reference runs. If you have performed a number of reference

    runs, this data will be used to correct and normalize your sample runs and it will be used to

    calculate the reproducibility of the MLPA probes in this experiment.

    If you do not have reference runs you may set a median of all samples as reference data. Please

    note that this is only recommended if the coincidence of aberrations for each probe is less than

    30% over all samples. Estimated standard deviations and predictions are unreliable when you

    are comparing your samples against themselves and these results should thus be ignored. Click

    on settings, to change your reference data type and navigate to the tab called Analysis Options

    (figure 10). Change the option box at Reference data TYPE to median of all imported samples

    to normalize samples without reference runs. Alternative method allow the average or

    minimum of all samples, these are only to be used by advanced users in special situations.

  • Figure 10 Basic analysis settings.

    12. Setting analysis method.

    Now that you reference data is set you need to choose an analysis method. To best meet the

    requirements of the context of your experimental design a number of pre-made settings are

    available. Each of these methods may work well in some circumstances but may before poorer

    in other. Depending on expected aberrations and availability of reference probes and reference

    runs and quality of the runs a method needs to be chosen by the user. The section here under

    will discuss each of the pre-defined methods; also see a quick overview in table 1.

    After choosing a method in the Combo box, click on the right play button to start the analysis. In

    case missing signals were found in reference runs or reference probes were missing in sample

    runs, you may get warning messages. Always delete the runs that are noted as not good, unless

    you are sure that the missing signals are really in your sample and thus not caused by bad

    filtering or other artifacts.

    Table 1 Quick overview of the pre-set analysis methods and features

  • "Directanalysis"

    This method is the most straight forward method to normalize your data. It doesnt correct for

    sloping effects. This method should thus only be used when size to signal sloping is more or less

    the same between your reference and sample runs. You can always check if there is a sloping

    artifact in your results by ordering your ratio results on probe length (after analysis, using the

    normal XLS order function) and this may thus be the best method to start with. When the

    average ratio of the shorter probes is higher or lower than the average ratio of the longer probes

    (not taking into account ratio caused by aberrant genes), you may need to re-perform the

    analysis using an alternative analysis method.

    This method assumes that the set reference probes remain normal between reference and

    sample runs. Each reference probes will be used to separately determine the ratio of a target

    probe. The median of all these calculated ratios will estimate the final ratio. The median is the

    center value, and this method is thus robust for aberrations in the reference probes as long as

    more than 51% of all reference probes remain normal.

    "Directanalysis (all probes)"

    This is almost equal to the directanalysis method in that it doesnt correct for sloping effects. This

    method assumes the number of changes in the target sequences of the probes is minimal. Each

    target probe will be used to separately determine the ratio of each target probe. The median of

    all these calculated ratios will estimate the final ratio. The median is the center value, and this

    method is thus robust as long as more than 51% of all probes remain normal.

    This method will usually be used for MLPA kits where no reference probes are present and the

    number of aberrant probes is limited to no more than 5 probes although this method may also

    provide good (or sometimes even better) results with MLPA kits having only a few reference

    probes (5 or less).

    "Directmethylation status"

    This is almost equal to the directanalysis method in that it doesnt correct for sloping effects. This

    will usually not be necessary to perform the methylation status analysis because the same DNA is

    being compared to each other (Cut & Uncut). This method assumes the number of changes in the

    target sequences of the reference probes is minimal. Each reference probe will be used to

    separately determine the ratio of each target probe. The median of all these calculated ratios will

    estimate the final ratio. The median is the center value, and this method is thus robust for

    aberrations in the reference probes as long as more than 51% of these probes remain normal.

    Please note that in many cases some reference probes may give higher or lower signals in the cut

    sample. This is not due to aberrations but rather because of the structural changes present after

    cutting the DNA or because of the reduced presence of amplifiable targets during the PCR

    reaction.

    This method calculates the relative ratio of each probe signal between the cut and uncut

    reaction. When a target is completely methylated, no target is cut and the signal of this probe is

    expected to be ratio 1 or 100%. When a target is hemi-methylated, a single allele is expected to

    be completely cut and the expected ratio between the cut and uncut single is 0.5 or 50%.

    "Control Probe Analysis (LS)"

    The control probe analysis (LS) method is only recommended for MLPA kits with at least 7-8

    reference probes which are targeted to areas of which the user is sure that no changes occur in

  • these regions in the used samples (please check the probe mix description for the targets of the

    reference probes in your kit).

    The control probes method (LS) first defines if there are outliers between the set reference

    probes in each sample assuming that a median of all reference probes is normal. The reference

    probes signals that are not outliers are then used to calculate the amount of regression in that

    run using an adapted the least of squares regression method. The size to signal drop is thus

    expected to be more or less linear, if this is not the case then use one of the LMS methods. Next

    to this, the least of squares method is sensitive with a low number of signals. It may thus be

    recommended to use the Tumor analysis (LS) method instead which is more robust and also

    normalizes against the reference probes. After regression correction the method assumes that

    the set reference probes remain normal between reference and sample runs. Each reference

    probes will be used to separately determine the ratio of a target probe. The median of all these

    calculated ratios will estimate the final ratio. The median is the center value, and this method is

    thus robust for aberrations in the reference probes as long as more than 51% of all reference

    probes remain normal.

    "Tumor analysis (LS)"

    The tumor analysis (LS) method is the recommended method for users who wish to normalized

    their data against their reference probes, but the number of available reference probes is either

    too low, or some samples may have aberrations in the regions to which the reference probes are

    targeted.

    The tumor analysis (LS) first defines if there are outliers over all probes in each sample assuming

    that a median of all probes is the normal status. It should be noted that it doesnt matter if a

    median of all signals falls on an aberrant probe. If most probes are targeted to a region, or

    regions that have the same copy number status, size to signal drop is still estimated correct. The

    probes signals that are not defined as outliers are then used to calculate the amount of

    regression in that run using an adapted the least of squares regression method. The size to signal

    drop is thus expected to be more or less linear, if this is not the case then use one of the LMS

    methods.

    After regression correction the method assumes that the set reference probes remain normal

    between reference and sample runs. Each reference probes will be used to separately determine

    the ratio of a target probe. The median of all these calculated ratios will estimate the final ratio.

    The median is the center value, and this method is thus robust for aberrations in the reference

    probes as long as more than 51% of all reference probes remain normal.

    "Population Analysis (LS)"

    The population analysis (LS) method is recommended for MLPA kits without any reference

    probes and the number of expected changed in the target regions of the probes is less than 40%.

    This method has also been proven to be useful for MLPA kits having a limited number of

    reference probes and the number of expected changed in the target regions of all probes in the

    samples is less than 40%.

    The population analysis (LS) first defines if there are outliers over all probes in each sample

    assuming that a median of all probes is the normal status. The probes signals that are not defined

    as outliers are then used to calculate the amount of regression in that run using an adapted the

    least of squares regression method. The size to signal drop is thus expected to be more or less

    linear, if this is not the case then use one of the LMS methods.

  • Each target probe will be used to separately determine the ratio of each target probe. The

    median of all these calculated ratios will estimate the final ratio. The median is the center value,

    and this method is thus robust as long as more than 51% of all probes remain normal.

    "Control Probe Analysis (LMS)"

    The control probe analysis (LMS) method is equal to the reference probe (LS) method except that

    the regression is determined directly on all control probe signals (without outlier detection) using

    a least of median squares method. This method is very robust in the presence of outliers,

    although it is less precise. This is also the recommended method when size to drop is non-linear,

    this problem usually has to be solved in the experimental procedure itself although sometimes

    LMS correction may give good results.

    After regression correction the method assumes that the set reference probes remain normal

    between reference and sample runs. Each reference probes will be used to separately determine

    the ratio of a target probe. The median of all these calculated ratios will estimate the final ratio.

    The median is the center value, and this method is thus robust for aberrations in the reference

    probes as long as more than 51% of all reference probes remain normal.

    "Tumor analysis (LMS)"

    The Tumor analysis (LMS) method is equal to the Tumor analysis (LS) method except that the

    regression is determined directly on all probe signals (without outlier detection) using a least of

    median squares method. This method is very robust in the presence of outliers, although it may

    be less precise. This is also the recommended method when size to drop is non-linear; this

    problem usually has to be solved in the experimental procedure itself although sometimes LMS

    correction may give good results.

    After regression correction the method assumes that the set reference probes remain normal

    between reference and sample runs. Each reference probes will be used to separately determine

    the ratio of a target probe. The median of all these calculated ratios will estimate the final ratio.

    The median is the center value, and this method is thus robust for aberrations in the reference

    probes as long as more than 51% of all reference probes remain normal.

    "Population Analysis (LMS)"

    The Population Analysis (LMS) method is equal to the Population Analysis (LS) method except

    that the regression is determined directly on all probe signals (without outlier detection) using a

    least of median squares method. This method is very robust in the presence of outliers, although

    it is less precise. This is also the recommended method when size to drop is non-linear, this

    problem usually has to be solved in the experimental procedure itself although sometimes LMS

    correction may give good results.

    Each target probe will be used to separately determine the ratio of each target probe. The

    median of all these calculated ratios will estimate the final ratio. The median is the center value,

    and this method is thus robust as long as more than 51% of all probes remain normal.

    "Methylation Status Analysis (P)"

    Please note that it is recommended to use the direct methylations status analysis method, unless

    there seems to be a sloping difference between your cut and uncut samples. This method

    assumes the number of changes in the target sequences of the reference probes is minimal.

  • The regression is determined directly on all reference probe signals (without outlier detection)

    using a least of median squares method. This method is very robust in the presence of outliers,

    After correction each reference probe will then be used to separately determine the ratio of each

    target probe. The median of all these calculated ratios will estimate the final ratio. The median is

    the center value, and this method is thus robust for aberrations in the reference probes as long

    as more than 51% of these probes remain normal. Please note that in many cases some

    reference probes may give higher or lower signals in the cut sample. This is not due to

    aberrations but rather because of the structural changes present after cutting the DNA or

    because of the reduced presence of amplifiable targets during the PCR reaction.

    This method calculates the relative ratio of each probe signal between the cut and uncut

    reaction. When a target is completely methylated, no target is cut and the signal of this probe is

    expected to be ratio 1 or 100%. When a target is hemi-methylated, a single allele is expected to

    be completely cut and the expected ratio between the cut and uncut single is 0.5 or 50%.

    "RNA Intra Normalization"

    The RNA intra normalization method is different from all methods in that it doesnt compare

    reference and sample run but only normalized the Sample run signals to the internal reference

    probe signals. This is thus only recommended if you are in the internal target concentration in a

    sample as compared to an internal control (such as B2M). If you want to compare reference and

    sample run performed with RNA MLPA kits, you may also use the directanalysis based on the

    control probes.

    "USER DEFINED"

    Putting the combobox on user defined means that you may define all steps yourself. You can do

    this by clicking on settings and making adjustments at the different tabs. This is only

    recommended for the advanced user. More info on advanced settings can be found in the

    complete coffalyser manual, appendix Advanced analysis settings.

    13. Exploring results (sample reports enabled)

    After the analysis is finished, the results page will automatically open. If you have selected to create separate

    reports (enable option: create and save all sample reports), you can explore chart and summaries from this

    page. When this option is enabled for each samples a separate results report will be generated. This costs

    more time and analyzing in this way is thus much slower. In practice it is more useful to first empirically define

    the optimal analysis method by performing your normalization multiple times using different normalization

    methods. By evaluating the different results an optimal analysis method can be defined with which you may

    want to create the separate results report sheets and charts.

    On the MLPA results page you have a number of options to continue.

    - Save your result sheets / results

    - Explore results reports

    - Explore project results.

    - Open the quick explorer

    - Open the advanced explorer

  • Bar charts

    Click on Result Reports to view the bar charts of your sample runs. In the MLPA results window use the

    right combobox to choose the way you want to view your sample results. This is either in a bar ratio chart,

    result report method or technical chart. Double click in the left column on a sample name to view the bar

    chart. The probes in this chart are ordered in the recommended order, showing the Map view locations

    on the X-axis and the found probe ratio on the Y-axis. The reference probes are usually ordered to the

    most right position. The error bars coming with the chart are the calculated standard deviations (1x)

    found for each probes, assumed you have used reference runs which were performed on normal human

    DNA. These standard deviations are calculated from the reproducibility of reference runs and the

    confidence of the normalization factor. High standard deviations are usually found when the used

    reference samples were not reproducible as compared to each other.

    Figure 11 Results screen

    Technical data figures

    The technical figures can be visualized by changing the right combo box to choose technical chart and

    double clicking on a sample name to open it. The figure that appears contains the relative MLPA signals

    after correction for probe specific bias and log transformation. In the default LS methods outliers are first

    determined where after a sloping correction is performed on the signals that were determined as not

    being outliers. Depending on the method a normal status will be declined on the reference probe signals

    (control probe method) or on all probe signals (tumor analysis method / population method). Please note

  • that the normal status during outlier detection may also fall on aberrant probes, e.g. when applying the

    tumor analysis method, sloping may be determined on aberrant probes even though the normalization

    may normalize the data afterwards to the reference probes.

    In figure 12 a sample is visualized analyzed in the Population method. Most probes, including the

    reference probes (green dots) give equal signal intensities except for 9 aberrant probes. These 9 aberrant

    probes were defined as outliers by dynamic correlation checks (see four outlier lines) and will not be used

    for slope correction. The remaining signals will be used for regression correction by calculating a expected

    regression line through the remaining point and using the distance of each point to this line for the

    coming normalization.

    This figure should thus only be used to determine if your sloping correction was ok, and how your

    reference probes behave according to each other (in figure 12 they all have the same chromosomal

    status, and thus give equal signal intensities, although more or less effected by the sloping artifact).

    Figure 12 Technical MLPA run figure

  • MLPA Results Reports

    Results reports images (figure 13) can be generated for each run separately by changing the right combo

    box to show results report and the left combobox to save as image and then double clicking on a

    sample name to open it. The practical information will also be displayed in the results window but doesnt

    have any color casing, which is available in the report.

    The report contains all important run information and analysis settings in the top. Next to this it contains

    all information on the probes of the used mix and results are ordered in the MRC recommended ordering,

    placing the important genes in the top and reference probes more to the bottom.

    The calculated ratios are those of the sample as compared to the noted Reference runs using the set

    normalization method. The added standard deviations are calculated from a combination of the

    reproducibility of the reference runs and the confidence of the normalization factor, giving an indication

    how much variation this probe had in the used experimental setting. The 95% confidence range is then

    furthermore basically the range in which the ratio may have fallen assuming two standard deviations.

    Finally the probability calculation can aid in results interpretation, it uses the estimated ratio and standard

    deviation to questimate if the estimated ratio is more probably to be a gain (ratio 1.5) or a loss (ratio 0.5)

    assuming that the variation of each probes follows a Gauss distribution. Ratio that are red are confident

    to be gained or lost for more than 95%. Please note that these are questimations and that for real statistic

    calculation on losses and gains several positive control samples are necessary. Questimation also assumes

    that the variation distribution of a gain is equal to that of a normal probe results.

  • Figure 13 MLPA results report image, created by saving result report as image.

    sample namesample namesample namesample name D1.03.01831DMD__MLPA-P034^GSsTXT.txt OperatorOperatorOperatorOperator User1

    Analysis Date 5/2/2008 17:16 Kit numberKit numberKit numberKit number

    normalisat ion methodnormalisat ion methodnormalisat ion methodnormalisat ion method Tumor analysis (LS)

    Average / Median Median on Normalisation Factor Refe rence runs (

  • 14. Exploring results 2 (No sample reports enabled)

    After the analysis is finished, the results page will automatically open (also when no reports are generated). Click

    in the right column on Open the quick explorer, the main user form will minimize and the quick results explorer

    will open together with the sheets containing the results. You can now use the results explorer to guide you to the

    calculated ratio results or just navigate to the different sheets. You can also use the minimize button in the right

    top, the coffalyser user form will then close and you can explore your results on the sheets.

    To minimize the number of available sheets, most calculation sheet will be hidden if you use the minimize button

    or quick explorer option. To view all sheets you can use the right mouse click button and click on Show all Sheets,

    you can also use the right mouse button to reopen the coffalyser user form by clicking on Open coffalyser.

    Figure 14 Quick results explorer form. Change the combo box to the results type you wish to view and

    then select the sample name in the main window. By clicking on Save results summary you can save a

    file containing all relevant result sheets.

    Ratio result sheet

    The most important sheet is probably the ratio results sheets or RESULTS (1), this sheet contains the calculated

    ratio results of all analyzed sample runs. Above the calculated ratios sample information and quality check points

    are displayed, in order they are: Sample name, analysis data, chosen normalization method, if an average or

    median over the reference probes was used as a normalization factor, the number of found probes, the number of

    found reference probes, if the ligation control peak (92 bp was found), if the sample was male or female (if a Y

    probe was present), the PPMC and MAD values (which will be explained more down, if there was enough DNA

    during the MLPA reaction (by estimation of the relative signal of the Q-fragments (60, 68, 74, 80 bp)), if the DNA

    was denatured completely (by estimation of the relative signals of the DD fragments (88, 96 bp)).

    PPMC

  • The PPMC value, or Pearson product moment correlation value is the correlation value found when slope

    correction is applied. For every MLPA run, first the probe specific biases are corrected; hereafter the signals which

    originate from targets with a normal status are expected to have a linear relationship with each other. These

    signals either originate from the reference probes (control probe methods) or all probes (tumor analysis /

    population method). Whether the signal are normal or not is defined by a outlier detection system which follows a

    Monte Carlo like simulation assuming that the median of all signals is the normal status, or can be used to find the

    sloping relation. Sloping correction can thus also be done on signals that are actually aberrant, but do show a linear

    relationship with all, cause the majority contains the same chromosomal status. In most cases correction of sloping

    can best be performed using all signals, this being much more robust that correcting on the reference probes only.

    Correction on the reference probes (control probes methods) is thus only recommended when the number of

    reference probes is 10 or more.

    MAD VALUE

    The MAD value or median of absolute deviation is the value which arises when you subtract the final estimated

    ratio (displayed in RESULTS (1)) from each separately calculated probe ratio (using each reference probes

    separately) and then take the median value of these. This value is thus very small (or zero) if each reference probes

    creates the same result, if this is not the case the target sequences to which the reference probes hybridize may be

    aberrant. The median estimator is however quite robust though a higher MAD value may sincerely compromise

    the confidence of the calculated ratios. In normal cases the color on the MAD value will be green or orange (with

    more aberrations), when the MAD value becomes red you may need to try a different normalization factor or

    adjust the reference probes for a more optimized normalization.

    Figure 15 Sample result information displayed above each sample ratios

  • Figure 16 Different results sheets can also be used to explore your results. The colored sheets contain all important

    results and data.

    The sheet called RESULT (2) basically contains the same results but now with the 95% confidence range followed

    after the estimated ratio. This 95% confidence range is the same as discussed earlier at the results reports. The

    sheets called Reference runs and Sample runs contain the filtered raw peak height or areas (this can be set at the

    settings option, in the tab called data filtering). Since 2008 on default the Coffalyser uses peak height even though

    peak areas are theoretically more correct. This is done so, because several program often calculate the areas

    underneath the peak incorrect, especially when the peak has a large base. Peak areas are thus only recommended

    to use when the peaks contain little or no shoulders and when background signals are low.

    Reference results sheet

    Another important sheet is the reference results sheet. During the normalization a single reference signal is

    created for each probe from all your reference runs. Before actual normalization of the samples, each separate

    reference runs is also normalized against this signal. This not only allows the program to calculate the

    reproducibility of each probe but can also inform you which reference runs failed. The results are displayed as

    normal sample runs and each run should be completely blue in case your reference runs are performed on normal

    DNA. The standard deviation over the reference runs should be low (it will be bold if the value is too high). If there

    are reference runs that seem to contain aberrant probes, it is good to try to reanalyze your samples are deleting

    the aberrant reference runs to see if results improve. This may also work when you set a median of your samples

    runs as reference data (settings Tab, reference data). After the initial analysis where all your samples were set as

    reference runs, you can explore your reference run results, recognize the samples that are most aberrant, then

    reopen the coffalyser and delete these samples from the reference list. In a reanalysis the most normal looking

    samples are now that as reference data which can improve your results.

    Statistics sheet

    The statistics sheet contains basic statistical information calculated over your reference runs. This sheet thus

    doesnt give information about the reproducibility of your probes (unless all samples were normal runs), but rather

    informs you on the probes that are changed most or least in the sample group used.

    Predictions sheet

    The predictions sheet can aid in results interpretation, although it should be used solely. The predictions are

    furthermore only useful when at least 3 reference samples are used which were performed on normal human

    DNA. The predictions are based on the same calculations as discussed earlier in the results report section. Each

    prediction checks if the calculated ratio is more likely to be a gain (ratio 1,5) or loss (ratio 0.5). Signals that were

    not found are automatically set as a homozygous deletion and ratios that were estimated higher than ratio 2.5 are

    set as amplifications. Please note that most MLPA probes will not reach a ratio 1.5 in the case of a gain but rather

    have a ratio between the 1.3-1.5. This makes the prediction less accurate and a gain may thus seem to be

    ambiguous (or undetermined) while it is truly gained. True statistics are thus dependent on positive controls and

    information about the MLPA probes signals in the case of a gain and also the reproducibility and spread of the

    MLPA probes in the case of a gain (or loss).

    A basic rule for MLPA probes ratios is that each aberrant probe signal should be confirmed by a second probe. This

    can be a probe within the same sample results which has its target sequence close to that of the first probes or in a

    second run or duplo.

  • Saving Results Summaries

    To save a results summary of your current analyzed results you can either click on the save results summary

    button in the quick explorer or click on File at the results page and then click on File > Save last analyzed results

    summary. Both options will create a new Excel file containing data of the sheets: reference runs, sample runs,

    results (1), results (2), predictions, reference runs and statistics.

    HEATMAP images

    A heatmap image can also be generated from the last analyzed results by clicking on Create heatmap image

    under File at the results page. Heatmap images have the advantage that the set color borders are more gradual

    which may make data interpretation easier, especially in the case of heterogenous cell populations (figure 17)

    when ratios of gained sequences tend to be a bit lower.

    Ratio result sheet

    PPMC

    MAD VALUE

    Reference results sheet

    Statistics sheet

    Predictions sheet

    Saving Results Summaries

    HEATMAP images

  • Figure 17 Heatmap image of some samples with the P034 MLPA mix.

    P034 MLPA probemix lot 1105, 0505,

    1004.txt D1.03

    .01

    831D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    D1.04

    .04

    280D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    D1.05

    .05

    541D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    D1.05

    .05

    580D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    D1.05

    .05

    626D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    D1.05

    .05

    692D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    D1.05

    .05

    825D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    D1.05

    .05

    830D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    D1.05

    .06

    014D

    MD_

    _M

    LPA-

    P034

    ^G

    SsTX

    T.tx

    t

    DMD probe 1353-L1001 1.01 1.03 1.03 1.09 1.04 1.08 1.14 1.04 1.08DMD probe 1357-L1005 1.45 1.02 1.05 1.00 1.02 1.01 1.01 1.03 0.99DMD probe 1361-L1009 1.45 1.05 1.06 1.03 0.99 1.07 1.04 1.02 1.02DMD probe 1365-L1013 1.34 0.95 0.99 0.99 0.93 1.05 0.96 1.02 1.00DMD probe 1954-L1574 1.42 0.89 0.94 0.93 0.88 0.92 0.93 0.94 0.90DMD probe 1373-L1021 1.41 0.99 1.04 1.03 1.03 1.06 1.00 1.00 1.00DMD probe 1713-L1281 1.50 1.05 0.97 1.04 1.04 0.99 0.95 1.02 0.99DMD probe 1715-L1283 1.41 0.47 0.95 0.98 0.95 0.94 1.04 1.04 0.94DMD probe 1385-L1033 1.45 0.51 1.03 1.00 1.04 1.03 0.99 1.00 1.00DMD probe 1718-L1286 1.45 0.44 0.89 0.90 0.86 0.92 1.06 0.97 0.91DMD probe 1355-L1615 0.97 0.99 1.12 1.09 1.03 1.06 1.06 1.01 1.05DMD probe 1359-L1007 0.97 1.03 1.04 1.08 1.03 1.04 1.08 1.05 1.06DMD probe 1363-L1011 0.90 0.93 0.99 1.01 0.94 1.04 1.04 1.02 1.00DMD probe 1958-L1518 0.93 0.98 1.04 1.09 1.01 1.03 0.99 0.96 1.06DMD probe 1371-L1019 0.96 1.06 0.99 1.06 1.01 1.05 1.03 0.99 0.99DMD probe 1375-L1023 1.00 1.00 1.06 1.05 1.02 1.03 1.04 0.96 0.98DMD probe 1379-L1616 0.96 0.98 0.99 1.02 0.96 1.06 0.99 1.02 0.99DMD probe 1716-L1284 0.96 0.99 0.99 1.07 0.95 0.90 0.96 0.93 1.01DMD probe 1387-L1035 1.04 1.11 1.07 1.06 1.05 1.00 1.02 0.99 1.05DMD probe 1391-L1039 0.97 1.05 0.96 0.94 0.89 0.95 0.86 0.95 0.90DMD probe 1354-L1002 0.96 0.96 1.00 0.98 0.97 0.98 1.00 0.97 1.00DMD probe 1711-L1279 0.96 1.04 1.02 1.12 1.03 1.08 1.07 1.01 1.03DMD probe 1362-L1010 0.98 0.97 0.99 0.99 1.01 0.98 0.98 1.02 0.99DMD probe 1366-L1014 1.00 1.02 1.01 1.03 0.97 1.02 1.00 1.03 1.03DMD probe 1370-L1287 1.01 0.96 0.00 0.51 0.97 0.00 1.00 0.97 0.97DMD probe 1374-L1288 1.01 1.11 0.00 0.52 1.01 0.00 1.06 1.01 1.02DMD probe 1378-L1026 1.00 1.00 0.00 0.49 0.96 0.00 1.00 1.01 1.00DMD probe 1382-L1030 0.98 1.11 1.03 0.53 1.03 0.00 0.00 1.01 1.06DMD probe 1717-L1285 1.01 0.92 0.98 0.50 0.85 0.00 0.00 0.95 0.93DMD probe 1390-L1038 1.06 0.96 0.97 0.47 0.91 0.00 0.00 0.90 0.97DMD probe 1356-L1004 0.98 0.97 1.06 0.97 1.00 1.04 1.04 1.03 1.02DMD probe 1897-L1008 1.03 1.02 1.06 1.04 1.00 1.06 1.01 1.01 1.03DMD probe 1364-L1012 0.98 1.02 1.07 1.00 1.03 1.06 1.03 1.04 1.07DMD probe 1368-L1016 0.99 1.00 1.00 0.96 1.00 0.97 0.97 1.00 0.98DMD probe 1372-L1020 1.00 1.05 1.01 1.01 1.02 1.03 1.01 1.00 1.03DMD probe 1376-L1024 1.03 1.03 1.08 0.90 0.97 1.03 0.95 0.99 1.04DMD probe 1960-L1520 1.02 1.05 1.00 1.03 1.06 1.01 0.98 0.99 1.03DMD probe 2482-L2711 1.04 1.06 1.02 1.08 1.04 1.00 1.03 1.00 1.05DMD probe 1388-L1036 0.95 1.03 0.98 0.97 0.98 0.99 0.99 0.98 0.98DMD probe 1392-L1040 0.99 1.04 1.01 0.96 0.95 0.97 0.95 0.99 1.04Control probe 1691-L0465 0.98 0.92 1.05 0.97 0.99 0.99 1.02 0.99 1.00Control probe 1690-L0423 1.03 0.90 0.92 0.96 1.02 1.01 0.93 0.95 0.94Control probe 1692-L1531 1.03 1.06 0.88 0.97 0.95 1.00 0.92 0.94 0.48Control probe 1768-L1617 0.99 0.95 1.01 1.02 0.98 0.98 1.01 1.02 0.46Control probe 1770-L1334 0.99 0.94 0.89 0.95 0.93 0.93 0.82 0.96 0.44