Jay Stokes, Microsoft Research John Platt, Microsoft Research Joseph Kravis, Microsoft Network...

Jay Stokes, Microsoft ResearchJohn Platt, Microsoft ResearchJoseph Kravis, Microsoft Network SecurityMichael Shilman, ChatterPop, Inc.

ALADIN: Active Learning for Statistical Intrusion Detection

NIPS Workshop 2007 – Machine Learning in Adversarial Environments for Computer Security 12/8/2007

Motivation

Metadata of Microsoft’s external internet traffic is logged using ISA Server Firewall ISA – Internet Security and Acceleration

Up to 35 million log entries per day Security analysts must search for and

identify new anomalies Looking for new malware, bad PTP, etc. Can machine learning help?

Active Learning

User 2User 1

ISA Server

ALADIN

RankSamples

EvaluateSamples

Security Analyst

Human interactively provides labels for new sample

Network traffic metadata logged to SQL

ALADIN evaluates and ranks samples

Security Analyst labels samples

ALADIN reranks samples and repeats

ALADIN

Multiclass classifier for monitoring network traffic

Goal: Minimize analyst labeling time

Weights can be adaptively improved at user’s site

12/8/2007NIPS Workshop 2007 – Machine Learning in Adversarial Environments for Computer Security

Choosing Samples for Labeling – Active Anomaly Detection

Label only anomalies (Pelleg, Moore, NIPS04)

Discover rare and interesting classes

Multiclass model Avoid “Normal” vs.

“Not Normal” problem

Leads to high error rates

Choosing Samples for Labeling – Active Learning Label only samples

closest to the decision boundary (Almgren, Jonsson, CSFW04)

RBF SVM Ignore samples

located away from the decision boundaries

May not find new classes

ALADIN: Combines Active Anomaly Detection and Active Learning

Unlabeled items

Anomalies (potential malware): ask analyst for labels

Samples closest to the hyperplanes

Classification Stage

Discriminative Learning, Logistic Regression

Minimize cross entropy function

Uncertainty Score

Fast computation for interactive labeling Scales well

| 1/ 1 expi ij j ij

P class x w x b

log | 1 log 1 |I

in n in nn i

E t P i x t P i x

| |min i n j ni j i

P class x P class x

Modeling Stage

naïve Bayes Model Training Data

labeled data predicted labels of the unlabeled data

Anomaly Score

Fast computation for interactive labeling Scales well

log | log |c j cj

P class P x class x

Network Intrusion Detection Results KDD-Cup 99 Data Set Provides Oracle Labels 100K Samples Use All Features in the Data Label 10 Initial Samples Randomly 100 Samples Labeled per Iteration

Results – Anomaly Detection

0 1 2 3 4 5 6 7 8 90

Iteration

ALADINLogistic RegressionSVM

Results – Prediction Accuracy

1 2 3 4 5 6 7 8 9 100

Iteration

ALADINLogistic RegressionSVM

FP/FN Per Class

True Label

Num Labeled Samples

True Predicted

LabelTP

CountIncorrectly

Predicted LabelFN

Count FP Rate FN Ratenormal 551 normal 55715 satan 3 4.12% 0.20%

guess_passwd 10ipsweep 67back 2

neptune 57 neptune 20425 0.00% 0.00%smurf 82 smurf 18904 normal 7 0.00% 0.04%back 36 back 5 normal 1961 0.00% 99.75%

ipsweep 58 ipsweep 675 normal 27 0.07% 3.85%satan 49 satan 470 normal 20 0.00% 4.08%

portsweep 54 portsweep 223 normal 1 0.00% 0.45%

Malware Detection on Microsoft Network Logs

Analyzed several daily log files.

Identified “5.exe” on the corporate network which was not previously identified Trojan.Esteems.D. 5.exe monitors user Internet

activity and private information. It sends stolen data to a hacker site.

Identified several other worms (NewApt Worm, Win32.Bropia.T, W32.MyDoom.B), and keyloggers (svchqs.exe) All of which were currently logged Some waiting to be labeled All currently blocked by ISA firewall rules

Conclusions

ALADIN discovers rare and interesting classes

ALADIN maintains low classification error Scales due to fast learning with logistic

regression and naïve Bayes Identifies network intrusion attacks Identifies malware via network traffic

patterns Tech Report:

http://research.microsoft.com/~jstokesNIPS Workshop 2007 – Machine Learning in Adversarial Environments for Computer Security 12/8/2007

Jay Stokes, Microsoft ResearchJohn Platt, Microsoft ResearchJoseph Kravis, Microsoft Network SecurityMichael Shilman, ChatterPop, Inc.

ALADIN: Active Learning for Statistical Intrusion Detection

Jay Stokes, Microsoft Research John Platt, Microsoft Research Joseph Kravis, Microsoft Network...

Documents

Transcript of Jay Stokes, Microsoft Research John Platt, Microsoft Research Joseph Kravis, Microsoft Network...

ICP-Africa Index Aggregation Methodology · PDF és.élémentaires.sont.ensuite.agrégés.en.leur.affectant.les.données.de. ... ch5.pdf >. See also Kravis ... Base-country.invariance

ICELAND SYMPHONY ORCHESTRA CONCURRENCE · Prize, the New York Philharmonic’s Kravis Emerging Composer Award, Lincoln Center’s Emerging Artist Award and Martin E. Segal Award.

Purchasing Power - World Bankpubdocs.worldbank.org/en/528591487105178371/Purchasing...to relalive prices of a fixed basket (LB. Kravis (1984), "Comparative Studies of National Incomes

Employee Links - cfly.trustedpartner.comcfly.trustedpartner.com/docs/library...celebrate Gulfstream Goodwill Industries’ 50th anniversary at the Kravis Center on Saturday, Nov. 12,

Regarding the Acquisition of Kravis Inc.

A note on the Second Edition - United Diversitylibrary.uniteddiversity.coop/Food/Guide_for_Beginning...foryoungfarmer.wikispace.com-----particular thanks go to: Talia Khan-Kravis Paula

Modeling and Visualization of CFSM Networks in JavaTime Michael Shilman James Shin Young EE249 Project Presentation December 8, 1998.

File Formats for Tariff Content. Prepared by Gary Kravis – UNICON, Inc. Practical Practical …must lend itself to tariff content …must lend itself to tariff.

Kohlberg Kravis Roberts Nov. 2012 Report - Historic Opportunities from the Shale Gas Revolution

NeurobiologyofDisease PDE ...1992; Berry-Kravis and Sklena 1993; Berry-Kravis et al., 1995; Berry-Kravis and Ciurlionis, 1998). A similar positive-feedback loop between FMRP and cAMP

TECHNICAL SPECIFICATIONS › kravis-media › wp-content › uploads › ...2019/08/13 · TECHNICAL SPECIFICATIONS THE EUNICE AND JULIAN COHEN PAVILION WEST PALM BEACH, FLORIDA August

Discerning Structure from Freeform Handwritten Notes · 2014-10-31 · Discerning Structure from Freeform Handwritten Notes Michael Shilman*, Zile Wei*, Sashi Raghupathy, Patrice

A Closer Look - shareholderfiles.shareholder.com/downloads/KKR/0x0x552876... · A Closer Look: KKR Capstone March 12, 2012 . Welcome Henry R. Kravis. Overview ... Balanced customer

Tom Blackmon Jonathan Chang Amy Cheng Tiffany Jen Hannah Kravis Raishay Lin Michael Lu Erin Ong Tanya Pakzad Nima Sarfaraz Yvonne Shiau Jacklyn Wong Mentor:

Kravis Center Offers a Spectacular Array of Dazzling ...€¦ · The Beach Boys, Tony Bennett, George Benson, Pianist Kenneth Broberg, Lil Buck & Jon Boogz, CABARET, ... Smoke Gets

Video Games & Art Museums - Inlcuding Marisa Pascucci and Henry Kravis

JAY A. CONGER - Claremont McKenna College€¦ · JAY A. CONGER The Henry Kravis Chaired Professor of Leadership Studies Director of KLI Advisory Board and Kravis Fellows Programming

What degrees do famous people have? Looking at the degrees of people like Brad Pitt, Henry R Kravis, Julia Roberts, Oprah, and others.

CONTACT! - New York Philharmonic/media/pdfs/watch-listen/commercial-recordings/...Gérard Grisey) Magnus Lindberg The Marie-Josée Kravis Composer-in-Residence Magnus Lindberg, now

Ken Hinckley, Shengdong Zhao, Raman Sarin, Patrick Baudisch, Edward Cutrell, Michael Shilman & Desney Tan oronto r.

Discerning Structure from Freeform Handwritten Notes · 2014-10-31 · Discerning Structure from Freeform Handwritten Notes Michael Shilman, Zile Wei, Sashi Raghupathy, Patrice