Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al.,...

19
Investigating the Parameter Space of A T-Cell Cross Regulation Binary Classifier Ian Wood 4/25/13 I690, Prof. Flammini

Transcript of Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al.,...

Page 1: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Investigating the Parameter Space of A T-Cell Cross Regulation Binary

ClassifierIan Wood

4/25/13I690, Prof. Flammini

Page 2: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

T-Cell Cross Regulation

Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of the dynamics and repertoire selection of regulatory CD4+ T cells.,” Immunological Reviews, vol. 216, pp. 48–68, 2007.

𝑑𝐸𝑑𝑡

=𝑝𝐸 𝐸𝐴−𝑑𝐸 𝐸

𝑑𝑅𝑑𝑡

=𝑝𝑅 𝑅𝐴−𝑑𝑅 𝑅

Page 3: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

T-Cell Cross Regulation for Machine Classification

Image From: A. Abi-Haidar and L. M. Rocha, “Collective Classification of Textual Documents by Guided Self-Organization in T-Cell Cross-Regulation Dynamics,” Evolutionary Intelligence, p. In press, 2011.

Page 4: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Machine Classification IssuesBenefits:

Temporal dynamics could allow the system to adapt to changes over time (concept drift)

Possibly useful for classifying unbalanced setsProblems:

Agent-based models take timeLarge parameter space is difficult to explore

Page 5: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

A Large Parameter SpaceNslot – The number of antigens to produce for each feature

DE – Death rate for unbound effectors

DR – Death rate for unbound regulators

E0- - Initial effector population for Nonself documents

E0+ - Initial effector population for Self documents

E0u - Initial effector population for Unlabeled documents

R0- - Initial regulator population for Nonself documents

R0+ - Initial regulator population for Self documents

R0u - Initial regulator population for Unlabeled documents

This doesn’t include variations in the algorithm!

Page 6: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Finished Work

Page 7: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Top Parameter Configurations So Far

nslot eself rself

enself

rnself eunlab runlab

edrate

rdrate cond condi precision

accuracy recall

mcc f1 tpos tneg

fpos fneg

12 3 9 8 2 6 3 1 1 2 1 0.74 0.8 0.930.62

0.82 28 20 10 2

13 3 9 8 2 5 3 1 1 2 1 0.95 0.78 0.60.61

0.73 18 29 1 12

12 3 9 8 2 5 3 2 2 2 1 0.84 0.78 0.70.57

0.76 21 26 4 9

20 8 12 12 8 8 8 25 25 2 2 1 0.57 0.130.2

7 0.24 4 30 0 26

20 12 24 12 10 12 10 2 2 1 2 0.58 0.63 10.3

9 0.73 30 8 22 0

20 8 12 12 8 8 8 25 25 5 2 0.58 0.63 10.3

9 0.73 30 8 22 0

Page 8: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Features Over Time

nslot eself rself

enself

rnself

eunself runself

edrate

rdrate cond

condi

precision

accuracy

recall mcc f1 tpos tneg

fpos fneg

12 3 9 8 2 6 3 1 1 2 1 0.74 0.80.9

30.620.8

2 28 20 10 2

Page 9: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

ApproachSee how distributions of cosine scores

correspond to parametersThe system should be able to correct itself, so

I want to see how parameters allow sensitivity to changes in co-occurrence frequencyInvestigate artificial datasets for simple casesInvestigate mathematical relationships in

simple cases

Page 10: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Distribution of TCells

nslot eself rself

enself

rnself

eunself runself

edrate

rdrate cond

condi

precision

accuracy

recall mcc f1 tpos tneg

fpos fneg

12 3 9 8 2 6 3 1 1 2 1 0.74 0.80.9

30.620.8

2 28 20 10 2

nslot eself rself

enself

rnself

eunself runself

edrate

rdrate cond

condi

precision

accuracy

recall mcc f1 tpos tneg

fpos fneg

12 3 9 8 2 6 3 1 1 2 1 0.74 0.80.9

30.620.8

2 28 20 10 2

Page 11: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Distribution of Tcells cont.

nslot eself rself

enself

rnself

eunself runself

edrate

rdrate cond

condi

precision

accuracy

recall mcc f1 tpos tneg

fpos fneg

14 3 8 3 7 3 7 1 2 5 1 0 0.38 0 -0.36 -1 0 23 7 30

20 4 6 6 4 4 4 1 1 6 1 0 0.5 0 0 0 0 30 0 30

Page 12: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Artificial Datasets10 documents of 100 words eachWords are randomly generated and unique to

each documentOne word, “lambda”, is present in every

document, but initially biased incorrectlySet 1 – First document is labeled Self, the rest

NonselfSet 2 – First document is labeled Nonself, the

rest SelfSet 3 – First 5 = Self, Last 5 = NonselfSet 4 – First 5 = Nonself, Last 5 = Self

Page 13: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Parameter ConfigurationsParameter Values, Step

Nslot [10, 13], 1

DE 0.1

DR 0.1

E0- =E0

+

E0+ [5, 14], 1

E0u =E0

+

R0- [1, 6], 1

R0+ [6, 16], 1

R0u =R0

-

Page 14: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Set1Appropriate Behavior Inappropriate Behavior

Page 15: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Set2Appropriate Behavior Inappropriate Behavior

Page 16: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Appropriate Behavior in Sets 1 & 2

Page 17: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Appropriate ConfigurationsNslot E0

+ R0+ E0

- R0- E0

u R0u DE DR

10 5 12 5 3 5 3 .1 .1

10 6 12 6 2 6 2 .1 .1

10 7 13 7 1 7 1 .1 .1

10 8 10 8 5 8 5 .1 .1

11 11 11 11 2 11 2 .1 .1

12 6 12 6 3 6 3 .1 .1

12 7 10 7 4 7 4 .1 .1

12 9 11 9 3 9 3 .1 .1

13 5 14 5 4 5 4 .1 .1

13 6 15 6 3 6 3 .1 .1

Page 18: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

Future DirectionsMathematical Analysis

I tried to write equations for the expected change in the lambda population between the first and second documents, but I either assumed too much or made errors.

Larger SearchSimple artificial dataset runs much faster than an actual

corpusRun on Sets 3 and 4More variation in the artificial data (lambda should not

be the only common feature)More precision in distribution data (only looks at

mean, over-emphasizes features that appear only once)

Page 19: Ian Wood 4/25/13 I690, Prof. Flammini. T-Cell Cross Regulation Image From: J. Carneiro, et al., “When three is not a crowd: a Crossregulation model of.

References J. Carneiro, et al., “When three is not a crowd: a Crossregulation

model of the dynamics and repertoire selection of regulatory CD4+ T cells.,” Immunological Reviews, vol. 216, pp. 48–68, 2007.

A. Abi-Haidar and L. M. Rocha, “Collective Classification of Textual Documents by Guided Self-Organization in T-Cell Cross-Regulation Dynamics,” Evolutionary Intelligence, p. In press, 2011.