Auditory Coding

download Auditory Coding

of 30

Transcript of Auditory Coding

  • 8/8/2019 Auditory Coding

    1/30

    > What Determine's a Neuron's Tuning? The

    Efficient Coding of Sensory Information

    4 January 2011

    [email protected]

  • 8/8/2019 Auditory Coding

    2/30

    18/01/2011 Eff ic ient Coding of Sensory Information2

    Summary

    1. Introduction

    2. The model

    3. Results

    4. Conclusion

    5. Bibliography

  • 8/8/2019 Auditory Coding

    3/30

    18/01/2011 Efficient Coding of Sensory Information3

    1. Introduction

    The context :

    Problem of finding an efficient coding

  • 8/8/2019 Auditory Coding

    4/30

    18/01/2011 Efficient Coding of Sensory Information4

    1. Introduction

    What makes a coding efficient ?

    Preserves the underlying sound features

    Lowest size possible for a given quality

    Easy to encode and decode

  • 8/8/2019 Auditory Coding

    5/30

    18/01/2011 Efficient Coding of Sensory Information5

    1. Introduction

    Our problematic here :

    Time-Relative

    Spikes in a

    population

    Given an

    Input Waveform

    = Reconstructed

    Waveform

    With lowdifferences with

    the original

  • 8/8/2019 Auditory Coding

    6/30

    18/01/2011 Efficient Coding of Sensory Information6

    1. Introduction

    What was done until then :

    Reverse Correlation (RevCor) :

    Given an input

    waveform

    We insert different

    white noises

    We use filters to find

    most probable spikes

    Then we use

    functions to

    reconstruct a

    waveform

  • 8/8/2019 Auditory Coding

    7/30

    18/01/2011 Efficient Coding of Sensory Information7

    1. Introduction

    What was done until then :

    Reverse Correlation (RevCor) :

    Given an input

    waveform

    We insert different

    white noises

    We use filters to

    find most probablespikes

    Then we use

    functions to

    reconstruct a

    waveform

  • 8/8/2019 Auditory Coding

    8/30

    18/01/2011 Efficient Coding of Sensory Information8

    1. Introduction

    The originality of this model:

    Theoretical Code ( Black box model) vs

    physiological revcor filters

    Spikes are chosen to maximize efficiency of the code

    (non-redudancy)

    The algorithm is trained with specific datasets

  • 8/8/2019 Auditory Coding

    9/30

    18/01/2011 Eff ic ient Coding of Sensory Information9

    Summary

    1. Introduction

    2. The model

    3. Results

    4. Conclusion

    5. Bibliography

  • 8/8/2019 Auditory Coding

    10/30

    18/01/2011 Efficient Coding of Sensory Information10

    2. The model

    What do we need for encoding accoustic signal :

    Suppressing useless information or noise

    Efficient for a wide range of signals (both transient

    and harmonic)

    Time-relative

    Event-based

  • 8/8/2019 Auditory Coding

    11/30

    18/01/2011 Efficient Coding of Sensory Information11

    2. The model

    An efficient way of coding would be a kernel take :

    The signal x(t) is encoded with a set a kernel functions 1, ,

    M that can be positioned arbitrarily and independently in

    time.

    Assuming that the kernel functions exist at all time points

    during the signal t :

    sm() = coefficient at time forM(t) = additive noise

    (1)

  • 8/8/2019 Auditory Coding

    12/30

    18/01/2011 Efficient Coding of Sensory Information12

    2. The model

    By using a sparse coefficient signal sm() composed only ofDirac delta functions (our event-based condition), this

    equation reduces to :

    sim

    = coefficient of the ith

    instance ofmi

    m = temporal position of the ith instance ofmnm = number of instance ofm (can be different for each m)

    (t) = additive noise

    (2)

  • 8/8/2019 Auditory Coding

    13/30

    18/01/2011 Efficient Coding of Sensory Information13

    2. The model

  • 8/8/2019 Auditory Coding

    14/30

    18/01/2011 Efficient Coding of Sensory Information14

    2. The model

    This is just a way to code sounds, we need to find values to these

    distincts parematers, in two (linked) steps :

    1. Encoding: Determining the optimal temporal positions and

    coefficients of kernels functions

    2. Learning : Determining the optimal kernel functions

    We repeat these steps until we find a treshold value (here 0,1 for the

    coefficient s)

    At the beginning, kernel functions are initialized as standard gammatone

    functions

  • 8/8/2019 Auditory Coding

    15/30

    18/01/2011 Efficient Coding of Sensory Information15

    2. The model

    Encoding : Matching-base pursuit

    The general idea is to iteratively approximate the input signal with successive

    orthogonal projections onto the unit-normed gammatone kernels.

    As such, we decompose the signal as

    : inner product between signal and m , equivalent to smRx(t) = Residual signal after approximating x(t) in the direction of m

    (1)

  • 8/8/2019 Auditory Coding

    16/30

    18/01/2011 Efficient Coding of Sensory Information16

    2. The model

    The projection with the biggest inner product will minimize the power of

    Rx(t), thus yielding the best approximation of x(t) with a single kernel. We

    want to record the coefficient for this approximation.

    Iteratively, (1) becomes

    Rx0 = x(t) on initialization.

    We then substract the best fitting projection, and record it, leaving

    orthogonal to

    On each iteration, the power of Rxn is thus bound to disminish. We

    put up a treshold to stop the algorithm

  • 8/8/2019 Auditory Coding

    17/30

    18/01/2011 Efficient Coding of Sensory Information17

    2. The model

    Learning: Probabilistic form

    We rewrite our main equation as :

    Where s^ is the approximation of the maximum and thegradient of

    1

    , , M

    (1)

  • 8/8/2019 Auditory Coding

    18/30

    18/01/2011 Efficient Coding of Sensory Information18

    2. The model

    Learning: Probabilistic form

    We then have, for each m :

    = Residual error at position tim of kernel M

    We know and x^, so with additional computations we can deduce anoptimal value forM

  • 8/8/2019 Auditory Coding

    19/30

    18/01/2011 Eff ic ient Coding of Sensory Information19

    Summary

    1. Introduction

    2. The model

    3. Results

    4. Conclusion

    5. Bibliography

  • 8/8/2019 Auditory Coding

    20/30

    18/01/2011 Efficient Coding of Sensory Information20

    3. Results

    Because one of the condition was that the model had to be

    robust to a large range of accoustic signals, the training dataset

    was composed by :

    1. Mammalian vocalizations2. Nature sounds

    a) Ambient (rain, wind)

    b) Transient (crunching leaves, impact of wood)

  • 8/8/2019 Auditory Coding

    21/30

    18/01/2011 Efficient Coding of Sensory Information21

    3. Results

    Red : ModelBlue : Physiological cat

    data

    The model predictsrevcor (physiological)

    shapes!

  • 8/8/2019 Auditory Coding

    22/30

    18/01/2011 Efficient Coding of Sensory Information22

    3. Results

    Red : Classic model Blue : Physiological Cat Data

    Black : Environmental SoundsInitialization

    Green : Animal Vocalization

    Initialization

    With speech initialization, the

    model yields similar results aswith the classic dataset.

    Comparison of the model initialized with different datasets :

  • 8/8/2019 Auditory Coding

    23/30

    18/01/2011 Efficient Coding of Sensory Information23

    3. Results

    Comparison of the model initialized with different datasets :

    Environmental Sound :

    Very brief

    Vocalizations :

    Longer

    Reserved Speech :

    Reverse of classic model

    (Grey bars = 5 ms)

  • 8/8/2019 Auditory Coding

    24/30

    18/01/2011 Efficient Coding of Sensory Information24

    3. Results

    Red : Classic model Light Blue : Not learning

    Model

    Black : Fourier Transform

    Blue : Daubechies wavelettransform

    For fidelity under 35db

    (treshold beyond which the

    difference between the original

    signal and the computed signalis untellable), the spike-coding

    model performs better thanclassic Fourier or wavelet

    transforms.

    Efficiency of the code :

  • 8/8/2019 Auditory Coding

    25/30

    18/01/2011 Eff ic ient Coding of Sensory Information25

    Summary

    1. Introduction

    2. The model

    3. Results

    4. Conclusion

    5. Bibliography

  • 8/8/2019 Auditory Coding

    26/30

    18/01/2011 Efficient Coding of Sensory Information26

    4. Conclusion

    General conclusions :

    This model yields results strikingly similar to those recorded

    physiologically in auditory nerves of a cat.

    The kernel functions we obtained with the right initialization

    (mixed natural sounds) should be good approximations of what

    happens in a neuron black box .

    The good results with speech initialization could prove that

    evolution had adapted in the direction of speech.

  • 8/8/2019 Auditory Coding

    27/30

    18/01/2011 Efficient Coding of Sensory Information27

    4. Conclusion

    Limits and extensions :

    Depends heavily on the datasets used (right one?)

    Optimizing this system is NP-Hard

    Doesnt describe the underlying sound features

    Doesnt take into account changes in response with signal

    intensity

  • 8/8/2019 Auditory Coding

    28/30

    18/01/2011 Eff ic ient Coding of Sensory Information28

    Summary

    1. Introduction

    2. The model

    3. Results

    4. Conclusion

    5. Bibliography

  • 8/8/2019 Auditory Coding

    29/30

    18/01/2011 Efficient Coding of Sensory Information29

    5. Bibliography

    Bibliography :

    Evan Smith, Michael S. Lewicki, Efficient coding of time-relative structure usingspikes, Neural Computation January 2005, Vol. 17, No. 1: 1945.

    Evan Smith, Michael S. Lewicki, Efficient auditory coding,Nature 439

    , 978-982(23 February 2006)

    Dario Ringach, Robert Shapley, Reverse correlation in neurophysiology (2003),Cognitive Science

    Mallat, S. G. & Zhang, Z. Matching pursuits with time-frequency dictionaries.IEEE Trans. Signal Process. 41, 3397-3415 (1993).

  • 8/8/2019 Auditory Coding

    30/30

    18/01/2011 Efficient Coding of Sensory Information30

    Thank you for your attention !