Interrupted speech perception

download Interrupted speech perception

of 38

  • date post

    12-Jan-2016
  • Category

    Documents

  • view

    38
  • download

    2

Embed Size (px)

description

Interrupted speech perception. Su-Hyun Jin, Ph.D. University of Texas & Peggy B. Nelson, Ph.D. University of Minnesota. Introduction. In everyday life, speech is not equally intelligible over time due to background noise - PowerPoint PPT Presentation

Transcript of Interrupted speech perception

  • Interrupted speech perception

    Su-Hyun Jin, Ph.D.University of Texas

    &

    Peggy B. Nelson, Ph.D.University of Minnesota

  • IntroductionIn everyday life, speech is not equally intelligible over time due to background noisequite fluctuating rather than steady in terms of time, amplitude and frequencyNormal hearing (NH) listeners can take advantage of fluctuating nature of noiseMasking release: improvements in speech recognition in fluctuating noise compared to performance in steady noise. Listeners ability to utilize the momentary decrease (dips) in fluctuating noise to resolve the background fluctuations in order to extract speech information (Dubno et al., 2002; Nelson et al. 2003; Jin & Nelson, 2006)

  • Speech and Noise (NH)Quiet speechSpeech in noiseSpeech in fluctuating noise

  • IntroductionEffect of noise on HI speech perceptiononly little or no release from masking in fluctuating noiseEven in listeners with mild hearing loss ( Bacon et al., 1998; Dubno et al. 2002)speech recognition in steady noise was close to normalSOME significantly worse than normal in fluctuating noiseSpeech perception in fluctuating noise may provide a more sensitive measure of impairment due to hearing loss

  • Jin & Nelson (2006)Investigated the relationship between the amount of masking release (MR) and hearing sensitivity and temporal resolution in NH and HI listenersSentence and consonant recognition in quiet, steady and fluctuating noiseHearing sensitivity and forward masking

  • Jin & Nelson (06)Amplification and shapingTwo-stage process to amplify speech and noise for HI listenersShaping applied based on half-gain rule to compensate for hearing loss configurationOverall amplification added to bring listeners to maximum sentence recognition (90% or better) in quietProcess applied to speech and noise for each HI listener individually

    FIG1

    1661.167

    1660.468.3

    11.55867

    11.555.765.7

    11.55564.8

    13.553.564

    13.552.664

    12.550.763.7

    12.550.564.5

    145060.6

    134657

    14.54656.5

    14.542.552.5

    &C&"Times New Roman,Regular"&12Figure 1. Long-term spectra for IEEE sentences and speech shaped noise along with the average thresholds in dB SPL for the NH group.26

    NH Threshold

    Speech

    Noise

    Frequency

    dB SPL

    FIG2

    4161.564.5

    4263.567

    51.564.572

    51.566.476.5

    536878

    5569.178

    58.570.279

    6073.281

    6275.784

    647283

    5865.672

    5562.770

    54.556.361

    &C&"Times New Roman,Regular"&12Figure 2. Long-term spectra for IEEE sentences and speech shaped noise along with hearing thresholds in dB SPL for HI1.27

    HI1 threshold

    HI1 Speech

    HI1 Noise

    Frequency

    dB SPL

    FIG3

    266472.5

    2660.573

    21.558.773

    21.55872

    236071

    256371.5

    28.56373

    52.56676

    6069.580.1

    7072.580.5

    686977

    5861.573

    54.55568

    &C&"Times New Roman,Regular"&12Figure 3. Long-term spectra for IEEE sentences and speech shaped noise along with hearing thresholds in dB SPL for HI3.28

    HI3 threshold

    HI3 Speech

    HI3 Noise

    Frequency

    dB SPL

    Table1(threshold)

    Table 1.

    Audiometric Hearing Thresholds in dB HL as a function of frequencies in Hz

    500100020003000400060008000

    AVG NH1611.513.512.5141314.5

    HI130455055554540

    HI255705545506055

    HI315152045655555

    HI45154545556055

    HI525252535455055

    HI620404045505560

    HI735556065707575

    HI820252550405045

    HI915151535505055

    &C&"Times New Roman,Regular"&12 19

    Table1(original)

    Table 1. Absolute thresholds in dB SPL

    500100020003000400060008000

    NH161611.511.511.513.513.512.512.5141314.514.5

    HI1414251.551.5535558.5606264585554.5

    HI2262621.521.5232528.552.56070685854.5

    HI416182021.5304053.552.55760535354.5

    HI266687276.5706763.56059595854.554.5

    HI5363631.531.531.533.533.542.55054535044.5

    SPEECH(LEQ)HI-TWHI-CNHI-DHHI-TKHI-SYNH-10 SNR (noise)HI-TWHI-CNHI-DHHI-TKHI-SYNH

    50061.58059.5646061.150064.584.565.372.56567

    63.58257.360.56060.46306787.569.7736468.3

    64.57858.658.758588007283.569.2736667

    100066.479615855.855.7100076.587.468726465.7

    687660.56055551250788768.4716364.8

    69.17260.56354.553.516007884.57071.563.364

    200070.26763635652.620007981717366.564

    73.264.563665950.725008179727669.563.7

    75.76364.569.561.550.5315084787580.173.764.5

    400072606272.559.550400083747380.572.560.6

    65.656566957.5465000726968.77770.557

    62.7565661.556466300706865.6736956.5

    800056.34845554742.58000616360.5686252.5

    dBA (overall)777672.574.568.565.7dBA (overall)878981.6857975

    SPEECH(MAX)HI-TWHI-CNHI-DHHI-TKHI-SYNH

    50061.588.773.376.57554.1

    63063.592.873.47873.461

    80064.589.971.27471.160

    100066.490.673.872.768.856.7

    12506885.774.972.569.556

    160069.184.674.169.971.352.6

    200070.279.774.975.57052.6

    250073.278.372.876.37251.5

    315075.778.974.77975.349.5

    40007271.872.476.47345.6

    500065.667.672.275.972.643

    630062.765.868.47470.345

    800056.363.760.168.461.542

    Table1(speech &noise)

    Table 1 (continue). Spectral level of Speech and Noise

    SPEECH(LEQ)HI1HI2HI4HI3HI5NH-10 SNR (noise)HI1HI2HI4HI3HI5NH

    50061.58059.5646061.150064.584.565.372.56567

    63.58257.360.56060.46306787.569.7736468.3

    64.57858.658.758588007283.569.2736667

    100066.479615855.855.7100076.587.468726465.7

    687660.56055551250788768.4716364.8

    69.17260.56354.553.516007884.57071.563.364

    200070.26763635652.620007981717366.564

    73.264.563665950.725008179727669.563.7

    75.76364.569.561.550.5315084787580.173.764.5

    400072606272.559.550400083747380.572.560.6

    65.656566957.5465000726968.77770.557

    62.7565661.556466300706865.6736956.5

    800056.34845554742.58000616360.5686252.5

    dBA (overall)777672.574.568.565.7dBA (overall)878981.6857975

    Sheet2

    Sheet3

  • Jin & Nelson (06)Role of hearing sensitivityListeners with sensorineural hearing loss showed reduced hearing sensitivityThe effect of noise on HI listeners becomes more detrimental than it is to NH listeners because they already have reduced redundancy of the speech signal in quiet (Van Tasell, 1993). fluctRole of temporal resolutionCompared to NH, HI listeners are more affected by non-simultaneous maskersStrong correlation between masking release and forward masking threshold (Dubno et al., 2002)Observed that syllable recognition in fluctuating noise might be associated with age-related increases in forward-masked thresholds

  • Result 1: sentence recognitionPercent correct keyword identification at -5 dB SNR

    Fig4

    41.55238485.8554004377484952353444

    90.6256372684.56500665726474394836

    936664732.2677868381667172645537

    99.75100991000.4629100499100100100989390

    &C&"Times New Roman,Regular"&12Figure 4. The percent correct keyword identification of HI listeners and the NH group for sentence recofnition at -5 SNR

    34

    NH Mean

    HI8

    HI9

    HI5

    HI1

    HI4

    HI3

    HI6

    HI7

    HI2

    Condition

    Percent Correct Key word

    Fig5

    10.625107125.85540043771417197816

    80.1255446524.56500665485052252515

    83.8755552562.2677868381555856402413

    99.75100991000.4629100499100100100989390

    &C&"Times New Roman,Regular"&12Figure 5. The percent correct keyword identification of HI listeners and the NH group for sentence recofnition at -5 SNR.

    35

    NH Mean

    HI8

    HI9

    HI5

    HI1

    HI4

    HI3

    HI6

    HI7

    HI2

    Condition

    Percent correct key word identification

    Fig6

    49.6251134202415224148.1141410953-8

    5210262511181029216.8868404533-7

    &C&12Figure 6. The amount of Masking Release (MR) of HI listeners and the NH group for sentence recofnition at -5 SNR.

    36

    NHAVG

    HI8

    HI9

    HI5

    HI1

    HI1

    HI3

    HI6

    HI7

    HI2

    Gate Frequency

    Percent Difference (MR)

    Fig7

    69.54439403433331817-1

    73.254545444141373316-3

    &C&"Times New Roman,Regular"&12Figure 7. The amount of Masking Release (MR) of HI listeners and the NH group for sentence recofnition at -10 SNR.

    37

    NHAVG

    HI8

    HI9

    HI5

    HI1

    HI4

    HI3

    HI6

    HI7

    HI2

    Gate Frequency

    Percent Differences

    Fig8

    0.8489760920.22916666670.55737704920.02095932240.02095932240.38461538460.46153846150.29411764710.45833333330.06349206350.2372881356-0.1739130435

    0.89338950760.20833333330.42622950820.48076923080.21153846150.35294117650.20833333330.46031746030.3559322034-0.152173913

    &C&"Times New Roman,Regular"&12Figure 8. The Normalized Masking Release (NMR) of HI listeners and the NH group for sentence recofnition at -5 SNR.

    39

    NHAVG

    HI8

    HI9

    HI5

    HI1

    HI4

    HI3

    HI6

    HI7

    HI2

    Gate Frequency

    Normalized Masking Release

    Fig9

    0.77680871280.48888888890.42391304350.02095932240.02095932240.45454545450.39534883720.39759036140.40740740740.19780219780.2-0.0135135135

    0.81908379410.50.48913043480.50.4767441860.49397590360.45679012350.36263736260.1882352941-0.0405405405

    &C&"Times New Roman,Regular"&12Figure 9. The Normailzed Masking Release (NMR) of HI listeners and the NH group for sentence recofnition at -10 SNR.

    40

    NHAVG

    HI8

    HI9

    HI5

    HI1

    HI4

    HI3

    HI6