Simultaneous acquisition of high-rate early, middle, and late auditory evoked potentials.

Simultaneous acquisition of high-rate early, middle, and late auditory evoked potentials 2 ¨ ¨ Fred D. Holt IV1 and Ozcan Ozdamar

Abstract— Auditory evoked potentials (AEPs) are typically acquired at rates that facilitate their study as segregated by epochs relative to stimulus onset: early (ABR, 1.5-15 ms), middle (MLR, 15-60 ms), and late (LAEP, ≥60 ms) potentials. In particular, late AEPs are often acquired with stimulus repetition rates between 0.1 Hz and 1 Hz, and are bandpass filtered to contain information only within 1-30 Hz. These low repetition rates, filtering and low SNRs eliminate much of the potential contributions of the early and middle-latency responses in AEP recordings. This study aims to demonstrate a method for acquiring whole-AEP responses at higher stimulus repetition rates of 0.5 Hz to 10 Hz, by utilizing the Continuous Loop Averaging Deconvolution (CLAD) method, increasing the bandwidth of the recordings to 1-300 Hz to include early components, and using short-duration chirps to increase synchronous firing of the cochlear and auditory pathway neurons. Such a method may facilitate diagnostic or functional assessment of single AEP recordings for detection, identification, or evaluation of early, middle and late components of auditory responses.

I. INTRODUCTION Auditory evoked potentials (AEPs) have historically been studied according to three latency classifications: The auditory brainstem responses (ABR), composed of strongly neuroanatomically-correlated peaks occurring within the first 15 ms after the onset of an auditory stimulus. The middlelatency response (MLR) represents activation of the auditory thalamocortical pathways and related areas within 15-60 ms after stimulus presentation. Both ABRs and MLRs are predominantly exogenic and depend highly on the physical characteristics of the acoustic stimulus. Late-latency AEPs (LAEP, or Cortical AEP, CAEP) are typically comprised of several characteristic peaks occurring at least 60 ms after a stimulus. Later components of the LAEP are commonly observed under different experimental conditions, some aspects of which are strongly related to exogenous conditions, and some are more endogenous in nature ([1], Ch. 11-12). AEPs are commonly exploited for a number of diagnostic purposes, for example, the presence of peak V in the ABR can be used as diagnostic of proper cochlear and early auditory pathway function ([1], Ch. 8). The ABR and MLR may also be used for non-behavioral auditory threshold detection and states of wakefulness ([1], Ch. 2), and LAEP has been utilized to diagnose schizophrenia and central auditory processing disorder, among others [1,2]. Some studies 1 F. D. Holt IV is with the Department of Biomedical Engineering, University of Miami, Coral Gables, FL 33214 USA (email: [email protected]) 2 O. ¨ Ozdamar ¨ is professor and chair of the Department of Biomedical Engineering, University of Miami, Coral Gables, FL 33214 USA. (email: [email protected])

978-1-4244-7929-0/14/$26.00 ©2014 IEEE

have utilized independent recordings of the ABR, MLR, and LAEP in order to compare inter-peak magnitudes and intervals between epochs for sex and gender differences [3]. The functional significance of LAEPs are still poorly understood, but the N1 (negative peak, 80-120 ms) is generally thought to be a pre-attentive response within the primary and secondary association cortices, and P2 (positive peak, 140180 ms) may be related to stimulus classification, see [1,4,5] for reviews. Late AEPs can be highly endogenous and less dependent on the physical characteristics of the stimulus than its contents and relationship to its current meaning. Generally, the primary stimuli for studying all components of AEPs have been gated pure tone bursts [1], where the recorded AEPs (for LAEPs in particular) are shown to be sensitive to parameters such as the gating function and duration, tone frequency, inter-stimulus interval (ISI), and intensity. A few studies have utilized noise, complex tones, speech syllabi, swept-tones or chirps on the ABR and MLR [1,2,6]. The effects of these variables on the LAEP have been shown to be non-linear and interactive, which has limited many studies to assessing a single variable [4]. Typical reports of the LAEP utilize ISIs of between 0.5 and 10 seconds (repetition rates of 0.1-2.0 Hz) ([1], Ch. 11). Among the reasons for a lack of studies for LAEPs at higher repetition rates is partly due to either an assumption that late potentials are adapted out prior to rates at which overlapping of adjacent stimuli begins to occur, or due to an inability to unwrap overlapping responses. Several studies have used subtractive methods to trains of two or more stimuli to observe the effects of stimulus duration, onset and/or offset ramping functions, ISI, intensity or adaptation of the stimulus [1,7]. The Continuous Loop Averaging Deconvolution (CLAD) method utilizes designed sequences with non-uniform ISIs, known as stimulus onsetasynchrony (SOA), to recover an estimate of the transient response (to a single stimulus) from recorded overlapping responses [8,9]. This method will allow us to observe late regions of the AEP when stimulating at rates faster than has been typical to LAEP studies. However, LAEPs have been difficult to study due to large inter-subject variability, long testing durations, and lack of substantial normative data [10]. This study aims to demonstrate a method for acquiring a whole-AEP recording that exhibit characteristic waves of the ABR, MLR, and LAEP within a single recording. This is accomplished by acquiring AEP recordings at low (non-overlapping) and high rates (overlapping) and utilizing deconvolution to unwrap the overlapping components of the responses. Such a method

1481

Fig. 1. Stimulus sequences designed for this study and their characteristics. The left column displays the stimulus presentation trigger raster for sequences with mean rates of 5.1 Hz (top), 6.8 Hz (middle), and 10.3 Hz (bottom) are shown with black stems. An isochronic stimulation sequence in red shown for comparison. The middle column shows the instantaneous rate histogram for each sequence, with a red line indicating the mean rate. The right column shows the deconvolution noise-amplification profile for each CLAD sequence, with NAF=1 shown as a red line (see text for details).

also allows us to retain a broadband recording of the entire AEP waveform, which may facilitate the study of interrelation between components on a per-subject basis. II. METHODS A. Subjects Data were acquired from 7 young volunteer subjects (6 male, ages 20-29), with mean age of 23. All subjects were assessed to have normal hearing thresholds (thresholds ≤25 dB at .25, .5, 1, 2, 4 and 8 kHz) and no history of neurological problems. All experiments and procedures were conducted in accordance within an IRB-approved protocol.

C. Stimuli The stimulus chosen for this study is a short duration swept tone or chirp specially designed to compensate for the basilar membrane forward-delay [11]. Such chirps have been shown to elicit larger steady-state responses (ASSRs), ABR and MLR component waves [6,11,12]. The chirps were generated using MATLAB (Natick, MA, USA) for an intensity of 50 dB HL over a range of 0.2-8 kHz, and calibrated using peak-equivalent reference equivalent threshold SPL (peRETSPL) in a custom 2cc coupler made of acrylic. The electroacoustic transfer function of the ER3A was mathematically compensated for in order to be acoustically flat.

B. Recording

D. Averaging, Filtering and Deconvolution

A 2-channel bioamplifier system (Intelligent Hearing Systems, Miami, FL, USA) was used to acquire continuous EEG data. Electrodes were placed on the subjects scalp and mastoids (Ch. 1: Cz −A2 and Ch. 2: Cz −A1 ), with center of the forehead as ground. Continuous EEG data were acquired at a rate of 5000 samples/sec., with a gain of 100,000× and band-pass filtered from 1-1500 Hz (6 dB/oct) prior to analogto-digital conversion, and stored for offline analysis. Only ipsilaterally collected (Ch. 1) data will be reported in this study. Stimuli were presented using insert earphones (ER3A with 0.89 ms acoustic delay line, Etymotic Research, Elk Grove Village, IL) to the subject’s right ear, and in alternating polarity. Testing was done in an acoustically and electrically treated booth with the subject lying on a bed comfortably and watching a silent movie with captions.

AEP recordings were obtained for several isochronic rates at 0.49, 0.98, 1.9, 4.9, 7.2, and 9.8 Hz (ISIs between 0.1 s and 2.0 s), as well as 3 CLAD optimized sequences with mean rates of 5.1, 6.8, and 10.3 Hz (mean SOA of 0.1, 0.15, and 0.2 s). Figure 1 shows the CLAD sequences designed for this study. The left column shows a stem plot for each of the stimulus triggers in black. For comparison, an isochronic stimulation pattern is shown with smaller red stems for the corresponding isochronic rates. The middle column shows the instantaneous rate histogram for the CLAD sequences in the left column, where the mean rate is indicated by a thin red line. These histograms show the rate dispersion associated with the sequence jitter required for the CLAD method. The associated deconvolution filter characteristics are shown in the right column, and are expressed in terms of

TABLE I S TIMULUS SEQUENCE PARAMETERS USED IN THIS STUDY. Mean Rate, Hz 5.1 6.8 10.3

Stimulus onset − asynchrony, ms 202, 221, 161, 248, 170, 175 182, 143, 161, 125, 169, 120, 135 114, 92, 79, 84, 79, 128, 88, 128, 101, 79, 97

No. of stimuli 6 7

SOA min 161 119

max 245 182

Mean NAF 0.56 0.52

Duration 1 sweep, sec. 1.18 1.03

No. of sweeps 450 450

Duration recording, min. 8.8 7.8

11

79

128

0.46

1.07

450

8.0

1482

a noise amplification factor (NAF) profile that is unique to each sequence. The NAF essentially describes the ability of the deconvolution process to attenuate (NAF1) noise present in the convoluted signal on a per-bin basis in the frequency domain [9]. A NAF of 2 denotes a 6 dB increase in noise. CLAD sequences selected for this study have been optimized for frequencies between 2.5 Hz and 300 Hz. Isolated points where NAF>1 can be combatted with judiciously chosen digital filters, if necessary. Table 1 provides additional details pertaining to the CLAD stimulation sequences used in this study. For each CLAD sequence, a list of the consecutive SOA comprising one sweep is provided, as well as the duration and number of sweeps acquired for each rate. The particular recording parameters, rates, and sequences were chosen as a compromise between recording duration and SNR. Averaging was performed off-line with a rejection threshold of ±50 µV to eliminate noisy recording artifacts, and deconvolution was performed in the frequency domain according to [9], when necessary. E. Analysis 1) Comparing to normative data: In order to compare the recorded LAEP data with established normative data (e.g.: Sussman et al. [13]), data from each subject for the iso-chronic rates of 0.49, 0.98, and 1.9 Hz were band-pass filtered between 1-30 Hz with a 2nd order, zero-phase Butterworth filter to remove high-frequency information. Then, the presence of, latency, and amplitude of characteristic peaks for LAEPs were tabulated or marked as indiscernible if they were not apparent. 2) Comparing isochronic and CLAD recordings: Isochronic data recorded at 4.9, 7.2, and 9.8 Hz, and the deconvolved CLAD recordings with rates of 5.1, 6.8, and 10.3 Hz were low-pass filtered at 300 Hz with a 2nd order zero-phase Butterworth filter in order to match the optimized deconvolution filter bandwidth (see right column of Fig. 1). For the purposes of this paper, the similarity between isochronic and deconvolved recordings at comparable rates (e.g.: 5.1 Hz CLAD and 4.9 Hz isochronic) is reduced to a simple R value representing the zeroth-lag normalized covariance function. III. RESULTS Figure 2 shows an example of an isochronic stimulated sequence at 0.98 Hz for subject S05 with detectable peaks

Fig. 2. Digitally filtered vs. raw EEG averaged waveforms for subject S05 at 0.98 Hz. The two traces represent the same 67 sweeps of recorded EEG (70 sec. recording duration). The upper trace has been band-pass filtered from 1-30 Hz (see text), while the lower trace has only the analog acquisition filter (1-1500 Hz).

labeled. The top trace is the output of applying a 1-30 Hz band-pass filter to the data represented in the bottom trace. The character of the latter portion of the MLR and LAEP is not diminished, however, peaks V and Pa are abolished or diminished significantly as a result of filtering, and the inter-peak amplitudes prior to P1 are distorted. Figure 3 shows responses recorded for the CLAD sequences (upper three traces) and isochronic recordings (lower three traces), each labeled according to mean rate of stimulation to the left. The left panel of Fig. 3 shows responses from subject S01 (with peak scoring indicated by labels or arrows), and the right panel shows the population averaged data (N =7). All data has been low-pass filtered at 300 Hz. Table 2 presents a partial summary of the mean and standard deviation for the latency of the prominent AEP peaks: V , Na , P1 , N1 , and P2 as well as an average %detected for the each peak for the 0.49, 0.98, and 1.98 Hz isochronic recordings, and the 5.1, 6.8, and 10.3 Hz CLAD recordings. The primary characteristic peaks for the ABR (V ), MLR (Na , Pa ), and at least the N1 of the LAEP are consistently identifiable in all averaged recordings. Initial comparisons between deconvolved CLAD and similar-rate isochronic recordings show high degree of correlation (Rvalues ≥0.67), but are suggestive of multiple-wrapping (data not shown), not resolvable with current methods. We found the peak amplitude and latencies for isochronic and CLAD recordings were consistent with established literature for peak V , and MLR components [1]. For lower rate 0.49 and 0.98 Hz recordings, MLR and LAEP components

TABLE II PARTIAL PEAK LATENCY DATA AVERAGED OVER THE POPULATION (N=7). * DENOTES CLAD SEQUENCE . Rate, Hz 10.3∗ 9.8 7.2 6.8∗ 5.1∗ 4.9 1.98 0.98 0.49

SNR, dB 4.9 7.3 5.5 3.5 5.8 12.7 9.5 7.1 4.5

R, Corr. coef 0.82 0.67 0.89 − − −

V 14.7 14.9 14.7 14.2 14.7 13.9 14.8 14.8 14.2

σ 0.7 0.5 0.4 0.9 0.5 2.1 0.5 0.5 0.4

% 100 100 100 100 100 100 100 100 100

Na 27.3 27.7 28 28 27 26.8 26.7 26 26

σ 1.3 1.8 1.4 2.1 1.7 1.6 3.8 3 4

1483

% 100 100 100 100 86 100 100 86 57

P1 63.8 60.9 65.9 61.6 66.4 64.6 71.5 71.7 69

σ 4.5 2.4 6.5 7.2 9.6 4.1 8 7.3 8.4

% 71 57 86 86 71 57 86 100 100

N1 90.8 − 96 91.6 97.9 94.4 105 108 108

σ 15.9 − 20.6 15.1 13.2 19.5 17.3 17.6 17.4

% 71 − 43 71 57 43 86 71 100

P2 111 − − 106 121 131 155 158 165

σ 12.8 − − 11.2 7.2 25.4 23.3 21.5 18.5

% 71 − − 71 71 43 86 71 100

Fig. 3. Results from whole-AEP recordings, arranged according to mean stimulation rate (left of figure). The left panel shows results from subject S01, and the right panel shows the population average. All data subjected to a 300 Hz low-pass filter. Note differing amplitude scales in the left and right panels.

of Na , P1 , N1 , P2 , and N2 are also consistent with published normative data acquired from non-attending adults [5,13]. As rate increases, there is a clear trend for the N1 amplitude to decrease, and a shallow negativity persists at the same latency in the 5.1 Hz and 10.3 Hz CLAD traces. For rates above 5.1 Hz, the deconvolution procedure reveals a region of the AEP that would otherwise be lost to overlap with the subsequent stimulus. Thus, it is possible to observe that the morphology of the middle and late-latency region of the AEP is considerably modified at higher rates. For example, the negativity occurring at about 250 ms at 0.49 Hz becomes a broad positivity by the rate of 10.3 Hz. It is apparent that the brain continues to process auditory stimuli well into the late-latency epoch when stimulated at these high rates. However, the mechanisms responsible for this morphology transition are unclear. IV. DISCUSSION AND CONCLUSION This study has been a demonstration of acquiring wholeAEP recordings at high-rates of stimulation and increased bandwidth. Results suggest that high quality recordings can be obtained that contains characteristic ABR, MLR, and LAEP peaks within a single recording. Additionally, rates higher than established LAEP literature appear to exhibit rate-dependence, and merits further investigation. The cochlear delay compensated chirp stimulus appears to be capable of eliciting conventional AEP components. However, it was not within the scope of this study to differentiate between AEP morphology elicited by a chirp vs. gated pure-tone stimuli. LAEP components are known to be especially sensitive to ISI (and SOA) [1,7], therefore additional work on the effects of jittered sequence stimulation is needed to investigate response fidelity. Also, the bandwidth limit imposed by the deconvolution filter was chosen somewhat arbitrarily, and may be adapted with additional sequence design. However, a compromise must be made between stimulation rate, recording duration and fidelity of the earlier AEP components. Acquiring high-fidelity whole-epoch multi-component recordings at high rates is possible using deconvolution. The

advantage of this method is that it may provide a reasonable compromise between early-AEP fidelity and potential diagnostic value of LAEP recordings in situations where recording duration or subject compliance is limited. ACKNOWLEDGMENT The authors would like to thank undergraduate research assistants Alise Senderak and Diego Vidal for their contributions to data acquisition and analysis. R EFERENCES [1] T. W. Picton, Human Auditory Evoked Potentials. Plural Publ., 2011. [2] J. Katz, R. Burkard, and L. Medwetsky, Clinical Audiology. Williams & Wilkins, 2002. [3] D. McFadden, M. D. Hsieh, A. Garcia-Sierra, and C. A. Champlin, “Differences by sex, ear, and sexual orientation in the time intervals between successive peaks in auditory evoked potentials.” Hear Res, vol. 270, pp. 56–64, 2010. [4] K. E. Crowley and I. M. Colrain, “A review of the evidence for P2 being an independent component process: age, sleep and modality,” Clin. Neurophysiol., vol. 115, pp. 732 – 744, 2004. [5] R. Näa¨ tänen, Attention and Brain Function. Lawrence Erlbaum Assoc., Hillside, NJ, USA, 1992. [6] T. Dau, O. Wegner, V. Mellert, and B. Kollmeier, “Auditory brainstem responses with optimized chirp signals compensating basilarmembrane dispersion,” J. Acoust. Soc. Am., vol. 107, pp. 1530–1540, 2000. [7] T. W. Budd and P. T. Michie, “Facilitation of the N1 peak of the auditory ERP at short stimulus intervals.” Neuroreport, vol. 5, pp. 2513–2516, 1994. ¨ Ozdamar, ¨ [8] R. E. Delgado and O. “Deconvolution of evoked responses obtained at high stimulus rates.” J. Acoust. Soc. Am., vol. 115, pp. 1242–1251, 2004. ¨ Ozdamar ¨ [9] O. and J. Bohórquez, “Signal-to-noise ratio and frequency analysis of continuous loop averaging deconvolution (CLAD) of overlapping evoked potentials.” J. Acoust. Soc. Am., vol. 119, pp. 429– 438, 2006. [10] Picton, “Hearing in time: Evoked potential studies of temporal processing.” Ear and Hear., vol. 34, pp. 385–401, 2013. [11] C. Elberling and M. Don, “A direct approach for the design of chirp stimuli used for the recording of auditory brainstem responses,” J. Acoust. Soc. Am., vol. 128, pp. 2955–2964, 2010. [12] C. Elberling, M. Don, M. Cebulla, and E. Sturzebecher, “Auditory steady-state responses to chirp stimuli based on cochlear traveling wave delay,” J. Acoust. Soc. Am., vol. 122, pp. 2772–2785, 2007. [13] E. Sussman, M. Steinschneider, V. Gumenyuk, J. Grushko, and K. Lawson, “The maturation of human evoked brain potentials to sounds presented at different stimulus rates,” Hear. Res., vol. 236, pp. 61 – 79, 2008.

1484

Preliminary report of real simultaneous record of auditory early, middle-latency and late potentials in man.

Middle latency auditory evoked potentials in temporal lobe disorders.

Spatial auditory regularity encoding and prediction: Human middle-latency and long-latency auditory evoked potentials.

Early development of brainstem auditory evoked potentials in Down's syndrome.

Audiometric comparison of the middle and late components of the adult auditory evoked potentials awake and asleep.

Development of auditory evoked potentials.

[Auditory evoked potentials: basics and clinical applications].

Auditory evoked potentials and transcendental meditation.

Diagnostic specificity and familiality of early versus late evoked potentials to auditory paired stimuli across the schizophrenia-bipolar psychosis spectrum.

Pattern analysis of auditory-evoked EEG potentials.

Lateralization and Binaural Interaction of Middle-Latency and Late-Brainstem Components of the Auditory Evoked Response.

Early evoked potentials.

Auditory evoked potentials in anxiety disorder.

The human auditory steady-state evoked potentials.

Auditory evoked potentials in panic disorder.

Long Latency Auditory Evoked Potentials during Meditation.

Brainstem auditory evoked potentials in veterinary medicine.

Auditory acclimatization and hearing aids: late auditory evoked potentials and speech recognition following unilateral and bilateral amplification.

The functional anatomy of middle-latency auditory evoked potentials: thalamocortical connections.

Effects of postinatal stress on visual and auditory evoked potentials.

[The latency behavior of early auditory evoked potentials in high frequency hearing loss].

Temporary threshold shift, loudness, and auditory evoked potentials.

A beamformer for the acquisition of evoked potentials.

Early and late lower limb motor evoked potentials elicited by transcranial magnetic motor cortex stimulation.