Sept 2020 The Hearing Journal
Existing guidelines for hearing aid (HA) fitting predominantly focus on frequency-specific audibility compensation for hearing loss. Current HA fitting practices for advanced features such as compression speeds, noise reduction, and frequency lowering rely on manufacturer defaults, individual expertise, or subjective patient feedback. These HA features may interact with each other as well as with the listener's environment, resulting in cumulative changes to the fidelity of the signal (e.g., speech envelope distortions) beyond the expected improvements in audibility. Without established guidelines for selecting and fine-tuning these features, there may be a risk of suboptimal HA outcomes, especially for hard-of-hearing listeners who are disadvantaged (e.g., older adults, poorer cognitive abilities) by more advanced settings of these features (e.g., faster compression speeds or greater noise reduction), particularly in challenging situations such as high background noise. Other than the Speech Intelligibility Index, which is a measure of audibility available on some clinical HA verification equipment, no objective metric is readily accessible to audiologists to measure overall HA signal fidelity. To facilitate evidence-based HA fitting, beyond achieving adequate gain, audiologists need tools to capture the acoustic effects of combined HA feature adjustments in the context of the HA user's listening environment. Moreover, for a truly individualized fitting, these tools should account for individual ear acoustics and vent effects that, in turn, influence the effectiveness of HA processing.
Figure 1: Air conduction thresholds (panel A) and match-to-prescriptive targets (panels B-E) for the right ears of the study participants (S1-S4)
Recent work has shown that it is possible to quantify signal fidelity from clinically fit HAs using cepstral correlation, which measures changes to the speech envelope of a processed signal and is a reliable predictor of speech intelligibility and quality across several forms of HA processing. Using this metric, Rallapalli, et al., measured signal fidelity across 96 clinically prescribed adult HA fittings by recording HA output in response to different levels of speech and background noise on an acoustic manikin. The study also demonstrated the advantages of an acoustic manikin over a test box to capture the open-ear effects and vent acoustics associated with commonly prescribed HAs, including receiver-in-the-canal type.
Although an acoustic manikin is representative of the average human ear, it may under- or overestimate the ear canal and pinna acoustics for some individuals. Moreover, an acoustic manikin may not be a readily accessible and cost-effective option in the clinic. An ideal clinical tool is one that can measure the acoustic effects of HA feature adjustments directly in a patient's ears. It is also important that such a tool is convenient and time-efficient to be successfully implemented in the clinic. This study takes a step in this direction and proposes the use of a commonly available clinical equipment, Audioscan Verifit 2 (VF-2), to overcome some of these issues. This study aims to determine the feasibility of measuring HA signal fidelity at user settings of recordings obtained from individual ears, with the VF-2.
STUDY METHODS Participants & Hearing Aids: Participants were four adults (69-80 years old; three men) with bilateral sensorineural hearing loss. Figure 1 (A) shows air conduction thresholds for the test ears. None of the participants reported having otologic problems or occluding cerumen at the time of testing. The participants’ right HAs were used in the study. Three participants wore receiver-in-the-canal HAs, two with open domes, one with a closed dome, and one participant wore a completely-in-the-canal HA. Before the recording, fresh batteries were placed in the HAs, and a listening check was performed. Real-ear aid responses (REAR) were measured at 55, 65, and 75 dB SPL input levels against NAL-NL2 prescriptive targets using a male talker signal (“carrot passage”) from the VF-2. Figure 1 (B-E) shows the match-to-targets. No adjustments were made to the participant's HAs because the study focused on determining the feasibility of the recording methodology at user settings. The match-to-NAL-NL2 targets in this study were consistent with other reports of real-world clinical fittings. All participants completed an informed consent process approved by the Institutional Review Board at Northwestern University.
Stimuli & Listening Conditions: Similar to Rallapalli, et al., stimuli were two sentences, spoken by one male and one female talker. To capture the HA response at realistic background noise levels for hard-of-hearing listeners, the researchers mixed sentences with six-talker babble at three signal-to-noise ratios (SNRs; 5 dB, 10 dB, and quiet). To capture the effects of soft, average, and loud speech, noisy and quiet stimuli were presented at three input levels (55 dB, 65 dB, 75 dB SPL), measured at two feet from the VF-2 loudspeaker.
Figure 2: Schematic of the recording setup
Figure 3: Boxplots illustrating measured signal fidelity (y-axis; 1 = maximum) between the present study (VF-2) and a previous study (manikin) as a function of input level (x-axis). Each panel represents a different SNR
Recording Setup: Recordings were completed in a double-walled sound-treated booth (Fig. 2), representative of a typical clinical environment. The calibrated probe microphone was placed in the right ear at depths (from inter-tragal notch) of 28 mm and 30 mm for female and male participants, respectively. Stimuli were presented using the USB connection of the VF-2, with each sound file consisting of a single repetition of the stimulus at each SNR. The stimuli and the corresponding input level were selected using the dropdown menu in the on-ear Speechmap mode. The output from the probe microphones was routed (via VF-2’s headphone monitoring jack) through a digital-to-analog converter (M-Audio 8) and saved in an external PC. The researcher was seated outside the booth and used a custom Matlab program to capture 90-second recordings (about six repetitions) in each listening condition (three SNRs X three input levels). The repetitions were later averaged to obtain the final recording that was subsequently used to calculate the signal fidelity in each condition, as described below.
Procedure: Participants were instructed according to standard real-ear measurement procedures. They were also instructed to minimize movements for the duration of the recording. First, unaided recordings were obtained in quiet at each input level to serve as an individual participant's reference signal for the metric. This would ideally allow us to account for individual ear acoustics. Next, aided recordings were obtained at user settings for all listening conditions. The researcher monitored the recording process and inspected each recorded waveform. If a given recording was deemed noisy (e.g., sneeze or sudden movement), it was re-recorded (and the participant reinstructed as needed). The total duration of the recording was approximately 18 minutes per participant: 4.5 minutes (90 s X three levels; unaided) + 13.5 minutes (90 s X three levels X three SNRs; aided). The entire session, including rest breaks, lasted about one hour.
Signal Fidelity Metric: Cepstral correlation from the Hearing Aid Speech Quality Index (HASQI) was used to calculate signal fidelity by comparing short-term envelope modulations of the reference signal with the signal processed through the HA. The reference signal was the recording for speech in quiet obtained at each input level. Both the reference and HA processed signals were passed through a peripheral-impaired auditory model that estimated loss of audibility and broadened auditory filters. NAL-R linear gain was applied to the reference signal. As such, the metric captures cumulative signal fidelity as a result of HA processing and the listening condition (background noise and input level), after accounting for individual hearing impairment and HA gain. Any condition that distorts the speech envelope will result in a mismatch between the processed and reference signals, thereby decreasing the cepstral correlation/signal fidelity. Cepstral correlation values range from 0 (no match between envelope modulations of processed and reference signals; least signal fidelity) to 1 (perfect match between the signals; maximum signal fidelity). Full details regarding the metric are available in Kates and Arehart.
STUDY RESULTS Comparison of Recording Methods: Manikin v. Individual Ears: First, a subset of four participants with matched audiometric thresholds (within +/- 10 dB between 0.25-6 kHz) was selected from the Rallapalli, et al.,6 study to determine whether the metric would be affected by recording method (individual ears+VF-2 vs. manikin). A one-way ANOVA showed no significant effect of the recording methodology on the measured signal fidelity (F[1,70]=1.30, p=0.258). This result suggested that signal fidelity could be measured reliably on HA recordings obtained directly from the patient's ears using the VF-2.
Signal Fidelity Across Listening Conditions: The distribution of signal fidelity across SNRs and input levels is shown in Figure 3. Data from the study by Rallapalli, et al., (manikin) are shown for comparison. As expected, signal fidelity decreased as SNR decreased from quiet to 5 dB and as input level increased from 55 dB to 75 dB SPL. The following analysis pertains to signal fidelity measured on individual ears in the present study. A statistical model (linear mixed-effects) showed significant main effects of SNR (F[2,24]=10.53, p<0.001) and input level (F[2,24]=24.13, p<0.001) on signal fidelity. Because the interaction between SNR and input level was also significant (F[4,24]=3.31, p=0.027), paired t-tests were conducted to determine the effect of input level at each SNR. After correcting for multiple comparisons,20 results showed that increasing the input level resulted in decreasing signal fidelity only when noise was present. For example, at 10 dB SNR, increasing the input level from soft (55 dB) to loud (75 dB SPL) resulted in a decrease in signal fidelity by 0.126 (p<0.001). The effect was greater at 5 dB SNR, such that the same increase in input level resulted in a decrease in signal fidelity by 0.217 (p<0.001). This worsening of signal fidelity is likely a result of the HA compressor engaging to a greater extent at high speech and noise levels, causing further distortion of the processed speech signal (envelope) compared with engagement at softer levels. On the other hand, no significant effect of input level was found on signal fidelity in quiet (p>0.05). The overall pattern of results indicates that the methods used in this study captured the behavior of HA processing in the context of clinically relevant speech and noise conditions.
DISCUSSION This study established the feasibility of measuring HA signal fidelity under clinical conditions. Consistent with previous work using an acoustic manikin, the methods presented in this study captured the pattern of HA response (at user settings) across a realistic range of speech and background noise levels. Although the present study only included a small number of participants and different HAs and HA parameters, the signal fidelity from individual ears in this study was generally comparable to the signal fidelity range noted by Rallapalli, et al.,using the acoustic manikin (Fig. 3). Thus, the new recording method using easily accessible clinical equipment, such as the VF-2, enables reliable measurement of combined acoustic effects of HA processing and listening conditions while allowing for individual ear acoustics and vent effects. While this study focused on measuring HA signal fidelity using the cepstral correlation metric, the recording methods introduced here can also be used in other types of analyses.
Several steps are being taken to make the recording methods and signal fidelity metric more clinic-friendly. Work is also underway to reduce potential sources of noise during recording to ensure sufficient fidelity of the recorded signal so it can be used not only in a sound-treated booth but also in a relatively noisier fitting room. The ultimate goal is to provide audiologists with an objective tool for guiding HA adjustments.