Comparative Analysis of HRTFs Measurement Using In-Ear Microphones

Bruschi, Valeria; Terenzi, Alessandro; Dourou, Nefeli A.; Spinsante, Susanna; Cecchi, Stefania

doi:10.3390/s23136016

Open AccessArticle

Comparative Analysis of HRTFs Measurement Using In-Ear Microphones

by

Valeria Bruschi

,

Alessandro Terenzi

,

Nefeli A. Dourou

,

Susanna Spinsante

and

Stefania Cecchi

^*

Department of Information Engineering, Università Politecnica delle Marche, 60131 Ancona, Italy

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(13), 6016; https://doi.org/10.3390/s23136016

Submission received: 17 May 2023 / Revised: 13 June 2023 / Accepted: 27 June 2023 / Published: 29 June 2023

(This article belongs to the Collection Advanced Techniques for Acquisition and Sensing)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The head-related transfer functions (HRTFs) describe the acoustic path transfer functions between sound sources in the free-field and the listener’s ear canal. They enable the evaluation of the sound perception of a human being and the creation of immersive virtual acoustic environments that can be reproduced over headphones or loudspeakers. HRTFs are strongly individual and they can be measured by in-ear microphones worn by real subjects. However, standardized HRTFs can also be measured using artificial head simulators which standardize the body dimensions. In this paper, a comparative analysis of HRTF measurement using in-ear microphones is presented. The results obtained with in-ear microphones are compared with the HRTFs measured with a standard head and torso simulator, investigating different positions of the microphones and of the sound source and employing two different types of microphones. Finally, the HRTFs of five real subjects are measured and compared with the ones measured by the microphones in the ear of a standard mannequin.

Keywords:

1. Introduction

The head-related transfer functions (HRTFs) are mathematical functions that represent the acoustic path between a sound source and the ears of a listener in the frequency domain. These functions can be expressed also in the time domain under the name of head-related impulse responses (HRIRs). Humans can perceive the direction and the distance of a sound source by evaluating the differences in sound between the ears. These differences are represented by the localization cues, such as interaural time difference (ITD), interaural level difference (ILD), and spectral cues, that are contained in the HRTFs [1]. For this reason, HRTFs are essential to understand the human being sound perception and they are used for binaural reproduction to enrich the acoustic signals with directional cues. Spatial audio systems can be obtained by binaural recordings or by processing the recorded signal with HRTFs [2]. In the second case, the binaural synthesis is achieved by the real-time convolution of the input signal with the respective HRIRs. Moreover, the increasing interest in deep learning has recently paved the way for machine learning (ML) methods for spatial audio processing [3]. ML algorithms can be applied for HRTF individualization to enhance the binaural rendering [4].

The HRTFs are strongly individual because they depends on the shape of head, pinnae, and torso that are different for each human being. However, mannequins with standardized dimensions can be used to generate standardized HRTFs [5]. Standard head and torso simulators are widely used in the literature to create HRTFs databases [6] and to investigate measurement limits, such as the directional resolution [7,8] and the distance between the sound source and the head [9,10]. However, there are individual differences in HRTFs that cannot be identified by the standardization. This problem can be solved by using in-ear microphones fitted to real subjects in order to measure individual (or personalized) HRTFs [11,12,13,14,15,16,17].

Several perceptual studies have aimed at evaluating how the use of individual HRTFs enables a high fidelity of the binaural rendering [18,19,20], resulting in contrasting conclusions. In fact, the analysis conducted in [18] shows that the personalized HRTFs are not always the preferred ones, while the studies presented in [19,20] prove that the source localization improves when individual HRTFs are involved. Although the use of individual HRTFs may represent the best solution in binaural reproduction, the HRTFs measurement could be affected by several errors. In [11], the effect of head movements during the HRTFs measurement has been evaluated, proving that the pitch movement, i.e., up or down movement of the head, is the largest among roll, pitch, and yaw, and could cause spectral differences of the HRTFs up to 6 dB. To manage this drawback, a neck support could be applied to the listener under measurement [12], or databases of individual HRTFs may be created including information about the azimuth and the elevation of the sound source and the orientation of the listener’s head, captured by a tracking system [13,14].

Another important aspect of individual HRTFs measurements is the microphone position [15,21]. Actually, the sound pressure distribution along the ear canal is non-uniform [22,23,24,25,26], so different microphone positions produce different measurement results that are not directly comparable. Depending on the type of study, the microphone could be placed at the entrance of the ear canal [5,27,28], inside the ear canal [22,29] or close to the eardrum [23,30,31]. Several researches have proven the necessity to record the signal near the eardrum, because in this way the HRTF measurements include most of the localization cues and the ear canal resonance [32,33,34]. However, positioning miniature microphones in a fixed and replicable position inside the ear canal could be very difficult. In [32], the ear canal is modeled as a one-dimensional transmission line, considering it as the direction-dependent part. Moreover, the psychoacoustic analyses carried out in [33] have proven that the sound direction depends only on the acoustic path between the sound source and the ear canal entrance. In [33], the acoustic transfer path from the sound source and the eardrum is divided into three parts, one direction-dependent part (from the source to the ear canal entrance), two direction-independent parts (from the blocked to open entrance of the ear canal), and the transmission along the ear canal.

In this context, this paper presents a new comparative analysis of two different in-ear miniature microphone systems positioned inside the ear canal for the HRTF measurement. Taking into account different positions of a sound source (i.e., varying the azimuth and the elevation), two analyses have been carried out. A comparison between the HRTFs obtained with the proposed systems and the HRTFs measured by a standard binaural mannequin is performed first, then an analysis of several HRTFs measured considering real subjects is reported. The former is performed to study the performance of the acquisition systems and the effect of the microphones positions with respect to the ear canal. The latter is performed to evaluate the performance of these two different systems considering real subject ear’s shape. All these analyses have been performed in terms of objective evaluations.

The paper is organized as follows. Section 2 offers an overview of the most used measurement algorithms for the impulse response acquisition. Section 3 widely explains the hardware used for the experiments. Section 4 shows the experimental results and a detailed analysis of the measurements. Finally, Section 5 reports the conclusions.

2. Measurement Techniques

The head-related impulse responses (HRIRs) can be measured using several approaches that can be divided into two groups, deconvolution methods and adaptive filtering techniques [21]. The deconvolution methods are the most popular and can be, in turn, classified depending on the input signals, as pseudo random sequences [17,35,36,37,38,39,40,41,42,43] or sweep signals [44,45,46,47,48]. The pseudo random sequences include the maximum length sequence (MLS) [36,37,38,39,40], the inverse repeated sequence (IRS) [41,42] and the Golay codes [43]. In [49], a comparison of the most used deconvolution methods can be found, and in [50], the MLS method is compared with the sweep. The other measurement approach is based on the adaptive filtering technique, employed in HRTFs measurements for the first time in [51] and then applied also in [10,52]. In most of the cases, the adaptation procedure is performed using the normalized least mean square (NLMS) algorithm, thanks to its simplicity and high performance [53].

HRTF measurements could be affected by several problems, such as non-linear distortions of the electro-acoustic systems, environmental noises, reflections from the environments, sound source characteristics, and temperature variations [21,54]. The measurement inside a controlled environment (e.g., anechoic chamber) can solve the problems derived by the environment, while non-linear distortions can be avoided by choosing the appropriate procedure and stimuli [55,56,57]. In [58], perfect periodic sequences (PPSs) and orthogonal periodic sequences (OPSs) are applied for HRTFs measurement in a real car environment, proving robustness towards non-linearities. In particular, PPSs are periodic sequences that require the perfect orthogonality of the basis functions over a period, so they can be used for the identification of Legendre non-linear (LN) filters [56,59] or Wiener non-linear (WN) filters [60,61]. Similarly, the OPSs is a periodic sequence that can identify functional link polynomial (FLiP) filters, i.e., a wide class of non-linear filters that includes LN and WN filters [62,63,64].

For the comparative analysis proposed in this paper, the measurements have been carried out in a semi-anechoic environment with professional equipment and low levels of input signals to reduce possible distortions introduced by the system. The impulse responses have been measured using the sweep signal. Further details on the hardware setup and the acquisition chain are reported in the next section.

3. Hardware Setup

Focusing on in-ear microphones, the sensors used must be as small as possible, indeed this is important for two main reasons; first, since the sensor is placed inside the ear canal, a small device could be installed without being too much annoying for the subject, then, a small form factor is also important in order to minimize any modification in the ear form which can degrade the quality of the measured responses. Another important aspect of the microphone is its frequency response, which should be as flat as possible at least until 10 kHz. For this analysis, the HRTF measurements have been carried out with two different microphones, i.e., the Knowles FG-23329-D65 and the Sennheiser MKE2-EW Gold. Figure 1a,b shows photos of the microphones and Figure 2a,b show their frequency responses. The Knowles FG-23329-D65 [65] is an electret condenser omnidirectional microphone. Its dimensions are about a few millimeters in diameter, as shown in Figure 1a, enabling an easy placement inside the ear canal. The Knowles microphone has a very flat frequency response in a reasonable band between 100 Hz and 10 kHz, as visible in Figure 2a. It has also a very limited power consumption (50 μA), so just two AA batteries are needed to power the microphone, avoiding noise problems. The Sennheiser MKE2-EW [66] Gold is a condenser Lavalier omnidirectional microphone. It features a wide frequency range, from 20 Hz to 20 kHz, as reported in Figure 2b, and an almost flat frequency response below 5 kHz. Figure 1b shows the microphone with its power supply/signal conditioner Sennheiser MZA-900P. The microphone is powered by a 48 V phantom line and generates a low-impedance balanced output. In comparison with the Knowles, the Sennheiser features a slightly bigger capsule with a thicker wire which makes the placement more difficult, on the other side the Sennheiser has a wider frequency response and more robust construction and it can be easily powered by any modern soundcard. On the other hand, the Knowles microphone price is one order of magnitude lower than Sennheiser microphone.

The HRTFs measured with the two microphones are compared with the ones measured with the Brüel & Kjær head and torso simulator (HATS) Type 4128C, i.e., a binaural mannequin used as a reference and shown in Figure 1c. The frequency response of the mannequin is reported in Figure 2c. In this case, the magnitude response is not flat due to the effect of the ear of the dummy head. For the measurements with the B&K simulator, the mannequin is connected to its power supply B&K PS 2829. Moreover, the microphones and a Genelec 8020 A are connected to the Scarlett Focusrite 2i2 soundcard, managed by a computer that uses the NU-Tech software (version 2.0) [68] for the acquisitions. The frequency response of the Genelec loudspeaker is declared flat (

\pm 2.5

dB) in the frequency range of 66 Hz–20 kHz by the manufacturer. The measurements have been carried out inside a semi-anechoic chamber and taking into account only the left ear. The scheme of the acquisition chain used for HRTF measurements is shown in Figure 3. A photo of the experimental setup is shown in Figure 4.

The non-linearities that the measurement system may introduce have been evaluated by calculating the total harmonic distortion (THD) on a tone at 1 kHz. The THD is defined as the percentage ratio between the power of all harmonics and the power of the fundamental frequency. The THD values obtained with the three microphones are reported in the following:

For the Brüel & Kjær HATS simulator, $THD = 0.9 %$ ;
For the Knowles microphone, $THD = 1.8 %$ ;
For the Sennheiser microphone, $THD = 0.4 %$ .

The highest distortion is obtained with the Knowles microphone and it is lower than

2 %

, so the system can be assumed linear. In addition, the impulse response measurements have been performed using a logarithmic sweep signal thanks to its rejection of the harmonic distortions, as declared in [49]. The sweep used for the experiments has a length of 32,768 samples (i.e., 682.7 ms) and is repeated three times. The sampling frequency is

F_{s} = 48

kHz and the final impulse responses have a length of 4096 samples.

4. Experimental Results

Two types of experiments have been carried out, in particular:

A comparison between the HRTFs measured with the in-ear miniature microphones placed in different points on the B&K mannequin ear canal and the HRTFs measured by the internal microphone of mannequin considering different positions of the sound source (see Figure 5 and Figure 6);
A comparison of individual HRTFs measured on five real subjects with the two in-ear microphones for different positions of the sound source (see Figure 7).

The two microphones have been settled on the left ear of the mannequin and of the subjects by means of a hook fixed on earplugs, as shown in Figure 5 and Figure 7. The measurements carried out with the in-ear microphones have been compared with the ones executed with the mannequin. It must be underlined that the comparison has been performed taking into consideration that the placement of the in-ear microphone occludes the ear canal, which is open when the HATS simulator is used.

For an objective evaluation, the HRTFs measured with the in-ear microphones are evaluated in terms of frequency magnitude response and log-spectral distance (LSD) [58] considering the B&K mannequin as reference. The LSD quantifies the distance between two spectra and, in this case, it is used to evaluate how much the HRTFs measured by the in-ear microphones differ from the ones measured with the mannequin. In particular, the LSD is calculated between the reference HRTF of the dummy ear

H_{HATS} (k)

and the one measured with the in-ear microphone

H_{MIC} (k)

as follows

LSD = \sqrt{\frac{1}{k_{2} - k_{1} + 1} \sum_{k = k_{1}}^{k_{2}} {[10 {log}_{10} \frac{| H_{HATS} (k) |^{2}}{| H_{MIC} (k) |^{2}}]}^{2}},

(1)

where

k_{1}

and

k_{2}

delimit the frequency range within which the LSD is estimated, defined as

B = [k_{1} \frac{F_{s}}{K}, k_{2} \frac{F_{s}}{K}] = [100 Hz, 10 kHz]

, with

K = 4096

the number of frequency bins for the FFT computation, and

F_{s} = 48

kHz the sampling frequency.

4.1. Experiment 1

The first experiment aims at analyzing the differences between the HRTFs measured with the mannequin and the HRTFs measured with the in-ear microphones settled on the dummy ear. The in-ear microphones have been placed in the mannequin ear considering four different positions, as shown in Figure 6c. Figure 5 shows the real microphone placement for position P1. Furthermore, four different positions of the sound source have been taken into account varying the azimuth

ϑ

and the elevation

φ

, as shown in Figure 6a,b, i.e.,

ϑ = 0^{\circ}, 45^{\circ}

, and

φ = 0^{\circ}, 15^{\circ}

.

The aim of this experiment is to investigate how much the microphone position influences the HRTF measurement and the results are shown in Figure 8 for both the Knowles (in the first column) and the Sennheiser (in the second column) microphones, considering the four different positions of the sound source. The LSD values calculated for the first experiment are reported in Table 1. For the source position with

ϑ = 0^{\circ}

and

φ = 0^{\circ}

, the HRTFs measured by the Sennheiser microphone are more similar to the HRTF measured with the dummy head, especially at the low frequencies up to 2 kHz (cf. Figure 8a,b). In fact, the Sennheiser reaches the lowest values of the LSD for the first three positions of the microphone. Moreover, the microphone position with the lowest LSD is P2 for both Knowles and Sennheiser microphones, exhibiting values of 1.4 dB and 1.1 dB, respectively, (cf. Table 1). In this case, also the HRIRs in the time domain measured by the HATS and by the two microphones placed at different points are shown in Figure 9. The comparison among the time-domain impulse responses is more difficult than in the frequency domain and the differences are not so clear. However, in all the HRIRs, can be identified the direct pulse and the first reflection caused by the ear’s pinna, followed by the head and torso reflections up to 7 ms. Finally, the small late reflections above 7 ms are created by the measurement devices and the ground. Regarding the source position with

ϑ = 45^{\circ}

and

φ = 0^{\circ}

, the HRTFs measured with the Sennheiser are closer to the one measured with the mannequin at the low frequencies (cf. Figure 8c,d). However, the differences at higher frequencies produce higher LSD values with the Sennheiser, when the microphone is in positions P2 and P3. In this case, the best position is P3 for the Knowles with an LSD of 1.3 dB, and P1 for the Sennheiser with an LSD of 1.1 dB (cf. Table 1). For the source position with

ϑ = 0^{\circ}

and

φ = 15^{\circ}

, the Knowles microphone introduces a notch around 6 kHz for P1 and P3, and around 7.5 kHz for P2 and P4 (cf. Figure 8e,f), resulting in LSD values lower than the ones obtained with the Sennheiser microphone. The lowest LSD reached by the Knowles microphone is 1.5 dB in positions P2 and P3, while the lowest LSD for the Sennheiser is 0.9 dB in position P2 (cf. Table 1). Finally, for the source position with

ϑ = 45^{\circ}

and

φ = 15^{\circ}

, the HRTFs measured with the Sennheiser produce the best results both in terms of frequency response and LSD values (cf. Figure 8g,h). For the last source position, the lowest LSD with the Knowles microphone is 1.9 dB at point P1, and with the Sennheiser microphone is 1.3 dB at point P3 (cf. Table 1). These results prove that the position of the microphone affects the frequency response only at frequencies higher than 4 kHz and the performance of the Sennheiser microphone reaches the lowest LSD values in comparison with the Knowles microphone in most of the cases.

4.2. Experiment 2

The second experiment involves five real subjects wearing alternatively the two in-ear microphones, as shown in Figure 7. Results of experiment 2 are reported for each subject in Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14, considering two azimuth angles of the sound source, i.e.,

ϑ = 0^{\circ}

and

ϑ = 45^{\circ}

and an elevation of

φ = 0^{\circ}

, as shown in Figure 6a. The measurements on subjects are compared with the HRTFs measured with the same microphone fixed on the dummy ear. In this case, the central position P1 of Figure 6c has been chosen for the acquisitions. As expected, each subject has a different frequency response due to the ear’s shape but a comparison between the two microphones and the dummy ear can be performed. In particular, the Sennheiser microphone exhibits frequency responses more similar to the dummy ear, while the Knowles microphone seems to have slight variations in comparison with the dummy ear. Figure 15 shows the HRIRs in the time domain for the two microphones and for the five real subjects in comparison with the microphones placed on the dummy head, considering the sound source in front of the listener, i.e.,

ϑ = 0^{\circ}

and

φ = 0^{\circ}

. Table 2 reports the LSD values comparing the HRTFs of the real subject with the HRTF measured with the mannequin at position P1. These results objectively confirm the observations already derived from the HRTFs analysis. In more detail, the worst performance is achieved by the first subject, shown in Figure 10, with the Knowles microphone. Differently, the lowest LSD value of 1.1 dB is reached by the fourth subject with the Sennheiser microphone and with the source positioned at

ϑ = 45^{\circ}

and

φ = 0^{\circ}

, reported in Figure 13d. Moreover, for every case, the Sennheiser microphone shows the lowest LSD value in comparison with the Knowles one and, thus, better performance.

5. Conclusions

In this paper, a comparative analysis of HRTFs measurement procedures is presented. In more detail, the HRTFs measured with two different in-ear microphones (i.e., Knowles and Sennheiser) are analyzed and compared with the HRTFs measured with a standard binaural mannequin. As first step, the influence of the microphone position on the frequency responses is investigated using the mannequin’s ear. Then, individual HRTFs of five real subjects are measured and compared among them. The experimental results have proven that the HRTFs are similar at low frequencies when different types of microphones are involved. In addition, the position of the microphone influences the HRTFs above 4 kHz. The experiments have shown that it is difficult to define the best position for the in-ear microphone. In fact, the analysis of the LSD values has reported that the best location of the microphone varies with the loudspeaker position and is not always the same. However, the Sennheiser microphone enables the obtaining of frequency responses more similar to the ones measured by the dummy head. Finally, the individual HRTFs measured on real subjects have shown how the frequency responses change with different ears. Furthermore, in this case, the Sennheiser microphone has produced HRTFs more similar to the mannequin and more similar among the five subjects. However, the worst performance of Knowles are compensated by the price that is one order of magnitude lower than Sennheiser microphone. Future works will investigate the effectiveness of the different HRTF measurements through subjective tests, evaluating the immersive perception. The subjective tests will examine the influence of the microphone position on the listening experience and will subjectively investigate the difference between personalized and standardized HRTFs.

Author Contributions

Conceptualization, S.S. and S.C.; methodology, S.C.; software, V.B. and A.T.; validation, V.B., A.T., N.A.D. and S.C.; formal analysis, V.B. and S.C.; investigation, V.B., A.T. and S.C.; resources, V.B. and A.T.; data curation, V.B. and A.T.; writing—original draft preparation, V.B. and A.T.; writing—review and editing, S.C.; visualization, S.C. and S.S.; supervision, S.C.; project administration, S.C.; funding acquisition, S.C. and S.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the financial program DM MiSE 5 Marzo 2018, project “ChAALenge”—F/180016/01-05/X43.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Not applicable.

Acknowledgments

The authors are grateful to the student Francisco Messina for their help during the experimental measurements.

Conflicts of Interest

The authors declare no conflict of interest.

References

Iida, K. Head-Related Transfer Function and Acoustic Virtual Reality; Springer: Singapore, 2019. [Google Scholar]
Rafaely, B.; Tourbabin, V.; Habets, E.; Ben-Hur, Z.; Lee, H.; Gamper, H.; Arbel, L.; Birnie, L.; Abhayapala, T.; Samarasinghe, P. Spatial audio signal processing for binaural reproduction of recorded acoustic scenes–review and challenges. Acta Acust. 2022, 6, 47. [Google Scholar] [CrossRef]
Cobos, M.; Ahrens, J.; Kowalczyk, K.; Politis, A. An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction. EURASIP J. Audio Speech Music Process. 2022, 2022, 1–21. [Google Scholar] [CrossRef]
Miccini, R.; Spagnol, S. HRTF individualization using deep learning. In Proceedings of the 2020 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW), Atlanta, GA, USA, 22–26 March 2020; pp. 390–395. [Google Scholar]
Burkhard, M.; Sachs, R. Anthropometric manikin for acoustic research. J. Acoust. Soc. Am. 1975, 58, 214–222. [Google Scholar] [CrossRef] [PubMed]
Gardner, B.; Martin, K. HRTF Measurements of a KEMAR Dummy-Head Microphone. In MIT Media Lab Perceptual Computing Technical Report 280; Massachusetts Institute of Technology: Cambridge, MA, USA, 1994. [Google Scholar]
Bovbjerg, B.P.; Christensen, F.; Minnaar, P.; Chen, X. Measuring the Head-Related Transfer Functions of an Artificial Head with a High-Directional Resolution. In Proceedings of the 109th Convention of the Audio Engineering Society, Los Angeles, CA, USA, 22–25 September 2000. [Google Scholar]
Richter, J.G.; Fels, J. On the Influence of Continuous Subject Rotation During High-Resolution Head-Related Transfer Function Measurements. IEEE/ACM Trans. Audio Speech Lang. Process. 2019, 27, 730–741. [Google Scholar] [CrossRef]
Wierstorf, H.; Geier, M.; Spors, S. A Free Database of Head Related Impulse Response Measurements in the Horizontal Plane with Multiple Distances. In Proceedings of the 130th Convention of the Audio Engineering Society, London, UK, 13–16 May 2011. [Google Scholar]
Li, Y.; Preihs, S.; Peissig, J. Acquisition of Continuous-Distance Near-Field Head-Related Transfer Functions on KEMAR Using Adaptive Filtering. In Proceedings of the 152nd Convention of the Audio Engineering Society, The Hague, The Netherlands, 7–8 May 2022. [Google Scholar]
Hirahara, T.; Sagara, H.; Toshima, I.; Otani, M. Head Movement During Head-Related Transfer Function Measurements. Acoust. Sci. Technol. 2010, 31, 165–171. [Google Scholar] [CrossRef] [Green Version]
Masiero, B.; Pollow, M.; Fels, J. Design of a fast broadband individual head-related transfer function measurement system. In Proceedings of the Forum Acusticum, Aalborg, Denmark, 27 June–1 July 2011; pp. 2197–2202. [Google Scholar]
Bolaños, J.G.; Pulkki, V. HRIR Database with Measured Actual Source Direction Data. In Proceedings of the 133rd Convention of the Audio Engineering Society, San Francisco, CA, USA, 26–29 October 2012. [Google Scholar]
Carpentier, T.; Bahu, H.; Noisternig, M.; Warusfel, O. Measurement of a head-related transfer function database with high spatial resolution. In Proceedings of the 7th Forum Acusticum (EAA), Krakow, Poland, 7–12 September 2014. [Google Scholar]
Møller, H.; Sørensen, M.F.; Hammershøi, D.; Jensen, C.B. Head-Related Transfer Functions of Human Subjects. J. Audio Eng. Soc. 1995, 43, 300–321. [Google Scholar]
Yu, G.; Wu, R.; Liu, Y.; Xie, B. Near-Field Head-Related Transfer-Function Measurement and Database of Human Subjects. J. Acoust. Soc. Am. 2018, 143, EL194–EL198. [Google Scholar] [CrossRef] [Green Version]
Ye, Q.; Dong, Q.; Zhang, Y.; Li, X. Fast head-related transfer function measurement in complex environments. In Proceedings of the 20th International Congress on Acoustics, Sydney, Australia, 23–27 August 2010; pp. 23–27. [Google Scholar]
Usher, J.; Martens, W.L. Perceived Naturalness of Speech Sounds Presented Using Personalized versus Non-Personalized HRTFs; Georgia Institute of Technology: Atlanta, GA, USA, 2007. [Google Scholar]
Møller, H.; Sørensen, M.F.; Jensen, C.B.; Hammershøi, D. Binaural technique: Do we need individual recordings? J. Audio Eng. Soc. 1996, 44, 451–469. [Google Scholar]
Armstrong, C.; Thresh, L.; Murphy, D.; Kearney, G. A Perceptual Evaluation of Individual and Non-Individual HRTFs: A Case Study of the SADIE II Database. Appl. Sci. 2018, 8, 2029. [Google Scholar] [CrossRef] [Green Version]
Li, S.; Peissig, J. Measurement of head-related transfer functions: A review. Appl. Sci. 2020, 10, 5014. [Google Scholar] [CrossRef]
Mehrgardt, S.; Mellert, V. Transformation characteristics of the external human ear. J. Acoust. Soc. Am. 1977, 61, 1567–1576. [Google Scholar] [CrossRef] [PubMed]
Wiener, F.M.; Ross, D.A. The pressure distribution in the auditory canal in a progressive sound field. J. Acoust. Soc. Am. 1946, 18, 401–408. [Google Scholar] [CrossRef]
Searle, C.; Braida, L.; Cuddy, D.; Davis, M. Binaural pinna disparity: Another auditory localization cue. J. Acoust. Soc. Am. 1975, 57, 448–455. [Google Scholar] [CrossRef]
Middlebrooks, J.C.; Makous, J.C.; Green, D.M. Directional sensitivity of sound-pressure levels in the human ear canal. J. Acoust. Soc. Am. 1989, 86, 89–108. [Google Scholar] [CrossRef]
Djupesland, G.; Zwislocki, J. Sound pressure distribution in the outer ear. Acta Oto-Laryngol. 1973, 75, 350–352. [Google Scholar] [CrossRef]
Wiener, F.M. On the diffraction of a progressive sound wave by the human head. J. Acoust. Soc. Am. 1947, 19, 143–146. [Google Scholar] [CrossRef]
Shaw, E.A. Earcanal pressure generated by a free sound field. J. Acoust. Soc. Am. 1966, 39, 465–470. [Google Scholar] [CrossRef]
Middlebrooks, J.C. Narrow-band sound localization related to external ear acoustics. J. Acoust. Soc. Am. 1992, 92, 2607–2624. [Google Scholar] [CrossRef] [PubMed]
Wightman, F.L.; Kistler, D.J. Headphone simulation of free-field listening. I: Stimulus synthesis. J. Acoust. Soc. Am. 1989, 85, 858–867. [Google Scholar] [CrossRef]
Hellstrom, P.A.; Axelsson, A. Miniature microphone probe tube measurements in the external auditory canal. J. Acoust. Soc. Am. 1993, 93, 907–919. [Google Scholar] [CrossRef]
Møller, H. Fundamentals of binaural technology. Appl. Acoust. 1992, 36, 171–218. [Google Scholar] [CrossRef] [Green Version]
Hammershøi, D.; Møller, H. Sound transmission to and within the human ear canal. J. Acoust. Soc. Am. 1996, 100, 408–427. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Algazi, V.R.; Avendano, C.; Thompson, D. Dependence of subject and measurement position in binaural signal acquisition. J. Audio Eng. Soc. 1999, 47, 937–947. [Google Scholar]
Gardner, B.; Martin, K. HRTF measurements of a KEMAR. J. Acoust. Soc. Am. 1995, 97, 3907–3908. [Google Scholar] [CrossRef]
Briggs, P.; Godfrey, K. Pseudorandom Signals for the Dynamic Analysis of Multivariable Systems. In Proceedings of the Institution of Electrical Engineers; IET: Stevenage, Hertfordshire, UK, 1966; Volume 113, pp. 1259–1267. [Google Scholar]
MacWilliams, F.; Sloane, N. Pseudo-random Sequences and Arrays. Proc. IEEE 1976, 64, 1715–1729. [Google Scholar] [CrossRef]
Schroeder, M.R. Integrated-Impulse Method Measuring Sound Decay Without Using Impulses. J. Acoust. Soc. Am. 1979, 66, 497–500. [Google Scholar] [CrossRef]
Borish, J.; Angell, J.B. An Efficient Algorithm for Measuring the Impulse Response Using Pseudorandom Noise. J. Audio Eng. Soc. 1983, 31, 478–488. [Google Scholar]
Mommertz, E.; Müller, S. Measuring impulse responses with digitally pre-emphasized pseudorandom noise derived from maximum-length sequences. Appl. Acoust. 1995, 44, 195–214. [Google Scholar] [CrossRef]
Ream, N. Nonlinear identification using inverse-repeatm sequences. In Proceedings of the Institution of Electrical Engineers; IET: Stevenage, Hertfordshire, UK, 1970; Volume 117, pp. 213–218. [Google Scholar]
Dunn, C.; Hawksford, M.J. Distortion Immunity of MLS-derived Impulse Response Measurements. J. Audio Eng. Soc. 1993, 41, 314–335. [Google Scholar]
Golay, M. Complementary series. IRE Trans. Inf. Theory 1961, 7, 82–87. [Google Scholar] [CrossRef]
Müller, S.; Massarani, P. Transfer-Function Measurement with Sweeps. J. Audio Eng. Soc. 2001, 49, 443–471. [Google Scholar]
Müller, S. Measuring Transfer-Functions and Impulse Responses. In Handbook of Signal Processing in Acoustics; Havelock, D., Kuwano, S., Vorländer, M., Eds.; Springer: New York, NY, USA, 2008; pp. 65–85. [Google Scholar] [CrossRef]
Rothbucher, M.; Veprek, K.; Paukner, P.; Habigt, T.; Diepold, K. Comparison of head-related impulse response measurement approaches. J. Acoust. Soc. Am. 2013, 134, EL223–EL229. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Heyser, R.C. Acoustical Measurements by Time Delay Spectrometry. J. Audio Eng. Soc. 1967, 15, 370–382. [Google Scholar]
Farina, A. Simultaneous Measurement of Impulse Response and Distortion with a Swept-Sine Technique. In Proceedings of the 108th Convention of the Audio Engineering Society, Paris, France, 19–22 February 2000. [Google Scholar]
Stan, G.B.; Embrechts, J.J.; Archambeau, D. Comparison of Different Impulse Response Measurement Techniques. J. Audio Eng. Soc. 2002, 50, 249–262. [Google Scholar]
Pulkki, V.; Laitinen, M.V.; Sivonen, V. HRTF measurements with a continuously moving loudspeaker and swept sines. In Proceedings of the 128th Convention of the Audio Engineering Society, London, UK, 22–25 May 2010. [Google Scholar]
Enzner, G. Analysis and optimal control of LMS-type adaptive filtering for continuous-azimuth acquisition of head related impulse responses. In Proceedings of the 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, Las Vegas, NV, USA, 31 March–4 April 2008; pp. 393–396. [Google Scholar]
Correa, C.K.; Li, S.; Peissig, J. Analysis and Comparison of different Adaptive Filtering Algorithms for Fast Continuous HRTF Measurement. In Proceedings of the Tagungsband Fortschritte der Akustik—DAGA, Kiel, Germany, 6–9 March 2017; pp. 6–9. [Google Scholar]
Haykin, S.S. Adaptive Filter Theory; Pearson Education India: Chennai, India, 2002. [Google Scholar]
Xie, B. Head-Related Transfer Function and Virtual Auditory Display; J. Ross Publishing: Plantation, FL, USA, 2013. [Google Scholar]
Carini, A.; Cecchi, S.; Romoli, L.; Sicuranza, G.L. Perfect Periodic Sequences for Legendre Nonlinear Filters. In Proceedings of the 22nd European Signal Processing Conference, Lisbon, Portugal, 1–5 September 2014; pp. 2400–2404. [Google Scholar]
Carini, A.; Cecchi, S.; Romoli, L. Room Impulse Response Estimation using Perfect sequences for Legendre Nonlinear filters. In Proceedings of the 23nd European Signal Processing Conference, Nice, France, 31 August–4 September 2015. [Google Scholar]
Carini, A.; Romoli, L.; Cecchi, S.; Orcioni, S. Perfect Periodic Sequences for Nonlinear Wiener Filters. In Proceedings of the 24th European Signal Processing Conference, Budapest, Hungary, 28 August–2 September 2016. [Google Scholar]
Cecchi, S.; Bruschi, V.; Nobili, S.; Terenzi, A.; Carini, A. Using Periodic Sequences for HRTFs Measurement Robust Towards Nonlinearities in Automotive Audio Applications. In Proceedings of the 2022 IEEE International Workshop on Metrology for Automotive (MetroAutomotive), Modena, Italy, 4–6 July 2022; pp. 99–104. [Google Scholar] [CrossRef]
Carini, A.; Cecchi, S.; Romoli, L. Robust room impulse response measurement using perfect sequences for Legendre nonlinear filters. IEEE/ACM Trans. Audio Speech Lang. Process. 2016, 24, 1969–1982. [Google Scholar] [CrossRef]
Carini, A.; Cecchi, S.; Terenzi, A.; Orcioni, S. On room impulse response measurement using perfect sequences for Wiener nonlinear filters. In Proceedings of the 2018 26th European Signal Processing Conference (EUSIPCO), Rome, Italy, 3–7 September 2018; pp. 982–986. [Google Scholar] [CrossRef]
Carini, A.; Cecchi, S.; Orcioni, S. Robust Room Impulse Response Measurement Using Perfect Periodic Sequences for Wiener Nonlinear Filters. Electronics 2020, 9, 1793. [Google Scholar] [CrossRef]
Carini, A.; Orcioni, S.; Terenzi, A.; Cecchi, S. Orthogonal periodic sequences for the identification of functional link polynomial filters. IEEE Trans. Signal Process. 2020, 68, 5308–5321. [Google Scholar] [CrossRef]
Carini, A.; Orcioni, S.; Cecchi, S. On Room Impulse Response Measurement Using Orthogonal Periodic Sequences. In Proceedings of the 2019 27th European Signal Processing Conference (EUSIPCO), A Coruna, Spain, 2–6 September 2019; pp. 1–5. [Google Scholar]
Carini, A.; Cecchi, S.; Terenzi, A.; Orcioni, S. A Room Impulse Response Measurement Method Robust Towards Nonlinearities Based on Orthogonal Periodic Sequences. IEEE/ACM Trans. Audio Speech Lang. Process. 2021, 29, 3104–3117. [Google Scholar] [CrossRef]
Knowles. FG-23329-D65 Microphones. 2009. Available online: https://www.mouser.it/datasheet/2/218/knowles_corporation_04272020_FG-23329-D65-1840470.pdf (accessed on 17 May 2023).
Sennheiser. MKE 2-EW GOLD Microphone. Available online: https://assets.sennheiser.com/global-downloads/file/5712/MKE2_Gold_Manual_07_2015_EN.pdf (accessed on 17 May 2023).
Bruel & Kjaer Head and Torso Simulator (HATS) Type 4128C. 2010. Available online: https://www.bksv.com/en/transducers/simulators/head-and-torso/hats-type-4128c (accessed on 17 May 2023).
Leaff Engineering. NU-Tech. Available online: https://www.nu-tech-dsp.com/content.php (accessed on 17 May 2023).

Figure 1. (a) Knowles FG-23329-D65 with its battery power supply, (b) Sennheiser MKE2-EW Gold microphone with its power supply and signal conditioner MZA900P, and (c) Brüel & Kjær head and torso simulator (HATS) Type 4128C.

Figure 2. Frequency responses of (a) Knowles FG-23329-D65 [65], (b) Sennheiser MKE2 EW Gold [66], and (c) Brüel & Kjær head and torso simulator (HATS) Type 4128C [67], provided by the manufacturers.

Figure 3. Scheme of the acquisition chain used for HRTF measurements.

Figure 4. Photo of the experimental setup used for HRTF measurements.

Figure 5. Photos of the (a) Knowles and (b) Sennheiser microphones on the mannequin ear.

Figure 6. Position of the sound source in terms of (a) azimuth and (b,c) positions of the in-ear microphone considered in experiment 1. The microphone’s positions are identified with different colors (i.e., red for P1, yellow for P2, violet for P3, and green for P4).

Figure 7. Photos of the (a) Knowles and (b) Sennheiser microphones on the ear of a real subject.

Figure 8. Experiment 1: HRTFs comparison considering the different in-ear microphone positions of Figure 6c for sound source positions (a,b) with

ϑ = 0^{\circ}

and

φ = 0^{\circ}

, (c,d) with

ϑ = 45^{\circ}

and

φ = 0^{\circ}

, (e,f) with

ϑ = 0^{\circ}

and

φ = 15^{\circ}

, and (g,h) with

ϑ = 45^{\circ}

and

φ = 15^{\circ}

, using (a,c,e,g) the Knowles microphone, and (b,d,f,h) the Sennheiser microphone.

Figure 8. Experiment 1: HRTFs comparison considering the different in-ear microphone positions of Figure 6c for sound source positions (a,b) with

ϑ = 0^{\circ}

and

φ = 0^{\circ}

, (c,d) with

ϑ = 45^{\circ}

and

φ = 0^{\circ}

, (e,f) with

ϑ = 0^{\circ}

and

φ = 15^{\circ}

, and (g,h) with

ϑ = 45^{\circ}

and

φ = 15^{\circ}

, using (a,c,e,g) the Knowles microphone, and (b,d,f,h) the Sennheiser microphone.

Figure 9. Experiment 1: Impulse responses comparison considering the different in-ear microphone positions of Figure 6c for the sound source position with

ϑ = 0^{\circ}

and

φ = 0^{\circ}

, using (a) the HATS, (b,d,f,h) the Knowles microphone, and (c,e,g,i) the Sennheiser microphone.

Figure 9. Experiment 1: Impulse responses comparison considering the different in-ear microphone positions of Figure 6c for the sound source position with

ϑ = 0^{\circ}

and

φ = 0^{\circ}

, using (a) the HATS, (b,d,f,h) the Knowles microphone, and (c,e,g,i) the Sennheiser microphone.

Figure 10. Experiment 2: HRTFs comparison of subject 1 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 10. Experiment 2: HRTFs comparison of subject 1 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 11. Experiment 2: HRTFs comparison of subject 2 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 11. Experiment 2: HRTFs comparison of subject 2 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 12. Experiment 2: HRTFs comparison of subject 3 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 12. Experiment 2: HRTFs comparison of subject 3 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 13. Experiment 2: HRTFs comparison of subject 4 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 13. Experiment 2: HRTFs comparison of subject 4 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 14. Experiment 2: HRTFs comparison of subject 5 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 14. Experiment 2: HRTFs comparison of subject 5 with (a,b) the Knowles microphone and (c,d) the Sennheiser microphone for (a,c)

ϑ = 0^{\circ}

, and (b,d)

ϑ = 45^{\circ}

and a fixed elevation of

φ = 0^{\circ}

. The measurements are compared with the same microphone on the dummy ear.

Figure 15. Experiment 2: Impulse responses measured with the Knowles microphone (first column) and the Sennheiser microphone (second column) on (a,b) the dummy head, and (c–l) on five real subjects for the sound source position with

ϑ = 0^{\circ}

and

φ = 0^{\circ}

.

Figure 15. Experiment 2: Impulse responses measured with the Knowles microphone (first column) and the Sennheiser microphone (second column) on (a,b) the dummy head, and (c–l) on five real subjects for the sound source position with

ϑ = 0^{\circ}

and

φ = 0^{\circ}

.

Table 1. LSD values (in dB) obtained in experiment 1. For each source position and microphone position, the lowest value of the LSD is bold.

Source Pos.	Mic. Pos.	Know.	Senn.
$ϑ = 0^{\circ}$ , $φ = 0^{\circ}$	P1	1.9	1.4
	P2	1.4	1.1
	P3	2.0	1.4
	P4	1.5	1.7
$ϑ = 45^{\circ}$ , $φ = 0^{\circ}$	P1	1.5	1.1
	P2	1.6	2.3
	P3	1.3	1.4
	P4	3.4	1.5
$ϑ = 0^{\circ}$ , $φ = 15^{\circ}$	P1	1.9	1.3
	P2	1.5	0.9
	P3	1.5	1.3
	P4	1.6	1.7
$ϑ = 45^{\circ}$ , $φ = 15^{\circ}$	P1	1.9	1.7
	P2	2.6	1.9
	P3	2.1	1.3
	P4	3.2	1.4

Table 2. LSD values (in dB) obtained in experiment 2. For each source position and subject, the lowest value of the LSD is in bold.

Source Position	Subject	Knowles	Sennheiser
$ϑ = 0^{\circ}$ , $φ = 0^{\circ}$	Subj1	2.3	1.9
	Subj2	1.8	1.3
	Subj3	2.0	1.2
	Subj4	2.1	1.3
	Subj5	1.8	1.5
$ϑ = 45^{\circ}$ , $φ = 0^{\circ}$	Subj1	2.7	1.3
	Subj2	2.0	1.3
	Subj3	2.2	1.3
	Subj4	2.1	1.1
	Subj4	1.6	1.5

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bruschi, V.; Terenzi, A.; Dourou, N.A.; Spinsante, S.; Cecchi, S. Comparative Analysis of HRTFs Measurement Using In-Ear Microphones. Sensors 2023, 23, 6016. https://doi.org/10.3390/s23136016

AMA Style

Bruschi V, Terenzi A, Dourou NA, Spinsante S, Cecchi S. Comparative Analysis of HRTFs Measurement Using In-Ear Microphones. Sensors. 2023; 23(13):6016. https://doi.org/10.3390/s23136016

Chicago/Turabian Style

Bruschi, Valeria, Alessandro Terenzi, Nefeli A. Dourou, Susanna Spinsante, and Stefania Cecchi. 2023. "Comparative Analysis of HRTFs Measurement Using In-Ear Microphones" Sensors 23, no. 13: 6016. https://doi.org/10.3390/s23136016

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparative Analysis of HRTFs Measurement Using In-Ear Microphones

Abstract

1. Introduction

2. Measurement Techniques

3. Hardware Setup

4. Experimental Results

4.1. Experiment 1

4.2. Experiment 2

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI