Virtual Acoustics and Audio Engineering: Numerical Modelling

VIRTUAL ACOUSTICS AND AUDIO ENGINEERING Fluid Dynamics and Acoustics Group

Home Page

Introduction

Stereo Dipole

Optimal Source Distribution

Adaptive Virtual Sound Imaging

Multi-Channel Systems

Numerical Modelling
  Baffled Pinnae
  HRTF's
  References
  Presentation

Auditory Process Model

Loudspeaker Design

Facilities

List of Publications

Download

Animations

Contact Us

ISVR Home Page

Numerical Modelling (1997-2001)

Head Related Transfer Functions (HRTFs)

A large body of research has been carried out on HRTF measurement (see Hebrank and Wright, 1974, Mehrgardt and Mellert, 1977, Butler and Belendiuk, 1977, Shaw and Teranishi, 1968, Shaw, 1974, Shaw, 1975, Shaw and Vaillancourt, 1985, Gardner and Martin, 1994, 1995, Møller et al, 1995, Hammershøi and Møller, 1996, Carlile and Pralong, 1994, Pralong and Carlile, 1994 and Blauert, 1997).

The external ear transfer functions are measured in the above publications at various places along the ear canal, which gives rise to different responses. However, since up to 12-14 kHz only the longitudinal mode is present in the ear canal, the variation of the position of the microphone does not distort the directionally dependent HRTFs.

Mesh decimation and manipulation

The initial assumption made during this work was that the highest resolution possible is required for the mesh models of the head and pinnae. It was not clear at that stage what accuracy is required and how sensitive will be the modelling of the acoustical response to geometry approximations. There are currently a few techniques available to obtain a computer model by scanning a physical model. These include: Computed Tomography (CT), Magnetic Resonance Imaging (MRI), 3-D ultrasonic imaging, etc. These are generally used for internal scanning for medical purposes. The main advantage of the 3-D laser scanner technique used in this research is that it can produce fairly quickly an accurate mesh of the surface made out of triangles

In the initial work, the Cyberware 'Head and Face' colour 3-D scanner had been used. This scanner includes the 3030/RGB digitising head and the PS motion system. It is mainly used for medical applications, anthropometry, human interface, and in the film industry. The geometry is captured by means of an optical range-finding system that produces around a half of a million interconnected triangles and their vertices in approximately 15 seconds. We refer to this scanner as having a 'low-resolution'.

To begin with, it was found in the scanning stage, that the accuracy of the pinna, which is our most important part of the head, was poor, since during the motion of the scanner the laser beam detects only unhidden parts. In the initial trials using the KEMAR head, the rear part of the pinnae were significantly distorted, and more importantly, the resulting concha was much shallower then the original rubber ear of KEMAR, and without the details of the cavum and cymba concha, helix, antihelix and fossa of helix. However, only after a set of simulations, it was found that another scanning technique was required for high accuracy modelling of the pinna.

Although still based on the technology of laser scanning, the Cyberware 'Mini model' 3-D scanner is based on the high-resolution 3030RGB/HIREZ scan head with a mid-size high-resolution motion system. It moves slowly from side to side in the horizontal plane in a straight line, parallel to the object. The principle of operation is similar to the 'head and face' scanner, but with software controlled, the data is accumulated through repeated scans at different angles of the pinna. In this way, almost every curvature can be captured correctly. Even the ear canal geometry can be obtained, by providing an additional model with the original's cross-section. Then, the software matches the 'internal' data with its own 'external' data by matching overlapping sections. The duration of this scanning procedure is much longer, of the order of a few hours; before any post-processing is applied to fix connectivity, rough surface, holes, etc.

A few tools were developed and used to integrate the two scans into a single mesh model for both KEMAR head and YK head. Two types of decimated KEMAR models are presented below. In both cases the original data included more than 400000 triangles (around 200000 vertices). The target was to obtain a suitable BEM mesh that could be used to modelling at the maximum frequency possible with the IBEM in-core solver . The mesh on the right was decimated using our proposed algorithm that produces homogeneously distributed vertices, thus optimising its size, geometry, and maximum frequency. This resulted in approximately 23000 elements that can be used up to 15 kHz if four elements per wavelength are assumed (for the ipsilateral ear), and 10 kHz if six elements per wavelength are assumed (for the contralateral ear).

When a conventional mesh decimation algorithm was used (the mesh in the middle, optimised for computer graphics applications), and the number of elements was limited to 23000 elements, a mesh with non-uniform distribution of vertices was obtained, where planar areas were described with less triangles, and complex areas retained a higher density of triangles to preserve the geometry. Note that with both decimation algorithms the rendered image (on the left) is very similar. This mesh in the middle could be investigated using the BEM reliably only up to 1 kHz (!). This emphasises the significance of optimising the mesh distribution while retaining the accuracy of complex shapes such as the pinna.

BEM simulation and measurement comparison

The following figures demonstrate the accuracy of the simulation results when compared to measurements at various angles in the horizontal plane.

The same simulated and measured responses are shown at discrete angles. These show that errors of <1 dB appear up to 15 kHz. At other angles where sharper notches are found errors increase locally at the position of the notch. It should be noted that median plane simulation and measurement do not impose great difficulty from the SNR point of view, since there are no effects of the shadowing due to the head, which have strong attenuation at high frequencies.

-40 deg.	0 deg.
40 deg.	90 deg.
130 deg.	180 deg.