Genelec Frequency Response Optimisation, Frequency Response Optimisatio User Manual

Statistical Analysis of an Automated
In-Situ Frequency Response Optimisation
Algorithm for Active Loudspeakers
Andrew Goldberg1 and Aki Mäkivirta1
1
Genelec Oy, Olvitie 5, 74100 Iisalmi, Finland.
ABSTRACT
This paper presents a novel method for automatically selecting the optimal in-situ acoustical frequency response of active loudspeakers within a discrete-valued set of responses offered by room response controls on active loudspeakers. The rationale of the room response controls for the active loudspeakers is explained. The frequency response, calculated from the acquired impulse response, is used as the input for the optimisation algorithm to select the most favourable combination of room response controls. The optimisation algorithm is described. The perform­ance of the algorithm is analysed and discussed. This algorithm has been implemented and is currently in active use by specialist loudspeaker system calibrators who set up and tune studios and listening rooms.
1. INTRODUCTION
This paper presents a system to optimally set the room response controls currently found on full-range active loudspeakers to achieve a desired in-room frequency response. The active loudspeakers [1] to be optimised are individually calibrated in anechoic conditions to have a flat frequency response magnitude within de­sign limits of ±2.5 dB.
When a loudspeaker is placed into the listening envi­ronment the frequency response changes due to loud­speaker-room interaction. To help alleviate this, the active loudspeakers incorporate a pragmatic set of room response controls, which account for common acoustic issues found in professional listening rooms.
Although many users have the facility to measure loudspeaker in-situ frequency responses, they often do not have the experience of calibrating active loud­speakers. Even with experienced system calibrators, significant variance between calibrations can be seen. Furthermore, with a number of different people cali­brating loudspeaker systems, additional variance in results will occur. For these reasons an automated calibration method was developed to ensure consis­tency of calibrations.
Presented first in this paper is the discrete-valued room response equaliser employed in the active loud­speakers. Then, the algorithm for automated value se­lection is explained including the software structure, algorithm, features and operation. The performance of the optimisation algorithm is then investigated by
studying the statistical properties of frequency re­sponses before and after equalisation.
2. IN-SITU EQUALISATION AND ROOM RESPONSE CONTROLS
2.1. Equalisation Techniques
The purpose of room equalisation is to improve the perceived quality of sound reproduction in a listening environment. The goal of in-room equalisation is usu­ally not to convert the listening room to anechoic. In fact, listeners prefer to hear some room response in the form of liveliness that can create a spatial impres­sion and some envelopment [2].
An approach to improve the performance of a loud­speaker in a room is to choose an optimal location for the loudspeaker. Cox and D’Antonio [3] (Room Opti­miser) use a computer model of the room to find op­timal loudspeaker positions and acoustical treatment location to give an optimally flat in-situ frequency re­sponse magnitude. Positional areas for the loud­speaker and listening locations can be given as con­straints to limit the final solution. Problems with this approach are that an optimisation may not be practi­cally possible in all cases and that this is only half of the installation process, as the loudspeaker should be corrected for problems caused by the loudspeaker­room interaction too.
Electronic equalisation to improve the subjective sound quality has been widespread for at least 40 years; see Boner & Boner [4] for an early example.
AES 23rd International Conference, Copenhagen, Denmark, 2003 May 23-25 1
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
Equalisation is particularly prevalent in professional sound reproduction applications such as recording stu­dios, mixing rooms and sound reinforcement.
In-situ response equalisation is typically implemented using a separate equaliser, although equalisers are in­creasingly built into active loudspeakers. Some equal­isers on the market play a test signal and then alter their response according to the in-situ transfer func­tion measured in this way [5] but the process can be so sensitive that a simple ‘press the button and every­thing will be OK’ approach proves hard to achieve with reliability, consistency and robustness.
It is possible that equalisation becomes skewed if it is based only on a single point measurement. The fre­quency response in nearby positions can actually be­come worse after applying an equalisation designed using only a single point measurement. A classical method to avoid this is to use a weighted average of responses measured within the listening area. Such spatial averaging is often required when the listening area is large. Examples of spatial averaging have been described in the automotive industry [6] and cinema in the SMPTE Standard 202M [7]. Spatial averaging can reduce local variance in midrange to high frequencies and can also reduce problems caused by the fact that a listener perceives sound differently to a microphone, but typically reduces the accuracy of equalisation ob­tained at the primary listening location.
The room transfer function is position dependent, and this poses major problems for all equalisation tech­niques. For a single loudspeaker in diffuse field no correction filter is capable of removing differences between responses measured at two separate receiver points. At high frequencies a required high-resolution correction can become very position sensitive. Fre­quency dependent resolution change is then preferable and is typically applied [8,9] but with the expense of reduced equalisation accuracy. Perfect equalisation able to achieve precisely flat frequency response in a listening room, even within a reasonably small listen­ing area, appears not to be possible. An acceptable equalisation is typically a compromise to minimise the subjective coloration in audio due to room effects.
Typically electronic equalisation in active loudspeak­ers uses low order analogue minimum phase filters [10-12]. Since the loudspeaker-room transfer function is of substantially higher order than such equalisation filters, the effect of filtering is to gently shape the re­sponse. Even with this limitation, in-situ equalisers have the potential to significantly improve perceived sound quality. The practical challenge is the selection of the best settings for the low-order in-situ equaliser.
Despite advances in psychoacoustics, it is difficult to quantify what the listener actually perceives the sound
quality to be, or to optimise equalisation based on that evaluation [13-15]. Because of this, in-situ equalisa­tion typically attempts to obtain the best fit to some objectively measurable target, such as a flat third­octave smoothed response, known to have a link to the perception of sound being free from coloration. Also, despite the widespread use of equalisation, it is still hard to provide exact timbre matching between differ­ent environments.
Several methods have been proposed for more exact inversion of the frequency response to achieve a close approximation of unity transfer function (no change to magnitude or phase) within a certain bandwidth of in­terest [16-24]. Some researchers have also shown an interest to control selectively the temporal decay char­acteristics of a listening space by active absorption or modification of the primary sound [25-30]. If realis­able, these are extremely attractive ideas because they imply that the perceived sound could be modified with precision, to different target responses. Then, spatial variations in the frequency response can become far more difficult to handle than with low-order methods because the correction depends strongly on an exact match between the acoustic and equalisation transfer functions, and can therefore be highly local in space [25].
2.2. Room Acoustic Considerations
In small to medium sized listening environments, the sound field in the frequency range up to a critical fre-
quency f
, (typically 70…200 Hz in small spaces) is
c
often dominated by room modes and comb filtering caused by low-order discrete reflections from room boundaries. Sound reproduction can be problematic because of this. For a room with a reverberation time
of 0.3 s the room mode bandwidth is approxi-
T
60
mately 2.2/T
= 7.3 Hz [23]. However, this does not
60
predict accurately what the decay rate of an individual mode is as reverberation time represents the total de­cay rate in diffuse field whereas modal decay rate may vary.
Above f
modal density becomes sufficiently high to
c
be described statistically. An unsmoothed room trans­fer function shows a large number of high Q notches. When frequency smoothing due to human hearing is taken into account [31], the resulting sensation is a rather smooth room transfer function causing timber changes in the perceived audio.
In the time domain, early reflections before about 25 ms combine with the direct sound to produce tone colouration (comb filtering effect). Reflections arriv­ing later than about 25 ms are less problematic as they typically combine to produce the reverberation of the room and are perceived as separate sound events (ech-
AES 23RD CONFERENCE, May 23-25, 2003 2
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
(
oes and reverberation) rather than tone colouration. This part of the time domain response contributes to the sensations of envelopment and spaciousness.
2.3. Room Response Controls
The loudspeakers to be optimised have room response controls [1,32]. The smaller loudspeakers have sim­pler controls than the larger systems but the philoso­phy of filtering is consistent across the range (Tables 1-4).
Table 1. Small two way room response controls.
Control type Room response control settings, dB Treble tilt 0, –2 Bass tilt 0, –2, –4, –6 Bass roll-off 0, –2
Table 2. Two way room response controls.
Control type Room response control settings, dB Treble tilt +2, 0, –2, –4, driver mute Bass tilt 0, –2, –4, –6, driver mute Bass roll-off 0, –2, –4, –6, –8
Table 3. Three way room response controls.
Control type Room response control settings, dB Treble level 0, –1, –2, –3, –4, –5, –6, driver mute Midrange level 0, –1, –2, –3, –4, –5, –6, driver mute Bass level 0, –1, –2, –3, –4, –5, –6, driver mute Bass tilt 0, –2, –4, –6, –8 Bass roll-off 0, –2, –4, –6, –8
Table 4. Large system room response controls.
Control type Room response control settings, dB Treble tilt +1, 0, –1, –2, –3 Treble level 0, –1, –2, –3, –4, –5, –6, driver mute Midrange level 0, –1, –2, –3, –4, –5, –6, driver mute Bass level 0, –1, –2, –3, –4, –5, –6, driver mute Bass tilt 0, –2, –4, –6, –8 Bass roll-off 0, –2, –4, –6, –8
The treble tilt control is used to reduce the high fre- quency energy. In the small two-way systems and two-way systems it is a level control of the treble driver and has an effect down to about 4 kHz. In large systems it has a noticeable effect only above 10 kHz and has a roll-off character.
The driver level controls can be used to shape the broadband response of a loudspeaker. They control the output level of each driver with frequency ranges that are determined by the crossover filters.
The bass tilt control compensates for a bass boost seen when the loudspeaker is loaded by large nearby boundaries [33-36]. This typically happens when a loudspeaker is placed next to, or mounted into, an acoustically hard wall. This filter is a first
order shelv-
ing filter. The bass roll-off control compensates for a bass
boost often seen at the very lowest frequencies the loudspeaker can reproduce. This typically happens when the loudspeaker is mounted in the corner of a room where the loudspeaker is able to couple very ef­ficiently to the room thereby exacerbating room mode effects that dominate this region of the frequency re­sponse. It is a notch filter with a centre frequency set close to the low frequency cut-off of the loudspeaker.
3. ROOM EQUALISATION OPTIMISER
Optimisation involves the minimisation or maximisa­tion of a scalar-valued objective function E(x),
)
where, x is the vector of design parameters, x Multi-objective optimisation is concerned with the minimisation of a vector of objectives E(x) that may be subject to constraints or bounds. Several robust methods exist for optimising functions with design parameters x having a continuous value range [37].
3.1. Efficiency of Direct Search
The room response controls of an active loudspeaker form a discrete-valued set of frequency responses. If the optimum is found by trying every possible combi­nation of room response controls then the number of processing steps becomes prohibitively high (Table 5).
Table 5. Number of setting combinations.
Type of loudspeaker Room Response
Control Treble tilt 5 - 4 2 Treble level 7 7 - ­Midrange level 7 7 - ­Bass level 7 7 - ­Bass tilt 5 5 4 4 Bass roll-off 5 5 5 2 Total 42875 8575 80 16
3.2. The Algorithm
The algorithm [38] exploits the heuristics of experi­enced system calibration engineers by dividing the optimisation into five main stages (Table 6), which will be described in detail. The optimiser considers
xEmin (1)
n
∈ℜ
.
Large 3-way 2-way
Small
2-way
AES 23RD CONFERENCE, May 23-25, 2003 3
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
certain frequency ranges in each stage (Table 7). Figure 5 in Appendix A shows a flow chart of the software. A screenshot of the software graphic user interface can be seen in Appendix B.
roll-off setting m currently being tested, x target response, f (Table 7) and f
defines the ‘bass roll-off region’
a
defines the ‘bass region’ (Table 7).
b
User selected frequency ranges are not permitted. The reason for this arrangement rather than using a
Table 6. Optimisation stages.
Type of loudspeaker Optimisation stage Large 3-way 2-way Small
2-way Preset bass roll-off Find midrange/
treble ratio Set bass tilt and
level Reset bass roll-off Set treble tilt
9 9 9 9
9 9
9 9
- -
- -
9 9 9 9 9
-
9 9
Table 7. Optimiser frequency ranges; fHF = 15 kHz; fLF is the frequency of the lower –3 dB limit of the fre­quency range.
Low High Loudspeaker pass band
Midrange and treble driver band 500 Hz Bass roll-off region Bass region
Frequency Range
Limit
f
fHF
LF
f
1.5 fLF
LF
1.5
f
6 fLF
LF
f
HF
3.2.1. Pre-set Bass Roll-off
In this stage, the bass roll-off control is set to keep the maximum level found in the ‘bass roll-off region’ as close to the maximum level found in the ‘bass region’. Once found the bass roll-off control is reset to one po­sition higher, for example, –4 dB is changed to –2 dB. The reason for this is to leave some very low bass en­ergy for the bass tilt to filter. It is possible that the bass tilt alone is sufficient to optimise the response and less or no bass roll-off is eventually required. The min-max type objective function to be minimised is given by Equation 2,
m
max
f
min
m
a
E
=
max
f
b
0
m
 
0
[] []
==
ba
)()(
fxfa
 
)(
fx
,
)()(
fxfa
 
)(
fx
(2)
,,,
ffffff
3221
least squares type objective function is that the bass roll-off tends to assume maximum attenuation to minimise the RMS deviation. This type of objective function does not yield the best setting, as subjectively a loss of bass extension is perceived. This stage of the optimiser algorithm takes six filtering steps (three for small two-way models).
3.2.2. Midrange Level to Treble Level Ratio
The aim of this stage is to find the relative levels of the midrange level and treble level controls required to get closest to the target response. The least squares type objective function to be minimised is given in Equation 3,
f
2
min
m
E
=
ff
=
1
where x(f) is the smoothed magnitude of the in-situ frequency response of the system, a range and treble level control combination m currently being tested, x
(f) is the target response, f1 and f2 de-
0
fine the ‘midrange and treble driver band’ The lower frequency bound is fixed at 500 Hz but a user selectable high frequency value is permitted. The default value is 15 kHz.
The midrange-to-treble level ratio is saved for per­forming the third stage of the optimisation process. The reason for this is to reduce the number of room response control combinations to be tested in the next stage. This stage of the optimisation algorithm takes 49 filtering steps and is not required for two-way models or small two-way models.
3.2.3. Bass Tilt and Bass Level
This stage of the optimiser algorithm filters using all possible combinations of bass tilt and bass level con­trols for a given midrange/treble level difference. By fixing this difference the total number of filter combi­nations can be reduced substantially.
A constraint imposed in this stage is that only two of the driver level controls can be set at any one time. If three of the level controls are simultaneously set the net effect is a loss of overall system sensitivity. Table 8 shows an example of incorrect and correct setting of the driver level controls.
(f) is the
0
2
fxfa
m
0
)()(
(3)
fx
df
)(
(f) is the mid-
m
(Table 7).
where x(f) is the smoothed magnitude of the in-situ frequency response of the system, a
AES 23RD CONFERENCE, May 23-25, 2003 4
(f) is the bass
m
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
Table 8. Driver level control settings.
Control Incorrect Set-
ting Bass level –4 dB –2 dB Midrange level –3 dB –1 dB Treble level –2 dB 0 dB Input sensitivity –6 dBu –4 dBu
Correct Set-
ting
The least squares type objective function to be mini­mised is the same as shown in Equation 3. However,
(f) is the bass tilt and bass level combination m cur-
a
m
rently being tested together with the fixed midrange and treble level ratio setting found in the previous stage. Also, f
(Table 7). High and low user selected frequency
band’
and f2 now define the ‘loudspeaker pass
1
values are permitted. The default values are the –3 dB lower cut-off frequency of the loudspeaker and 15 kHz.
This part of the optimisation algorithm takes 35 filter­ing steps. There are no driver level controls in two­way or small two way systems so these virtual con­trols are set to 0 dB. The bass tilt control can then be optimised using the same objective function. Only five filtering steps are required for two-way and small two-way systems.
3.2.4. Reset Bass Roll-off
Firstly, the bass roll-off control is reset to 0 dB. Then the same method used to set the bass roll-off earlier is repeated, but without modifying upwards the final set­ting. The same objective function is used as presented in Section 3.2.1.
3.2.5. Set Treble Tilt
The least squares type objective function to be mini­mised is the same as shown in Equation 3. However,
and f2 now define the ‘loudspeaker pass band’
f
1
(Table 7). High and low user selected frequency val­ues are permitted. The default values are the –3 dB lower cut-off frequency of the loudspeaker and 15 kHz. This part of the algorithm requires five filtering steps for two way and large models (three for small two way models) and is skipped for three ways be­cause they do not have this control.
3.3. Reduction of Computational Load
The optimiser algorithm has been designed to reduce the computational load by exploiting the heuristics of experienced calibration engineers. The resulting num­ber of filtering steps has been dramatically reduced for the larger systems (Table 9) and even the relatively simple two-way systems show a substantial improve­ment when compared to the number of filtering steps
needed by direct search method as summarised in Table 5. There are two main reasons for the improve­ment; the constraint of not allowing the setting of all three of the driver level settings simultaneously and the breaking up of the optimisation into stages.
Table 9. Number of filter evaluations needed by the optimisation algorithm.
Type of loudspeaker Optimisation
stage Preset bass roll-
off Find midrange/
treble ratio Set bass tilt and
level Reset bass roll-off 6 6 6 3 Set treble tilt 5 - 4 2 Total 101 96 21 13 Total re. direct
search
Large 3-way 2-way
6 6 6 3
49 49 - -
35 35 5 5
0.2% 1.1% 26% 81%
Small
2-way
The run time on a PII 366 MHz computer for a three­way system is about 15 s (direct search 3 minutes). Large systems now take about the same time as a three-way system (predicted direct search time was 15 minutes). The processing time is directly proportional to the processor speed as a PIII 1200 MHz based computer takes about 4 s to perform the same optimi­sation. Further changes in the software have improved these run times by about 30%.
3.4. Algorithm Features
3.4.1. Frequency Range of Equalisation
The default frequency range of equalisation is from the low frequency
3 dB cut-off of the loudspeaker f
LF
to 15 kHz. If there is a strong cancellation in the fre­quency response around f
, or the high frequency
LF
level is decreased significantly due to an off-axis loca­tion or the loudspeaker is positioned behind a screen or due to very long measuring distance, manual read­justment of the design frequency range (indicated on the graphical output by the blue crosses, Figure 1) is needed. Naturally it is preferable to remove the causes of such problems, if possible.
3.4.2. Target for Optimisation
There are five target curves from which to select:
1. ‘Flat’ is the default setting for a studio monitor. The tolerance lines are set to +/–2.5 dB.
2. ‘Slope’ gives a user defined sloping target re­sponse. There are two user defined knee frequen-
AES 23RD CONFERENCE, May 23-25, 2003 5
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
x
y
x
cies and a dB drop/lift value. A positive slope can also be set but is generally not desirable. The tol­erance lines are set to ±2.5 dB. Some relevant slope settings include:
–2 dB slope from low frequency –3 dB cut-off
to 15 kHz for the large systems to reduce the aggressiveness of sound at very high output levels
–2 dB slope from 4 kHz to 15 kHz to reduce
long-term usage listening fatigue
–3 dB slope from 100 Hz to 200 Hz for Home
Theatre installations to increase low frequency impact without affecting midrange intelligibil­ity
3. ‘Another Measurement’ allows the user to opti­mise a loudspeaker’s frequency response magni­tude to that of another loudspeaker. For example, measure the left loudspeaker and optimise it, then measure the right loudspeaker and optimise this to the optimised left loudspeaker response. The result will be the closest match possible between the left and right loudspeaker pair ensuring a good stereo pair match and phantom imaging. Tolerance lines are set at ±2.5 dB.
4. ‘X Curve – Small Room’ will give the closest ap­proximation to the X Curve for a small room as defined in ANSI/SMPTE 202M-1998 [7]. This is a target response commonly used in the movie in­dustry. A small room is defined as having a vol­ume less than 5300 cubic feet or 150 cubic meters. The curve is flat up to 2 kHz and rolls off 1.5 dB per octave above 2 kHz. Tolerance lines are set to
1
±3 dB.
5. ‘X Curve – Large Room’ will give the closest ap­proximation to the X Curve for a large room as de­fined in ANSI/SMPTE 202M-1998 [7]. The curve is flat from 63 Hz to 2 kHz and then rolls off at 3 dB per octave above 2 kHz. Below 63 Hz there is also a 3 dB roll off, with 50 Hz being down by 1 dB and 40 Hz by 2 dB. Tolerance lines are set to ±3 dB with additional leeway at low and high fre­quencies.
1
An example of the room equaliser settings output for the large system optimised in Figure 1 is shown in Figure 2. The optimised result is displayed in green and dark grey boxes. The green boxes are room re­sponse controls that should be set on the loudspeaker. The light grey boxes are room response controls that
1
The room response controls do not directly support the X Curves but it may be possible to achieve X Curves in a room due to particular acoustic circum­stances. This is also a good way to check how close the response is to the selected X Curve.
are not present on the loudspeaker. Also displayed in this area is the error function, which is an RMS of the optimised frequency response pass band.
(f)
(f)
Figure 1. Typical graphical output of the optimiser software. Original response x(f), target response x and final response y(f). Also, –3 dB cut-off frequen­cies (triangles), optimisation range (crosses) and target tolerance (dotted).
Figure 2. Output section displays all settings and val­ues to be changed (green background) as well as the value of the error function and processing time.
4. PERFORMANCE OF THE OPTIMISATION ALGORITHM
To assess the performance of the combination of optimisation algorithm and equalisation in the loudspeakers, the analysis compares the unequalised in-situ frequency response to the response after equalisation.
The MLS measurement technique was used to meas­ure the in-situ acoustical frequency responses. The acquisition system parameters are shown in Table 10. The values in parentheses are the parameters used for acquiring the impulse response for models that have a bass extension below 30 Hz.
The room response control settings were calculated for each loudspeaker response according to the algo-
0
(f)
(f)
0
AES 23RD CONFERENCE, May 23-25, 2003 6
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
rithm discussed in Section 3 and statistical data for each measurement before and after equalisation was recorded. The statistical data is analysed to study how the objective quality of the system magnitude re­sponse has been improved by using the proposed algo­rithm for setting the room response controls.
Table 10. Acoustic measurement system parameters.
Parameter Equipment / Setting Measurement System WinMLS2000 [39] Microphone Neutrik 3382 [40] Sample rate, fs 48 kHz MLS sequence order 14 (16) Averages 1 Impulse response length 0.341 s (1.36 s) Time window Half-cosine FFT size 16384 (65536) Frequency resolution 2.93 Hz (0.733 Hz)
4.1. Statistical Data Analysis
A further statistical analysis was conducted on all of the loudspeakers in the study. The bandwidths of the frequency bands used are shown in Table 11. The bandwidths ‘LF’, ‘MF’ and ‘HF’ are later referred to collectively as the ‘subbands’ and correspond roughly to the bandwidths for each driver in the three-way sys­tems.
Table 11. Frequency band definitions the statistical data analysis; f
is the frequency of the lower –3 dB
LF
limit of the frequency range.
Bandwidth Name Low High Broadband fLF 15 kHz LF fLF 400 Hz MF 400 Hz 3.5 kHz HF 3.5 kHz 15 kHz
Frequency Range Limit
For each loudspeaker, the broadband (Table 11) mag­nitude response data median value is standardised to 0 dB.
The statistical descriptors recorded before and after equalisation for each loudspeaker and in each fre­quency band defined in Table 11 are the minimum, maximum and range of the magnitude dB values. Also for the magnitude pressure values in each bandwidth (Table 11), the median, 5% & 95% percentiles and quartiles are recorded. In addition, the root-mean­square (RMS) deviation of the pressure from the me­dian in each bandwidth is calculated: the value is ex­pressed in dB.
These statistical descriptors are compared for each subband to study the in-band flatness improvement due to equalisation. The median values for each sub­band are compared to study the broadband tonal bal­ance improvement. This is indicated by a reduction of the median value differences.
4.2. Example of Statistical Data Analysis
Figure 7 in Appendix C shows a case example where room response control settings are calculated accord­ing to the optimisation algorithm. The equalisation target is a flat magnitude response (straight line at 0 dB level). The in-situ frequency response of the loudspeaker was recorded before equalisation, i.e. when all the room response controls were set to their default position, which has no effect to the response. The appropriate room response control settings were calculated using the optimisation algorithm, applied to the loudspeaker and the corrected in-situ frequency response plotted. The loudspeaker’s passband (trian­gles) and the frequency band of equalisation (crosses) are indicated on the graphical output. The proposed room response control settings are shown and the ef­fect of these settings is visualised in the response plot. The treble tilt, midrange level and bass tilt controls have been set. The equalisation corrects the low fre­quency alignment and improves the linearity across the whole passband.
Figure 8 in Appendix C shows a statistical analysis of the same loudspeaker presented in graphical form. The upper three plots were calculated before equalisa­tion and the lower three plots after equalisation. The plots display the values of percentiles in the magni­tude value distribution (box plot), the histogram of values and the fit of the magnitude values to normal distribution before and after equalisation. These plots clearly show that the distribution in magnitude data has been reduced. This is illustrated by the reduced range in the box plot and the value histogram, as well as a better fit to a normal distribution in the normal probability plot.
4.3. Results
A total of 63 loudspeakers were measured before and after equalisation. Of these, 12 were small two-way systems, 22 were two-way systems, 30 were three­way systems and three were large systems.
Depending on the product type, not all of the room response controls are available (Tables 1–4). Table 12 shows the number times the controls were used when available on the loudspeaker. The midrange level con­trol is used most frequently and the bass roll-off the least.
AES 23RD CONFERENCE, May 23-25, 2003 7
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
Table 12. Use of available room response controls.
Room Response Control Usage vs.
availability Midrange Level 27/33 82% Treble Level 22/33 67% Bass Tilt 37/67 55% Treble Tilt 11/37 30% Bass Level 8/33 24% Bass Roll-off 10/67 15%
% Usage
Appendix D gives the quartile difference and RMS deviations for each loudspeaker in the study, for the broadband and each subband. The quartile difference or RMS deviation after equalisation is subtracted from the same before equalisation. An improvement will produce a negative value of difference. Both the quar­tile difference and RMS deviation values represent two slightly different ways to look at the deviation from the median value of the distribution. The quartile limits are more robust to outlier values while the RMS values include these effects.
For small two-way systems (Figure 9-10), the main improvement is seen at low frequencies in four out of 12 cases. In only one case is there is a significant im­provement in the broadband flatness.
The broadband flatness of the two-way systems is im­proved in four (quartile data, Figure 11) or eight
Level, dB
5
4
3
2
1
0
-1
-2
-3
Subband M edia n Levels - All Models
LF MF HF LF MF HF
Original Equalised
(RMS data, Figure 12) cases out of 22. An equal number of reductions and increases of low frequency quartile values can be seen. MF subband quartile val­ues improve in one case and deteriorate in 5 cases and there are no changes in the HF subband. The flatness in the broadband and LF subband of the RMS devia­tion data has improved indicating a reduction of out­lier values. The MF and HF subbands show no changes or a slight increase of the RMS deviation.
Three-way systems show a clear reduction in most cases of both the quartile difference (Figure 13) and RMS deviation (Figure 14) for the broadband and LF subband. Slight, and equal numbers of, increases and reductions are seen for MF and HF subbands.
A similar trend is seen for the three large systems in­cluded in this study (Figure 15-16). Mainly the LF subband flatness is improved and this is also reflected in the broadband improvement.
Some of the responses appeared to become worse in the quartile difference and RMS deviation in the sub­band analysis. This was not reflected in the broadband metrics, which indicates that the arbitrary subband fre­quency division introduced some of the error. Also, the cases where this happened suffered from severe anomalies within the pre-equalisation response due to extremely bad room acoustic conditions. The equalisa­tion was not designed to compensate for this.
Level, dB
5
4
3
2
1
0
-1
-2
-3
Subband Media n Levels - S ma ll models
LF MF HF LF MF HF
Original Equalised
Level, dB
5
4
3
2
1
0
-1
-2
-3
Subband Me dian Levels - 2-w ay models
LF MF HF LF MF HF
Original Equalised
Figure 3. Mean and standard deviation of subband median levels before and after equalisation.
The subband median levels (Figure 3) illustrate the broadband frequency balance between the subbands. Loudspeaker loading from nearby boundaries is re-
AES 23RD CONFERENCE, May 23-25, 2003 8
Level, dB
5
4
3
2
1
0
-1
-2
-3
Subband Median Le vels - 3-w ay m odels
LF MF HF LF MF HF
Original Equalised
flected in the LF subband median level before equali­sation, especially in the often flush mounted three­way models. The median level in the LF subband is
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
reduced after equalisation, which indicates that equali­sation compensates well for the loudspeaker loading. A better match across subbands of the average sub­band median level demonstrates that equalisation has improved the broadband flatness. The largest im­provement is seen in the three-way loudspeakers. In the two-way systems the equalisation has improved broadband flatness only marginally as the subband median levels do not show major signs of change. The broadband flatness improvement is mainly the result of better alignment of the LF subband with the MF
Level, dB
1
0
Broadband LF MF HF
-1
-2
-3
Level, dB
1
0
Broadband LF MF HF
-1
-2
-3
25% to 75% Percentile Difference
Change due to Equa lisation - All models
25% to 75% Percentile Difference
Change due to Equalisa tion - Sma ll mode ls
and HF subbands. This indicates that the equalisation has not only reduced the variation inside individual subbands but also improved the broadband flatness of the acoustical response, translating to a reduced colouration of the audio at the listening position.
For all loudspeakers pooled together (Figure 3), the equalisation reduces the variance in the median level for the LF subband. A similar outcome is noted sepa­rately for each loudspeaker type. However, only in the three-way systems is an improvement seen also in the MF and HF subband variance.
Level, dB
1
0
-1
-2
-3
-4
-5
Level, dB
1
0
-1
-2
-3
-4
-5
Change due to Equalis ation - All mode ls
Broadband LF MF HF
Change due to Equalisa tion - Small m odels
Broadband LF MF HF
RMS Devi at ion
RMS Devi atio n
Level, dB
1
0
Broadband LF MF HF
-1
-2
-3
Level, dB
1
0
Broadband LF MF HF
-1
-2
-3
25% to 75% Percentile Difference
Change due to Equalis ation - 2-w ay models
25% to 75% Percentile Difference
Change due to Equalis ation - 3-w ay models
Level, dB
1
0
-1
-2
-3
-4
-5
Level, dB
1
0
-1
-2
-3
-4
-5
Change due to Equalisa tion - 2 -w ay models
Broadband LF MF HF
Change due to Equalisation - 3-wa y m odels
Broadband LF MF HF
RMS Devi at ion
RMS Devi atio n
Figure 4. Change in sound level deviation due to equalisation. For each subband, quartile difference and RMS de­viation from the median. The error bar indicates the standard deviation.
AES 23RD CONFERENCE, May 23-25, 2003 9
GOLDBERG AND MÄKIVIRTA AUTOMATED IN-SITU EQUALISATION
In Figure 4 the results are pooled for all products and for each product type, excluding the main monitors where there were only three cases. The change in quartile difference and RMS deviation for the broad­band and the subbands is illustrated. For all models, the broadband flatness is improved by 0.4 dB and the mean reduction in the LF subband RMS deviation is
1.4 dB. The RMS deviation for all models pooled to­gether has been reduced by equalisation and the larg­est reduction is seen for the three-way systems. To some extent this is similar for the quartile difference but the small two-way and two-way systems do not see such large improvements due to equalisation. This indicates that the improvement is mainly a reduction of extreme magnitude values (heights of peaks and notches) in the low frequency response.
5. DISCUSSION
The objective of this paper is to present an automated system for choosing appropriate room response con­trol settings once an in-situ frequency response meas­urement has been made and to show that it is effec­tive.
The room response controls in active loudspeakers implement discrete filter parameter values rather than a continuous parameter value range. The number of possible filter parameter value combinations can be quite large and so even an experienced operator can find it difficult to choose the optimal settings.
The task of the automated optimiser is to find the op­timal combination from the possible combinations of discrete filter parameter values. The cost of perform­ing a brute force search of all value combinations and then choosing the best among them is prohibitive in terms of computer processing time. The approach cho­sen is to exploit the heuristics of experienced calibra­tion engineers and to reduce the number of alterna­tives by dividing the task into subsections that can re­liably be solved independently. A significant part of the heuristics is the order in which these choices should be taken. A considerable improvement in the speed of optimisation was achieved relative to a full exhaustive search.
The optimisation algorithm is relatively robust to a wide variety of situations, such as varying room acoustics, different sized loudspeakers with differing anechoic responses and varying in-situ responses [41]. The optimisation is efficient and so the software is fast enough to be used routinely at in-situ loudspeaker calibrations.
A case study demonstrates the statistical changes due to the optimisation algorithm’s recommended room response control settings. The settings achieve im­proved equalisation in the form of a smaller RMS de-
viation from the target response. The improvement is not limited by the optimisation method but by the room response controls which are not intended to cor­rect for narrow-band deviations in the frequency re­sponse. Examples of these are response variations re­sulting from acoustic issues such as cancellations as­sociated comb filtering due to reflections. These should be solved acoustically rather than electroni­cally.
The statistical analysis of 63 loudspeakers shows that the automated equalisation is able to systematically reduce the variability in the equalised responses and to improve the frequency response flatness relative to the target response. It achieves this by improving the broadband frequency balance relative to the target re­sponse and by reducing the variability in the response, particularly in the low frequencies. Across all loud­speaker groups the main improvement is in the reduc­tion of extreme (outlier) values in the low frequency band of the response.
It is interesting to note that the most commonly used room response control was the midrange level, fol­lowed closely by the treble level and bass tilt control. This is explained by the fact that the algorithm in most stages minimises the RMS deviation, and in so doing affects most efficiently the extreme deviations from the median level.
For all models pooled together, the broadband flatness was improved by equalisation. This improvement is mainly due to a reduction of the extreme magnitude values (heights of peaks and notches) in the low fre­quency response (LF subband).
The lack of improvement in the quartile values and RMS deviation in the midrange and high frequencies (MF and HF subbands) is because the room related response variation becomes narrow band. Some im­provement in the equalisation could be obtained with room response controls offering a tilting or shaping of the response within the mid-to-high frequency range.
The largest variability of the improvement in the low frequency range can be explained by the acoustics found in listening rooms [42]. At low frequencies the radiation from the loudspeaker can be considered om­nidirectional and the sound field in the room is usually not very diffuse. This results in strong room effects and hence large variations in the magnitude response at these frequencies.
The largest improvement is seen for the three-way systems and can be explained by two main factors. Firstly, the rooms in which this type of loudspeaker is typically installed are of a higher quality acoustical design, so the sound field in them is well controlled. Conversely, smaller loudspeakers are often installed in rooms with little or no acoustical design, making cor-
AES 23RD CONFERENCE, May 23-25, 2003 10
Loading...
+ 22 hidden pages