Predict airfoil self-noise using machine learning

In this example, we use machine learning to predict airfoil self-noise using data from a series of aerodynamic and acoustic tests.

The noise generated by an aircraft is a significant environmental concern for the aerospace industry. A vital component of the total airframe noise is the airfoil self-noise resulting from the interaction between an airfoil blade and the turbulence generated in its boundary layer and near wake. Performance optimization can be applied to understand the behavior of airfoils and make designs with reduced noise.

Application type.
Data set.
Neural network.
Training strategy.
Model selection.
Testing analysis.
Model deployment.
Tutorial Video.

NASA processed the self-noise data set used in this example. It was obtained from a series of aerodynamic and acoustic tests of two and three-dimensional airfoil blade sections conducted in an anechoic wind tunnel.

The NASA data set comprises different NACA 0012 airfoils at various wind tunnel speeds and angles of attack. The airfoil and observer position span was the same in all experiments.

This example is solved with Neural Designer. To follow it step by step, you can use the free trial.

1. Application type

The variable to be predicted is continuous (sound pressure level). Therefore, this is an approximation project.

The primary objective is to model the sound pressure level as a function of the airfoil’s features and airspeed.

2. Data set

The first step is to prepare the data set, which is the source of information for the approximation problem. It consists of:

Data source.
Variables.
Instances.

The file airfoil_self_noise.csv contains the data for this example. Here, the number of variables (columns) is 6, and the number of instances (rows) is 1503.

In that way, this problem has the following six variables:

frequency, in Hertz, used as input.
angle_of_attack, in degrees, used as input.
chord_length, in meters, used as input.
free_stream_velocity, in meters per second, used as input.
suction_side_displacement_thickness, in meters, used as input.
scaled_sound_pressure_level, in decibels, is used as the target.

On the other hand, the NASA dataset contains 1,503 instances. They are divided randomly into training, selection, and testing subsets, containing 60%, 20%, and 20% of the instances, respectively. More specifically, 753 samples are used here for training, 375 for validation, and 375 for testing.

Once all the data set information has been set, we will perform some analytics to check the quality of the data.

For instance, we can calculate the data distribution. The following figure depicts the histogram for the target variable.

As we can see, the scaled sound pressure level has a normal distribution.

The following figure depicts inputs-targets correlations. This might help us understand the influence of different inputs on the sound level. As we can see, the wave’s frequency has the most significant impact on the noise.

The above chart shows that the wave’s frequency has the most significant impact on the noise.

We can also plot a scatter chart with the scaled sound pressure level versus the frequency.

In general, the more the frequency, the lower the scaled sound pressure level. However, the scaled sound pressure level depends on all the inputs simultaneously.

3. Neural network

The neural network will output the scaled sound pressure level as a function of the frequency, angle of attack, chord length, free stream velocity, and suction side displacement thickness.

For this approximation example, the neural network comprises:

Scaling layer.
Perceptron layers.
Unscaling layer.

The scaling layer transforms the original inputs to normalized values. Here, the mean and standard deviation scaling method is set so that the input values have a mean of 0 and a standard deviation of 1.

Here, two perceptron layers are added to the neural network. This number of layers is enough for most applications. The first layer has five inputs and three neurons. The second layer has three inputs and one neuron.

The unscaling layer transforms the normalized values from the neural network into the original outputs. Here, the mean and standard deviation unscaling method will also be used.

The following figure shows the resulting network architecture.

This neural network represents a function containing 22 adjustable parameters.

4. Training strategy

The next step is to select an appropriate training strategy, which defines what the neural network will learn. A general training strategy is composed of two concepts:

A loss index.
An optimization algorithm.

The loss index chosen is the normalized squared error with L2 regularization. This loss index is the default in approximation applications.

The optimization algorithm chosen is the quasi-Newton method. This optimization algorithm is the default for medium-sized applications like this one.

Once we have established the strategy, we can train the neural network. The following chart shows how the training (blue) and selection (orange) errors decrease with the training epoch during the training process.

The most crucial training result is the final selection error. Indeed, this is a measure of the neural network’s generalization capabilities. Here, the final selection error is 0.112 NSE.

5. Model selection

The objective of model selection is to find the network architecture with the best generalization properties. We aim to reduce the final selection error obtained previously (0.112 NSE).

The best selection error is achieved using a model whose complexity is most appropriate for producing an adequate data fit. Order selection algorithms are responsible for finding the optimal number of perceptrons in the neural network.

The following chart shows the results of the incremental order algorithm. The blue line plots the final training error as a function of the number of neurons, and the orange line plots the final selection error as a function of the number of neurons.

As we can see, the final training error continuously decreases with the number of neurons. However, the final selection error takes a minimum value at some point. Here, the optimal number of neurons is 13, corresponding to a selection error of 0.100 NSE.

The following figure shows the optimal network architecture for this application.

6. Testing analysis

The objective of the testing analysis is to validate the generalization performance of the trained neural network. The testing compares the values provided by this technique to the observed values.

A standard testing technique in approximation problems is to perform a linear regression analysis between the predicted and the real values,
using an independent testing set. The following figure illustrates a graphical output provided by this testing analysis.

From the above chart, we can see that the neural network accurately predicts the entire range of sound level data. The correlation value is R² = 0.952, which is close to 1.

7. Model deployment

The model is ready to estimate the self-noise of new airfoils with satisfactory quality over the same data range.

We can now use Response Optimization. The objective of the response optimization algorithm is to utilize the mathematical model to identify optimal operating conditions. Indeed, the predictive model enables us to simulate various operating scenarios and adjust the control variables to enhance efficiency.

An example is maximizing speed while maintaining sound pressure at the desired value.

The following table summarizes the conditions for this problem.

Variable name	Condition
Frequency	None
Angle of attack	None
Chord length	None
Free stream velocity	Maximize
Suction side displacement thickness	None
Scaled sound pressure level	Less than or equal to	115

The following list presents the optimal values for the specified conditions.

frequency: 7087.22 Hertz.
angle_of_attack: 9.9381 degrees.
chord_length: 0.206181 meters.
free_stream_velocity: 71.1912 meters per second.
suction_side_displacement_thickness: 0.015893 meters.
scaled_sound_pressure_level: 108.015 decibels.

It is advantageous to observe how the outputs vary as a single input function is varied when all the others are fixed. Directional outputs plot the neural network outputs through some reference points.

The following list shows the reference points for the plots.

angle_of_attack: 6.782 degrees.
chord_length: 0.136 meters.
free_stream_velocity: 50.860 meters per second.
suction_side_displacement_thickness: 0.011 meters.

We can plot a directional output of the neural network to see how the sound level varies with a given input for all other fixed inputs. The next plot shows the sound level as a function of the frequency through the following point:

The file airfoil_self_noise.py contains the Python code for the scaled sound pressure level.

8. Tutorial video

You can watch the step-by-step tutorial video below to help you complete this Machine Learning example
for free using the easy-to-use machine learning software Neural Designer.

References

UCI Machine Learning Repository. Airfoil Self-Noise Data Set.
T.F. Brooks, D.S. Pope, and A.M. Marcolini. Airfoil self-noise and prediction. Technical report, NASA RP-1218, July 1989.