Human activity recognition machine learning with smartphone data

Introduction

Human activity recognition (HAR) using machine learning can support healthcare applications by analyzing smartphone movement data to detect daily activities.

In this study, 30 volunteers (aged 19–48) performed six activities while a waist-worn smartphone recorded accelerometer and gyroscope data at 50 Hz, with video labeling for accuracy.

Healthcare professionals can test this approach with Neural Designer’s trial version.

The following index outlines the steps for performing the analysis.

1. Model type

Problem type: Multiclass classification (walking, walking_upstairs, walking_downstairs, sitting, standing, lying)
Goal: Model the probability of each activity based on input variables to support healthcare applications such as patient monitoring, rehabilitation, and lifestyle management.

2. Data set

Data source

The file activity_recognition.csv contains 10299 samples, each of them with 561 inputs and one categorical target.

Variables

The following list summarizes the variables’ information:

Sensor signals – Time and frequency domain

body_acceleration (x, y, z) – Linear acceleration of the body along 3 axes.
gravity_acceleration (x, y, z) – Gravity component of acceleration along 3 axes.
body_acceleration_jerk (x, y, z) – Rate of change of body acceleration.
body_angular_speed (x, y, z) – Angular velocity of the body from gyroscope measurements.
body_angular_acceleration (x, y, z) – Rate of change of angular velocity.
body_acceleration_magnitude – Magnitude of body acceleration vector.
gravity_acceleration_magnitude – Magnitude of gravity acceleration vector.
body_acceleration_jerk_magnitude – Magnitude of body acceleration jerk vector.
body_angular_speed_magnitude – Magnitude of angular speed vector.
body_angular_acceleration_magnitude – Magnitude of angular acceleration vector.

Target variable

activity – The human activity performed during data collection: Walking, Walking Upstairs, Walking Downstairs, Sitting, Standing, and Lying.

Variables distributions

Variable distributions can be calculated; the figure shows the number of samples for each activity in the dataset.

As we can see, the number of instances belonging to each category is similar. Therefore, this data set exhibits good balance.

Input-target correlations

The input-target correlations indicate which sensor signals most influence the classification of activities and, therefore, are more relevant to our analysis.

The most correlated variables with activity classification are tBodyAccJerk-mean-Z, fBodyAcc-kurtosis-X, and fBodyAcc-max-X.

3. Neural network

A neural network is an artificial intelligence model inspired by how the human brain processes information.

It is organized in layers: the input layer receives the variables, the hidden layers combine them to detect relevant patterns, and the output layer provides the probability of belonging to a given class.

Trained with historical data, the network learns to recognize patterns and distinguish between categories, offering objective support for decision-making.

The network processes the smartphone sensor signals, combines them in hidden layers, and outputs the probability for each activity class.

The connections illustrate how the different sensor variables contribute to the classification.

4. Training strategy

Training a neural network uses a loss function to measure errors and an optimization algorithm to adjust the model, ensuring it learns from data while avoiding overfitting for good performance on new cases.

The model was trained for accuracy and stability, with training and selection errors decreasing steadily (0.008 and 0.048 NSE), indicating effective learning and generalization to new activity instances.

5. Testing analysis

Once the model is trained, we perform a testing analysis to validate its prediction capacity.

In particular, we use a subset of data that has not been used before, the testing instances.

Confusion matrix

The confusion matrix shows the model’s performance by comparing predicted and actual activities. It includes:

True positives: activities correctly identified
False positives: activities incorrectly identified as another class
False negatives: activities that were missed or misclassified
True negatives: activities correctly recognized as not belonging to a given class

	Predicted STANDING	Predicted SITTING	Predicted LAYING	Predicted WALKING	Predicted WALKING DOWNSTAIRS	Predicted WALKING UPSTAIRS
Real STANDING	376	18	0	0	0	0
Real SITTING	18	330	1	0	0	0
Real LAYING	0	2	402	0	0	0
Real WALKING	1	0	0	307	0	0
Real WALKING DOWNSTAIRS	0	0	0	0	279	1
Real WALKING UPSTAIRS	1	0	0	0	5	318

In this example, 97.72% of cases were correctly classified and 2.28% were misclassified.

6. Model deployment

Once validated, the neural network can be saved for deployment, allowing predictions of human activities from new accelerometer and gyroscope data.

In deployment mode, researchers can use it as a real-time or offline recognition tool, with Neural Designer automatically exporting the model for easy integration into laboratory or application workflows.

Conclusions

The human activity recognition model achieved excellent performance, correctly classifying 2012 of 2059 instances (2.3% misclassification).

Key features—such as Z-axis body acceleration jerk, frequency-domain acceleration, and maximum X-axis acceleration—align with biomechanical principles.

With strong generalization, this neural network can support healthcare professionals in monitoring activity, developing health apps, and providing personalized lifestyle and rehabilitation recommendations.

References

UCI Machine Learning Repository Human Activity Recognition Using Smartphones Data Set.
Davide Anguita, Alessandro Ghio, Luca Oneto, Xavier Parra, and Jorge L. Reyes-Ortiz. A Public Domain Dataset for Human Activity Recognition Using Smartphones. 21st European Symposium on Artificial Neural Networks, Computational Intelligence, and Machine Learning, ESANN 2013. Bruges, Belgium 24-26 April 2013.