Группа авторов

Machine Learning for Healthcare Applications


Скачать книгу

age 21, height 176 cm, weight 63 kg with a less physical activity needs to consume 1,950 calories per day to maintain weight’. But the limitation of this approach is not considered some of the important parameters like (i) Different Health Parameters (Sleep Status, Calorie Status, etc.) have different consultants. (ii) They may not be very accurate in predicting them manually without any calculations. (iii) Consulting experts might be costly for a low-middle and middle-class family.

      Thus, it demands the need for designing a model that can predict their health status from daily life activities.

      A work by Tayeb et al. [12], proposed a method based on the popular machine learning algorithm KNN to predict heart disease and chronic kidney failure. Researchers [6] proposed an automated system for the prediction of stroke based on Electronic Medical Claims (EMCs), and they compared the Deep Neural Network (DNN) with the gradient boosting decision tree (GBDT), logistic regression (LR) and support vector machine (SVM) approach. Researchers [8] proposed the cloud-based smart clothing system for sustainable monitoring of human health. They also discussed the technologies and the implementation of methodologies. Reseachers Schmidt, Tittlbach, Bös & Woll [11], analyzed varieties of physical activity, fitness and health, they considered 18 years duration for study and identified interesting insights. In a recent work on Analyzing University Fitness Center data [14], the user’s fitness activity data is collected to predict the crowd at the fitness center. But the fitness activity data can be used to predict more than that.

      A lot of research was done on measuring health parameters numerically. Also, there are many works on calculating some health parameters from other parameters. A work by Harris-Benedict [4] calculates Basal Metabolic Rate from an individual’s physical measures. It is used to estimate the number of calories needed for an individual to maintain good health. Our work incorporated the effect of daily life activities on health status. But that data can be used to personalize health predictions and suggestions. This motivated to design a model that predicts health status from the daily life activities of individuals.

      Let At be the set of daily life activities done by an individual t day’s back. Thus, A0 is the set of activities done by an individual today, A1 be the set of activities done by an individual yesterday, and so on. A is the collection of the activities of an individual for many days. M be the set of physical measures of an individual. H be the health status matrix.

      Definition 1: Health Status Matrix: A health status matrix M describes the outcome of various parameters of health status. Each row of the matrix is considered as a vector of possible outcomes of the respective parameter of the health status. Examples of health status parameters are sleep status, smoke status, drink status, etc.

      Given a set of daily life activities and physical measures of users over a few days and their health status. The health status of a set of users already defined, known as labeled users UL. Whereas the health status of other sets of users is not defined, known as unlabeled users UV. The aim of the proposed model is to learn a function that uses the information of the labeled users’ UL and find the health status of the unlabeled users UV.

      Given a series of activities from last t days, the objective is to learn a function F,

      2.4.1 Pre-Processing

      The daily life activities of an individual that are mainly considered are screen time, sleep time, physical activity, number of cigarettes smoked, units of alcohol consumed. The measures that are mainly considered are age, gender, height, weight, calorie intake. Thus, there are ten features that are collected from an individual. Then, in the pre-processing step, the number of features is reduced by removing the activities and measures that do not have any direct effect on health status. This is achieved by using the Harris-Benedict Equation [4].

      The Harris–Benedict Equation [4] is a method used to estimate an individual’s basal metabolic rate (BMR). It says

For Men BMR = (10 × Weight in kg) + (6.25 × Height in cm) − (5 × Age in years) + 5
For Women BMR = (10 × Weight in kg) + (6.25 × Height in cm) − (5 × Age in years) − 161

      As per the Harris–Benedict Equation [4], the calories to be consumed is depending on the BMR value and the physical activity.

Schematic illustration of architecture of the model.

       Calories to be consumed = BMR * Physical Activity

       Calories Difference = (Calories Consumed) − (Calories to be consumed).

      In the proposed method the number of features is reduced to seven. They are age, gender, sleep time, screen time, number of cigarettes, units of alcohol consumed, and calorie intake.

      2.4.2 Phase-I

      The Phase-I of the model, process the data received from both the data sources and the user. In this phase, a decision tree classifier is used to estimate the health parameter of the user. Initially, the model is trained with the dataset received from the data sources. The Phase-I of the model