Week 1 – Friday

Today’s work was all about the analysis of the %Obesity and % Inactivity of CDC datasets. I have gone through the same concepts for understanding the common factors related to the linear regression and the behaviour of the plot for both the datasets obesity and inactivity.

Similarly, I have written a code to find out the mean, median, standard deviation, skewness, and kurtosis for different U.S. counties for both the datasets. The attached pdf file I have linked with the post as one can refer to that easily.

Furthermore analysis I will work on the R-squared value and p-value to determine the stats and relation between the diabetes, obesity, and inactivity. Moreover, I am going to do the statistical analysis to calculate the p-value and observe the results in the datasets. Later on, I will visualise the datasets by plotting them and exploring them using multiple regressions and will be using the function matplotlib for visualising the data.

Project1Data_Analysis

Leave a Reply

Your email address will not be published. Required fields are marked *