Final Project
Data Visualization (STAT 302)
Data source
I will be using a dataset from Kaggle to visual trends between health indictors and diabetes. The dataset I will be using is listed below:
- health indicators: diabetes_binary_5050split_health_indicators_BRFSS2015.csv contains information about some potential health indicators of diabetes (source: Kaggle)
Additional information
In my project, I focus on some of the health indicators for diabetes. I ultimately decided to use a shiny app to visualize the data. The dataset consists of survey data from 2015 that observes some health indicators and whether the survey participant has diabetes/prediabetes. There is an equal 50-50 split of respondents without diabetes and those with either prediabetes or diabetes. This 50-50 split makes it easier to compare the trends from the visualization.
Using the shiny app, the user can use the drop down selector to choose a health indicator they would like to examine. I then use a bar graph to graph the responses for the selected health indicator within each diabetic category (diabetic/prediabetic vs not diabetic).
The data communicates some insights about certain health conditions that may be indicators for diabetes. For instance, when looking at general health, for people without diabetes seemed to be in greater health than people with diabetes or prediabetes. A great number of diabetic/prediabetic people have high blood pressure and high cholesterol, which hints at a correlation between these two indicators and diabetes. I also observed that a great number of people that do physical activity and consume fruits and vegetables one or more times per day do not have diabetes. Thus, an insight users can glean from this visualization is that physical activity and consuming fruits and vegetables daily may potentially be helpful for preventing diabetes. Based on the trends and insights found in this visualization, users can hopefully invest in healthier life habits to promote healthier lives.