# CS 1173 Data Analysis and Visualization in MATLAB Laboratory 1 Diabetes in the United States

### Objectives:

• Analyze a new data set applying tools of previous lessons and labs.
• Further develop critical thinking skills.
• Think about the issue of health disparities

Diabetes has become an epidemic with potentially catastrophic implications for human society. Diabetes disproportionately afflicts minority populations. Situations in which one or more subgroups of a population are affected disproportionately by a disease is known as a health disparity. UTSA is developing research emphases in the area of health disparities to help the San Antonio community. In this laboratory, you will look at some data about diabetes, one of the diseases that focused national attention on the issue of health disparities.

### Hand-in Requirements:

• The lab1Script.m script file containing the code used to read the data and generate the graphs. (Be sure your script displays each of the figures you want to have considered.)
• A Microsoft Word document with bullet points.

The lab should be submitted submitted electronically through Blackboard under the Assignments menu. Zip up your entire lab1 directory to submit. (Right click on the lab1 folder and follow the 7-Zip link.) Remember to put your Word document in the lab1 directory along with your script and the data.

### Overview:

In this laboratory you will analyze data on diabetes in the United States from the years 1980 to 2009. The data comes from the Centers for Disease Control and Prevention (CDC) website (http://apps.nccd.cdc.gov/DDTSTRS/default.aspx).

 File Description diabetics.mat The dataset contains four variables: overall holds the overall percentages of diabetics in the US population by age group for the years 1980 to 2009. blacks holds the percentages of diabetics in the US black population by age group for the years 1980 to 2009. whites holds the percentages of diabetics in the US white population by age group for the years 1980 to 2009. years holds the years 1980 to 2009. The percentage arrays have 5 columns, with the breakdown by age: Column 1 contains the percentage of the population 44 years or younger diagnosed as diabetic Column 2 contains the percentage of the population in the age group 45-64 diagnosed as diabetic Column 3 contains the percentage of the population in the age group 65-74 diagnosed as diabetic Column 4 contains the percentage of the population in the age group 75 or older diagnosed as diabetic Column 5 contains the overall percentage of the population Note: "diagnosed diabetic" does not refer just to those newly-diagnosed that year. In addition, the file years.mat contains a column vector with the corresponding years for the data.

### Part I:Setup

Use the standard setup by creating a lab1 in your V:\matlab subdirectory. Download the data set into this directory and also create a lab1Script.m file to hold all of the commands of Parts II, III, IV, V, and VI. Save this file in your lab1 directory. The video Setting up a project in MATLAB (4:47 mins) demonstrates the setup of lab 1. Be sure that the script runs and generates all of the required graphs before handing it in.

### Part II: Plot overall percentages for different ages

On a single axis, plot the overall percentages of diabetics in the US broken down by age. Your figure should have 5 different line graphs. You should include an appropriate title, axis labels, and a legend. DO NOT USE the MATLAB plottools. Your graph must be done entirely with a MATLAB script.

### Part III: Plot overall percentages for blacks

On a single axis, plot the percentages of diagnosed diabetics in the black population broken down by age. Your figure should have 5 different line graphs. You should include an appropriate title, axis labels, and a legend. DO NOT USE the MATLAB plottools. Your graph must be done entirely with a MATLAB script.

### Part IV: Plot overall percentages for whites

On a single axis, plot the percentages of diagnosed diabetics in the white population broken down by age. Your figure should have 5 different line graphs. You should include an appropriate title, axis labels, and a legend. DO NOT USE the MATLAB plottools. Your graph must be done entirely with a MATLAB script.

### Part V: Plot the percentages for age group 65-74

On a single axis, plot the percentages of diagnosed diabetics age 65-74 for blacks and for whites as well as the overall percentage in this age group. Your figure should have 3 different line graphs. You should include an appropriate title, axis labels, and a legend. DO NOT USE the MATLAB plottools. Your graph must be done entirely with a MATLAB script.

You should take care that your labels, titles and legends fit into an ordinary size figure. You can place legends with the Location and Orientation properties.

Example: The following places a legend for two data sets (Apples and Oranges) on a single line at the bottom of the figure window:
```    legend('Apples', 'Oranges', 'Location', 'SouthOutside', 'Orientation', 'horizontal')
```

### Other requirements:

Implement each part of the lab in a separate cell. Document what each cell does.

Publish the output. Include your published output in the zip file.

### Bullet point requirements:

Create a MicroSoft Word document containing the following:

• A set of 3 bullet points describing the characteristics of the diabetic population as a function of age.
• A set of 3 bullet points describing the detailed characteristics of the diabetic population for one ethnic group (your choice) compared with the overall population.
• A set of 3 bullet points describing the characteristics of the 65-74 age group.
• A short paragraph giving your opinion on the implications of this data for the United States.

Each bullet point should mention which figures (by lab part number) you used to do this. Focus on relevant observations such as about trends, magnitudes, and variation expressed in terms of the data. The bullet points and paragraph should be spell and grammar checked using Microsoft.

### Grading rubric for Lab (point values) 50 points total

 Criterion Performance indicator Missing Needs improvement (50%) Needs a little improvement (80%) Meets expectations (90%) Excellent (100%) Part II graph is correct and has appropriate labeling 0 2.75 4.125 5.225 5.5 Part III graph is correct and has appropriate labeling 0 2.75 4.125 5.225 5.5 Part IV graph is correct and has appropriate labeling 0 2.75 4.125 5.225 5.5 Part V graph is correct and has appropriate labeling 0 2.75 4.125 5.225 5.5 Script runs without error 0 2.75 4.125 5.225 5.5 3 bullet points discussing overall data by age are correct and relevant 0 2.75 4.125 5.225 5.5 3 bullet points comparing an ethnic group to overall population are correct and relevant 0 2.75 4.125 5.225 5.5 3 bullet points discussing the age group 64-75 are correct and relevant 0 2.75 4.125 5.225 5.5 Paragraph on the implications 0 3 4.5 5.7 6