R studio dateset assignment | Information Systems homework help

Unit 4 Assignment: Categorization Analytics

Outcomes addressed in this activity:

Unit Outcomes:

Analyze categorical situations in data analysis.

Apply statistical methods to data sets in order to create categories.

Interpret the results of categorical models.

Course Outcome: 

IT527-4: Construct useable and effective data analytics models incorporating industry-recognized software and standard algorithms.


This Assignment will enable you to practice categorization analytics in R Studio. Specifically, you will create and then interpret a k-Means model.

Assignment Instructions

The Unit 4 Assignment will give you an opportunity to practice some of the analytics skills you learned in your Reading this week, and also to reflect on that learning. To fulfill the Unit 4 Assignment, complete the following steps:

Download the Heart Disease Risk comma separated values (CSV) file from Course Documents. Import it into a data frame in R Studio named Patients. In a Word document, use a screenshot to document successful completion of this step and label the screenshot appropriately.

Data Understanding:

Age: Patient’s age in years

Marital_Status: 0 = Single, Never Married; 1 = Married; 2 = Divorced; 3 = Widowed

Gender: 0=Female; 1=Male

Weight_Category: 0=Normal; 1=Overweight; 2=Obese

Cholesterol: Total cholesterol as measured by milligrams per deciliter of blood

Stress_Management: 0=Did not attend classes; 1=Did attend classes

Trait_Anxiety: A score from 0 (never anxious or stressed) to 100 (always anxious or stressed)

Create a k-means clustering model on the Patients data frame with four clusters. Show the size and centers for each of your four clusters in a centroid table. Place a screenshot into your Word document and label it. Write an explanation explaining how this size and center table would be interpreted by a data analyst.

Create a new data frame called PatientClusters and put the cluster number together with the Patient attributes into it. View this data frame in the R Studio data viewer and take a screenshot. Place it into your Word document and label it. Explain how a data analyst might use the data in this data frame.

In your Word document, classify clusters one through four as “Low Risk”, “Moderate Risk”, “High Risk”, or “Critical Risk”. Justify your classifications using the data in your k-means analysis.

Based on the data, list at least one thing that you would recommend to patients who fall into the “Critical Risk” category. Defend your recommendation with data from the analysis results.

Explain the relationship between Gender and the risk categories you assigned in Step 4 above. Use data from the k-means analysis to defend your explanation.

Assignment Requirements

Prepare your Assignment submission in Microsoft Word following standard APA formatting guidelines: Double spaced, Times New Roman 12-point font, and one-inch margins on all sides. Include a title page, table of contents, and references page. You do not need to write an abstract. Label all tables and figures. Cite sources appropriately both in the text of your writing (parenthetical citations) and on your references page (full APA citation format).

For more information on APA style formatting, go to APA Style Central under Academic Tools of this course.

Also review the university policy on plagiarism. If you have any questions, please contact your professor. 

Calculate the price of your order

550 words
We'll send you the first draft for approval by September 11, 2018 at 10:52 AM
Total price:
The price is based on these factors:
Academic level
Number of pages
Basic features
  • Free title page and bibliography
  • Unlimited revisions
  • Plagiarism-free guarantee
  • Money-back guarantee
  • 24/7 support
On-demand options
  • Writer’s samples
  • Part-by-part delivery
  • Overnight delivery
  • Copies of used sources
  • Expert Proofreading
Paper format
  • 275 words per page
  • 12 pt Arial/Times New Roman
  • Double line spacing
  • Any citation style (APA, MLA, Chicago/Turabian, Harvard)

Our guarantees

Delivering a high-quality product at a reasonable price is not enough anymore.
That’s why we have developed 5 beneficial guarantees that will make your experience with our service enjoyable, easy, and safe.

Money-back guarantee

You have to be 100% sure of the quality of your product to give a money-back guarantee. This describes us perfectly. Make sure that this guarantee is totally transparent.

Read more

Zero-plagiarism guarantee

Each paper is composed from scratch, according to your instructions. It is then checked by our plagiarism-detection software. There is no gap where plagiarism could squeeze in.

Read more

Free-revision policy

Thanks to our free revisions, there is no way for you to be unsatisfied. We will work on your paper until you are completely happy with the result.

Read more

Privacy policy

Your email is safe, as we store it according to international data protection rules. Your bank details are secure, as we use only reliable payment systems.

Read more

Fair-cooperation guarantee

By sending us your money, you buy the service we provide. Check out our terms and conditions if you prefer business talks to be laid out in official language.

Read more