Explain why the data in its current form is not considered to be in ‘tidy’ format….

1.1. Explain why the data in its current form is not considered to be in ‘tidy’ format. (1 mark)

1.2. Write a function that takes a year and outputs a dataframe with one or more rows, where each row

shows the obesity rate of Male and Female in a country of that given year along with its 95%

confidence interval values. The returned dataframe should have 6 columns (Country, Year, Sex, Rate,

MinCI, MaxCI).

a) Load the data from the worksheet of the given year into a dataframe. (1 mark)

b) Drop the column ‘Both sexes’. (1 mark)

c) Add a new column named ‘Year’ filled all rows with the given year. (1 mark)

d) Use the gather() function to transform the data in the two columns Male and Female to become

rows. After this step, your dataframe now should have 4 columns: Country, Year, Sex, Rate.

(2 marks)

e) Split the Rate column into 3 columns named Rate, MinCI, MaxCI. Your dataframe should have 6

columns after this step: Country, Year, Sex, Rate, MinCI, MaxCI (2 marks)

f) Check and make necessary changes to make sure the data type of Rate, MinCI and MaxCI is

numeric. Print the summary of the dataframe. (1 mark)

g) Find and display rows with any invalid data, e.g. the rate value is not in the range of MinCI and

MaxCI, MinCI is larger than MaxCI, etc. If they exist, change the MinCI and MaxCI values in these

rows into NA. Print the summary of the dataframe.

Needs help with similar assignment?

We are available 24x7 to deliver the best services and assignment ready within 3-4 hours? Order a custom-written, plagiarism-free paper

Order Over WhatsApp Place an Order Online