ANL252 Based on the given ECA dataset on medical costs, use Python to conduct three (3) data pre-processing tasks: Python for Data Analytics Assignment SUSS

Section A (Total 100 marks) (100 marks)
Answer all questions in this section.

The dataset used in this paper contains information about medical costs, and its data dictionary is provided in the Appendix. Please refer to Canvas for details of this dataset.

Question 1
Based on the given ECA dataset on medical costs, use Python to conduct three (3) data pre-processing tasks to clean and prepare the dataset and provide relevant explanations.
[No more than 500 words]
(30 marks)

Question 2
Based on the ECA dataset on medical costs, use Python to plot three (3) figures (i.e., charts) and discuss the insights for each figure accordingly. Each figure and its corresponding Python codes and insights collectively carry 10 marks. The figures and Python codes are to be provided as part of the answer in the main report.
[No more than 500 words]
(30 marks)

Hire a Professional Essay & Assignment Writer for completing your Academic Assessments

Native Singapore Writers Team

100% Plagiarism-Free Essay
Highest Satisfaction Rate
Free Revision
On-Time Delivery

Question 3
Use decision tree to further explore the dataset, where the dependent variable is ‘smoker’. Please explain the approach taken. [No more than 300 words]
(20 marks)

Question 4
Plot the decision tree obtained from Question 3, and discuss the relevant insights. [No more than 200 words]
(5 marks)

Question 5
Can decision trees be effectively used for exploratory data analysis, moving beyond their traditional role in making predictions? Discuss. [No more than 300 words
(including in-text citation)]

Buy Custom Answer of This Assessment & Raise Your Grades

The post ANL252 Based on the given ECA dataset on medical costs, use Python to conduct three (3) data pre-processing tasks: Python for Data Analytics Assignment SUSS appeared first on Singapore Assignment Help.