Assignment 3.1: Performing Operations Using Databricks Service The goal of this assignment is to get acquainted with performing operations using the Databricks service. For more information on Databricks and how to get started, please view to the Getting Started with Databricks tutorials before beginning your assignment.Assignment Instructions:For this assignment you will have to load the “Ecomm-Customers.csv” into a storage container and perform transformations on the data. Navigate to Github’s LInear Regression data zip – “Ecomm-Customers.csv” file and download the file. Sign-up for Databricks, and then select “Get Started for Free.” For tutorials on how to sign-up, view the Getting Started with Databricks tutorials part 1-3 in the Module 3 Presentations above. Upload sample data into the storage container. Extract data from the storage container. Transform data in Databricks. Visualize the dataset. Submission format: Take screenshots of the commands and outputs to show you’ve completed all the steps and add them into a PDF document. Include a few sentences describing each screenshot. Be sure to follow the formatting guidelines below. IMPORTANT: Clear out your Databricks/Synapse resources after every assignment is complete. If you do not clean out your assignment resources, you will deplete your $200 Azure free credits and incur a charge.Make sure to properly use APA formatting and cite external resources. For details of the APA guidelines and to use the SafeAssign Originality Checker to check your paper for possible plagiarism issues, please use the Course Resources link in the left navigation bar.
Posted inUncategorized