MACHINE LEARNING

Team Work Assessments

Background

Since 2008, guests and hosts have used Airbnb to expand on travelling possibilities and present more unique, personalised ways of experiencing the world. The Airbnb dataset (AB_NYC_2019.csv) along with its description can be accessed through Kaggle. This dataset describes the listing activity and metrics in NYC, NY for 2019, which includes all needed information about hosts, geographical availability, and necessary metrics to make predictions and draw conclusions. This assignment is aimed at assessing your ability to pose interesting questions relevant to Airbnb business, process the data using the key steps of big data analytics, such as, data pre-processing, analysing, and eventually preparing an analytical report. In order for your analysis to be compelling, it must address a substantive issue rather than a trivial one.

Tasks

  • Propose an interesting business analytic question that can be answered using the given Airbnb dataset. The proposed question should be useful for Airbnb. An example question could be: is there any noticeable difference in bookings among different areas and what could be the reasons for it?
  • In this task, use your data analytics skills to answer the question posed in the Task 1. Depending upon your chosen question, you will typically have to perform Exploratory Data Analysis (EDA), data pre-processing, statistics-based data analysis, data visualisation and use unsupervised machine learning algorithms (e.g., clustering).
  • In this task, you are expected to prepare a 1000 word analytical report, which can easily be interpreted by the executive board members of the Airbnb. There is no fixed specification for this report but in general, it must contain an adequate number of visualisation charts/graphs with a lucid description.

Results

PDF Icon Group Project - Report

PDF Icon Group Project - Code