OverviewLast Updated: 3/7/2017
The goal of the Business Analytics competition is to evaluate student teams’ ability to provide evidence-based business decision-making, while leveraging only those software resources widely available throughout the business world.
The contestants will be working on a prediction problem. The contestants will be provided with a context for the problem and a detailed description of the data, to include a data dictionary.
Your team may be made up of 1-2 students.
Friday, April 7, 2017, 6:30-10:30 pm
Competitive teams should have the following skills.
- The contestants must have some knowledge of predictive analytics, data mining, or machine learning techniques.
- The contestants must be able to use technology to read a relatively large dataset.
- Transform the variables as needed.
- Select the appropriate technology to approach the problem.
Teams are also expected to have foundational level knowledge of accounting, finance, economics, management, SQL, and quantitative methods.
Students may use any software, including access to the Internet, to develop and implement their model.
It is expected that contestants will access the data via a MSSQL Server that will host the database that will be utilized for the event. An IP address, username, and password will be provided at the beginning of the contest. To access the data, contestants are expected to pre-install SQL Server Management Studio (SSMS) (available for free at https://docs.microsoft.com/en-us/sql/ssms/download-sql-server-management-studio-ssms ) or HeidiSQL ( https://www.heidisql.com/download.php ).
The contestants will be provided with the context of the problem, and a detailed description of the data including a data dictionary. The research question will require the students to analyze the data for descriptive and inferential statistics, create a prediction model with final submission of a column vector of predictions in .csv format, and provide a brief report submitted as a .pdf
Judges will use the following criteria to determine each team’s score:
- Team submissions will be ranked based on the following deliverables:
- 30% A vector of predictions for the test set. (to be used by the judges)
- 20% A justification explaining why the method/technique used is appropriate to the problem scenario.
- 20% A brief description of the method used
- 20% Inferential statistics about the data
- 10% Descriptive statistics about the data