Decription and rubric for the course project for FOR 796.
The primary goal of this course is to prepare you to apply machine learning techniques to real-world problems. This project is intended to start you in that direction.
Working alone, you will identify a data set, use methods from the course to create models predicting an outcome within that data set, and write a report (in the format of a short journal article).
This report will identify the problem you are addressing, the approach you took to tune and fit models, and the relative strengths and weaknesses of your final models. You will hand in this report as well as all the code and data used to produce it.
For our final class, everyone will give a brief (5 minute maximum) presentation on what they did for their report, covering their objectives, methods, results, and a reflection on how well things went and where they had issues. This should not feel like a high stakes situation, but rather an opportunity to talk with the group about the process of actually implementing machine learning methods. The presentation is worth 5% of the final grade and is scored entirely on completion (“did you give a presentation”), not quality. Slides and other presentation aids are permitted, but not required.
You are allowed to get help with your code from the internet, other people in the class, or myself, but the final product – code and written report – should be your own work. It’s fine to copy code snippets from the lectures or StackOverflow, but the structure and organization of your program should be your own, as should all the written report.
An example report (without code) is available online at https://mlca.mm218.dev/project/project_example.pdf.
You may submit your project as a zip folder containing your data, report, and code files or as a link to a git repo of the same. The report can be a PDF, Word document, or R Markdown file (in which case the code can be either a separate R file or contained in the report itself). All projects should be submitted to mike.mahoney.218@gmail.com via an email with the subject line “FOR 796 Course Project” by the start of class on December 8th.
Formatting should be appropriate for submitting to a journal in your field.
setwd()
, no rm(list = ls())
, sets a seed, returns the same results as are reported (20 points)NOTE: You do not need to do a literature review for this project. You do not need to deeply cite your introduction or discussion, especially if this isn’t data from your own work (so you don’t have the citations on-hand already). While there is nothing wrong with citing relevant work, the only required citations are:
citation()
function) (5 points)Presentation included objectives, methods, results, and reflection.
Grades are assigned as follows:
< 80 points: F
80-90 points: B
> 90 points: A