Final Project¶
The final assignment for this class is a multi-week project. The project is self-driven but the expectation is that you will work in groups to demonstrate your ability to do something original with your newfound pythonic abilities.
It’s up to you, but some suggestions include:
- Testing ideas for your Capstone project
- Replicating/extending analysis done in a published paper
- Working with an existing codebase/model to apply an interesting ML method
- Performing a novel analysis on a dataset
Expectations¶
- This should involve original work from your team (size of group: anywhere between 1-38)
- Level of effort take 2-3 weeks of your work
- Submission will be a repository including:
- Code: your own and perhaps from an existing project
- Documentation (any format: markdown, pdf, word, video, presentation)
- Overview of the problem
- Description of the dataset you used (input features, outcome,dimensions, etc)
- Tools/methods used
- Decisions made along the way, including trade-offs e.g., cut X for time so our solution may lack Y)
- Issues overcome along the way
- How to run the code (dependencies, etc.)
- Example output (what does it do?)
- Citations (data, code, papers)
Inspiration¶
Data & code¶
-
List of papers, code, examples using PyTorch
-
ML competition/collaboration site
-
Official examples of implementation using Google’s TensorFlow Keras
-
PLOS papers with available data
Searching PLOS for keywords likely to have available data, refine further to get topics interesting to you
-
PLOS recommended repositories (data, code, and sometimes both)
Lots here, mostly data repositories
-
Visual essays with data
Fun Examples¶
- SF Budget Visualization
- Tuesday Night Movie Night (movie recommendations)
- This course! > Assignment you didn't like? Make a better version of it! Include an example solution and, as a stretch goal, automated grading.