Skip to main content

The NCI Community Hub (NCI Hub) will be offline for maintenance December 2-3, 2022 as we migrate URLs from NCIhub.org to NCIhub.cancer.gov. The current address will automatically redirect to the new URL. The NCI Hub will continue to function as usual after the migration. Please be sure to update any bookmarks or favorites to the new URL. close

  • Discoverability Visible
  • Join Policy Open/Anyone
  • Created 08 Sep 2021

The presentation and video recording are now available.

Example Project

Overview:
Project organization is key for communication and reproducibility of data science projects. Dr. Fear will offer guidelines and examples from his personal experience, including 10 best practices, examples of do’s and don’ts – and useful tools of the trade to get you started!

Topics: 10 Best Practices for Organizing Data Science Projects

  1. Use the same structure and names across projects
  2. Separate original data, generated data, and scripts
  3. Use workflows to orchestrate
  4. Split out configuration for consistency
  5. Modularize reusable code
  6. Use a style guide and linters
  7. Use containers and environments
  8. Document as you go
  9. Document as you go
  10. Document as you go!

Date:               Thursday, December 12, 2019
Time:               9:00-10:00 a.m.
Location:         NCI Shady Grove, Seminar Room 406

Instructor: Justin Fear, PhD, Postdoctoral Researcher at the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK).

Questions? Contact the NCI Data Science Learning Exchange

Created by Clint Malone Last Modified Fri December 3, 2021 12:02 am by Clint Malone