The Data Science Pipeline

The Data Science Pipeline


  • Planning
  • Data preparation
  • Modeling
  • Followup

Planning 

  1. Define goals
  2. Organize resources 
  3. Coordinate people
  4. Schedule project

Data preparation

  1. Get the data
  2. Clean the data
  3. Explore the data
  4. Refine the data

Modeling

  1. Crate model
  2. Validate model
  3. Evaluate the model
  4. Refine model

Followup

  1. Present model
  2. Deploy model
  3. Revisit model
  4. Archive assets


Comments

Popular posts from this blog

Variables Types

Scientific Method

Confounder Variable