Skip to content

Latest commit

 

History

History
84 lines (69 loc) · 3.44 KB

README.rst

File metadata and controls

84 lines (69 loc) · 3.44 KB

Azure Machine Learning Cookiecutter Template

Setting up a project with Azure Machine Learning can be quite a lot of work some times. This template helps you set up a standardized project layout.

  • Python 3.8.6 or higher
  • Cookiecutter

We ran into the same issues over and over again with our machine learning projects. We do like creativity, but we think that it's good to have a solid base to start from. It saves you and your team a lot of time, because you don't have to spend time discussing the project layout and MLOps frameworks you're going to use. All we ask is that you pick the machine learning framework and data processing tool. The rest is all taken care of. You're welcome :-)

To use the template, follow these steps:

  • Open a new terminal window
  • Execute the command pip install cookiecutter
  • Execute the command cookiecutter https://github.com/infosupport/azureml-cookiecutter
  • Follow the on-screen instructions to set up your project

This template generates the following layout:

├─── README.md                  # Describes how to operate the project
├─── requirements.txt           # Dependencies used by the project
├─── .gitignore                 # Ignored files for GIT
├─── .amlignore                 # Ignored files for the Azure ML workspace
├─── setup.py                   # Setup file in case you want to package your project as a pip package.
├─── MANIFEST.in                # Manifest file for pip packaging.
│
├─── my_project
│    ├─── score.py              # The inference script
│    └─── train.py              # The training script
│
├─── data                       # Stores the datasets
├─── docs                       # Stores the project documentation
├─── reports                    # Stores generated reports
├─── notebooks                  # Stores Python notebooks
├─── tests                      # Stores the unit-tests for the project
└─── tasks
     ├─── deploy_model.py       # Deploys the model
     ├─── make_dataset.py       # Creates the dataset for training
     ├─── make_environment.py   # Creates a compute environment
     ├─── make_workspace.py     # Creates or updates the Azure Machine Learning Workspace
     └─── train_model.py        # Trains the model using on the compute environment

We don't prescribe any fixed number of machine learning tools to use with this template. Azure Machine Learning and this template support a number of machine learning tools:

  • Scikit-learn
  • Tensorflow
  • PyTorch
  • Spacy
  • NLTK

Aside from these tools, we have tested the template with the following data science libraries:

  • Scipy
  • Numpy
  • Pandas
  • Matplotlib
  • Jupyter notebooks

Can't find the tool you're using? Please let us know if it works for you and we'll add it to this list.

If you think that something doesn't work, please open a new issue in this repository.