Azure Databricks - Azure Machine Learning SDK Sample Notebooks
NOTE: With the latest version of Azure Machine Learning SDK, there are some API changes due to which previous version of notebooks will not work. Please remove the previous SDK version and install the latest SDK by installing azureml-sdk[databricks] as a PyPi library in Azure Databricks workspace.
NOTE: Please create your Azure Databricks cluster as v4.x (high concurrency preferred) with Python 3 (dropdown).
NOTE: Some packages like psutil upgrade libs that can cause a conflict, please install such packages by freezing lib version. Eg. "pstuil cryptography==1.5 pyopenssl==16.0.0 ipython=2.2.0" to avoid install error. This issue is related to Databricks and not related to AML SDK.
NOTE: You should at least have contributor access to your Azure subcription to run some of the notebooks.
The iPython Notebooks have to be run sequentially after making changes based on your subscription. The corresponding DBC archive contains all the notebooks and can be imported into your Databricks workspace. You can the run notebooks after importing .dbc instead of downloading individually.
This set of notebooks are related to Income prediction experiment based on this dataset and demonstrate how to data prep, train and operationalize a Spark ML model with Azure ML Python SDK from within Azure Databricks. For details on SDK concepts, please refer to notebooks
(Recommended) Azure Databricks AML SDK notebooks A single DBC package to import all notebooks in your Azure Databricks workspace.
- Installation and Configuration: Install the Azure ML Python SDK and Initialize an Azure ML Workspace and save the Workspace configuration file.
- Ingest data: Download the Adult Census Income dataset and split it into train and test sets.
- Build model: Train a binary classification model in Azure Databricks with a Spark ML Pipeline.
- Build model with Run History: Train model and also capture run history (tracking) with Azure ML Python SDK.
- Deploy to ACI: Deploy model to Azure Container Instance (ACI) with Azure ML Python SDK.
- Deploy to AKS: Deploy model to Azure Kubernetis Service (AKS) with Azure ML Python SDK from an existing Image with model, conda and score file.
Copyright (c) Microsoft Corporation. All rights reserved.
All notebooks in this folder are licensed under the MIT License.
Apache®, Apache Spark, and Spark® are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.