update

updated guidance
removed table
2025-12-20 09:37:04 -05:00 · 2018-12-04 15:06:30 -08:00 · 2018-11-30 12:38:37 -08:00 · 2018-11-30 12:37:02 -08:00 · 2018-11-30 11:14:49 -08:00 · 2018-11-29 23:18:30 -08:00
1 changed files with 49 additions and 0 deletions
--- a/databricks/automl_adb_readme.md
+++ b/databricks/automl_adb_readme.md
@@ -0,0 +1,49 @@
+**PREVIEW capability** 
+
+Automated ML now supports Azure Databricks as a local compute to perform training (**public preview**). Azure Databricks is a managed Spark offering on Azure and customers already use it for advanced analytics. It provides a collaborative Notebook based environment with CPU or GPU based compute cluster. 
+- Customers who use Azure Databricks for advanced analytics can now use the same cluster to run automated machine learning experiments. 
+- You can keep the data within the same cluster. 
+- You can leverage the local worker nodes with autoscale and auto termination capabilities. 
+- You can use multiple cores of your Azure Databricks cluster to perform simultenous training. 
+- You can further tune the model generated by automated machine learning if you chose to. 
+- Every run (including the best run) is available as a pipeline. 
+- The model from the pipeline can be registered in Azure ML SDK workspace and then deployed to Azure managed compute (ACI or AKS) using the Azure Machine learning SDK.
+
+**Create Azure Databricks Cluster:**
+
+Select New Cluster and fill in following detail:
+ - Cluster name: _yourclustername_
+ - Cluster Mode: Any. **High Concurrency** preferred
+ - Databricks Runtime: Any 4.x runtime.
+ - Python version: **3**
+ - Workers: 2 or higher.  
+ - Max. number of **concurrent iterations** in Automated ML settings is **<=** to the number of **worker nodes** in your Databricks cluster.
+ - Worker node VM types: **Memory optimized VM** preferred. 
+ - Uncheck _Enable Autoscaling_
+
+
+It will take few minutes to create the cluster. Please ensure that the cluster state is running before proceeding further.
+
+**Install Azure ML with Automated ML SDK on your Azure Databricks cluster**
+
+- Select Import library
+
+- Source: Upload Python Egg or PyPI
+
+- PyPi Name (_with_ Automated ML capability): **azureml-sdk[automl_databricks]**
+
+- PyPi Name (_without_ Automated ML capability): **azureml-sdk[databricks]**
+
+- Click Install Library
+
+- Do not select _Attach automatically to all clusters_. In case you have selected earlier then you can go to your Home folder and deselect it.
+
+- Select the check box _Attach_ next to your cluster name
+
+(More details on how to attach and detach libs are here - [https://docs.databricks.com/user-guide/libraries.html#attach-a-library-to-a-cluster](https://docs.databricks.com/user-guide/libraries.html#attach-a-library-to-a-cluster) )
+
+- Ensure that there are no errors until Status changes to _Attached_. It may take a couple of minutes.
+
+**Note** - If you have the old build the please deselect it from cluster’s installed libs > move to trash. Install the new build and restart the cluster. And if still there is an issue then detach and reattach your cluster.
+
+**Now you can run the Automated ML sample notebook on your Azure Databricks cluster. Please let us know your feedback.**
Author	SHA1	Message	Date
Parashar Shah	91aa7e04b0	update	2018-12-04 15:06:30 -08:00
Parashar Shah	2c3d3f446d	updated guidance	2018-11-30 12:38:37 -08:00
Parashar Shah	44c8a632bb	removed table	2018-11-30 12:37:02 -08:00
Parashar Shah	01d391f5c2	updated table	2018-11-30 11:14:49 -08:00
Parashar Shah	d4281967a2	added introduction.	2018-11-29 23:18:30 -08:00
Parashar Shah	b0ff1e1a5d	added table	2018-11-29 20:15:54 -08:00
Parashar Shah	df025e6a17	instructions for installing automl on adb	2018-11-29 19:57:34 -08:00