update samples from Release-114 as a part of SDK release

Merge pull request #1600 from Azure/release_update/Release-113
update samples from Release-113 as a part of SDK release
2025-12-20 01:27:06 -05:00 · 2021-09-29 20:16:51 +00:00 · 2021-09-28 12:53:57 -07:00 · 2021-09-28 19:44:58 +00:00 · 2021-09-24 14:44:53 -07:00 · 2021-09-24 21:40:44 +00:00
3 changed files with 14 additions and 33 deletions
--- a/how-to-use-azureml/machine-learning-pipelines/nyc-taxi-data-regression-model-building/scripts/prepdata/normalize.py
+++ b/how-to-use-azureml/machine-learning-pipelines/nyc-taxi-data-regression-model-building/scripts/prepdata/normalize.py
@@ -28,13 +28,21 @@ replaced_distance_vals_df = (replaced_stfor_vals_df.replace({"distance": ".00"},

 normalized_df = replaced_distance_vals_df.astype({"distance": 'float64'})

+
+def time_to_us(time_str):
+    hh, mm , ss = map(int, time_str.split(':'))
+    return (ss + 60 * (mm + 60 * hh)) * (10**6)
+
+
 temp = pd.DatetimeIndex(normalized_df["pickup_datetime"])
-normalized_df["pickup_date"] = temp.date
+normalized_df["pickup_date"] = pd.to_datetime(temp.date)
 normalized_df["pickup_time"] = temp.time
+normalized_df["pickup_time"] = normalized_df["pickup_time"].apply(lambda x: time_to_us(str(x)))

 temp = pd.DatetimeIndex(normalized_df["dropoff_datetime"])
-normalized_df["dropoff_date"] = temp.date
+normalized_df["dropoff_date"] = pd.to_datetime(temp.date)
 normalized_df["dropoff_time"] = temp.time
+normalized_df["dropoff_time"] = normalized_df["dropoff_time"].apply(lambda x: time_to_us(str(x)))

 del normalized_df["pickup_datetime"]
 del normalized_df["dropoff_datetime"]
--- a/how-to-use-azureml/work-with-data/datasets-tutorial/pipeline-with-datasets/pipeline-for-image-classification.ipynb
+++ b/how-to-use-azureml/work-with-data/datasets-tutorial/pipeline-with-datasets/pipeline-for-image-classification.ipynb
@@ -272,7 +272,8 @@
        "dependencies:\n",
        "- python=3.6.2\n",
        "- pip:\n",
-        "  - azureml-defaults\n",
+        "  - azureml-core\n",
+        "  - azureml-dataset-runtime\n",
        "  - keras==2.4.3\n",
        "  - tensorflow==2.4.3\n",
        "  - numpy\n",
--- a/tutorials/regression-automl-nyc-taxi-data/regression-automated-ml.ipynb
+++ b/tutorials/regression-automl-nyc-taxi-data/regression-automated-ml.ipynb
@@ -101,9 +101,7 @@
      "cell_type": "markdown",
      "metadata": {},
      "source": [
-        "Now that the initial data is loaded, define a function to create various time-based features from the pickup datetime field. This will create new fields for the month number, day of month, day of week, and hour of day, and will allow the model to factor in time-based seasonality. \n",
-        "\n",
-        "Use the `apply()` function on the dataframe to iteratively apply the `build_time_features()` function to each row in the taxi data."
+        "Remove some of the columns that you won't need for training or additional feature building.  Automate machine learning will automatically handle time-based features such as lpepPickupDatetime."
      ]
    },
    {
@@ -112,33 +110,7 @@
      "metadata": {},
      "outputs": [],
      "source": [
-        "def build_time_features(vector):\n",
-        "    pickup_datetime = vector[0]\n",
-        "    month_num = pickup_datetime.month\n",
-        "    day_of_month = pickup_datetime.day\n",
-        "    day_of_week = pickup_datetime.weekday()\n",
-        "    hour_of_day = pickup_datetime.hour\n",
-        "    \n",
-        "    return pd.Series((month_num, day_of_month, day_of_week, hour_of_day))\n",
-        "\n",
-        "green_taxi_df[[\"month_num\", \"day_of_month\",\"day_of_week\", \"hour_of_day\"]] = green_taxi_df[[\"lpepPickupDatetime\"]].apply(build_time_features, axis=1)\n",
-        "green_taxi_df.head(10)"
-      ]
-    },
-    {
-      "cell_type": "markdown",
-      "metadata": {},
-      "source": [
-        "Remove some of the columns that you won't need for training or additional feature building."
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "columns_to_remove = [\"lpepPickupDatetime\", \"lpepDropoffDatetime\", \"puLocationId\", \"doLocationId\", \"extra\", \"mtaTax\",\n",
+        "columns_to_remove = [\"lpepDropoffDatetime\", \"puLocationId\", \"doLocationId\", \"extra\", \"mtaTax\",\n",
        "                     \"improvementSurcharge\", \"tollsAmount\", \"ehailFee\", \"tripType\", \"rateCodeID\", \n",
        "                     \"storeAndFwdFlag\", \"paymentType\", \"fareAmount\", \"tipAmount\"\n",
        "                    ]\n",
Author	SHA1	Message	Date
amlrelsa-ms	36d96f96ec	update samples from Release-114 as a part of SDK release	2021-09-29 20:16:51 +00:00
Harneet Virk	7ebcfea5a3	Merge pull request #1600 from Azure/release_update/Release-113 update samples from Release-113 as a part of SDK release	2021-09-28 12:53:57 -07:00
amlrelsa-ms	b20bfed33a	update samples from Release-113 as a part of SDK release	2021-09-28 19:44:58 +00:00
Harneet Virk	a66a92e338	Merge pull request #1597 from Azure/release_update/Release-112 update samples from Release-112 as a part of SDK release	2021-09-24 14:44:53 -07:00
amlrelsa-ms	c56c2c3525	update samples from Release-112 as a part of SDK release	2021-09-24 21:40:44 +00:00