1
0
mirror of synced 2025-12-30 12:04:43 -05:00
Files
airbyte/docs/integrations/sources/google-pagespeed-insights.md
Reid Beels ea7bd028ac 🐛 Fix URL pattern regex in Google PageSpeed Insights connector (#22287)
* Fix URL pattern regex in Google PageSpeed Insights connector

Without this `[` escaped, setting up a new source results in `java.util.regex.PatternSyntaxException: Unclosed character class`

* Bump source-google-pagespeed-insights version to 0.1.1

* fix: correct validation errors in acceptance-test-config.yml

* docs: bump metadata.yml file and update connector changelog

---------

Co-authored-by: Sajarin <sajarindider@gmail.com>
2023-05-26 01:26:28 -04:00

3.4 KiB

Google PageSpeed Insights

This page guides you through the process of setting up the Google PageSpeed Insights source connector.

Sync overview

Prerequisites

Set up the Google PageSpeed Insights source connector

  1. Log into your Airbyte Cloud or Airbyte Open Source account.
  2. Click Sources and then click + New source.
  3. On the Set up the source page, select Google PageSpeed Insights from the Source type dropdown.
  4. Enter a name for your source.
  5. For API Key, enter your Google PageSpeed API Key.
  6. For URLs to analyse, enter one or many URLs you want to create PageSpeed Insights for. Example: https://www.google.com.
  7. For Analyses Strategies, enter either "desktop", "mobile" or both to define which Analyses strategy to use.
  8. For Lighthouse Categories, select one or many of the provided options. Categories are also called "audits" in some of the Google Lighthouse documentation.
  9. Click Set up source.

IMPORTANT: As of 2022-12-13, the PageSpeed Insights API - as well as this Airbyte Connector - allow to specify a URL with prefix "origin:" - like origin:https://www.google.com. This results in condensed, aggregated reports about the specified origin - see this FAQ. However: This option is not specified in any official documentation anymore, therefore it might be deprecated anytime soon!

Supported sync modes

The Google PageSpeed Insights source connector supports the following sync modes:

  • Full Refresh

Supported Streams

The Google PageSpeed Insights source connector supports the following stream:

  • pagespeed: Full pagespeed report of the selected URLs, lighthouse categories and analyses strategies.

Features

Feature Supported?(Yes/No) Notes
Full Refresh Sync Yes
Incremental Sync No

Performance considerations

When using the connector without an API key, Google utilizes an undocumented, but strict rate limit - which also depends on how many global requests are currently sent to the PageSpeed API. The connector will retry, using an exponential backoff interval.

If the connector is used with an API key, Google allows for 25.000 queries per day and 240 queries per minute. Therefore, under normal usage, the connector should not trigger any rate limits. Create an issue if you see any rate limit issues that are not automatically retried successfully.

Changelog

Version Date Pull Request Subject
0.1.1 2023-05-25 #22287 🐛 Fix URL pattern regex
0.1.0 2022-11-26 #19813 🎉 New Source: Google PageSpeed Insights [low-code CDK]