1
0
mirror of synced 2026-01-04 09:04:47 -05:00
Files
airbyte/docs/integrations/sources/redshift.md
Liren Tu 888347a0d0 🎉 JDBC sources: store cursor record count in db state (#15535)
* Add cursor_record_count to db stream state

* Add cursor record count to cursor info

* Emit max cursor record count

* Add original cursor record count

* Unify logging format

* Add backward compatible methods

* Update unit tests for state decorating iterator

* Update test (not done yet)

* Fix one more unit test

* Change where clause operator according to record count

* Add branch for null cursor

* Skip saving record count when it is 0

* Fix log wording

* Set mock record count in test

* Check cursor value instead of cursor info

* Fix source jdbc test

* Read record count from state

* Fix tests

* Add an acceptance test case

* Fix npe

* Change record count from int to long to avoid type conversion

* Fix references

* Fix oracle container

* Use uppercase for snowflake

* Use uppercase for db2

* Fix and use uppercase

* Update test case to include the edge case

* Format code

* Remove extra assertion in clickhouse

* Merge ms sql incremental query method

* Log query for debugging

* Clean up name_and_timestamp table

* Fix db2 tests

* Fix mssql tests

* Fix oracle tests

* Fix oracle tests

* Fix cockroachdb tests

* Fix snowflake tests

* Add changelog

* Fix mssql tests

* Fix db2-strict-encrypt tests

* Fix oracle-strict-encrypt tests

* Bump postgres version

* Fix oracle-strict-encrypt tests

* auto-bump connector version [ci skip]

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-10-14 01:09:12 -07:00

5.3 KiB

Redshift

Overview

The Redshift source supports Full Refresh syncs. That is, every time a sync is run, Airbyte will copy all rows in the tables and columns you set up for replication into the destination in a new table.

This Redshift source connector is built on top of the source-jdbc code base and is configured to rely on JDBC 4.2 standard drivers provided by Amazon via Mulesoft here as described in Redshift documentation here.

Sync overview

Resulting schema

The Redshift source does not alter the schema present in your warehouse. Depending on the destination connected to this source, however, the schema may be altered. See the destination's documentation for more details.

Features

Feature Supported Notes
Full Refresh Sync Yes
Incremental Sync Coming soon
Replicate Incremental Deletes Coming soon
Logical Replication (WAL) Coming soon
SSL Support Yes
SSH Tunnel Connection Coming soon
Namespaces Yes Enabled by default
Schema Selection Yes Multiple schemas may be used at one time. Keep empty to process all of existing schemas

Incremental Sync

Incremental sync (copying only the data that has changed) for this source is coming soon.

Getting started

Requirements

  1. Active Redshift cluster
  2. Allow connections from Airbyte to your Redshift cluster (if they exist in separate VPCs)

Setup guide

1. Make sure your cluster is active and accessible from the machine running Airbyte

This is dependent on your networking setup. The easiest way to verify if Airbyte is able to connect to your Redshift cluster is via the check connection tool in the UI. You can check AWS Redshift documentation with a tutorial on how to properly configure your cluster's access here

2. Fill up connection info

Next is to provide the necessary information on how to connect to your cluster such as the host whcih is part of the connection string or Endpoint accessible here without the port and database name (it typically includes the cluster-id, region and end with .redshift.amazonaws.com).

Encryption

All Redshift connections are encrypted using SSL

Changelog

Version Date Pull Request Subject
2022-10-13 15535 Update incremental query to avoid data missing when new data is inserted at the same time as a sync starts under non-CDC incremental mode
0.3.14 2022-09-01 16258 Emit state messages more frequently
0.3.13 2022-05-25 Added JDBC URL params
0.3.12 2022-08-18 14356 DB Sources: only show a table can sync incrementally if at least one column can be used as a cursor field
0.3.11 2022-07-14 14574 Removed additionalProperties:false from JDBC source connectors
0.3.10 2022-04-29 12480 Query tables with adaptive fetch size to optimize JDBC memory consumption
0.3.9 2022-02-21 9744 List only the tables on which the user has SELECT permissions.
0.3.8 2022-02-14 10256 Add -XX:+ExitOnOutOfMemoryError JVM option
0.3.7 2022-01-26 9721 Added schema selection
0.3.6 2022-01-20 8617 Update connector fields title/description
0.3.5 2021-12-24 8958 Add support for JdbcType.ARRAY
0.3.4 2021-10-21 7234 Allow SSL traffic only
0.3.3 2021-10-12 6965 Added SSL Support
0.3.2 2021-08-13 4699 Added json config validator