1
0
mirror of synced 2026-02-01 04:01:50 -05:00
Commit Graph

174 Commits

Author SHA1 Message Date
Davin Chia
a5a142d085 Implement majority of more efficient Redshift Copy Strategy. (#2547)
Instead of inserts, we write the data to S3 and issue a COPY command copying from S3 into redshift.

Use a single file as its sufficiently performant and we do not want to introduce file destination related operations yet.
Use an open source library for uploads as AWS does not natively support streaming loads.

My intention with this PR is to first implement the meat of the write-and-copy strategy. This is mainly centered around the RedshiftCopier class. I plan hook up the RedshiftCopier to the actual Destination class, and implement all the plumbing, in a follow up PR.

Co-authored-by: Davin Chia <davinchia@Davins-MacBook-Pro.local>
2021-03-24 22:26:28 +08:00
Charles
9a81bd6e5c MeiliSearch Destination (#1964) 2021-02-08 18:44:55 -08:00
Charles
285c176a93 remove naming transformer from destination interface (#1953) 2021-02-04 15:46:53 -08:00
Charles
aadfae24bd Iterator-based JDBC Source (and Redshift bugfix) (#1887) 2021-02-02 17:14:14 -08:00
Charles
8184441892 Add logging to JdbcSource and JdbcDestination (#1798)
* gives better visibility into the progress of phase of both the source and destination.

* 1 log per every 10K records in the source

* destination logs each step in destination including setting up tmp tables, creating final tables, cleaning up tmp tables
2021-01-22 22:13:10 -08:00
Christophe Duong
7a741d24b9 Disable ExtendedNameTransformer and use StandardNameTransformer instead (#1786)
* Disable ExtendedNameTransformer and use StandardNameTransformer instead

* Fix acceptance test
2021-01-22 20:53:12 +01:00
Charles
451846ccaf add more logging to jdbc destination (#1775) 2021-01-21 16:37:16 -08:00
Christophe Duong
ebc24f85fe Rename destination tables (#1737) 2021-01-20 17:37:00 +01:00
Christophe Duong
194d6cb5d0 Add metadata prefix to destination internal columns (#1708)
* Add metadata prefix to destination internal columns

* Bumpversion all destinations for new normalization behavior

* Fix acceptance tests
2021-01-19 17:21:15 +01:00
hudsondba
a254234b0a Increasing BATCH_SIZE and timeout sync (#1593)
Co-authored-by: Hudson dos Santos <hudson@flieber.com>
2021-01-11 09:19:47 -08:00
Charles
f9b63d0a9e Update Destination Abstractions (#1456) 2021-01-07 10:37:10 -08:00
Christophe Duong
6d25e65d57 Renaming some classes to be less specific to SQL destinations (#1283) 2020-12-21 15:16:57 +01:00
Charles
25689eea56 add incremental to jooq source (and postgres) (#1172) 2020-12-08 21:14:11 -08:00
Christophe Duong
4ca8760e51 Chris/snowflake incremental (#1194)
* Refactor Snowflake destination and support incremental (#1125)
2020-12-04 19:52:33 +01:00
Christophe Duong
d06392e900 Redshift Destination & refactoring to introduce destination-jdbc
Closes #193
Closes #1126
2020-12-03 18:07:46 +01:00
Charles
cda619697e add cleaned names to api interface (#1138) 2020-11-30 21:30:16 -08:00
Christophe Duong
206d3cbea8 Naming conventions managed in destinations (#1060)
Handled field and table identifier conversions as necessary in order to successfully in destinations
2020-11-25 18:53:23 +01:00
Charles
e7edb2c858 Adding incremental to the catalog data model (#998)
* Add ConfiguredAirbyteCatalog and ConfiguredAirbyteStream
2020-11-18 14:15:59 -08:00
Jared Rhizor
8c7129c307 fix format build (#877) 2020-11-10 09:13:21 -08:00
Sherif A. Nada
9305461fec Add _raw suffix to non-normalized destination table names (#874) 2020-11-10 08:31:27 -08:00
Jared Rhizor
e1d7add50a mostly incremental builds (#817)
* support cached builds for base -> base-java -> snowflake

* use plugin for image building

* fix matching on COPY from

* remove docker.gradle

* tmp commit

* update connectors

* finish rest of build files

* fix ide errors

* more build fixes

* clean up

* clean up for new soruces

* fix spotless

* fix flake problems

* add recommended empty file

* python caching

* fixes upon review

* clean up docker and build test files

* clean up python

* clean up

* fix integration test dependencies

* fix standard tests

* fix

* remove symlink

* re-add requirements to fix normalizatioon build

* fix symlink

* fix dumbest build problem of all

* add missing integration test def

* fix missing dep

* remove class exclusion

* move trim so null source versions are allowed

* rename map

* fix hardcoded value

* remove unnecessary dep

* use dashes for salesforce package name

* fix typo

* DRY and fix test image name

* Fix edit

* assert string is not empty

* build integration test image only for integrationTest

* move code generator to tools and rename docker build tasks

* make source test depend on integration test build, not the other way

* remove guard because the docker build should exist before the integrationtest is applied

* remove comment

* DRY up airbyte-source-test

* fix plugin compilation

* add missing dependency

* rename getTaggedImage to getDevTaggedImage

* fix test vs main docker build bug
2020-11-09 11:00:11 -08:00
Charles
5f45aef149 Add support for sources to java base (#749) 2020-10-30 10:33:51 -07:00
Sherif A. Nada
3cea00913d place test mount roots in /tmp (#623) 2020-10-29 14:43:33 -07:00
Charles
31013556c3 Integrations Reorganization: Bases (#673) 2020-10-21 11:23:49 -07:00