1
0
mirror of synced 2025-12-23 21:03:15 -05:00
Commit Graph

1127 Commits

Author SHA1 Message Date
Augustin
89f531f9fa CAT: handle discovery error on previous catalog retrieval (#23992) 2023-03-14 08:43:39 +00:00
Sergio Ropero
771d21822c feat: Add CDC sync checkpointing based on time or records (#21727)
* This commit adds new functionality that generates checkpoints when doing CDC synchronization.
For that purpose we encapsulate an AirbyteMessage Iterator on a new iterator that handles the
checkpoint messaging.

* Reformat code

* Reformat code

* Reformat code

* Reformat code

* Second attempt with ugly if statement

* Add `isRecordBehindOffset` function to make sure is safe to send the state.
Tests are failing as now it has more state messages:
expected: <1> but was: <3>

* Code formatting

* Add additional check if the record is part of the snapshot load to skip state message.

* Remove comments

* Fix imports

* Fix format

* Add check if the iterator has extra elements so we don't send state message twice (edge case)

* Add a new check to avoid sending multiple state messages with same offset.
Fix PR comments.
Not sending checkpoints... figuring out

* Modify MSSQL and MySQL implementations

* Adds better control on Maps and include a test for time checkpoint.
Also adds extra assert to verify there are no duplicate states

* Formatting

* Improve code documentation and use default for CdcStateHandler new functions

* Sort out missing `final` and types from comments

* Minor improve in checkpoint validation

* format files

* It's 2023!

* Import issues

* Changes after merging master

* Upgrade Debezium version in MySQL

* Bump Postgres and Alloydb

* auto-bump connector version

* Manually bumping version

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-13 14:03:01 +01:00
Ryan Fu
85391864f7 Ryan/periodic buffer flush (#23931)
* Added support for periodic buffer flush with tests and uses env variable

* Improves code readability and encapulates testing logic

* Removed demo changes and created const for tests

* Updated constructor to reuse method signature

* Increases Snowflake parallel integration forks

* Bumps version number, fixes linting issues and constant format

* Generate seed spec
2023-03-10 21:43:39 +00:00
Eugene
6672b9cb3a Destination-S3: Correctly generate int64 values (#23466)
* [17564] Updated s3 avro to use long instead of int


---------

Co-authored-by: etsybaev <etsybaev@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-10 16:17:33 +00:00
Sergio Ropero
e72d6859e4 fix: Reduce errors on Postgres source tests (#23727)
* Split test for readability and increase waiting time as possible culprit of random failure

* Improve testDataContent() output and test all the types without instead of stopping the test in the first one.

* Format and add documentation
2023-03-09 14:31:25 +01:00
Sherif A. Nada
e85eda088e Remove ExtendedNameTrasformer (#23655) 2023-03-07 17:22:08 -08:00
Subodh Kant Chaturvedi
f6ec8769a3 DB sources cdc : upgrade Debezium version to 2.1.2 (#23112)
* WIP

* remove wal2json

* revert test change

* working version for MySQL

* cleanup

* mssql changes

* rename module

* format

* undo unwanted change

* disable backward compatibility spec test

* fix acceptance-test-config.yml

* fix acceptance-test-config.yml

* update doc link

* fix mssql

* review comments

* master merge

* review comments

* disable few mssql tests

* temp commit for mssql tests experiment

* another temp commit

* another temp commit

* revert temp commits

* update test

* ssl support added for mssql cdc

* improve tests

* fix strict encrypt test

* make sql-server state backward compatible

* missed this one

* add error properties

* upgrade version and docs

* remove wal2json reference from docs

* auto-bump connector version

* add extra safety validation

* increase wait time

* auto-bump connector version

* update spec

---------

Co-authored-by: alafanechere <augustin.lafanechere@gmail.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-07 18:12:07 +05:30
Subodh Kant Chaturvedi
a521ae580e Revert "Postgres Source : Support JSONB datatype" (#23642)
* Revert "Postgres Source : Support JSONB datatype (#21695)"

This reverts commit 90884d0c7b.

* upgrade version and add changelog

* auto-bump connector version

* update spec

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-02 18:42:39 +05:30
Roman Yermilov [GL]
ea776ef19a SAT ignored fields key for basic read (#22996)
* [SAT] add ignored fields for basic read

* [SAT] ignored_fields validation fixture

* [SAT] set ignored_fields default value

* Update airbyte-integrations/bases/connector-acceptance-test/CHANGELOG.md

Co-authored-by: Ella Rohm-Ensing <erohmensing@gmail.com>

* Automated Change

* [SAT] fixture tests and some small fixes

* Update airbyte-integrations/bases/connector-acceptance-test/CHANGELOG.md

Co-authored-by: Ella Rohm-Ensing <erohmensing@gmail.com>

* Automated Change

---------

Co-authored-by: Ella Rohm-Ensing <erohmensing@gmail.com>
Co-authored-by: roman-yermilov-gl <roman-yermilov-gl@users.noreply.github.com>
2023-02-28 18:11:59 +04:00
Ryan Fu
32ae1b0c94 Logging recordWriter and onStreamFlush completion (#23360)
* Adds additional logging when flushing buffer and writing records

* Removes logging for writeRecord since this will explode log lines

* Added logging when uploading records to stage/bucket

* Fixes log lines to properly capture when records have been uploaded

* Bumps version and fixes logging message to more accurately reflect logic

* auto-bump connector version

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-02-27 11:49:58 -08:00
VitaliiMaltsev
90884d0c7b Postgres Source : Support JSONB datatype (#21695)
* Postgres Source Jsonb updated schema with oneOf definition

* updated json schema definition

* updated json schema definition

* updated tests

* refactoring

* fixed checkstyle

* fixed checkstyle

* updated values mapping

* updated test cases and refactoring

* updated test cases

* refactoring

* added jsonb[] support

* refactoring

* updated json schema

* reverted to schema with oneOf

* updated airbyte-protocol version

* deleted protocol files

* bump version

* auto-bump connector version

* manual bump of postgres-source version

* Automated Change

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: VitaliiMaltsev <VitaliiMaltsev@users.noreply.github.com>
2023-02-27 21:38:37 +02:00
Charles
f83ef9eea7 Remove workers (#23422) 2023-02-24 17:45:44 -08:00
Catherine Noll
7da6a3bb77 Run CATs with local CDK (#23084)
Scripts to 

* Run CATs against the local CDK for one connector

* Run CATs against the local CDK for multiple connectors

* Create a connecter image with the local CDK

---------

Co-authored-by: Alexandre Girard <alexandre@airbyte.io>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>
2023-02-24 16:13:42 -05:00
plenti-jacob-roe
5bf63c6b2f Source MSSQL: Added event_serial_no to cdc metadata (#16798)
* Added event_serial_no as cdc metadata for sql server source connector

* fix SourceAcceptanceTest

* upgrade version

* auto-bump connector version

---------

Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
Co-authored-by: Subodh Kant Chaturvedi <subodh1810@gmail.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-02-24 19:55:35 +05:30
Nataly Merezhuk
fafffc4607 🎉 Destination S3: Add root level flattening option to JSON output (#21355)
* Adds flattening option in the UI and to S3JsonlFormatConfig.

* Corrects JsonL Serialized Buffer.

* Corrects null value in JsonlFormatConfig and adds tests.

* Adds flattening as requirement in spec.

* Fixes testDataTypeTestWithNormalization in S3JsonlDestinationAcceptanceTest

* Fixes testDataTypeTestWithNormalization in S3JsonlGzipDestinationAcceptanceTest

* Corrects connector behavior and makes testDataTypeTestWithNormalization pass.

* Delete unnecessarily committed file.

* Fixes formatting.

* Fixes build.

* Adds flattening constants and extracts flattening enum.

* Adds flattening type helper to check the flattening value.

* Corrects some of the errors caused by refactoring.

* Completes refactoring.

* Cleans up refactor.

* Delets unnecessary file.

* Removes additional unnecessary files.

* Fix conflict.

* Formatting fix.

* Removes flattening as a required parameter for Jsonl.

* Requested change: removes constant.

* Requested change: move Flattening enum out of the CSVFormatConfig.

* Requested change: removes override and runs format check.

* Requested changes: add lombok and update spec text.

* Fix format.

* Fixes imports.

* Bumps version and adds changelog.

* Roll back Dockerfile version update.

* Requested changes: explicit imports.

* Requested changes: fix failing check_connector_https_url_only.

* Corrects http url.

* Bumps docker version and corrects changelog date.

* auto-bump connector version

---------

Co-authored-by: octavia-squidington-iii <octavia-bot@airbyte.io>
Co-authored-by: Edward Gao <edward.gao@airbyte.io>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-02-23 13:02:22 -05:00
Sergio Ropero
c0f973e2e4 fix: Permission check failing when privilege is at ROLE level (#23330)
* source-snowflake: use a safer method for parsing a BigInteger cursor value (#22358)

* use a safer method for parsing a BigInteger cursor value

* Add testing

* fix format change

* Fix failing integration tests

* Try removing the failing incremental test

* Try removing the failing incremental test

* Fix failing test

* Add metadata to connector logs (log level, class name, method name and line number) (#23105)

* Issue #17861 Add labels, class, method name and line numbers to connector logs

* Refactored unit test

* fix for warning about UTF8 charset in test class

---------

Co-authored-by: prateekmukhedkar <prateek@airbyte.io>

* This commit fixes the issue when permission is granted at ROLE level instead of USER level.

Missing revoke privileges in the tests.

* Change the query to recursively look for all roles asigned.

Also improve testing.

* Add test for subrole with replication access

* formatting

* Roles don't share attributes, only accesses.
That means that the REPLICATION or SUPERUSER can not be inhered to the user. Because of that, we need to make the user have REPLICATION access directly.

* Bump versions and update alloydb docs

* Roles don't share attributes, only accesses.
That means that the REPLICATION or SUPERUSER can not be inhered to the user. Because of that, we need to make the user have REPLICATION access directly.

* improve comment

* Typo

* Change from checking the permisions in `pg_users` to execute `createConnection` and verifying the connection is fine for CDC.

* Remove unneeded import

* format

* Rename ReplicationConnection class

* Revert "source-snowflake: use a safer method for parsing a BigInteger cursor value (#22358)"

This reverts commit e9efd9878a.

* Revert "Add metadata to connector logs (log level, class name, method name and line number) (#23105)"

This reverts commit a2c80a1fdb.

* Change ConfigError throw point

* Include in try to autoclose the connection

* Bump versions

* auto-bump connector version

* fix SSL failure on check

* format + undo spec changes

* auto-bump connector version

* Manual interaction for source definitions

---------

Co-authored-by: Rodi Reich Zilberman <867491+rodireich@users.noreply.github.com>
Co-authored-by: Prateek Mukhedkar <123108018+prateekmukhedkar@users.noreply.github.com>
Co-authored-by: prateekmukhedkar <prateek@airbyte.io>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: subodh <subodh1810@gmail.com>
2023-02-23 17:43:31 +00:00
Rodi Reich Zilberman
49f97f1142 Integration Branch for DB/DW Sources team for Feb '23 code freeze (#23185)
* source-snowflake: use a safer method for parsing a BigInteger cursor value (#22358)

* use a safer method for parsing a BigInteger cursor value

* Add testing

* fix format change

* Fix failing integration tests

* Try removing the failing incremental test

* Try removing the failing incremental test

* Fix failing test

* Add metadata to connector logs (log level, class name, method name and line number) (#23105)

* Issue #17861 Add labels, class, method name and line numbers to connector logs

* Refactored unit test

* fix for warning about UTF8 charset in test class

---------

Co-authored-by: prateekmukhedkar <prateek@airbyte.io>

* Update docker image and release notes

* auto-bump connector version

* manually bump version on spec

---------

Co-authored-by: Prateek Mukhedkar <123108018+prateekmukhedkar@users.noreply.github.com>
Co-authored-by: prateekmukhedkar <prateek@airbyte.io>
Co-authored-by: Sergio Ropero <sergio@airbyte.io>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-02-22 11:59:23 -08:00
Conor
8b679d7473 Remove unused deps in airbyte-commons-worker (#23238)
* fix failing test from migration

* remove unused code in airbyte-commons-worker
2023-02-19 14:17:52 -06:00
Mikhail Shustov
2ce3c17048 🎉 Destination ClickHouse: bump dbt-clickhouse to v1.4.0 (#23023)
* bump dbt-clickhouse to 1.4.0

* fix clickhouse integration test

* exclude duckdb from tests

* add to changelog

* bump normalization version in definitions

---------

Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
Co-authored-by: Edward Gao <edward.gao@airbyte.io>
2023-02-16 20:15:09 -08:00
Conor
903d95d2ef Download latest oss connector catalog on build and use as reference (#22376)
* download latest connector catalog on build

* format

* Add small todos

* Add skeleton for local provider change

* Load catalog from local

* Remove seed class from localdefprovider

* Get catalog path from config

* refactor arguments out of download catalog

* Move to a constant file method

* Fix arguments

* Rename to CatalogDefinitionsConfig

* Add todos

* Update todo

* Apply formatting

* remove unnessessary try catch

* Refactor IconValidationTask

* Fix null value issue

* Only build oss catalog via github action

* Add LOCAL_CONNECTOR_CATALOG_PATH env var override

* Format

* Add logger

* Copy file directly to avoid circular dep

* Format

* Ensure run before tests

* Move airbyte:init:spec to general dependency

* LOCAL_CATALOG_PATH -> LOCAL_CONNECTOR_CATALOG_PATH

---------

Co-authored-by: Ben Church <ben@airbyte.io>
2023-02-16 16:09:59 -06:00
Davin Chia
5578c8fd8f Revert "Use Micronaut with airbyte-server-wrapped (#22669)" (#23109)
This reverts commit d8ae36fe54 as this wasn't passing Airbyte Prod tess.
2023-02-16 07:45:20 -08:00
Serhii Chvaliuk
a7c8ea74bb Connector acceptance test: Fix docker API - "Error" is optional. (#22987)
* error fix

Signed-off-by: Serhii Chvaliuk <grubberr@gmail.com>

* bump 0.5.4

Signed-off-by: Serhii Chvaliuk <grubberr@gmail.com>

* test_failed_reading fixed

Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>

---------

Signed-off-by: Serhii Chvaliuk <grubberr@gmail.com>
Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>
2023-02-15 04:48:45 -06:00
Jonathan Pearlin
d8ae36fe54 Use Micronaut with airbyte-server-wrapped (#22669)
Support conversion of the airbyte-server-wrapped Cloud module to Micronaut

- Remove @Requires rule from controllers so they will be used in both OSS and Cloud
- Remove airbyte-commons-worker dependency from airbyte-server by moving shared code to new airbyte-commons-converters module

Co-authored-by: Davin Chia <davinchia@gmail.com>
2023-02-14 13:21:45 -08:00
Augustin
9d57772303 connector-acceptance-tests: support custom environment variables (#22937)
* connector-acceptance-tests: support custom environment variables

* update postgres acceptance-test-config.yml
2023-02-14 13:57:38 -06:00
Sherif A. Nada
37501884eb unpin cdk version in normalization build(#22973) 2023-02-13 20:59:20 -08:00
Charles
c4bf76655e format (#22970) 2023-02-13 19:21:54 -08:00
Sherif A. Nada
bec7a26b27 ignore normalization CDK model related linting errors (#22963) 2023-02-13 17:48:23 -08:00
Greg Solovyev
6c8d3f655d Default CH ssl to true and fix the failure if ssl property is missing (#22846) 2023-02-13 17:40:08 -08:00
Peter Hu
e5eac0a4cc use published protocol models jar (#22498)
* add airbyte-protocol to deps.toml

* use published protocol jar for platform

* use published protocol jar for connectors

* point at published jar

* fix dep

* bump gcs storage

* fix build failures in standard-source-test

* fix deps

* downgrade alloy db because it is missing strictness tests

* Revert "downgrade alloy db because it is missing strictness tests"

This reverts commit cc6089d053.

---------

Co-authored-by: cgardens <charles@airbyte.io>
2023-02-13 12:50:43 -06:00
Ben Church
ce770d3eea Catch empty state in incremental SAT (#22353)
* Catch state being empty

* Update test_two_sequential_reads to catch empty state on first read

* Add integration test of empty state

* Fix legacy state test

* Move state_name to variable

* Clean up

* Format

* Fix rogue test
2023-02-09 15:19:55 -08:00
Joe Reuter
d82e01a34f Connector acceptance test: Fix oneof check (#22395)
* fix oneof check

* adjust changelog

* adjust changelog
2023-02-09 20:37:52 +01:00
Ben Church
465e6460ee Add emitted_at SAT test (22240) (#22291) 2023-02-09 08:11:55 -08:00
Xiaohan Song
0ba609c6f1 Fix for acceptance test on discover workflow (#22595)
* fix test

* remove unused var

* add converter into test

* use converters to convert client catalog to proto

* remove cdk related changes

* more cdk remove

* Minor format changes

* remove untrue comment

* Minor format changes

---------

Co-authored-by: Sergio Ropero <42538006+sergio-ropero@users.noreply.github.com>
Co-authored-by: Sergio Ropero <sergio@airbyte.io>
2023-02-09 00:52:04 +00:00
Cole Snodgrass
2e099acc52 update headers from 2022 -> 2023 (#22594)
* It's 2023!

* 2022 -> 2023

---------

Co-authored-by: evantahler <evan@airbyte.io>
2023-02-08 13:01:16 -08:00
Ryan Fu
d21068c989 Tentatively disallowed normalization for DuckDB (#22528) 2023-02-07 20:22:49 -08:00
Xiaohan Song
b89196b24e add a return for the mock (#22535) 2023-02-07 15:59:49 -08:00
Cole Snodgrass
f45718d850 remove commons-cli dependency from integrations (#22525)
* remove commons-cli dependency from integrations

* re-add commons-cli dependency, used in other places
2023-02-07 14:17:46 -08:00
Simon Späti
2bbc4f6f83 🎉 New Destination: DuckDB (#17494)
This is the first version of the DuckDB destination. There are potential edge cases that still need to be taken care of. But looking forward to your feedback.
2023-02-07 11:33:10 +01:00
Edward Gao
517fc6ac10 Normalization: Revert to protocol v0 (#22283)
* Revert "Normalization: handle non-object top-level schemas; treat binary data as string (#22165)"

This reverts commit 8276d03359.

* Revert "Normalization: check for ref type existence (#22161)"

This reverts commit dbe56d6fc2.

* Revert "🎉Updated normalization to handle new datatypes (#19721)"

This reverts commit c1d7736639.

* revert dest definitions

* also dockerfile

* re-add to changelog

* add comment in dockerfile
2023-02-06 10:14:36 -08:00
Ben Church
035d415bc4 📖 Add run one test example to SAT readme (#22292) 2023-02-06 10:10:44 -08:00
Xiaohan Song
36698ce710 Discover worker starts to use API to write schema result (#21875)
* api changes for writing discover catalog

* api changes

* format

* worker change 1

* change return type of the API to return catalogId

* worker to call api

* typo

* 🎉 Source GoogleSheets -  migrated SAT to strictness level (#21399)

* migrated SAT to strictness level

* fixed expected records

* revert file from another source

* changed extension to txt

* changed extension to txt

* 🐛Destination-Bigquery: Added an explicit error message if sync fails due to a config issue (#21144)

* [19998] Destination-Bigquery: Added an explicit error message in sync fails due to a config issue

* ci-connector-ops: split workflows(#21474)

* CI:  nightly build alpha sources and destinations (#21562)

* Revert "Change main class in strict-encrypt destination and bump versions on both destinations to keep them in sync (#21509)" (#21567)

This reverts commit 1d202d1707.

* Fixes webhook updating logic (#21519)

* ci_credentials: disable tooling test run by tox (#21580)

* disable tox

* rename steps

* revert changes on experimental workflow

* do not install tox

* Revert "CI:  nightly build alpha sources and destinations (#21562)" (#21589)

This reverts commit 61f88f3013.

* Security update of default docker images (#21407)

Because there is a lot of CVEs in those releases.

Co-authored-by: Topher Lubaway <asimplechris@gmail.com>

* 📝 add docs for how to add normalization (#21563)

* add docs

* add schema link

* update based on feedback

* 🪟 🚦  E2E tests: clean up matchers (#20887)

* improve serviceTypeDropdownOption selector

* add test ids to PathPopout component(s)

* add unique id's to table dropdowns

* extend submitButtonClick to support optional click options

* update dropdown(pathPopout) matchers

* add test-id to Overlay component

* remove redundant function brackets

* revert changes onSubmit button click

* fix dropDown overlay issue

* move all duplicated intercepters to beforeEach

* add test id's to Connections, Sources and Destinations tables

* add table helper functions

* update source page actions

* intercepter fixes

* update createTestConnection function with optional replication settings

* remove extra Connection name check

* replace "cypress-postgres" with "pg-promise" npm package

* update cypress config

* Revert "update createTestConnection function with optional replication settings"

This reverts commit 8e47c7837b.

* Revert "remove extra Connection name check"

This reverts commit dfb19c7dd4.

* replace openSourceDestinationFromGrid with specific selector

* replace openSourceDestinationFromGrid with specific selector

* turn on test

* add test-id's

* fix selectors

* update test

* update test snapshots

* fix lost data-testid after resolve merge conflicts

* remove extra check

* move clickOnCellInTable helper to common.ts file

* remove empty line and comments

* fix dropdownType

* replace partial string check with exact

* extract interceptors and waiters to separate file

* fix selector for predefined PK

* fix selector

* add comment regarding dropdown

* 🪟 🎨 [Free connectors] Update modal copy (#21600)

* move start/end time options out of optional block (#21541)

* lingering fix

* reflecting api changes

* test fix

* worker to call api to do discover work

* recovered deleted html

* self review

* more converters refactor

* fix connector test

* fix test

* fix

* fix integration test

* add unit test for converter

* static fix

* api client needs to have a timeout in case request does not get responded

---------

Co-authored-by: midavadim <midavadim@yahoo.com>
Co-authored-by: Eugene <etsybaev@gmail.com>
Co-authored-by: Augustin <augustin@airbyte.io>
Co-authored-by: Greg Solovyev <grishick@users.noreply.github.com>
Co-authored-by: Yatsuk Bogdan <yatsukbogdan@gmail.com>
Co-authored-by: Hervé Commowick <github@herve.commowick.fr>
Co-authored-by: Topher Lubaway <asimplechris@gmail.com>
Co-authored-by: Pedro S. Lopez <pedroslopez@me.com>
Co-authored-by: Vladimir <volodymyr.s.petrov@globallogic.com>
Co-authored-by: Joey Marshment-Howell <josephkmh@users.noreply.github.com>
Co-authored-by: Lake Mossman <lake@airbyte.io>
2023-02-06 09:35:16 -08:00
Serhii Chvaliuk
449c3d8ba2 Connector acceptance test: Fix discovered catalog caching for different configs (#22301)
Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>
2023-02-06 18:49:42 +02:00
Joe Reuter
6a10ae3e05 Rename source acceptance test to connector acceptance test (#21846)
Rename source acceptance test to connector acceptance test
2023-02-02 11:38:19 +01:00
Edward Gao
8276d03359 Normalization: handle non-object top-level schemas; treat binary data as string (#22165)
* handle dumb top-level schemas

* version bump

* also definitions

* treat binary as string

* fallback case

* format

* new variable
2023-01-31 15:59:04 -06:00
Edward Gao
dbe56d6fc2 Normalization: check for ref type existence (#22161)
* check for ref type existence

* version bump

* bump normalization version

* format
2023-01-31 11:33:34 -08:00
Ben Church
90828d4bd9 Version connector build statuses (#22029)
* Refactor the job log json to include the docker_version

* Output to versioned folder

* Handle the case where people call the action without connector prefixed

* Retrieve status of each connector

* Use build report statuses in the QA Engine

* Cast build status as an enum
2023-01-30 16:16:16 -08:00
Jimmy Ma
6660b13ad2 Add Airbyte Protocol V1 support. (#20036)
* Add Airbyte Protocol V1 support.

* Fix VersionedAirbyteStreamFactoryTest

* Remove AirbyteMessageMigrationV0 example

* Add Protocol Version constants

* 🎉Updated normalization to handle new datatypes (#19721)

* Updated normalization simple stream processing to handle new datatypes

* Updated normalization nested stream processing to handle new datatypes

* Updated normalization nested stream processing to handle new datatypes

* Updated normalization drop_scd_catalog processing to handle new datatypes

* Updated normalization ephemeral test processing to handle new datatypes

* fixed more tests for normalization

* fixed more tests for normalization

* fixed more tests for normalization

* fixed more tests for normalization

* fixed more issues

* fixed more issues (clickhouse)

* fixed more issues

* fixed more issues

* fixed more issues

* added binary type processing for some DBs

* cleared commented code and moved some hardcodes to processing as macro

* fixed codestyle and cleared commented code

* minor refactor

* minor refactor

* minor refactor

* fixed bool cast error

* fixed dict->str cast error

* fixed is_combining_node cast py check

* removed commented code

* removed commented code

* committed autogenerated normalization_test_output files

* committed autogenerated normalization_test_output files (new files)

* refactored utils.py

* Updated utils.py to use Callable functions and get rid of property_type in is_number and is_bool functions

* committed autogenerated normalization_test_output files (new files)

* fixed typo in TIMESTAMP_WITH_TIMEZONE_TYPE

* updated stream_processor to handle string type first as a wider type

* fixed arrays normalization by updating is_simple_property method as per new approaches

* format

Co-authored-by: Edward Gao <edward.gao@airbyte.io>

* Update airbyte protocol migration (#20745)

* Extract MigrationContainer from AirbyteMessageMigrator

* Add ConfiguredAirbyteCatalogMigrations

* Add ConfiguredAirbyteCatalog to AirbyteMessageMigrations

* Enable ConfiguredAirbyteCatalog migration

* Fix tests

* Remove extra this.

* Add missing docs

* Typo

Co-authored-by: Edward Gao <edward.gao@airbyte.io>

* Data types update: Implement protocol message migrations (#19240)

* Extract MigrationContainer from AirbyteMessageMigrator

* Add ConfiguredAirbyteCatalogMigrations

* Add ConfiguredAirbyteCatalog to AirbyteMessageMigrations

* Enable ConfiguredAirbyteCatalog migration

* set up scaffolding

* [wip] more scaffolding, basic unit test

* minimal green code

* [wip] add failing test for other primitive types

* correct version number

* handle basic primitive type decls

* add implicit cases

* add recursive schema

* formatting

* comment

* support not

* fix indentation

* handle all nested schema cases

* handle boolean schemas

* verify empty schema handling

* cleanup

* extract map

* code organization

* extract method

* reformat

* [wip] more tests, minor fix type array handling

* corrected test

* cleanup

* reformat

* switch to v1

* add support for multityped fields

* missed test case

* nested test class

* basic record upgrade

* implement record upgrades

* slight refactor

* comments+clarificationso

* extract constants

* (partly) correct model classes

* add de/ser

* formatting

* extract constants

* fix json reference

* update docs

* switch to v1 models

* fix compile+test

* add base64 handling

* use vnull

* Data types update: Implement protocol message downgrade path (#19909)

* rough skeleton for passing catalog into migration

* basic test

* more scaffolding

* basic implementation

* add primitives test

* add in other tests (nested fields currently failing)

* add formats

* impleent oneOf handling

* formatting

* oneOf handling

* better tests

* comments + organization

* progress

* basic test case

* downgrade objects, ish

* basic array implementation

* handle numeric failure

* test for new type

* handle array items

* empty schema handling

* first pass at oneof handling

* add more tests+handling

* more tests

* comments

* add empty oneof test case

* format + reorganize

* more reorganize

* fix name

* also downgrade binary data

* only import vnull

* move migrations into v1 package

* extract schema mutation code

* comment

* extract schema migration to new class

* extract record downgrade logic for future use

* format

* fix build after rebase

* rename private method for consistency

* also implement configuredcatalog migrations >.>

* quick and dirty tests

* slight cleanup

* fix tests

* pmd

* pmd test

* null check on message objects

* maybe fix acceptance tests?

* fix name

* extract constants

* more fixes

* tmp

* meh

* fix cdc acc tests

* revert to master source-postgres

* remove log messages

* revert other misc hacks

* integers are valid cursors

* remove unrelated change

* fix build

* fix build more?

* [MUST REVERT] use dev normalization

* capture kube logs

* also here?

* no debug logs?

* delete dup from merging

* add final everywhere

* revert test changes

Co-authored-by: Jimmy Ma <jimmy@airbyte.io>

* On-the-fly migrations of persisted catalogs (#21757)

* On the fly catalog migration for normalization activity

* On the fly catalog migration for job persistence

* On the fly migration for standard sync persistence

* On the fly migration for airbyte catalogs

* Refactor code to share JsonSchema traversal

* Add V0 Data type search function

* PMD and Format

* Fix getOrInsertActorCatalog and ConfigRepositoryE2E tests

* Null-proofing CatalogMigrationV1Helper

* More null checks

* Fix test

* Format

* Add data type v1 support to the FE

* Changes AC test check to check exited ps (#21672)

some docker compose changes no longer show exited
processes.  this broke out test

this change should fix master

tested in a runner that failed

* Move wellknown types mapping to the utility function

* use protocolv1 normalization

---------

Co-authored-by: Topher Lubaway <asimplechris@gmail.com>
Co-authored-by: Edward Gao <edward.gao@airbyte.io>

* Update protocol support range (#21996)

* bump normalization version to 0.3.0

* Add version check on normalization (#22048)

* Add normalization min version check

* Add visible for testing

---------

Co-authored-by: Edward Gao <edward.gao@airbyte.io>
Co-authored-by: Eugene <etsybaev@gmail.com>
Co-authored-by: Topher Lubaway <asimplechris@gmail.com>
2023-01-30 10:17:49 -08:00
Joe Reuter
63064d9f60 Extend SATs to capture UI limitations - fixed version (#21903)
* Revert "Revert "Extend SATs to capture UI limitations (#21451)" (#21896)"

This reverts commit 74b5dbf794.

* fix fixture problem
2023-01-30 09:58:36 +01:00
Yevhen Sukhomud
2226a2e516 Base Java S3: Update Avro TimeWithTimezone schema mapping (#21909)
* 21908 Base Java S3: Update Avro TimeWithTimezone schema mapping

* 21908 Base Java S3: Formatting

* 21908 Base Java S3: fix integration test gcs + S3

* 21908 Base Java S3: fix unit test

* 21908 Base Java S3: fix format
2023-01-30 12:11:43 +07:00
Ryan Fu
aa6afd0aa8 🚨 Destination Snowflake: checkpointing flush/commit and emit STATE message 🚨 (#20631)
* Checkpointing flush/commit and emit STATE message

* Fixed tests for SerializedBufferingStrategy

* Updates BigQuery to support checkpointing and consolidates method naming for uploading from staging (#21028)

* Updates BigQuery to support checkpointing and consolidates method naming for uploading from staging

* Updated messages to reflect method changes

* Updates createTable to include mimic replication by calling createPartitionTable and removes unused copyIntoTargetTable

* Updates the COPY INTO methods to match writing to table

* Fixed comments and non-executed path

* Fixed BufferedStreamConsumerTest to support new logic for checkpointing

* Removed cleanup logic that no longer applies with checkpointing changes

* Checkpointing flush/commit and emit STATE message

* Updates BigQuery to support checkpointing and consolidates method naming for uploading from staging (#21028)

* Updates BigQuery to support checkpointing and consolidates method naming for uploading from staging

* Updated messages to reflect method changes

* Updates createTable to include mimic replication by calling createPartitionTable and removes unused copyIntoTargetTable

* Updates the COPY INTO methods to match writing to table

* Fixed comments and non-executed path

* Resolved BigQuery partitioning tests and parameterized GCS Staging test

* Fixed review comments and bumps version number

* Definition generation
2023-01-26 14:40:37 -08:00