1
0
mirror of synced 2025-12-23 11:57:55 -05:00
Commit Graph

1127 Commits

Author SHA1 Message Date
Davin Chia
21b36e79da Async Destination V0: Async Staging Consumer Factory. (#26371)
Follow up to #26366 .

Pull in the async consumer changes into the Consumer Factory. Also take the chance to split out the StagingConsumerFactory with the goal of clarifying the various general, serial and async functions.

Instead of one massive factory file, split into
- GeneralStagingFunction.java
- AsyncFlush.java
- SerialFlush.java
representing the general buckets of code we have today.

I'm sure we can do smarter things here. This is the bare minimum to unblock us + 'leave things better than we found them'.
2023-05-22 14:05:29 -07:00
Davin Chia
a7442e3c2d Async Destination V0: Async Stream Consumer (#26366)
Follow up after #26324 .

Introduce the AsyncStreamConsumer.

After this, one more PR to add the Staging Consumer changes in.
2023-05-22 13:16:36 -07:00
Davin Chia
988ce24b3f Async Destination V0 - Split up BufferManager (#26331)
Follow up to #26324 - here we split up the BufferManager and add tests and comments.

- Split up the buffer manager class into -> BufferManager, BufferEnqueue and BufferDequeue.
- Move all buffer related code to the buffers package.
- Rename test classes to match this split.
- Add java docs and tests as part of this split.
- Simplify the BufferDequeue interface to return a set streams representing the buffered streams instead of the underlying map of buffers. This lets us keep the memory queue package private.
- all getYMethods now return Optionals for better error handling. This would have resulted in NPEs previously.
2023-05-22 11:16:12 -07:00
Edward Gao
67f3cdb125 🐛 Destination S3: Fix Parquet LZO compression (#26284)
* Revert "Move hadoop-lzo to test dependency (#21085)"

This reverts commit 1241569ba3.

* add basic test

* Automated Change

* version bumps, changelog

* Automated Change

* unused import

* Ran ./gradlew :spotlessJavaApply to trigger GitHub build

* regenerate registry

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: ryankfu <ryan.fu@airbyte.io>
2023-05-22 16:49:45 +00:00
Augustin
80032f73f9 connectors-ci: deprecate slash publish (#25865) 2023-05-22 10:10:56 +02:00
Davin Chia
8bfbef23cb Async Code V0 (#26324)
Split out the smallest set of reasonable changes from #26086 .

My goal was to split out the interface, as well as show how the interface it's meant to be used.

Follow up PRs:
- Split out classes from BufferManager and add more tests there.
- Add in the AsyncConsumer with tests.
- Add in the StagingConsumer factory.
2023-05-20 13:41:54 -07:00
Duy Nguyen
1a45a66133 Source Postgres: Number(38,0) is incorrectly translated to a destination Double instead of BigInt (#25898)
* Translate Numeric with no decimals to BigInt + relevant tests
* Update CDC to handle numeric/decimal arrays correctly
* Revert spacing
* Preserve Decimal scale of numeric columns
* Bumped versions to 2.0.29 + docs
* Bumped versions in metadata.yaml
* Automated Change

---------

Co-authored-by: nguyenaiden <nguyenaiden@users.noreply.github.com>
2023-05-18 13:02:27 -07:00
Joe Bell
9a4be977c1 Destination Bigquery: stop running normalization container for DAT (#25925)
* readme update

* allow passing additional flags to test containers

* remove build dependency

* Automated Change

* versioning updates

* restore denormalized change from master

* formatting changes

* formatting

* Automated Change

* update metadata file

---------

Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>
2023-05-18 00:46:32 +00:00
Ben Church
1d856ab943 Feat: Metadata Service remove old catalog system (#26013)
* Remove Definition File References from Python files (#25590)

* Remove check_images_exist.sh

* Update definitions.py

* Update build_report.py

* Update tools/bin/ci_integration_workflow_launcher.py

* Update tools/bin/ci_check_dependency.py

* tools/bin/scan_all_spec_airbyte_secret.py

* Remove qa engine references

* Revert "Remove check_images_exist.sh"

This reverts commit 7675162789.

* Improve get url function

* Add test

* remove scan_all_spec_airbyte_secret.py

* add additional test

* Remove check_images_exist.sh (#25593)

* Remove Definition File References from Java files (LocalDefinitionProvider), shell scripts and docs (#25592)

* Remove CombinedConnectorCatalogGenerator.java

* Update local definition provider

* Update local def test

* Add spec mask downloader

* Make downloader work

* Delete generators and add tests

* REMOVE THE YAML FILES

* Roughly update docs

* Update shell scripts

* Remove unused

* Add connector metadata file doc

* Apply suggestions from code review

Co-authored-by: Augustin <augustin@airbyte.io>

* Additional PR comments

* Run format tasks

---------

Co-authored-by: Augustin <augustin@airbyte.io>

* Remove unused import

* bundle registry

* Ignore future updates

* Update registry

* new registry

* Readd maskeddatainterceptor

* Automated Change

* Remove icon validation

* Automated Change

* Automated Change

* Source Amazon Ads: get rid of `fail_on_extra_columns: false` in SAT (#25913)

* Source Amazon Ads: small schema fixes

* Source Amazon Ads: update changelog

* Source Amazon Ads: update unittest

* Source Amazon Ads: unittest additional property is boolean

* Source Amazon Ads: bump version

* auto-bump connector version

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>

* connectors-ci: make spec-cache / metadata bucket and creds not required for pre-release (#26119)

* Automated Change

---------

Co-authored-by: Augustin <augustin@airbyte.io>
Co-authored-by: bnchrch <bnchrch@users.noreply.github.com>
Co-authored-by: Roman Yermilov [GL] <86300758+roman-yermilov-gl@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-05-16 11:40:11 -07:00
Ryan Fu
95dd113a5f Revert "Splits bases and updates build.gradle files (#25649)" (#26081)
* Revert "Splits bases and updates build.gradle files (#25649)"

This reverts commit c673b0a692.

* Bumps branch to prevent a conflict with publishing

* Forward fixes Snowflake to use singular base-java and develop within a new package within the same module

* Forcing automated change to merge changes
2023-05-15 21:56:40 +00:00
Edward Gao
fb152a9a0a Normalization: Better handling for CDC transactional updates (#25993)
* try this?

* fix tests

* assert cdc values

* handle case where we have lsn but no updated_at

* readability improvements

* tweaks to test

* version bumps + changelogs

* Automated Change

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
2023-05-12 12:53:23 +00:00
Augustin
5c5eab0308 connectors-ci: fix postgres integration testing (#25942) 2023-05-11 21:19:29 +02:00
Ryan Fu
c673b0a692 Splits bases and updates build.gradle files (#25649)
* Splits bases and updates build.gradle files

* Fixed changelog out of sync

* Bumps version number and metadata files

* auto-bump connector version

* Downgraded untouched connector bumps

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-05-06 00:24:40 +00:00
Cynthia Yin
8400d20352 Destination Redshift: deprecate old migration normalization code (#25771)
* first pass normalization

* add pr link

* remove python test & resources

* linting
2023-05-05 14:18:27 -07:00
Edward Gao
ccb6a071a9 Avro-based destinations: map airbyte_type: integer to long (#25570)
* map the other integer schema to long

* fix test + add test

* delete_public_access_block for bucket if public (#25663)

Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>

* 🐛 Source Facebook Marketing: fix `expected records` for  CAT (#25604)

* publish normalization (#25591)

* publish normalization
* bump normalization container version in all the destinations that use it

Co-authored-by: Edward Gao <edward.gao@airbyte.io>
Co-authored-by: edgao <edgao@users.noreply.github.com>

* Bump Airbyte version from 0.44.2 to 0.44.3

* Destination Bigquery: update AIRBYTE_ENTRYPOINT env var for kube process (#25588)

* add AIRBYTE_ENTRYPOINT env var for kube

* amazing, absolute genius

* version bump + changelog

* derp, no need to publish denormalizeid

* fix changelog entry

* auto-bump connector version

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>

* version bumps + changelog

* whoops

* bump metadata

* bump metadatas

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

---------

Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>
Co-authored-by: Serhii Chvaliuk <grubberr@gmail.com>
Co-authored-by: Baz <oleksandr.bazarnov@globallogic.com>
Co-authored-by: Jeff Cowan (Airbyte) <4992320+jcowanpdx@users.noreply.github.com>
Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: benmoriceau <benmoriceau@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-05-05 19:14:37 +00:00
Ryan Fu
1de58e9774 🚨 Removes defunct Azure Blob Storage laoding option for Snowflake 🚨 (#25739)
* Removes defunct Azure Blob Storage laoding option for Snowflake

* Bumps to major version and removes documentation that references AzureBlobStorage

* Updates the destination_definitions.yaml

* Run ProcessResources to match version of 1.0.0 mismatched spec

* Pinning urllib to older version since the 2.0 version removed classes
2023-05-04 22:37:34 +00:00
Cynthia Yin
74a3e2adb7 Destination Redshift: deprecate old migration Java code (#25698)
* first pass

* update changelog

* auto-bump connector version

* bump metadata.yaml

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-05-02 19:41:26 +00:00
Ben Church
0ea1a8143d Remove connectors.md and builds.md (#25597)
* Remove builds.md references

* Remove connectors.md

* Update connectors.md to reference the new report

* Add connector report to readme
2023-04-27 22:08:37 +00:00
Jeff Cowan (Airbyte)
79db9f8e68 Clean up destination bases (#25346)
Changes in this refactor PR
* Use the proper interface name for the OnStartFunction
* Use the proper interface name for the OnCloseFunction
* Create and use a proper interface name for the FlushBufferFunction
* Create and use a proper interface name for the BufferCreateFunction
* Mostly naming consistency changes. These are things caught in static, compile time checks so should be low risk.

---------

Co-authored-by: jcowanpdx <jcowanpdx@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-27 08:47:02 -07:00
Jeff Cowan (Airbyte)
3a308ba48b Pin MarkupSafe for normalization (#25577)
We were running into a CI/CD system-only bug with dbt that requires this workaround to get it working
---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-26 23:16:50 +00:00
Jonathan Pearlin
a38af089e9 Track stream status in source (#24971)
* WIP Track stream status in source

* Revert formatting

* Revert formatting changes

* Remove unnecessary file

* Automated Change

* Automated Change

* Use new stream status trace message

* Rename class

* Remove unnecessary import

* Formatting

* Add tests

* Fix compile issues

* Automated Commit - Formatting Changes

* Remove TODO

* Fix compilation error

* Split STOPPED into INCOMPLETE and COMPLETE

* Remove unused import

* Changelog updates for source-postgres

* Remove unused import

* auto-bump connector version

---------

Co-authored-by: jdpgrailsdev <jdpgrailsdev@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-26 15:14:25 -05:00
Edward Gao
7abda87840 destination bigquery: run normalization inside container if env var is set (#25097)
* super hacky start

* also check that we're writing

* v0 convert normalization logs to airbytemessage

* add start+end logs

* aggregate errors into a single trace?

* pipefail; quick tweaks to log parser

* make spotbugs happy

* more comments, uncomment env var check

* copy in SentryExceptionHelper

* final fixes

* write tests + fix bugs

* move to base-java

* remove outdated comment

* fix spotbugs

* Automated Change

* minor version bump

* changelog

* fix behavior when env var not set

* run normalization even if destination fails

* better logic

* better logging

* oops

* move to base-java

* rebump version

* Automated Change

* auto-bump connector version

* wtf how did this work previously

* auto-bump connector version

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-25 20:11:57 +00:00
Sergio Ropero
819bba174f postgres-cdc checkpointing: fix LSN parsing bug + refactor for efficiency (#24582)
* Fix LSN parsing from Integer to Long

* rebasing

* Rebase

* Rebase

* Other casting

* Lock the file only when reading, so the file is free when parsing the object.
Increased from 1 to 166 checkpoints, and from skipping hundreds of checkpoints to never skip a state.

* Update load function documentation

* bump mysql and mssql

* cdc: refactor to remove debezium dependency from connector packages

* use gradle's shared dependency

* more refactoring

* upgrade docker version

* resolve master merge conflicts

* Automated Change

* minor changes

* resolve merge conflicts

* avoid deserializing multiple times

* simplify

* enable checkpointing for Postgres

* more improvements

* enable assertions

* changelog + bump version

* auto-bump connector version

* auto-bump connector version

* manual bump

---------

Co-authored-by: subodh <subodh1810@gmail.com>
Co-authored-by: subodh1810 <subodh1810@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-19 21:05:50 +05:30
Ella Rohm-Ensing
b053ad6b2c CAT tools/acceptance_test_config_migration: Make relatively configurable per migration, add (hacky) ability to run CAT tests locally for multiple connectors (#24377)
* Pass argument along, add test that should pass and test that should fail

* Add tests with additionalProperties

* Set additionalproperties=false when not set|

* Parametrize test cases

* Make the behavior 'optional'

* Fix parametrization for all combinations

* Improve CI credentials README and rename param

* Update naming to be clearer about columns only

* record_has_unexpected_field > record_has_unexpected_column

* Automated Change

* Hacking the CAT dockerfile and run script to test my changes specifically

* First crack at running CAT on all connectors

* Write during instead of after all tests

* Async-ify it

* Add ability to define max concurrency

* Write successes

* ci_credentials: fix overwriting 'data' before getting nextPageToken

* Adjustible num_semaphores, check to make sure it's an airbyte connector first

* Automated Change

* Make create_issues and create_prs more configurable, add issue for fail_on_extra_columns

* Add ability to pass in sources as a list or from a txt file

* Add logs to issue, make project nullable

* Migrate multiple connectors

* Add cli args

* use ruamel.yaml to preserve ordering

* Separate config loading from config migration

* Add ability to pass in lists of sources to test. Sort output by exit codes. Fix max_concurrency flag

* Default to testing only beta and GA connectors

* Always write test output when available

* Revert "Add cli args"

This reverts commit b538a8c696.

* Remove slash

* Don't run on alpha connetors, handle older config style

* Don't migrate to new format, preserve quotes and long lines

* Automated Change

* Update issue, don't run for alpha connectors

* Automated Change

* Add bypass for extra fields test

* Add bypass for extra fields test

* Rename run_tests script

* Rename module

* Update args usage, small changes

* Refactor create_issues.py

* Clean up run_tests.py

* Sort out arg parsers

* Pull out get_valid_definitions_from_args

* Import definitions module instead of methods

* Use config files to provide constants for each migration

* Handle FileNotFoundError in create_issues.py, improve logging

* Rename to migrations, reference name of folder via utils

* Update readmes for migration modules, add script for getting outputs

* Use tmp dir, correct path for issue reference

* Fix bash script

* Fix create command, pull out test results insertion

* Update call to update_configuration

* add precommit to requirements

* Reorder README

* README cleanup for test and create issues

* README cleanup for create_prs and config_migration

* More readmes! Readmes galore

* allow_beta

* Restore hacky changes to dockerfile and acceptance-test-docker

* Handle 'other' release stages

* Update readme

* Remove TODO, add comments to shell script

* format according to gradle

* format

* Fix formatting

---------

Co-authored-by: marcosmarxm <marcosmarxm@users.noreply.github.com>
Co-authored-by: erohmensing <erohmensing@users.noreply.github.com>
2023-04-18 19:58:31 +00:00
Ryan Fu
0dcc653bd1 Shuffled destination-jdbc into bases folder (#25120)
* Testing changes for reshuffle

* Resolves dependency/race-condition with base classes and docker creation

* Removes dockerfile

* Updates java connectors gradle dependencies
2023-04-17 12:29:03 -07:00
Subodh Kant Chaturvedi
b5a2ceb7fe cdc: refactor to remove debezium dependency from connector packages (#24656)
* cdc: refactor to remove debezium dependency from connector packages

* use gradle's shared dependency

* more refactoring

* upgrade docker version

* Automated Change

* auto-bump connector version

* manual bump

---------

Co-authored-by: subodh1810 <subodh1810@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-11 21:32:17 +05:30
Rodi Reich Zilberman
0bab1756b8 Rename airbyte-config module (#24885)
* rename airbyte-config module

* Automated Commit - Formatting Changes

* sanity

* update import

* update import

* update script

* update script

* update script

* update script

* Automated Change

* Automated Change

* Automated Change

* Automated Change

* update awsdatalake icon

* point slash commands to new path

* sanity

* Automated Commit - Formatting Changes

* sanity

* Automated Change

* Automated Change

* sanity

---------

Co-authored-by: rodireich <rodireich@users.noreply.github.com>
2023-04-06 10:47:30 -07:00
Subodh Kant Chaturvedi
8748a8d6c7 fix-cdc: errors retry debezium property should be less than max retry (#24833)
* fix-cdc: errors retry debezium property should be less than max retry

* add comment

* version bump + changelog

* auto-bump connector version

* Update source-alloydb versions to match source-postgres

* rebump to 2.0.15

* auto-bump connector version

* definitions + regenerate manually

---------

Co-authored-by: Edward Gao <edward.gao@airbyte.io>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: nguyenaiden <duy@airbyte.io>
2023-04-04 19:23:00 -07:00
Rodi Reich Zilberman
cd928a7844 Fix all tests to pass on local and CI environments (#24683)
* test docker behavior on CI env

* Automated Change

* test docker behavior on CI env

* Make all unit and integration tests in source-postgres pass locally

* Fix mysql ssh integration test

* Fix failing test

* Fix source-mssql build

* source-mssql runss tests locally.
Fix compilation errors

---------

Co-authored-by: rodireich <rodireich@users.noreply.github.com>
2023-04-03 12:30:11 -07:00
Joe Bell
ba7f6d2fca Add Equality check to Clickhouse Destination Acceptance Test (#24646)
* add equality check

* fixing another failing test

* update acceptance tests

* Automated Change

* formatting

---------

Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>
2023-03-31 11:19:04 -07:00
Edward Gao
0974a05c55 S3 destinations: use STANDARD retry mode (#24736)
* use standard retry mode

* Automated Change

* dest-s3 version bump + changelog

* also in redhsift + snowflake

* auto-bump connector version

* version bumps

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-31 14:50:26 +00:00
Edward Gao
05860064f8 Staging destinations: Fail fast on error during periodic checkpoint (#24671)
* rethrow exception for fail fast

* version bumps + changelog

* auto-bump connector version

* bump versions

* regenerate

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-29 23:49:14 +00:00
Subodh Kant Chaturvedi
29a7e2ebf5 cdc: fix debezium shutdown interruption bug (#24166)
* cdc: fix debezium shutdown interruption bug

* update retry properties

* add test for queue transfer

* refactor to simplify

* rename parameter

* fix test

* add comments

* add check for engine already closed

* disable checkpointing

* bump version

* auto-bump connector version

* manual bump

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-28 19:47:40 +05:30
Sergio Ropero
80da900d70 feat: Enable CDC checkpointing on Postgres (#24373)
* Fix the error reading offset file meanwhile Debezium is writing.
Enable CDC checkpointing to Postgres.
Minor change in the variable name to fit the type.

* Add final statement on exception ;)

* Add comments to CDC Checkpoint tests.
Clean a bit.

* Bump connector versioning

* Add log message

* Fix changelog

* auto-bump connector version

* Manually generate definitions

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-27 14:00:49 +02:00
Greg Solovyev
5f1eb87444 Fix destination integration tests (#24431)
* MS SQL does not support schema change in incremental model
* make schema change test optional
* fix compilation errors in postgres-strict-encrypt destination
* deparallelize integration tests for destination postgres
* deparalellize MS SQL integration tests
* remove broken SSH tunnel test from destination-postgres-strict-encrypt
2023-03-24 10:41:47 -07:00
Ella Rohm-Ensing
9cfc647fa5 Ella/fail on extra fields (#23985)
* Pass argument along, add test that should pass and test that should fail

* Add tests with additionalProperties

* Set additionalproperties=false when not set|

* Parametrize test cases

* Make the behavior 'optional'

* Fix parametrization for all combinations

* Improve CI credentials README and rename param

* Update naming to be clearer about columns only

* record_has_unexpected_field > record_has_unexpected_column

* Automated Change

* Add bypass for beta + ga connectors that failed

* Update docs and TODOs

* Update changelog and dockerfile

* Update TODO

* Update a few neglected connectors

* Remove uploaded file

* Update dockerfile after merge conflict

---------

Co-authored-by: marcosmarxm <marcosmarxm@users.noreply.github.com>
2023-03-23 22:22:44 +00:00
Edward Gao
9b7b30f92b Normalization: Use strict > comparison in incremental mode (#22381)
* copy tests from other branch

* switch to >

* [wip] wire up tests

* make tests work

* fixes

* nicer test structure

* maybe add feature flag?

* pattern matching

* also add version check

* formatting

* refactor test also

* extract test + fix method call

* minor tweaks

* add context to log message

* put workspace id in normalization input

* use non-semver tag

* add flag for version of normalization

* also flag old version

* add test

* missed part of the commit

* format

* add test for null workspace ID

* Revert "also flag old version"

This reverts commit 3be601d16c.

* Revert "missed part of the commit"

This reverts commit 47a67b4631.

* always apply flag, even if we're behind a version

* derp

* Add more logging to the normalization activity

* Update charts and kustomize for the feature flag

* fix clickhouse integration test

* remove replace_identifiers

* Revert "remove replace_identifiers"

This reverts commit 0e7ded5a7b.

* fix replace_identifiers

* garbage debug logs

* stop trying to setup duckdb test

* wake up and choose violence

* fix mssql

* exclude duckdb from tests

* make snowflake happy

* uncomment tests

* derp

* derpderp

* format

* format

* also fix redshift???

* maybe now everything works???

* remove debug logs

* use special docker tag

* bump to new tag

* use random test schema in publish also

* properly cleanup

* remove feature flag stuff

* version bump + changelog

* Automated Commit - Formatting Changes

* bump definitions

---------

Co-authored-by: Jimmy Ma <gosusnp@users.noreply.github.com>
Co-authored-by: Jimmy Ma <jimmy@airbyte.io>
Co-authored-by: octavia-squidington-iii <octavia-bot@airbyte.io>
Co-authored-by: edgao <edgao@users.noreply.github.com>
2023-03-23 09:37:15 -07:00
Joe Reuter
c820c455b8 Connector acceptance tests: Validate grouping and collapsing (#24055)
* more tests

* format

* format

* check orders separately per group

* typo

* add examples to docstring

* prepare release

---------

Co-authored-by: Augustin <augustin@airbyte.io>
2023-03-23 15:56:08 +01:00
VitaliiMaltsev
12eac421de JDBC Sources: remove redundant date-time datatypes formatting (#20760)
* JDBC sources: consolidate date/time values mapping

* fixed Snowflake acceptance tests

* fixed TestJdbcUtils

* refactoring

* update setDate for incremental sync

* fixed CockroachDbJdbcSourceAcceptanceTest

* fixed CockroachDbSourceDatatypeTest

* fixed CockroachDbSourceDatatypeTest

* fixed MsSQLSourceDatatypeTest

* fixed MsSQL cdc datatype test

* refactoring

* fixed mssql cdc test

* fixed Redshift tests

* fixed Redshift tests

* updated db2 datatype tests

* fixed DB2 tests

* fixed DB2 strict-encrypt tests

* fixed Oracle strict-encrypt tests

* refactoring

* updated changelog

* refactoring

* removed redundant overrides

* bump versions

* auto-bump connector version

* JDBC Sources: remove redundant date parsing

* updated Snowflake test

* updated cockroachdb test

* updated date-time datatypes parsing in Postgres Source

* updated formatting in JDBC sources

* updated test cases

* updated Snowflake test

* updated Snowflake, Redshift, Cockroach, DB2 tests

* updated Snowflake, Postgres tests

* updated Redshift tests

* updated MySQL, MsSQL, DB2 tests

* updated MsSQL tests

* updated Oracle tests

* updated Oracle tests

* updated Oracle tests

* fixed some tests

* refactoring

* fixed checkstyle

* fixed TestJdbcUtils

* randomize Reshift test username

* removed redundant test

* updated sanity test

* Automated Change

* format time and bump versions

* auto-bump connector version

* bump versions

* auto-bump connector version

* auto-bump connector version

* manually update definitions

* Automated Change

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: ievgeniit <etsybaev@gmail.com>
Co-authored-by: VitaliiMaltsev <VitaliiMaltsev@users.noreply.github.com>
2023-03-23 14:31:50 +00:00
Augustin
bad5bce8ce CDK: remove unexpected error swallowing on abstract source's check method (#24240) 2023-03-23 13:04:51 +00:00
Lake Mossman
b1fbd6f33f Simplify Github and Postgres forms #2 (#24255)
* add grouping and collapsing fields to postgres source

* add auth group to github source connector

* revert postgres field order changes and adjust group of schemas field

* inject group into ssh tunnel spec for postgres only, through overloaded methods

* Automated Change

* bump Dockerfile versions and update changelogs

* bump strict encrypt version as well

* fix postgres acceptance test

* fix acceptance test again

* fix all postgres acceptance tests

* add newline

* undo other changes to postgres readme file

* add security group to tunnel_method in expected_spec.json

* bump version of strict encrypt

* manually bump versions in seed files

---------

Co-authored-by: lmossman <lmossman@users.noreply.github.com>
2023-03-22 11:56:17 -07:00
Subodh Kant Chaturvedi
e839f8563e fix-postgres-cdc-npe:do not put null in properties (#24271)
* fix-postgres-cdc-npe:do not put null in properties

* version bump + change log

* auto-bump connector version

* manual bump

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-21 23:42:26 +05:30
VitaliiMaltsev
f41d8219df MySQL Source : add CDC heartbeat support (#23984)
* MySQL Source : add CDC heartbeat support

* removed logs

* fixed DebeziumRecordIteratorTest

* Automated Change

* use generics to simplify implementation

* use Duration

* more refactoring

* removed redundant null check

* Automated Change

* bump version

* auto-bump connector version

---------

Co-authored-by: VitaliiMaltsev <VitaliiMaltsev@users.noreply.github.com>
Co-authored-by: subodh <subodh1810@gmail.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-21 16:54:45 +00:00
oneshcheret
f6bcc4914f Postgres source: add integration with data dog (#21533)
* Source postgres: add dd for env running locally

* Source postgres: add dd for running in cloud

* auto-bump connector version

* Source postgres: bump postgres strict-encrypt version

* Source postgres: filter datadog agent env variables just for postgres source

* Source postgres: format

* Source postgres: clean code

* Source postgres: pass java opts for all connectors

* Source postgres: temp removing dd agent from image

* Source postgres: add dd agent to image

* Source postgres: temp revert adding dd env variable

* Source postgres: temp hardcoded dd env variable

* Source postgres: temp hardcoded dd env variable

* Source postgres: temp hardcoded dd env variable

* Source postgres: temp hardcoded dd env variable

* Source postgres: temp hardcoded dd env variable

* Source postgres: temp hardcoded dd env variable

* Source postgres: temp hardcoded dd env variable

* Source postgres: temp hardcoded dd env variable

* Source postgres: temp removing hardcoded dd env variable

* Source postgres: temp added hardcoded dd env variable

* Source postgres: temp added hardcoded dd env variable

* Source postgres: temp added hardcoded dd env variable

* Source postgres: rename to java_opts and pass data dog host

* Source postgres: add vars to kube pods

* Source postgres: add vars to kube pods

* Source postgres: add vars to kube pods

* Source postgres: add Trace to more methods

* Source postgres: add Trace to more methods

* Source postgres: add Trace to more methods

* Source postgres: temp reverting service name removing

* Source postgres: temp reverting service name removing

* Source postgres: temp reverting service name removing

* Source postgres: temp adding trace to integration runner

* Source postgres: temp adding trace to integration runner

* Source postgres: bump postgres source dd version

* Source postgres: bump postgres source dd version

* Source postgres: revert temp changes

* Source postgres: merge with master

* Automated Commit - Formatting Changes

* Source postgres: move dd java agent to base java

* Source postgres: move dd java agent to base java

* Source postgres: clean up

* Source postgres: clean up

* Automated Change

* Source postgres: clean up

* Source postgres: bump version

* Source postgres: bump version for test

* Source postgres: temp bump version

* Source postgres: bump version

* Automated Change

* Source postgres: bump version

* auto-bump connector version

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: sashaNeshcheret <sashaNeshcheret@users.noreply.github.com>
2023-03-21 22:16:09 +05:30
Sergio Ropero
ca2d9c733f fix: Disable CDC checkpointing using a flag (#24275)
* Disable CDC checkpointing until finished the investigation with reading Offset file during syncs

* Bump connector versions

* auto-bump connector version

* Publish postgres manually

* Definitions manually for mysql

* Generated definitions manually mssql

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-03-21 15:32:20 +01:00
Lake Mossman
2d3c48da8d Revert "Simplify postgres and GitHub forms (#24127)" (#24219)
This reverts commit fcc80cb5be.
2023-03-18 19:22:38 -07:00
Eugene
c4fbc8c9a5 S3-Parquet: added handler to process null values in arrays (#23788)
* [22807] Parquet - added handler to null values in arrays
2023-03-18 08:52:37 +00:00
Lake Mossman
fcc80cb5be Simplify postgres and GitHub forms (#24127)
* add grouping and collapsing fields to postgres source

* add auth group to github source connector

* revert postgres field order changes and adjust group of schemas field

* inject group into ssh tunnel spec for postgres only, through overloaded methods

* Automated Change

* bump Dockerfile versions and update changelogs

* bump strict encrypt version as well

* fix postgres acceptance test

* fix acceptance test again

---------

Co-authored-by: lmossman <lmossman@users.noreply.github.com>
2023-03-17 23:02:03 +00:00
Evan Tahler
7538437452 Fixup dead links (#24167) 2023-03-16 22:42:30 +01:00
Edward Gao
294cdbcf4a Normalization Bigquery: Add more reserved words (#24077)
* add current times for bigquery

* bump version + changelog
2023-03-15 18:33:59 +00:00