1
0
mirror of synced 2025-12-22 11:31:02 -05:00
Commit Graph

2421 Commits

Author SHA1 Message Date
Davin Chia
b9a3c0817e 🐛 Release Snowflake Async State Bug. (#28581)
Release #28342 for Snowflake.
2023-07-21 15:38:54 -07:00
Edward Gao
53da5baa7d Destination bigquery 1s1t: fix 1s1t schema change logic; extract TyperDeduper (#28490)
* rename for clarity

* fix cleanup method

* giant commit because I'm irresponsible

* rename constant

* better raw table creation

* fix build?

* move code around

* tweaks

* more code shuffling

* Automated Commit - Format and Process Resources Changes

* add tests

* minor tweak

* remove unimportant methods

* cleanup

* Automated Commit - Format and Process Resources Changes

* derp

* clean up tests

* some more fixes post-merge

* botched merge

* create NoopTyperDeduper

* try and update everything to work?

* tweak comment

* move suffix args to end of list

* fix exception message

* Automated Commit - Format and Process Resources Changes

* add sqlgenerator test for softReset

* only prepare once

* update log message

* do what intellij says

* implement one more test

* less indirection

* Automated Commit - Format and Process Resources Changes

* rename test

* use noop in test

* version bump + changelog

* use stringutils

* fix typo

* flip if-statement

* typo

* simplify logic

* fix schema change logic

* typo

* use spy for clarity

* Automated Commit - Format and Process Resources Changes

* better test teardown

* slightly better logs

* fix exception message

* softReset returns single string

* Automated Commit - Format and Process Resources Changes

* simplify if chain

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
2023-07-21 13:45:19 -06:00
Joe Reuter
6540fa7c91 Langchain destination: Support composite and nested primary keys for deduping (#28556)
* support composite primary keys for deduping

* prepare release

* format

* fix

---------

Co-authored-by: Augustin <augustin@airbyte.io>
2023-07-21 17:17:29 +02:00
Joe Bell
a16cbea2ae Destination BigQuery - Handle Schema Changes (#28382)
* Add ability to detect differences in expected Schemas and perform soft resets

* Remove alter table for overwrite syncs since its unneccessary

* Updates after testing

* pr reorganize

* comments

* add collection util test

* Add Tests

* bump version

* Automated Commit - Format and Process Resources Changes

* Destination BigQuery - Reduce amount of typing and deduping for GCS staging (#28489)

* undo comment out

* centralize t&d logic for staging and standard, add valve to staging

* Share more logic for typing and deduping

* Remove record checking logic and use only time for staging inserts

* Add Javadoc

* Automated Commit - Format and Process Resources Changes

---------

Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>

* Change TableNotMigratedException to extend runtime exception, remove SqlGenerator interface method

* Make Lambda slightly more readable

* add test for validating v2 schemas

* change soft reset to single string

* convert back to list, update dockerfile

* remove needless default

---------

Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>
2023-07-20 09:25:44 -06:00
Augustin
9815e080cd destination-langchain: fix build (#28509) 2023-07-20 13:22:52 +02:00
Eduard Tudenhoefner
588c7d6f43 Destination Iceberg: Bump Iceberg from 1.1.0 to 1.3.0 and add REST catalog support (#28158)
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2023-07-19 14:45:01 -05:00
Joe Reuter
d5d7e757f8 Vector databases destination: Make available on cloud (#28398)
* make connector available on cloud

* fix build

* prepare release

* Update langchain.md
2023-07-18 18:23:29 +02:00
Ben Church
6fa755f81d Java Pipeline Bump patch bump all java connectors in july (#28345)
* patch bump all java connectors in july

* Bump changelog
2023-07-14 21:01:39 -05:00
Evan Tahler
b81cc031e0 destination-redshift should fail syncs if records or properties are too large, rather than silently skipping records and succeeding (#27993)
* `destination-redshift` will fail syncs if records or properties are too large, rather than silently skipping records and succeding

* Bump version

* remove tests that don't matter any more

* more test removal

* more test removal

---------

Co-authored-by: Augustin <augustin@airbyte.io>
2023-07-14 14:27:12 -05:00
Davin Chia
fa4a278a2d 🐛 Destination Snowflake: Pull in async minor bug fix. (#28315)
* Pull in async minor bug fix.

* Update readme.
2023-07-14 08:53:01 -07:00
Edward Gao
934acaa137 Destination bigquery: rerelease 1s1t behind gate (#27936)
* Revert "Revert "Destination Bigquery: Scaffolding for destinations v2 (#27268)""

This reverts commit 348c577dbb.

* version bumps+changelog

* Speed up BQ by having 2 queries, and not an OR (#27981)

* 🐛 Destination Bigquery: fix bug in standard inserts for syncs >10K records (#27856)

* only run t+d code if it's enabled

* dockerfile+changelog

* remove changelog entry

* Destinations V2: handle optional fields for `object` and `array` types (#27898)

* catch null schema

* fix null properties

* clean up

* consolidate + add more tests

* try catch

* empty json test

* Automated Commit - Formatting Changes

* remove todo

* destination bigquery: misc updates to 1s1t code (#28057)

* switch to checkedconsumer

* add unit test for buildColumnId

* use flag

* restructure prefix check

* fix build

* more type-parsing fixes (#28100)

* more type-parsing fixes

* handle duplicates

* Automated Commit - Format and Process Resources Changes

* add tests for asColumns

* Automated Commit - Format and Process Resources Changes

* log warnings instead of throwing exception

* better log message

* error level

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>

* Automated Commit - Formatting Changes

* Improve protocol type parsing (#28126)

* Automated Commit - Formatting Changes

* Change from T&D every 10k records to an increasing time based interval (#28130)

* fifteen minute t&d

* add typing and deduping operation valve for increased intervals of typing and deduping

* Automated Commit - Format and Process Resources Changes

* resolve bizarre merge conflict

* Automated Commit - Format and Process Resources Changes

---------

Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>

* Simplify and speed up CDC delete support [DestinationsV2] (#28029)

* Simplify and speed up CDC delete support [DestinationsV2]

* better QUOTE

* spotbugs?

* recompile dbt image for local arch and use that when building images

* things compile, but tests fail

* tests working-ish

* comment

* fix logic to re-insert deleted records for cursor comparison.

tests pass!

* remove comment

* Skip CDC re-include logic if there are no CDC columns

* stop hardcoding pk (#28092)

* wip

* remove TODOs

---------

Co-authored-by: Edward Gao <edward.gao@airbyte.io>

* update method name

* Automated Commit - Formatting Changes

* depend on pinned normalization version

* implement 1s1t DATs for destination-bigquery (#27852)

* intiial implementation

* Automated Commit - Formatting Changes

* add second sync to test

* do concurrent things

* Automated Commit - Formatting Changes

* clarify comment

* minor tweaks

* more stuff

* Automated Commit - Formatting Changes

* minor cleanup

* lots of fixes

* handle sql vs json null better
* verify extra columns
* only check deleted_at if in DEDUP mode and the column exists
* add full refresh append test case

* Automated Commit - Formatting Changes

* add tests for the remaining sync modes

* Automated Commit - Formatting Changes

* readability stuff

* Automated Commit - Formatting Changes

* add test for gcs mode

* remove static fields

* Automated Commit - Formatting Changes

* add more test cases, tweak test scaffold

* cleanup

* Automated Commit - Formatting Changes

* extract recorddiffer

* and use it in the sql generator test

* fix

* comment

* naming+comment

* one more comment

* better assert

* remove unnecessary thing

* one last thing

* Automated Commit - Formatting Changes

* enable concurrent execution on all java integration tests

* add test for default namespace

* Automated Commit - Formatting Changes

* implement a 2-stream test

* Automated Commit - Formatting Changes

* extract methods

* invert jsonNodesNotEquivalent

* Automated Commit - Formatting Changes

* fix conditional

* pull out diffSingleRecord

* Automated Commit - Formatting Changes

* handle nulls correctly

* remove raw-specific handling; break up methods

* Automated Commit - Formatting Changes

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: octavia-approvington <octavia-approvington@users.noreply.github.com>

* Destinations V2: move create raw tables earlier (#28255)

* move create raw tables

* better log message

* stop building normalization (#28256)

* fix ability to run tests

* disable incremental t+d for now

* Automated Commit - Formatting Changes

---------

Co-authored-by: Evan Tahler <evan@airbyte.io>
Co-authored-by: Cynthia Yin <cynthia@airbyte.io>
Co-authored-by: cynthiaxyin <cynthiaxyin@users.noreply.github.com>
Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: Joe Bell <joseph.bell@airbyte.io>
Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>
Co-authored-by: octavia-approvington <octavia-approvington@users.noreply.github.com>
2023-07-14 09:34:56 -05:00
Joe Reuter
b52e88a977 🎉 New Destination Vector Database (powered by LangChain) (#26184)
* basic version

* polish

* iterate

* keep working

* fix spec

* wip

* improve destination

* basic unit tests

* move embedding dimensionality into embedder

* improve several things

* adjust documentation

* remove unnecessary call

* add some debug information

* fix local destination

* various small fixes

* bring tests into order

* document and add batching to pinecone

* checklist

* improve performance a bit and add test

* fix formatting

* fix metadata

* install C++ 11 on python base

* no more alpine for ci-connector-ops

* remove hard-to-run test

* more documentation

* better documentation

* add icon

* some small adjustments

* review comments

* format

* review comments

---------

Co-authored-by: alafanechere <augustin.lafanechere@gmail.com>
Co-authored-by: Augustin <augustin@airbyte.io>
2023-07-12 20:06:59 +02:00
Eduard Tudenhoefner
2a69ee5eff Destination Iceberg: Bump Iceberg from 1.0.0 to 1.1.0 (#23201)
* Destination Iceberg: Fix compilation

* Iceberg Destination: Bump Iceberg from 1.0.0 to 1.1.0

* add icon and changelog entry

* force aws dependencies used by iceberg

* fix tests as MINIO runs without SSL

* move icon

---------

Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2023-07-11 11:50:18 -05:00
Denys Davydov
a924142791 Destination Google Sheets: migrate to advanced auth (#28035)
* Connector health: source hubspot, gitlab, snapchat-marketing: fix builds

* #27975 destination google sheets: migrate from authSpecification to advancedAuth

* destination google sheets: upd changelog
2023-07-07 13:29:45 +03:00
Joe Bell
b03be1b714 🐛 Destination Snowflake: Merge old snowflake work (#27935)
* Adds data as JsonNode to pass through, running into memory issues so add JVM args to attach VisualVM

* JsonNode

* Lowers the optimal batch size to see if this improvements movement

* Fixes NPE by checking if PartialAirbyteMessage contains a PartialAirbyteRecord

* Fixes config switch when config is not explicitly set (config migration needed)

* Adds logic to check if queue has elements before getting timeOfLastMessage

* Add PartialSerialisedMessage test. (#27452)

* Test deserialise.

* Add tests.

* Simplify and fix tests.

* Format.

* Adds tests for deserializeAirbyteMessage

* Adds tests for deserializeAirbyteMessage with bad data

* Cleans up deserializeAirbyteMessage and throws Exception when invalid message

* More code cleanup

---------

Co-authored-by: ryankfu <ryan.fu@airbyte.io>

* 🤖 Auto format destination-snowflake code [skip ci]

* Cleans up code w/o JVM args & rebase

* 🤖 Auto format destination-snowflake code [skip ci]

* Adds breadcrumb on the STATE message deviation and where the deserialize/serialize is done to unpack

* 🤖 Auto format destination-snowflake code [skip ci]

* Adds back line formatter removed and comment describing rational for lower batchSize

* Bumps Snowflake version and type checks

* Added note to remove PartialAirbyteRecordMessage with low resource testing

* Automated Commit - Format and Process Resources Changes

* Fix issue with multiple namespaces in snowflake not writing to the correct staging schema

* Fix issue with multiple namespaces in snowflake not writing to the correct staging schema

* remove stage name maniuplating method

* update readme

* Source Stripe: update credit_notes expected records (#27941)

* Source Zendesk Talk: update expected records (#27942)

* Source Xero: update expected records (#27943)

* Metadata: Persist Registry entries (#27766)

* DNC

* Update poetry

* Update dagster

* Apply partition

* Get metadata entry

* Use helpers

* Write registry entry to appropriate location

* Delete when registry removed

* Update to use new file (broken)

* Render registry from registry entries

* Run format

* Fix plural issue

* Update to all metadata file blobs

* Fix test

* Update to all blobs

* Add ignore validation error for version logic

* Rename to max_run_request

* Pedros review

* Ella suggestions

Co-authored-by: Ella Rohm-Ensing <erohmensing@gmail.com>

* Update airbyte-ci/connectors/metadata_service/orchestrator/orchestrator/assets/registry_entry.py

Co-authored-by: Ella Rohm-Ensing <erohmensing@gmail.com>

* Update naming

* Add tests for connector type and deletion

* Test safe parse

* Format

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@sers.noreply.github.com>
Co-authored-by: Ella Rohm-Ensing <erohmensing@gmail.com>

* Fix Dagster Deploy Failure (#27955)

* Add pydantic

* Add pydantic to orchestration deploy pipeline

* 🐛 Source Jira: update expected records (#27951)

* Source Jira: update expected records

* Update issues expected records

* Source Zendesk Chat: update expected records (#27965)

* 🐛 Source Pipedrive: update expected records (#27967)

* 🐛 Source Pinterest: update expected records (#27964)

*  Source Amazon-Ads: Add streams for portfolios and sponsored brands v3 (#27607)

* Add stream for sponsored brands v3
* Add new stream Portfolios

* Source Google Search Console: added discover and googleNews to searchType (#27952)

* added discover and googleNews to searchType

* updated changelog

* fixed types for streams

* 🎉 Source Instagram: Improve, refactor `STATE` management (#27908)

* add test for enabling

* update versions

* fix test

* update other snowflake loading method types

* remove standard

---------

Co-authored-by: ryankfu <ryan.fu@airbyte.io>
Co-authored-by: Davin Chia <davinchia@gmail.com>
Co-authored-by: octavia-squidington-iii <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: ryankfu <ryankfu@users.noreply.github.com>
Co-authored-by: Augustin <augustin@airbyte.io>
Co-authored-by: Arsen Losenko <20901439+arsenlosenko@users.noreply.github.com>
Co-authored-by: Ben Church <ben@airbyte.io>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@sers.noreply.github.com>
Co-authored-by: Ella Rohm-Ensing <erohmensing@gmail.com>
Co-authored-by: Anatolii Yatsuk <35109939+tolik0@users.noreply.github.com>
Co-authored-by: Daryna Ishchenko <80129833+darynaishchenko@users.noreply.github.com>
Co-authored-by: Baz <oleksandr.bazarnov@globallogic.com>
2023-07-06 12:28:52 -05:00
Amélie
196fbbcf79 Update broken link in Meilisearch connector documentation (#27828) 2023-07-03 09:19:06 -05:00
Edward Gao
52b8cbe39d Revert "Destination Bigquery: Scaffolding for destinations v2 (#27268)" (#27891)
* Revert "Destination Bigquery: Scaffolding for destinations v2 (#27268)"

This reverts commit ba3e39bb0c.

* bump versions to 1.5.1 everywhere
2023-06-30 20:26:48 -04:00
Evan Tahler
f455c1288d Java DB Destination connector licenses to Elv2 (#27781)
* Java DB Destination connector licenses to Elv2

* PR id for docs

* fix redshift tagging
2023-06-29 12:26:24 -05:00
Edward Gao
ba3e39bb0c Destination Bigquery: Scaffolding for destinations v2 (#27268)
* copy files from edgao branch

* start writing create table statement

* add basic unit test setup

* create a table, probably

* remove outdated todo

* derp, one more column

* ugh

* add partitioning+clustering

* use StringSubstitutor

* substitutions in updateTable

* wip generate update/insert statement

* split up into smaller methods

* handle json types correctly

* rename stuff

* more json_query vs _value stuff

* minor tweak

* super basic test setup

* laying foundation for type parsing

* more stuff

* tweaks

* more progress on type parsing

* fix json_value stuff?

* misc fixes in insert

* fix dedupFinalTable

* add testDedupRaw

* full e2e test

* type parsing: gave up and mirrored the dbt code structure to avoid bugs

* type parsing - more cleanup

* handle column name collisions

* handle tablename collisions...?

* comments

* remove original ns/name from quotedstream

* also javadoc

* remove redundant method

* fix table rename

* add incremental append test

* add full refresh append test

* comment

* call T+D sql in a reasonable location for standard inserts

* add config option

* use config option here

* type parsing - fix fromJsonSchema

* gate everything

* log query + runtime

* add spec option temporarily

* Raw Table Updates

* fix more stuff

* first big pass at toDialectType

* no quotes

* wrap everything in quotes

* resolve some TODOs

* log sql statement in tests

* overwriteFinalTable returns optional

* minor clean up

* add raw dataset override

* try to preserve the original namespace for t+d?

* write to the raw table correctly

* update todos

* write directly to raw table

this is kind of dumb because we're still trying to do tmp table operations,
and we still don't ack state until the end of the entire sync.

* standard inserts write to raw table correctly

* imports + log statements

* move logs + add comment

* explicitly create raw table

* move comment to better place

* Typing issues

* bash attempt

* formatting updates

* formatting updates

* write to the airbyte schema by default unless overriden by config options

* standard inserts truncate raw table at start of sync

* full refresh overwrite will overwrite correctly!

* fix avro record schema parsing

* better raw table recreate

* rename raw table to match standard inserts

* full refresh overwrite does tmp table things

* small clean up

* small clean up

* remove errors entry if no errors

* pull out destination config into singleton

* clean up singleton stuff

* make sure dest config exists when trying to do lookups

* avoid stringifying null

* quick thoughts on alter table

* add basic cdc testcase

* tweak cdc test setup

* rename raw table to match standard inserts

* minor tweak

* delete exact sql string assertions

* switch to JSON type

* minor cleanup

* sql whitespace changes

* explain cdc deletions

* GCS Staging Full Refresh create temp table

* assert schema

* first out of order cdc test

* add another cdc test case (currently failing)

* better test structure

* make this work

* oops, fix test

* stop trying to delete deletion records

* minor improvements to code+test

* enable concurrent test runs on integration test

* move stuff to static initializer

* extract utility method

* formatting

* Move conditional to the base java package, replace conditionals which did not use the typing and deduping flag but should have been.

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* switch back to empty list; write big assert

* minor wording tweaks

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* DestinationConfigTest

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* formatting

* remove ParsedType

* 🤖 Auto format destination-gcs code [skip ci]

* 🤖 Auto format destination-bigquery code [skip ci]

* tests verify every data type

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* full update with all data types

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* move stuff to new base lib

* 🤖 Auto format destination-gcs code [skip ci]

* Automated Commit - Formatting Changes

* 🤖 Auto format destination-bigquery code [skip ci]

* fix test

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* asserts in dedupFinalTable

* better asserts in dedupRawTable

* [wip] test case for all data types

* 🤖 Auto format destination-gcs code [skip ci]

* 🤖 Auto format destination-bigquery code [skip ci]

* AirbyteTypeTest

* Automated Commit - Formatting Changes

* remove comments

* test chooseOneOf

* slightly better test output

* Automated Commit - Formatting Changes

* add some awful pretty print code

* more comment

* minor tweaks

* verify array/object type

* fix test

* handle deletions more correctly

* test toDialectType

* Destinations v2: better namespace handling (#27682)

* [wip] better namespace handling

* 🤖 Auto format destination-bigquery code [skip ci]

* wip also implement in gcs

* get gcs working (?)

* 🤖 Auto format destination-bigquery code [skip ci]

* remove duplicate method

* 🤖 Auto format destination-bigquery code [skip ci]

* fixed my code style settings

* make ci happy?

* 🤖 Auto format destination-bigquery code [skip ci]

* make ci happy?

* remove incorrect test

* blank line change

* initialize singleton

---------

Co-authored-by: octavia-squidington-iii <octavia-squidington-iii@users.noreply.github.com>

* reset args correctly

* Automated Commit - Formatting Changes

* more bash stuff

* parse implicit structs

* initialize singleton in more tests

* Automated Commit - Formatting Changes

* I missed this namespace handling thing

* test more schemas

* fix singular types specified in arrays

* Automated Commit - Formatting Changes

* disable test for unimplemented feature

* initialize singleton

* remove spec options; changelogs+metadata

* randomize namespace

* also bump dockerfile

* unremove namespace sanitizing in legacy mode

* ... disable the correct test

* even more unit test fixes!

* move integration test to integration tests

---------

Co-authored-by: Cynthia Yin <cynthia@airbyte.io>
Co-authored-by: Joe Bell <joseph.bell@airbyte.io>
Co-authored-by: octavia-squidington-iii <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: cynthiaxyin <cynthiaxyin@users.noreply.github.com>
2023-06-29 08:44:37 -07:00
Evan Tahler
e03404bbd4 Google Sheets passes QA check (#27782)
* Google Sheets passes QA check

* bump PR id
2023-06-28 08:19:16 +03:00
Evan Tahler
4fb1f98221 Fix destination-s3 build (#27786)
* bump version

* PR id

* shh normalization, shh

* remove a bunch of arm64 deps?

* might as well match the dockerfile
2023-06-27 17:15:44 -07:00
Evan Tahler
fd34fe3e2b Google Sheets Destination connector licenses to Elv2 (#27780)
* Google Sheets Destination connector licenses to Elv2

* PR Id
2023-06-27 16:31:12 -05:00
Evan Tahler
79dba56923 S3 and GCS connector license to Elv2 (#27725)
* S3 and GCS connector license to Elv2

* docs update

* docs
2023-06-26 18:27:18 -05:00
Evan Tahler
1f6aef98df yum clean all after every yum install to save space (#27555)
* `yum clean all` after every yum install to save space

* docs and versions

* update env clean

* fix python install confusion

---------

Co-authored-by: Augustin <augustin@airbyte.io>
2023-06-23 13:53:22 -07:00
Alex
7088910c99 Docs: Remove egress notice for destination connectors (#27667)
* add info blurb to Cloud Bucket Storage sources and destinations

* remove egress alert from destination docs

* remove egress alert from destinations, add it to sources

* revert sources to match master
2023-06-23 13:21:44 -05:00
Octavia Squidington III
bad8157c44 🤖 Cloud Availability updater: new connectors to deploy [20230621] (#27542) 2023-06-21 10:26:00 +02:00
Kostas Botsas
8057ead798 Fix typos in readme (#27387) 2023-06-15 06:50:18 -05:00
Philip Krauss
18553a28c3 🎉 New destination connector for Xata.io (#24192)
* Add Xata.io Destination Connector

* readme for connectir

* adjust versioning

* missing state propagation

* throw exception if inconsistentcy is found

* set version in dockerfile

* exchange mock for deprecated logger

* put invalid config to file

* changelog in docs

* add xata dest connector to builds.md

* bump dependency to newer sdk version

* bump to latest sdk version

* blank bootstrap

* use class logger

* correct way to use the logger

* unit tests

* add write integration test

* removed irrelevant log line

* raise exception on incorrect message

* e2e test for writing

* add metadata file

* remove unued file

* remove build md

* update metdata and docs

* fix formatting

---------

Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2023-06-14 17:55:17 -03:00
Jove Zhong
e63d371e10 🎉 New Destination: Timeplus (#21226)
* timeplus destination connector to push data to Timeplus via REST API/SDK

* 0.2.0, support nullable columns, add more test data

boolean and pokemon

* mark the apikey as secret

* us.timeplus.cloud as sample URL

* removed the unnecessary pass

* rewrite the logic of when to delete/create stream

* format code, following the PEP-8

* add write integration test

* replacing AirbyteLogger() with logging.getLogger('airbyte')

* add logs to destination.py, refine integration test

* add docs/integrations/destinations/timeplus.md

* add icon.svg

* set to 0.4.0 version

* upgrade timeplus sdk from 1.1.1 to 1.2.1

* add metadata file

* fix docker version

---------

Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2023-06-14 17:33:23 -03:00
Gaëtan Podevijn
eb14b968d8 Destination Databricks: connector supports schema evolution (schemaMerge) #26658 (#26942)
* https://prima-assicurazioni-spa.myjetbrains.com/youtrack/issue/SSDP-165/Enable-schema-evolution-in-the-Airbyte-Databricks-connector (#1)

* Ssdp 165/enable schema evolution in the airbyte databricks connector (#4)

* https://prima-assicurazioni-spa.myjetbrains.com/youtrack/issue/SSDP-165/Enable-schema-evolution-in-the-Airbyte-Databricks-connector

* bump minor version

* bump minor version

* Ssdp 165/add doc (#5)

* https://prima-assicurazioni-spa.myjetbrains.com/youtrack/issue/SSDP-165/Enable-schema-evolution-in-the-Airbyte-Databricks-connector

* bump minor version

* bump minor version

* Edit connector doc
2023-06-08 14:56:34 -03:00
Edward Gao
ae43522659 Destination TiDB: install normalization (#27025)
* install normalization

* changelog

* add to codeowners

* update changelog messages
2023-06-06 00:05:50 +00:00
Edward Gao
bd22b25ccf Destination Mssql: install normalization (#27034)
* install in mssql

* changelog+metadata

* update changelog message
2023-06-06 00:00:39 +00:00
Edward Gao
3aa4512b46 Destination Clickhouse: install normalization (#27036)
* install normalization

* changelog

* update changelog messages
2023-06-05 16:53:43 -07:00
JamesB797
776f650e9c Update pubsub.md correct typo bigquery to pubsub (#26984) 2023-06-05 09:32:24 -05:00
Edward Gao
7a75c0c0e6 Destination Snowflake: Enable in-connector normalization (#25782)
* add support for snowflake

* do useful copy things

* corrections

* add build dependency

* deeeerp

* version bumps + changelog

* try installing gcc-c++

* move stuff around

* copy snowflake

* bump version

---------

Co-authored-by: Cynthia Yin <cynthia@airbyte.io>
2023-06-01 14:14:28 -07:00
Marcos Marx
3aa0c284e2 🎉 New Destination: Vertica (#26718)
* initial code

* added vertica md file

* added vertica svg file

* Update vertica.md

Only Full Refresh Sync supported as per this PR.

* Update vertica.md

set 12.0.x for vertica documentation

* pointing to a database running on a cluster

* Automated Commit - Formatting Changes

* adding metadata.yml file

* move icon and add changelog

* rename icon

---------

Co-authored-by: dhanjimahto <dhanji.km@simplify3x.com>
Co-authored-by: dhanjimahto <122617093+dhanjimahto@users.noreply.github.com>
Co-authored-by: rakeshsimplify3xaccount <127210710+rakeshsimplify3xaccount@users.noreply.github.com>
Co-authored-by: dhanjimahto <dhanjimahto@users.noreply.github.com>
2023-05-30 11:39:23 -03:00
Edward Gao
bbdb3be868 Destination Redshift: Install normalization (#26165)
* do things in-connector

* Automated Change

* version bumps + changelog

* remove unnecessary yum package

* derp

* also build.gradle

* Automated Change

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
2023-05-25 23:40:48 +00:00
Edward Gao
cf2ded2bbb Destination Bigquery: small tweak to clarify logs (#26585)
* make logs less misleading

* version bumps + changelog

* tweak wording
2023-05-25 18:20:40 +00:00
Emma Forman Ling
360e2f9261 🐛 Update Convex destination connector to fix overwrite sync mode (#26103)
* auto-generated destination connector template

* add config and health check

* support full_refresh overwrite

* support full refresh overwrite, full refresh append, and incremental append

* write works!

* lint

* update readme

* add test messages

* list destination??

* surface error messages

* add integration test

* add destination docs

* update source docs

* change integration test to unit test

* add check unit test

* use a map for streams

* make compatible with API changes

* add indexes for primary keys

* poll for indexes to be backfilled

* simplified stream API

* support append_dedup sync mode

* cleanup

* add changelog

* fix changelog

* use indexes ready endpoint

* update types

* rename to include primary key in API, handle namespaces

* fix types

* update docs

* add better test cases

* fix docs link

* update API and headers

* fix formatting

* update API name to streaming_import

* improve request exception and add a test for error formatting

* remove secret config

* use /replace_tables endpoint instead of /clear_tables

* partial revert of replace_tables

* fix

* formatting and fix table names

* update docs

* update metadata and dockerfile

---------

Co-authored-by: Lee Danilek <lee@convex.dev>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2023-05-24 12:12:35 -03:00
Daemonxiao
1e1bdac65d TiDB Destination: add append_dedup mode (#19109)
* TiDB Destination: add append_dedup mode

* update Dockerfile

* update docs and metadata file

---------

Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>
2023-05-23 15:28:58 -03:00
Edward Gao
67f3cdb125 🐛 Destination S3: Fix Parquet LZO compression (#26284)
* Revert "Move hadoop-lzo to test dependency (#21085)"

This reverts commit 1241569ba3.

* add basic test

* Automated Change

* version bumps, changelog

* Automated Change

* unused import

* Ran ./gradlew :spotlessJavaApply to trigger GitHub build

* regenerate registry

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: ryankfu <ryan.fu@airbyte.io>
2023-05-22 16:49:45 +00:00
Edward Gao
50e784b844 🐛 Destination Snowflake: mitigate infinite retries in snowflake jdbc driver (#26323)
* switch to forked snowflake jdbc driver

* oops. rebuild with retryCount < 100

* Automated Change

* version bumps + changelog

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
2023-05-20 06:08:34 -07:00
Edward Gao
c25afc4adb 🐛 Destination BigQuery (+denormalized): correctly parse buffer count from config (#26213)
* fix logic in parsing config

* simplify logic

* ugh

* holy moly that took way too many iterations

* version bumps / changelog

* Automated Change

*  Destination Bigquery: stop running normalization container for DAT (#25925)

* readme update

* allow passing additional flags to test containers

* remove build dependency

* Automated Change

* versioning updates

* restore denormalized change from master

* formatting changes

* formatting

* Automated Change

* update metadata file

---------

Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>

* fix version (#26218)

* Source Airtable: skip missing streams (#25946)

* Source Airtable: skip missing streams

* Move stream removal to a separate method, cover with tests

* Update changelog

* Fix flake warnings

* Update docs/integrations/sources/airtable.md

Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* Update docs/integrations/sources/airtable.md

Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* Automated Change

* Update link to docs in warning

* Automated Change

* Automated Change

* Automated Change

* “Empty-Commit”

---------

Co-authored-by: Sherif A. Nada <snadalive@gmail.com>
Co-authored-by: arsenlosenko <arsenlosenko@users.noreply.github.com>

* 🎉 New Source: Ringcentral [Low code CDK] (#25701)

* Initial commit - All test passed

* add stream fax cover

* refactor docs

* fix schema, Added pagination

* Add several streams, fix schema

* fix schema, add streams, refactor docs

* EOF

* Resolve conflicts

* Resolve conflicts

* add metadata file

---------

Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>

* rebump version

* Automated Change

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: Joe Bell <joseph.bell@airbyte.io>
Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>
Co-authored-by: Joe Reuter <joe@airbyte.io>
Co-authored-by: Arsen Losenko <20901439+arsenlosenko@users.noreply.github.com>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>
Co-authored-by: arsenlosenko <arsenlosenko@users.noreply.github.com>
Co-authored-by: btkcodedev <btk.codedev@gmail.com>
Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>
2023-05-18 16:45:24 +00:00
Joe Bell
9a4be977c1 Destination Bigquery: stop running normalization container for DAT (#25925)
* readme update

* allow passing additional flags to test containers

* remove build dependency

* Automated Change

* versioning updates

* restore denormalized change from master

* formatting changes

* formatting

* Automated Change

* update metadata file

---------

Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>
2023-05-18 00:46:32 +00:00
Ryan Fu
95dd113a5f Revert "Splits bases and updates build.gradle files (#25649)" (#26081)
* Revert "Splits bases and updates build.gradle files (#25649)"

This reverts commit c673b0a692.

* Bumps branch to prevent a conflict with publishing

* Forward fixes Snowflake to use singular base-java and develop within a new package within the same module

* Forcing automated change to merge changes
2023-05-15 21:56:40 +00:00
terencecho
780903ff80 🚨🚨 e2e-test: Standardize e2e testing destination spec (#25776)
* Standardize e2e testing destination

* Change casing

* Fix implementation

* Update versions and docs

* Do major version bump

* Update acceptance tests

* use correct name

* Fix implementation

* Automated Change

* Revert "Automated Change"

This reverts commit 816b66229f.

* Automated Change

* Fix unit test

---------

Co-authored-by: terencecho <terencecho@users.noreply.github.com>
2023-05-15 14:01:54 -07:00
Edward Gao
fb152a9a0a Normalization: Better handling for CDC transactional updates (#25993)
* try this?

* fix tests

* assert cdc values

* handle case where we have lsn but no updated_at

* readability improvements

* tweaks to test

* version bumps + changelogs

* Automated Change

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
2023-05-12 12:53:23 +00:00
Alex
f43cc9f3fd 📝 Add info on egress costs for Cloud storage connectors (#25935)
* add info blurb to Cloud Bucket Storage sources and destinations

* Apply suggestions from code review

Remove extra colon

Co-authored-by: Ben Church <ben@airbyte.io>

---------

Co-authored-by: Ben Church <ben@airbyte.io>
2023-05-09 17:33:49 -05:00
Ryan Fu
c673b0a692 Splits bases and updates build.gradle files (#25649)
* Splits bases and updates build.gradle files

* Fixed changelog out of sync

* Bumps version number and metadata files

* auto-bump connector version

* Downgraded untouched connector bumps

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-05-06 00:24:40 +00:00
Cynthia Yin
7f36de2efc 📚 Destination Redshift: update _airbyte_data column type in docs (#25854) 2023-05-05 16:00:39 -07:00