1
0
mirror of synced 2026-01-06 06:04:16 -05:00
Commit Graph

10794 Commits

Author SHA1 Message Date
Alexandre Girard
0a91a98370 Add descriptions and examples to component schema (#25117)
* Reference docs for backoff strategies

* Docs for most authentication mechansims

* Docs for CheckStream

* MinMaxDatetime

* DeclarativeStream

* DefaultErrorHandler

* CompositeErrorHandler

* update

* Update token expiry date description

* DPath extractor

* Add interpolation_context

* HttpResponseFilter

* RecordFilter

* RecordSelector

* DefaultPaginator

* CursorPagination

* OffsetIncrement

* Page Increment

* PrimaryKey

* HttpRequester

* request option and request path

* Schemas

* Spec

* Add field

* remove fields

* no auth and no pagination

* Delete deprecated comment

* Missing description

* Json Decoder

* OAuthConfigSpecification

* reorder

* add titles, examples, and descriptions for partition routers and datetime based cursor

* updates

* Update

* fix indentation

* Automated Commit - Formatting Changes

* Update as per feedback

* html tag

* generate models

* Update name

* do not use title as class name

* Update

* Add stream_interval and stream_partition to interpolation_context

* Automated Commit - Formatting Changes

* fix path

* format

---------

Co-authored-by: brianjlai <brian.lai@airbyte.io>
Co-authored-by: girarda <girarda@users.noreply.github.com>
2023-04-21 10:58:23 -07:00
Cynthia Yin
fd849d4a0c small docs fixes (#25377) 2023-04-21 10:49:57 -07:00
Marcos Marx
7c9392adf8 Destination S3 Glue: Fix precision and scale of decimal type (#25397)
* destination-s3-glue: Fix precision and scale of decimal type

The decimal type in Hive will default to a precision of 10 and a scale of 0, meaning
that it cannot handle numbers larger than 10 digits. The maximum values for precision
and scale are 38 and 38 respectively. This updates the decimal type specified in
destination schemas to use the maximum values for the broadest compatibility with
numeric values.

If a table is using the default precision of 10 and a number is present which exceeds
that many digits then the query engine will throw a `fieldValue cannot be null` error
message which is very cryptic.

* run seed generator

* Automated Change

* Update connectors.md

* Automated Change

---------

Co-authored-by: Tobias Macey <tmacey@mit.edu>
2023-04-21 13:25:00 -03:00
Ben Church
4b407e23b7 Add git checkout (#25376) 2023-04-21 09:16:05 -07:00
Marcos Marx
123a31e813 🎉 New Destination: Starburst Galaxy (#25399)
Co-authored-by: Mayank Vadariya <48036907+mayankvadariya@users.noreply.github.com>
2023-04-21 12:34:55 -03:00
Joe Reuter
62f3c0bfe0 Conenctor builder: Record processing page (#25267)
* authentication documentation

* some fixes

* fix broken link

* remove session token auth

* review comments

* first version of tutorial

* Revert "first version of tutorial"

This reverts commit bbdeb997f2.

* first version of tutorial

* review comments

* improvements

* partitioning documentation

* review comments

* improvements

* record processing doc

* review comments
2023-04-21 16:20:18 +02:00
Joe Reuter
5edce353a3 Connector builder: Incremental sync documentation (#25238)
* authentication documentation

* some fixes

* fix broken link

* remove session token auth

* review comments

* first version of tutorial

* Revert "first version of tutorial"

This reverts commit bbdeb997f2.

* review comments

* partitioning documentation

* incremental sync

* improvements

* review comments
2023-04-21 16:20:04 +02:00
Pekka Enberg
66ea940743 Fix Java destination template (#25390)
The currently generated Java destination code doesn't compile. Fix it
up.
2023-04-21 09:51:23 -03:00
iberchid
f5d494a451 Source Snapchat Marketing: Add missing story metrics to basic stats (#20897)
* Add missing story metrics to basic stats

* fix expected records and update dockerfile version

* auto-bump connector version

---------

Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-21 09:50:18 -03:00
Chris Wu
7a250d57fd Docs: Add AWS_SECRET_MANAGER to list of available options for storing secrets in documentation (#25374) 2023-04-21 09:32:37 -03:00
Marcos Marx
e473418c0e Update Issues Templates (#25365)
* update issue templates

* add titles and fix multiline text

* Automated Change

* Update issue-cli.yaml

* Update issue-connector.yaml

* Update issue-platform.yaml

* Update issue-connector.yaml
2023-04-21 08:17:13 -03:00
Artem Inzhyyants
eb671190f5 Source Amplitude: update changelog (#25392) 2023-04-21 13:08:49 +02:00
Arsen Losenko
0bebd4bdb4 Source Freshsales: update expected records (#25388) 2023-04-21 13:57:27 +03:00
Arsen Losenko
675df1aab7 Source Faker: update expected records (#25387) 2023-04-21 13:57:19 +03:00
Arsen Losenko
b6c81d688a Source Salesforce: update fields in Account expected_records (#25385) 2023-04-21 13:57:10 +03:00
Edward Gao
d9a50be557 Destination Databricks: use hive_metastore if catalog not set (#25366)
* use hive_metastore if catalog not set

* Automated Change

* add test case

* version bump + changelog

* auto-bump connector version

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-20 17:42:26 -07:00
Ben Church
de6811f3d8 Dagster registry generation and persist (#25260)
* Define legacy and latest registry at same time

* Fix file persist

* Get persist to work

* Expand generate models to all models

* Add new registry types

* Fix class name

* Get valid registry type definition

* Remove uuid hack

* Fix ids for json schemas

* Resolve issues

* Update legacy assets

* Add typed legacy registry

* Fix icon issue

* Regenerate models

* Update spec mask to use registry type

* Move v1 to v0

* Add json sanitized dict helper

* Fix tests and format

* Ensure we only get latest

* Code review comments

* fix missing spec error

* Move registry code to helper
2023-04-20 17:08:26 -07:00
Augustin
7d75e5e0dc fix typo in GHA workflow (#25372) 2023-04-20 17:15:13 -05:00
Augustin
b30a23de7e connectors-ci: use shared pip cache instead of locked (#25370) 2023-04-20 16:26:25 -05:00
Alexandre Girard
1e8cf8f5d5 low-code: Do not apply transforms on AirbyteLogMessages and AirbyteTraceMessages (#25290)
* Check the input type before applying transformations

* format

* remove debug prints
2023-04-20 14:12:22 -07:00
Artem Inzhyyants
aaf4bf056b Source Amplitude: refactor Events stream (#25317)
* Source Amplitude: Refactor Events Stream, based on Python CDK

* Source Amplitude: bump docker version

* Source Amplitude: Fix add get_updated_state

* Revert "Increase memory_limit for Amplitude (#25282)"

This reverts commit 06354ae4e4.

* Source Amplitude: Fix backport compatibility with new config

* Update source_definitions.yaml

* Source Amplitude: update catalogs


---------

Co-authored-by: artem1205 <artem1205@users.noreply.github.com>
2023-04-20 22:14:51 +02:00
Ana Loureiro
a6d6b06303 Source SFTP-Bulk: Support custom CSV separators (#19224)
* manage different separators

* tests

* update tests

* add optional parameter for separator

* add default separator

* typo

* add trailing newline

* fix spec.json order

* bump dockerfile version

* update changelog

* auto-bump connector version

* retrigger checks

* retrigger checks

---------

Co-authored-by: Sunny Hashmi <6833405+sh4sh@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-20 10:02:19 -04:00
Joost Stessens
a6acd135db 🎉 New Destination: Cumul.io (#23153)
* Included Cumul.io Destination connector

* add gradle file

* add dockerfile and format files

* Refactor DestinationCumulio class based on review

This commit refactors the DestinationCumulio class by removing some unused dummy variables and commented code that used to push data in the check method. 
Next to that I fixed a potential bug as pointed out by @grishick who reviewed this code (thanks for that!). Previously, the state message could have been yielded before any buffers were flushed, which could result in incorrectly indicating a synchronization has been completed while data could still be inside the buffer. Now, all buffers will be flushed before yielding that message.

* Chore: add Cumul.io logo svg to airbyte-config/init/src/main/resources/icons/

* Update two unit tests that were failing due to fixing bug in a previous commit

* fix tests

---------

Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2023-04-20 10:38:35 -03:00
Augustin
a342e14324 connectors-ci: build multi platform connector images (#25286) 2023-04-20 08:13:16 +00:00
Ella Rohm-Ensing
eecb080446 update link (#25349) 2023-04-19 19:24:00 -07:00
Akash Kulkarni
395de77a50 [JDBC sources] : Log database indexes per stream (#25345)
* [JDBC sources] : Log database indexes per stream

* Docker & doc bump

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-20 01:05:15 +00:00
Cynthia Yin
ae34a80cc5 Destination BigQuery + BigQuery denormalized: add configurable file buffer count (#25287)
* add configurable file buffer count for dest bq + bq denorm

* update changelog

* changelog nits

* auto-bump connector version

* manually bump connector version for bq denorm

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-20 00:11:14 +00:00
Riley Brook
f97d8a5846 Update managing-airbyte-cloud.md (#25341)
* Update managing-airbyte-cloud.md

* Update managing-airbyte-cloud.md

* Update managing-airbyte-cloud.md
2023-04-19 15:19:42 -07:00
Pedro S. Lopez
7d6eb43ae6 Add version info to publish comment (#25343) 2023-04-19 17:22:06 -04:00
Yevhen Sukhomud
fd70eb614d 24153 Destination Databricks: Fix support for external tables on S3 (#24657)
* 24153 Destination Databricks: Update s3 source external

* 24153 Destination Databricks: Fixed databricks.md formatting

* 24153 Destination Databricks: Update the version

* auto-bump connector version

---------

Co-authored-by: Evan Tahler <evan@airbyte.io>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-19 13:28:32 -07:00
Dominik Wagner
692e8c36c4 🎉Source Mailjet-Mail: Add listrecipient stream (#24689)
* add listrecipient stream

* update schemas to match source api

* bump dockerfile version

* update changelog

* auto-bump connector version

---------

Co-authored-by: sh4sh <6833405+sh4sh@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-19 20:18:20 +00:00
Conor
de3f342213 update to new version of dagger cli (#25340) 2023-04-19 14:47:36 -05:00
Ben Church
20283fe156 Dagger: Return true on valid exit in Dagger Commands (#25333)
* Return true on valid exit

* Update return type to bool
2023-04-19 19:24:22 +00:00
Henri Blancke
af3db20f64 🐛 Destination AWS Datalake: fix airbyte type in complex data types (#24642)
* [UPD] prefer airbyte type in complex data types

Signed-off-by: Henri Blancke <blanckehenri@gmail.com>

* [UPD] bump version

Signed-off-by: Henri Blancke <blanckehenri@gmail.com>

* [UPD] only for integers

Signed-off-by: Henri Blancke <blanckehenri@gmail.com>

* [FIX] changelog table

Signed-off-by: Henri Blancke <blanckehenri@gmail.com>

* [UPD] cast objects that can have additional props to string

Signed-off-by: Henri Blancke <blanckehenri@gmail.com>

* [ADD] tests

Signed-off-by: Henri Blancke <blanckehenri@gmail.com>

* auto-bump connector version

---------

Signed-off-by: Henri Blancke <blanckehenri@gmail.com>
Co-authored-by: Sajarin <sajarindider@gmail.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-19 15:12:41 -04:00
Lake Mossman
2b6aa624d9 Builder documentation: Pagination documentation (#25261)
* add offset increment and page increment docs

* add cursor pagination docs

* add full example of requests submitted by connector

* improve examples and explanation of page/cursor increment

* improve descriptions

* add section about accessing last record for cursor pagination

* add recommendation about page size

* add links to low-code and python CDKs
2023-04-19 11:09:56 -07:00
Arsen Losenko
67c7ea8743 Source Twilio: update expected records (#25320) 2023-04-19 20:23:54 +03:00
Joey Marshment-Howell
4a0c85c1a9 use actual release tag (#25311) 2023-04-19 18:12:31 +02:00
Sergio Ropero
819bba174f postgres-cdc checkpointing: fix LSN parsing bug + refactor for efficiency (#24582)
* Fix LSN parsing from Integer to Long

* rebasing

* Rebase

* Rebase

* Other casting

* Lock the file only when reading, so the file is free when parsing the object.
Increased from 1 to 166 checkpoints, and from skipping hundreds of checkpoints to never skip a state.

* Update load function documentation

* bump mysql and mssql

* cdc: refactor to remove debezium dependency from connector packages

* use gradle's shared dependency

* more refactoring

* upgrade docker version

* resolve master merge conflicts

* Automated Change

* minor changes

* resolve merge conflicts

* avoid deserializing multiple times

* simplify

* enable checkpointing for Postgres

* more improvements

* enable assertions

* changelog + bump version

* auto-bump connector version

* auto-bump connector version

* manual bump

---------

Co-authored-by: subodh <subodh1810@gmail.com>
Co-authored-by: subodh1810 <subodh1810@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-19 21:05:50 +05:30
Catherine Noll
2bc79893aa source-amplitude: Only fetch date_time_fields once per request (#25315) 2023-04-19 10:47:59 -04:00
midavadim
0b9847742f 🎉 Source Google Ads - fixed schema type for "data_type: ENUM" and "is_repeated: true" to array of strings (#24945)
* fixed schema type for "data_type: ENUM" and "is_repeated: true" to array of strings

* added SAT for GAQL

* updated version

* Added unittest for get_json_schema

* slight enhancement

* fixed empty lines

* auto-bump connector version

---------

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-19 15:35:54 +03:00
josephkmh
09222b1f0c Bump Airbyte version from 0.44.0 to 0.44.1 v0.44.1 2023-04-19 11:45:34 +00:00
Arsen Losenko
f40a6b2530 Source Hubspot: update expected records (#25309)
* Source Hubspot: update expected records

* Update expected records
2023-04-19 14:08:14 +03:00
Joe Reuter
92ce26bc73 Connector builder: Tutorial (#25153)
* authentication documentation

* some fixes

* fix broken link

* remove session token auth

* review comments

* first version of tutorial

* Revert "first version of tutorial"

This reverts commit bbdeb997f2.

* first version of tutorial

* review comments

* improvements

* improvements

* review comments
2023-04-19 10:55:02 +00:00
Joe Reuter
e19bc57352 remove session token auth (#25142) 2023-04-19 12:27:23 +02:00
Marcos Marx
10db658097 Source Stripe: add account stream (#25136)
* add accounts stream

* add docs

* lint

* fix

* fix

* fix test

* fix tests

* fix account schema

* fix expected records and schema

* remove account from abnormal state

* update dockerfile and update docs

* Update Dockerfile

* Update source_definitions.yaml

* Update test_source stream count

* auto-bump connector version

---------

Co-authored-by: Haithem Souala <haithem.souala@woopit.fr>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-19 13:03:39 +03:00
Augustin
2cdde75ed9 connectors-ci: fix java connectors testing (#25268)
* connectors-ci: include destination-azure-blob-storage gradle project as a destination build dep

* fix integration test return value
2023-04-19 01:05:40 -05:00
Cole Snodgrass
2199f9734d increase postgres heartbeat from 3600->7200 (#25304) 2023-04-18 15:04:43 -07:00
Artem Inzhyyants
ba65654d6c Source Amplitude: Reduce request_time_range (#25281)
* Source Amplitude: Reduce request_time_range

* Source Amplitude: Reduce request_time_range

* Update docs/integrations/sources/amplitude.md

Co-authored-by: Catherine Noll <clnoll@users.noreply.github.com>

* Update docs/integrations/sources/amplitude.md

* auto-bump connector version

---------

Co-authored-by: Serhii Lazebnyi <53845333+lazebnyi@users.noreply.github.com>
Co-authored-by: Catherine Noll <clnoll@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2023-04-18 23:12:01 +02:00
Ben Church
5563179782 Dagster: rename catalog to registry (#25254)
* rename catalog to registry in metadata service

* rename catalog to registry in metadata files

* Run generate models

* Fix missed renames

* Add github personal access token

* Run black

* Automated Change

---------

Co-authored-by: bnchrch <bnchrch@users.noreply.github.com>
2023-04-18 22:15:11 +02:00
Ella Rohm-Ensing
b053ad6b2c CAT tools/acceptance_test_config_migration: Make relatively configurable per migration, add (hacky) ability to run CAT tests locally for multiple connectors (#24377)
* Pass argument along, add test that should pass and test that should fail

* Add tests with additionalProperties

* Set additionalproperties=false when not set|

* Parametrize test cases

* Make the behavior 'optional'

* Fix parametrization for all combinations

* Improve CI credentials README and rename param

* Update naming to be clearer about columns only

* record_has_unexpected_field > record_has_unexpected_column

* Automated Change

* Hacking the CAT dockerfile and run script to test my changes specifically

* First crack at running CAT on all connectors

* Write during instead of after all tests

* Async-ify it

* Add ability to define max concurrency

* Write successes

* ci_credentials: fix overwriting 'data' before getting nextPageToken

* Adjustible num_semaphores, check to make sure it's an airbyte connector first

* Automated Change

* Make create_issues and create_prs more configurable, add issue for fail_on_extra_columns

* Add ability to pass in sources as a list or from a txt file

* Add logs to issue, make project nullable

* Migrate multiple connectors

* Add cli args

* use ruamel.yaml to preserve ordering

* Separate config loading from config migration

* Add ability to pass in lists of sources to test. Sort output by exit codes. Fix max_concurrency flag

* Default to testing only beta and GA connectors

* Always write test output when available

* Revert "Add cli args"

This reverts commit b538a8c696.

* Remove slash

* Don't run on alpha connetors, handle older config style

* Don't migrate to new format, preserve quotes and long lines

* Automated Change

* Update issue, don't run for alpha connectors

* Automated Change

* Add bypass for extra fields test

* Add bypass for extra fields test

* Rename run_tests script

* Rename module

* Update args usage, small changes

* Refactor create_issues.py

* Clean up run_tests.py

* Sort out arg parsers

* Pull out get_valid_definitions_from_args

* Import definitions module instead of methods

* Use config files to provide constants for each migration

* Handle FileNotFoundError in create_issues.py, improve logging

* Rename to migrations, reference name of folder via utils

* Update readmes for migration modules, add script for getting outputs

* Use tmp dir, correct path for issue reference

* Fix bash script

* Fix create command, pull out test results insertion

* Update call to update_configuration

* add precommit to requirements

* Reorder README

* README cleanup for test and create issues

* README cleanup for create_prs and config_migration

* More readmes! Readmes galore

* allow_beta

* Restore hacky changes to dockerfile and acceptance-test-docker

* Handle 'other' release stages

* Update readme

* Remove TODO, add comments to shell script

* format according to gradle

* format

* Fix formatting

---------

Co-authored-by: marcosmarxm <marcosmarxm@users.noreply.github.com>
Co-authored-by: erohmensing <erohmensing@users.noreply.github.com>
2023-04-18 19:58:31 +00:00