1
0
mirror of synced 2025-12-21 11:01:41 -05:00
Commit Graph

851 Commits

Author SHA1 Message Date
LiRen Tu
61a18af5a8 Publish source bamboo hr (#5698)
* Add secret variable for source bamboo hr

* Add bamboo hr to connector catalog
2021-08-27 04:28:49 -07:00
Jared Rhizor
164d13866e allow scheduler and server to run on separate nodes (#5506)
* allow scheduler and server to run on separate nodes

* re-add workspace mount for docker compose only

* remove stacktrace printing

* add affinity testing components

* reorder mounts

* just try a two node cluster

* add waiting log line

* seed containers are now axed

* remove unused var

* add comment

* rename to integration-test
2021-08-26 15:51:16 -04:00
Dmytro
3630cdeded 🎉 New Source: Amazon Ads (initial version) (#5023)
* 3772: Amazon Ads source, initial version


Co-authored-by: Dmytro Rezchykov <dmitry.rezchykov@zazmic.com>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>
2021-08-26 11:15:38 +03:00
Serhii Lazebnyi
2ee462a9a1 🎉 New Source-connector for Google Analytics v4 (#5290)
* Add Google Analytics v4 implementation

* Add docs and connector index

* Fix a broken link to Airbyte CDK

* Fix a broken link to source acceptance tests docs

* Add condition for a full refresh or incremental stream

* Add unit tests

* Fix formatting to flake8

* Updated to review

* Added logger for custom reports validation

* Add comments to code

* Updated to format
2021-08-25 17:55:08 +03:00
Serhii Lazebnyi
ce5d8d6758 🎉 New Source: Trello (#5501)
* Add Trello implementation

* Add doc string to TrelloAuthenticator

* Add spaces to Docker file

* Add rate limit balancer

* Add loger to load balancer
2021-08-25 14:29:50 +03:00
Marcos Marx
088a138c0e Add oracle dest creds to get from secrets (#5578)
* add oracle dest creds to get from secrets

* add oracle in publish and test workflow
2021-08-23 16:11:18 -03:00
Dmytro
5560df7747 Add Braintree source connector (#5362)
* 5236: Add Braintree source connector

- Add Braintree source connector based on airbyte CDK
- Remove Braintree singer based connector

Co-authored-by: Dmytro Rezchykov <dmitry.rezchykov@zazmic.com>
2021-08-19 11:49:45 +03:00
Baz
0f55bf63d5 📝 🎉 Source Dixa: Migrated to CI Sandbox, refactored code structure for future support (#5367) 2021-08-18 21:59:33 +03:00
Sherif A. Nada
179d4b366e publish PR #5339 (#5450) 2021-08-17 15:10:22 -07:00
LiRen Tu
79b8fd5c12 Remove seed generation task and seed from YAML files (#5335)
## What
- This is the first PR for #4890.
  - This PR does not remove the config volume.
  - This PR does not mount the directories for the local connectors.
- Resolves #5373.

## How
-  Previously the seed container copies the configs to the storage root, it may take some time for the operation to complete and for the `CONFIG_DIR` to show up. So we cannot infer anything based on the existence of this directory. Now this seed generation step has been removed. So we can tell immediately whether `CONFIG_DIR` exists or not.
  - If `CONFIG_DIR` exists, it means the user has just migrated Airbyte from an old version that uses this file system config persistence.
  - Otherwise, we can seed the config persistence from the YAML files.
2021-08-17 11:40:06 -07:00
Davin Chia
07d854f564 Set up Shortio integration test and release connector. (#5418) 2021-08-16 12:27:53 +08:00
Yaroslav Dudar
5373965ffc 🎉 New Source: Bing Ads (#4911)
Bing Ads Source connector. Streams: Account, Campaign, AdGroup, Ad
2021-08-13 18:13:40 +03:00
midavadim
fd52290f15 Source google spreadsheets: fix error during listing sheets with diagram only (#5233)
* Fix error during listing sheets with diagram only

* added PR number

* fixed formatting

* add dummy test

* added filter for grid sheets, added SAT tests.

* added new GOOGLE_SHEETS_TESTS_CREDS env

* fixed formatting
2021-08-11 21:25:19 +03:00
Vadym
ba322ac580 🚀 Amazon SP: expand endpoint support (#4863)
* Add GET_FBA_INVENTORY_AGED_DATA data

* Add GET_MERCHANT_LISTINGS_ALL_DATA stream support

* Update schemas

* Update configured_catalog.json

* Update connector to airbyte-cdk

* Add amazon seller partner test creds

* Update state sample files

* Apply code format

* Update acceptance-test-config.yml

* Add dummy integration test

* Refactor auth signature.
Update streams.py

* Remove print_function import from auth.py

* Refactor source class.
Add pydantic spec.
PR fixes.

* Add dummy integration test

* Typing added.
Add _create_prepared_request docstring.

* Update docs and spec

* Post merge code fixes
2021-08-06 17:10:17 +03:00
Oleksandr
7660245915 🎉 New Source-connector for Zuora (#4661)
#2664 - closing New Source: Zuora (https://github.com/airbytehq/airbyte/issues/2664)

Co-authored-by: Oleksandr Bazarnov <oleksandr.bazarnov@globallogic.com>
2021-08-05 13:41:57 +03:00
midavadim
5c717607b8 Build report - changed os environment variable name to SLACK_BUILD_REPORT (#5147)
* Build monitoring script init

* Added GH action which run build-report script

* use slack webhook now

* added notification in case of failure, changed status icon

* Fixed os environment variable name

* Fixed os environment variable name to build report webhook
2021-08-03 00:48:44 +03:00
midavadim
8a4e7a4080 Build report (#5104)
* Build monitoring script init

* Added GH action which run build-report script

* use slack webhook now

* added notification in case of failure, changed status icon
2021-08-02 23:44:38 +03:00
George Claireaux
b7c8101cab 🎉 New Source: Apify Dataset (#5081)
* Initial version of Apify Dataset source connector

* Add apify dataset to source definition

* Make sure clean is False by default

* Remove need for user id and token since it is not needed for reading dataset

* Add comment

* Update README

* Add docs to summary

* Add changelog to readme

* Add link to README

* Add PR link

* Initial version of Apify Dataset source connector

* Add apify dataset to source definition

* Make sure clean is False by default

* Remove need for user id and token since it is not needed for reading dataset

* Add comment

* Update README

* Add docs to summary

* Add changelog to readme

* Add link to README

* Add PR link

* Address comments

* Add newline

* added secrets stuff

* added environment more-secrets

* added more-secrets environment

* removed environment more-secrets to add in separate PR

* Docs nits

* Make sure that dataset items come in the correct order

* lint

* User partial function

* lint

* Address comments:

* newline

* format fix

* format

* bump version for formatting fix

Co-authored-by: Matej Hamas <matej.hamas@gmail.com>
2021-08-02 19:57:25 +01:00
Maksym Pavlenok
85b0bb22d2 🎉 Source Zendesk: Migration from Singer to CDK (#4861)
* init the new connector source-zendesk-support

* Finished a development of ZenDesk streams

* Source ZenDesk: finished

* Source ZenDesk: remove unused test files

* Source ZenDesk: format and validate code

* Source Zendesk: update docs

* Remove unused files

* add a stream_slices logic for ticket_comments stream

* 🎉 Python CDK: Allow setting network adapter args on outgoing HTTP requests  (#4493)

* 🎉 Destination S3: support `anyOf` `allOf` and `oneOf` (#4613)

* Support combined restrictions in json schema

* Bump s3 version

* Add more test cases

* Update changelog

* Add more test cases

* Update documentation

* Format code

* SAT: verify `AIRBYTE_ENTRYPOINT` is defined (#4478)

* save changes required for work; TODO locate all places that need to be updated to make test working

* move new test inside test_spec

* apply suggestions

* change return type + add check env = space_joined_entrypoint

* requested

* add check entrypoint with env

* bump SAT --version && changelog update

* merge && fix changelog

* changes

* add dynamic docker runner creator + test having properties

* update the names

* change names

* make fixtures

* upd text

* Update airbyte-integrations/bases/source-acceptance-test/unit_tests/test_spec_unit.py

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* requested changes

* Update airbyte-integrations/bases/source-acceptance-test/unit_tests/test_spec_unit.py

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* Update airbyte-integrations/bases/source-acceptance-test/unit_tests/test_spec_unit.py

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* apply requested changes

* change names (requested)

* move binary strings to standard with convertation in builder

* fixing merge-conflict side effect

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* Migrate Quickstart to use PokeAPI (#4615)

* Migrate Quickstart to use PokeAPI

* Words words words

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Left isn't right (#4616)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Create on on-oci-vm.md (#4468)

* Create on on-oci-vm.md

Deployment guide for Airbyte on Oracle Cloud Infrastructure (OCI) VM

* Update on-oci-vm.md

Adding the image links and uploading images to the repository

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update on-oci-vm.md

* Add files via upload

* Update on-oci-vm.md

* Add files via upload

* Update on-oci-vm.md

* Update on-oci-vm.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* 🐛  platform: Fix silent failures in sources (#4617)

* add oracle dpeloyment guide to summary.md (#4619)

* Mailchimp fix url-base (#4621)

* minimal change to show acceptance test failure

* exactly fix

* bump version and readme

* upd

* 🎉 New Source: Paypal Transaction (#4240)

* Added spec.json

* Initialization

* added oauth2 autorization

* added spec, check, discover + catalogs/configurared_catalogs

* updated request_params

* added paging, slicing (1d)

* Use oath2 for paypal

* incremental sync, acceptance test

* incremental sync, acceptance test

* Added spec.json

* Initialization

* added oauth2 autorization

* added spec, check, discover + catalogs/configurared_catalogs

* updated request_params

* added paging, slicing (1d)

* Use oath2 for paypal

* incremental sync, acceptance test

* incremental sync, acceptance test

* Added spec.json

* Initialization

* added oauth2 autorization

* added spec, check, discover + catalogs/configurared_catalogs

* updated request_params

* added paging, slicing (1d)

* Use oath2 for paypal

* incremental sync, acceptance test

* updated slices and api limits, added validation for input dates

* added tests, fixed cursor related information in schemas and configured catalogs, removed old comments, re-arranged Base PaypalTransactionStream class

* added input param 'env' to support production and sandbox envs

* added support for sandbox option, updated pattern for optional end date option

* added github secrets

* added support for sandbox option, updated pattern for optional end date option

* fixed Copyright date, removed debug mesages

* added docs

* fix for test failure - The sync should produce at least one STATE message

* removed optional parameter 'end_date'

* removed detailed info about balances schema

* Delete employees.json

* Delete customers.json

* Added requests_per_minute rate limit

* added unit tests, added custom backoff

* added test for stream slices with stream state

* removed comments

* updated docs pages

* fixed format for json files

* fixed types in schemas and link to the schema. fixed primary key for Transactions stream

* updated stream slices

* Updated tests, unified stream_slices for both streams, all instance variables instantiated directly in __init__ method

* added CHANGELOG.md

* Added build seeds

* fixed closing double quotation mark

* added paypal entry in builds.md

* add fixture helper

* added paypal transaction generator script

* fixed styling

* maximum allowed start_date is extracted from API response now.

* fixed schemas

* fixed schemas - removed datetime

* now maximum_allowed_start_date is identified by last_refreshed_datetime attr in API response.

* added possibility to specify additional properties

Co-authored-by: Sherif Nada <snadalive@gmail.com>

* set db version after full import is complete (#4626)

* set db version after full import is complete

* check db version in the last step

* add comment

* Fix docs formatting

* Redirect old link to upgrading tutorial (#4635)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Fix broken link in SUMMARY.md

* Airflow Demo: Remove superset in down.sh (#4638)

* Remove superset in down.sh

* Clean up superset containers before creating them in up.sh

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Airflow demo: Clean up scripts and more clearly describe actions (#4639)

* Airflow demo: Script cleanup

* Correct docker compose name for airflow file

* Final fixes

* Clean up airbyte destination

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* 🎉 Add documentation for configuring Kube GCS logging. (#4622)

* Bump version: 0.27.0-alpha → 0.27.1-alpha (#4640)

* 0.27.1 Platform Patch Notes (#4644)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* 🎉 New Source: Zendesk Sunshine (#4359)

* pre-PR

* add git config

* format

* Update airbyte-integrations/connectors/source-zendesk-sunshine/requirements.txt

upd requirements.txt remove extra

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* Update airbyte-integrations/connectors/source-zendesk-sunshine/source_zendesk_sunshine/streams.py

backoff time int to float (btw real return type in headers is integer)

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* requested changes

* fix newline absence && rm unnecessary temp file

* url_base to property

* rm extra var coming property

* rm extra var coming property

* save

* finishing updating the documentation

* forgotten definition

* add nullable to pass the test

* fix date in the log

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* 0.27.1 Connector Patch Notes (#4646)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Update connector certification table. (#4647)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* 🐛 Stub out the GCP Env Var in Docker to prevent noisy and harmless errors. (#4642)

* Add this to prevent noisy errors.

* Add hint to Airflow guide about local example (#4656)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* fix version for kube automatic migration support (#4649)

* format zendesk sunshine connector (#4658)

* 🎉 New source: Dixa (#4358)

* Turn on MYSQL normalization flag. (#4651)

* Turn on normalization flag. Bump versions

* Combine admin and settings (#4525)

* Add side menu component

* Add side menu to settings page. Remove admin link from sidebar

* Move NotificationPage

* Move ConfigurationPage

* Add Sources and Destinations pages to Settings. Delete Admin page

* Add MetricsPage

* Edit Notifications and Metrics pages

* Update feedback for metrics and notification pages

* Add update icons data to side menu

* Add AccountPage

* Job history purging (#4575)

* WIP: Job history purging

* Created test cases that handle variations of job history purging configuration

* Typo fix

* Expanded test cases to control for job history on multiple connections at once.

* Handle latest job with saved state correctly regardless of order of ids

* Whitespace

* Externalized sql. Cleaned up constants.

* Cleaned up test case persistence code and structure

* Whitespace and formatting per standard tooling.

* 0.27.1 Announcement Summary (#4678)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* 🐛 Source Sendgrid: add start_time config and correct primary_key (#4682)

* add start_time config and correct primary_key

* correct integration tests

* correct type

* config txt and primary_key

* test to show how automatic migration handles deprecated definitions (#4655)

* test to show definitions not present in latest seed would be deleted in automatic migration

* format

* add deprecated config being used scenario

* Source dixa: fix unit tests (#4690)

* introduce common abstraction for CDC via debezium (#4580)

* wip

* add file

* final structure

* few more updates

* undo unwanted changes

* add abstract test + more refinement

* remove CDC metadata to debezium

* rename class + add missing property

* move debezium to bases + upgrade debezium version + review comments

* downgrade version + minor fixes

* reset to minutes

* fix build

* address review comments

* should return Optional

* use common abstraction for CDC via debezium for mysql (#4604)

* use new cdc abstraction for mysql

* undo wanted change

* pull in latest changes

* use renamed class + move constants to MySqlSource

* bring in latest changes from cdc abstraction

* format

* bring in latest changes

* pull in latest changes

* use common abstraction for CDC via debezium for postgres (#4607)

* use cdc abstraction for postgres

* add files

* ready

* use renamed class + move constants to PostgresSource

* bring in the latest changes

* bring in latest changes

* pull in latest changes

* Source Dixa: Pin tz in ConversationExport.ms_timestamp_to_datetime (#4696)

* Source Dixa: add to connector index (#4701)

* allow injecting filters for server (#4677)

* allow injecting filters

* fmt

* upgrade postgres version for new cdc abstraction (#4702)

* Fix dependencies for Superset demo (#4705)

* Fix superset dependency location

* Add some Superset setup

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* 📚  add SSH instructions for OCI VM setup (#4684)

Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* upgrade mysql version for new cdc abstraction (#4703)

* Update with ALTER TABLE statements (#4707)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* remove unused deps (#4512)

Co-authored-by: Davin Chia <davinchia@gmail.com>

* fix config init race condition (#4679)

* 🐛 Destination S3: fix minio output for parquet format

* Bump destination s3 version (#4718)

* Fix scheduler race condition. (#4691)

* Periodic connector tests workflow: add `Accept` header per github docs recommendation (#4722)

* allow launching integration tests from workflow dispatch (#4723)

* Bump version: 0.27.1-alpha → 0.27.2-alpha (#4724)

* 🐛 Source Square: Update _send_request method due to changes in Airbyte CDK (#4645)

* 🎉 Destination Snowflake: tag snowflake traffic with airbyte ID to enable optimizations from Snowflake (#4713)

* 🎉 New source: Typeform (#4541)

Typeform source: Forms and Responses streams

* Upgrade postgres and redshift destination to remove basic_normalization attribute (#4725)

* upgrade snowflake,redshift,postgres to remove basic_normalization

* undo snowflake

* undo snowflaketest

* fix broken assertions for automatic migration tests (#4732)

* Slightly improve sed-based yaml parsing (#4721)

Previous sed did not handle the valid `profile: foo`

* throw exception if we close engine before snapshot is complete + increase timeout for subsequent records (#4730)

* throw exception if we close engine before snapshot is complete + increase timeout for subsequent records

* add comment + bump postgres version to use new changes

* allow publishing airbyte-server to local maven repo (#4717)

* allow publishing airbyte-server to local maven repo

* Stub this out so the name that is created is airbyte-server-0.27.1-alpha.jar and not airbyte-server-0.27.1-alpha-all.jar.

* Add comments.

* see if this fixes build

Co-authored-by: Davin Chia <davinchia@gmail.com>

* CDK: Add initial Destination abstraction and tests (#4719)

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* Update docs on GitHub connector now that its Airbyte native (#4739)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Remove statement about Postgres connector being based on Singer (#4740)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* fix flaky migration acceptance test (#4743)

* upgrade fabric8 client (#4738)

* 🎉 Source MSSQL: implementation for CDC (#4689)

* first few classes for mssql cdc

* wip

* mssql cdc working against unit tests

* increment version

* add cdc acceptance test

* tweaks

* add file

* working on comprehensive tests

* change isolation from snapshot to read_committed_snapshot

* finalised type tests

* Revert "change isolation from snapshot to read_committed_snapshot"

This reverts commit 20c6768071.

* small docstring fix

* remove unused imports

* stress test fixes

* minor formatting improvements

* mssql cdc docs

* finish off cdc docs

* format fix

* update connector version

* add to changelog

* fix for sql server agent offline failing cdc enable on tables

* final structure

* few more updates

* undo unwanted changes

* add abstract test + more refinement

* remove CDC metadata to debezium

* use new cdc abstraction for mysql

* undo wanted change

* use cdc abstraction for postgres

* add files

* pull in latest changes

* ready

* rename class + add missing property

* use renamed class + move constants to MySqlSource

* use renamed class + move constants to PostgresSource

* move debezium to bases + upgrade debezium version + review comments

* downgrade version + minor fixes

* bring in latest changes from cdc abstraction

* reset to minutes

* bring in the latest changes

* format

* fix build

* address review comments

* bring in latest changes

* bring in latest changes

* use common abstraction for CDC via debezium for sql server

* remove debezium from build

* finalise PR

* should return Optional

* pull in latest changes

* pull in latest changes

* address review comments

* use common abstraction for CDC via debezium for mysql (#4604)

* use new cdc abstraction for mysql

* undo wanted change

* pull in latest changes

* use renamed class + move constants to MySqlSource

* bring in latest changes from cdc abstraction

* format

* bring in latest changes

* pull in latest changes

* use common abstraction for CDC via debezium for postgres (#4607)

* use cdc abstraction for postgres

* add files

* ready

* use renamed class + move constants to PostgresSource

* bring in the latest changes

* bring in latest changes

* pull in latest changes

* lower version for tests to run on CI

* format

* Update docs/integrations/sources/mssql.md

Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* addressing review comments

* fix for testGetTargetPosition

* format changes

Co-authored-by: George Claireaux <george@claireaux.co.uk>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* bump up MSSQL version for cdc (#4694)

* first few classes for mssql cdc

* wip

* mssql cdc working against unit tests

* increment version

* add cdc acceptance test

* tweaks

* add file

* working on comprehensive tests

* change isolation from snapshot to read_committed_snapshot

* finalised type tests

* Revert "change isolation from snapshot to read_committed_snapshot"

This reverts commit 20c6768071.

* small docstring fix

* remove unused imports

* stress test fixes

* minor formatting improvements

* mssql cdc docs

* finish off cdc docs

* format fix

* update connector version

* add to changelog

* fix for sql server agent offline failing cdc enable on tables

* final structure

* few more updates

* undo unwanted changes

* add abstract test + more refinement

* remove CDC metadata to debezium

* use new cdc abstraction for mysql

* undo wanted change

* use cdc abstraction for postgres

* add files

* pull in latest changes

* ready

* rename class + add missing property

* use renamed class + move constants to MySqlSource

* use renamed class + move constants to PostgresSource

* move debezium to bases + upgrade debezium version + review comments

* downgrade version + minor fixes

* bring in latest changes from cdc abstraction

* reset to minutes

* bring in the latest changes

* format

* fix build

* address review comments

* bring in latest changes

* bring in latest changes

* use common abstraction for CDC via debezium for sql server

* remove debezium from build

* finalise PR

* should return Optional

* pull in latest changes

* pull in latest changes

* address review comments

* use common abstraction for CDC via debezium for mysql (#4604)

* use new cdc abstraction for mysql

* undo wanted change

* pull in latest changes

* use renamed class + move constants to MySqlSource

* bring in latest changes from cdc abstraction

* format

* bring in latest changes

* pull in latest changes

* use common abstraction for CDC via debezium for postgres (#4607)

* use cdc abstraction for postgres

* add files

* ready

* use renamed class + move constants to PostgresSource

* bring in the latest changes

* bring in latest changes

* pull in latest changes

* lower version for tests to run on CI

* bump up mssql version for cdc

* format

* Update docs/integrations/sources/mssql.md

Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* addressing review comments

* fix for testGetTargetPosition

* format changes

Co-authored-by: George Claireaux <george@claireaux.co.uk>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* fixed broken links and styling (#4747)

* Fix enabling connection in refresh catalog mode (#4527)

* Fix enabling connection in refresh catalog mode

* Do not update deprecated connectors (#4674)

* Do not update deprecated connectors

* Fix various connectorDefinition issues: disappearing button, wrong id used for destination update

* 🐛 Source Slack: add float_ts field (#4683)

* rename float_ts to ts cursor_field

* add float_ts

* change float_ts to number

* change channel_msg

* bump version

* increase default timeout_seconds slack acc test

* timeout_seconds to 1750

* timeout_seconds to 3600 :p

* add changelog for slack connector

* copy docs to webapp docker image (#4522)

* use kube service user for pod sweeper (#4737)

* use kube service user for pod sweeper

* add pod sweeper logs

* temporarily switch to stable for testing

* temporarily remove building steps for kube testing since it can use prod images

* output date strings from date command

* load stable images

* remove loading since it can pull the images

* increase window for success storage to two hours

* revert test logging changes

* 🐛 Source GitHub: fix bug with `IssueEvents` stream and add handling for rate limiting (#4708)

* Few updates for GitHub source

Set correct `cursor_field` for `IssueEvents` stream.
Add rate limit handling.
Add handling for 403 error.
Add handling for 502 error.

Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* 🐛 Fix some api-spec errors. (#4742)

* Source PostHog: Use account information for checking the connection (#4692)

* this should fix the check if no records in annotations stream

* update schemas for new SAT requirements && apply user hint upgrade on wrong api key

* save schema upd

* upd insights schema

* upd insights schema2

* upd insights schema3

* upd insights schema4

* upd insights schema5 (null is joking)

* upd insights schema6 (null is joking)

* upd insights schema7

* upd insights schema8

* upd insights schema8

* bump version && docs

* SAT: Improve error message when data mismatches schema (#4753)

* improve message when data mismatch schema

Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>

* increase sleep duration + show logs in CI (#4756)

* Fixed cockroachdb repo image (#4758)

* Bump version: 0.27.2-alpha → 0.27.3-alpha (#4761)

* update kube docs (#4749)

* fix kube overlay version (#4765)

* Split Platform and Connector Builds (#4514)

* remove second docs check in build(#4766)

* Restore template generator and fix formatting. (#4768)

* connector generate: fix chown logic (#4774)

* Remove example use cases from docs (#4775)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Update README.md

* 🎉 All java connectors: Added configValidator to check, discover, read and write calls (#4699)

* Added configValidator to java connectors

* 🎉 Stripe Source: Fix subscriptions stream to return all kinds of subscriptions (including expired and canceled) (#4669)

#4669 Stripe Source: Fix subscriptions stream to return all kinds of subscriptions (including expired and canceled)
Co-authored-by: Oleksandr Bazarnov <oleksandr.bazarnov@globallogic.com>

* Add note about orphaned Airbyte configs preventing automatic upgrades (#4709)

* Add note about removing orphaned Airbyte configs

* Remove excess baggage

* Add a resetting section to make this more clear.

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Patch 0.27.2 and 0.27.3 platform notes (#4792)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Connector notes for 0.27.3 (#4794)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* Add new logo to GitHub page (#4796)

Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>

* 🎉 New Destination: Google Cloud Storage (#4784)

* Adding Google Cloud Storage as destination

* Removed few comments and amended the version

* Added documentation in docs/integrations/destinations/gcs.md

* Amended gcs.md with the right pull id

* Implemented all the fixes requested by tuliren as per https://github.com/airbytehq/airbyte/pull/4329

* Renaming all the files

* Branch alligned to S3 0.1.7 (with Avro and Jsonl). Removed redundant file by making S3 a dependency for GCS

* Removed some additional duplicates between GCS and S3

* Revert changes in the root files

* Revert jdbc files

* Fix package names

* Refactor gcs config

* Format code

* Fix gcs connection

* Format code

* Add acceptance tests

* Fix parquet acceptance test

* Add ci credentials

* Register the connector and update documentations

* Fix typo

* Format code

* Add unit test

* Add comments

* Update readme

Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

Co-authored-by: Marco Fontana <marco.fontana@sohohouse.com>
Co-authored-by: marcofontana.ing@gmail.com <marcofontana.ing@gmail.com>
Co-authored-by: Marco Fontana <MaxwellJK@users.noreply.github.com>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

* 🐛 CDK: Fix logging of initial state value (#4795)

* Update abstract_source.py

* bump

* CHANGELOG.md

Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>

* bug fix: use register api (#4811)

* 🐛  Add missing dependencies for acceptance tests to run. (#4808)

* 🎉 Add Python Destination Template (#4771)

* Format. (#4814)

* 🎉 Migrate config persistence to database (#4670)

* Implement db config persistence

* Fix database readiness check

* Reduce logging noise

* Setup config database in config persistence factory

* Update documentation

* Load seed from yaml files

* Refactor config persistence factory

* Add one more test to mimic migration

* Remove unnecessary changes

* Run code formatter

* Update placeholder env values

* Set default config database parameters in docker compose

Co-authored-by: Christophe Duong <christophe.duong@gmail.com>

* Default setupDatabase to false

* Rename variable

* Set default config db parameters for server

* Remove config db parameters from the env file

* Remove unnecessary environment statements

* Hide config persistence factory (#4772)

* Remove CONFIG_DATABASE_HOST

* Use builder in the test

* Simplify config persistence builder

* Clarify config db connection readiness

* Format code

* Add logging

* Fix typo

Co-authored-by: Christophe Duong <christophe.duong@gmail.com>

* Add a config_id only index

* Reuse record insertion code

* Add id field name to config schema

* Support data loading from legacy config schemas

* Log missing logs in migration test

* Move airbyte configs table to separate directory

* Update exception message

* Dump specific tables from the job database

* Remove postgres specific uuid extension

* Comment out future branch

* Default configs db variables to empty

When defaulting them to the jobs db variables, it somehow does not work.

* Log inserted config records

* Log all db write operations

* Add back config db variables in env file to mute warnings

* Log connection exception to debug flaky e2e test

* Leave config db variables empty

`.env` file does not support variable expansion.

Co-authored-by: Christophe Duong <christophe.duong@gmail.com>
Co-authored-by: Charles <giardina.charles@gmail.com>

* 🎉 Source intercom: migration to CDK (#4676)

* Added Intercom implementation

* Updated segments docs

* Updated _send_request method to new airbyte-cdk version

* Updated cursor field to datetime string

* Added filtering by state for incremental sync

* Updated cursor paths for test incremental sync

* Added dict type validation to get_data method

* Updated catalog

* Updated typing for start_date

* Updated singer seed to cdk seed

* Updated connector docs

* Updated sample config file

* Sorted streams alphabetically

* Removed placeholder comments

* Renamed rate_limit to queries_per_hour

* Updated common sleep time to backoff_time method

* 🎉 New source: Pipedrive connector (#4686)

* Add pipedrive source initial

* Add initial schemas.
Add MVP source implementation.

* Implement MVP streams

* Complete MVP streams implementation

* Apply schema format

* Add test creds

* Update streams.py
Fix schemas

* Update replication_start_date format.
Add extra pagination condition

* Refactor streams, remove unused classes.

* Add pipedrive.md docs file.
Add Pipedrive source definitions.

* Add json source definition.

* Update spec.json

* Add docs mentions throughout the project files

* Make number of Concurrent Jobs configurable. (#4687)

* Explicitly pin ec2 runner version to 2.2.1. (#4823)

This was a mismash before, partially my fault. Explicitly pinning for now.

* 🐛 Source Facebook: Improve rate limit management (#4820)

* Improve rate limit management

* bump version

* facebook-marketing.md update the changelog

* format and fix

* Source Facebook: fix formatting and publish new version (#4826)

* format

* disable schema validation

* fix urls in AdCreatives stream, enable SAT for creatives

* format

Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>

* Code generator: Update generator to chown docs and config definition directories (#4819)

* Python Demo Destination: KVDB (#4786)

* 📚 CDK: Add python destination tutorial  (#4800)

* 📚 Source Shopify: migrate to new sandbox, update API version to 2021-07 (#4830)

(#4830) Source Shopify: migrate to new sandbox, update API version to 2021-07

Co-authored-by: Oleksandr Bazarnov <oleksandr.bazarnov@globallogic.com>

* 🐛 Source Instagram: Read previous state format and upgrade it (#4805)

* few fixes for user_insights state

* support old state format

* format

* bump

Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>

* Add placeholder (#4816)

* Add update button (#4809)

* Point to new location for connector build status history (#4840)

* Update GAds docs to indicate incremental support

* Add openreplay (#4685)

* Add openreplay

* Add env variables for openreplay

* Add openreplay env for k8s

* 🎉 Source mixpanel: migration to CDK (#4566)

* Mixpanel initiation

* copied schemas and specs file from singer connector

* authentication and a few streams

* Added Funnels + FunnelsList

* Added example of funnel response

* added incremental Funnels stream with tests

* added Annotations, CohortMembers, Engage, Cohorts, Funnels

* added Revenue

* fixed formatting

* fixed variable names

* fixed cohort_members and updated export streams

* moved start_date and date checks into SourceMixpanel class

* added error handling

* added unit test, update docs and ci creds

* fix url base for export stream

* added full and incremental read for export stream

* updated acceptance tests, added limit correction based on number of streams, export cursor is stored in datatime string

* Funnel stream - added complex state which contains state for each funnel

* added attribution windows support and project timezone config

* fixed formatting

* added default timezone

* added dynamic schema generation for Engage and Export streams

* fixed formatting

* fixed ability to pass start_date in datetime format as well

* fixed ability to pass start_date in datetime format as well

* added additional_properties field for dynamic schemas. updates regex for start_date matching to support old config file

* fixed formatting

* export stream - convert all values to default type - string

* added schema ref

* added new properties for funnel stream

* fixed formatting in funnel schema

* added build related files

* update changelog

* fixed and added comments, renamed rate_limit variable

* fixed formatting

* changed normalization for reserved mixpanel attributes like $browser

* alphabetise spec fields

* added description about API limit handling

* updated comment

* Add openreplay variable (#4844)

* 🐛  Sendgrid source: Gracefully handle malformed responses from sendgrid API (#4839)

* Update job description (#4848)

* Update job description

* Create senior-product-manager

* Create founding-account-executive

* Update senior-product-manager

* Update SUMMARY.md

* Add py destination tutorial to summary.md (#4853)

* Update CHANGELOG.md

* 🐛 Kube: Fix Source Ports not releasing. (#4822)

Closes #4660 .

On further investigation, it turns out we were not releasing the source ports. This is because of how the Process abstraction works - waitFor calls close under the hood. We were only calling waitFor if the process was still alive. This is determined by the exitValue which comes from the Kubernetes pod's termination status. However, these ports are a local resource and no close calls means they were left dangling, leading to the behaviour we see today.

Explicitly call close after retrieving the exit value of the Kubernetes pod. This better follows traditional assumptions around Processes - if the process returns some exit value, it means all resources associated with that process have been cleaned up.

Also,
- add in a bunch of debug logging for the future.
- have better names for Kubernetes workers to make operations easier.

* use new AMI ID for connector builds (#4855)

* Wait for config volume to be ready (#4835)

* Do not create config directory in fs persistence construction

* Run kube acceptance test only for testing purpose

* Wait for config volume to be ready

* Move config volume wait for fs persistence construction

* Restore ci workflow

* Prune imports

* 🎉 New source: US census (#4228)

Co-authored-by: Sherif Nada <snadalive@gmail.com>

* publish US Census (connector) (#4857)

Co-authored-by: Daniel Mateus Pires <dpires@earnestresearch.com>
Co-authored-by: Daniel Mateus Pires <dmateusp@gmail.com>

* 🐛 Source JIRA: Fix DBT failing normalization on `Labels` schema. (#4817)

(#4817) 🐛 Source JIRA: Fix DBT failing normalization on `Labels` schema. 

Co-authored-by: Oleksandr Bazarnov <oleksandr.bazarnov@globallogic.com>

* Rename founding-account-executive to founding-account-executive.md

* Tweak ConfigNotFoundException class (#4821)


* Use internal_api_host env variable

* Source ZenDesk: format and validate code

* refactor import / export endpoints to use the same code path as auto migration (#4797)

* fix build (#4865)

* 📝 Add server version requirement for mysql normalization (#4856)

* 🐛 Destination MySQL: fix problem if source has a column with json (#4825)

* [4583] Fixed MySQL destination of fails is source has a column with json data

* hotfix: rename senior PM file to add .md

* 📚 improve mongo docs and param descriptions (#4870)

* Remove duplicated seed repository (#4869)

* add workspace helper (#4868)

* add workspace helper

* fmt

* switch to a fixed limit

* 🐛 Fix Oracle spec to declare `sid` instead of `database` param, Redshift to allow `additionalProperties`, MSSQL test and spec to declare spec type correctly (#4874)

* Kube: Better Port Abstraction. (#4829)

Introduce a better port abstraction whose primary purpose is to confirm that ports are released when the Kube Pod Process is closed.

This prevents issues like #4660

I'm also opening more ports so we can run at least 10 syncs in parallel.

* Source Zendesk: update docs

* Remove unused files

* add a stream_slices logic for ticket_comments stream

* remove changes of other connections

* add secret Zendesk keys to command configs

* 🐛 Source Zendesk Support: add dummy unit test

* add dummy integration test

* fix Zendesk not loading username and facebook/twitter id #4373

* sort streams alphabetically

* fix test issue with the unsupport field validate_output_from_all_streams

* add info to source_definitions.yaml

* remove json_schema from configured_catalog.json

* add backoff logic

* add unit tests

* move part of unit tests to integration tests

* fix test dependencies

* add a build status

Co-authored-by: Maksym Pavlenok <maksym.pavlenok@globallogic.com>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>
Co-authored-by: LiRen Tu <tuliren@gmail.com>
Co-authored-by: vovavovavovavova <39351371+vovavovavovavova@users.noreply.github.com>
Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>
Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>
Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>
Co-authored-by: Shadab Mohammad <39692236+shadabshaukat@users.noreply.github.com>
Co-authored-by: midavadim <midavadim@yahoo.com>
Co-authored-by: Subodh Kant Chaturvedi <subodh1810@gmail.com>
Co-authored-by: Davin Chia <davinchia@gmail.com>
Co-authored-by: Oliver Meyer <42039965+olivermeyer@users.noreply.github.com>
Co-authored-by: Artem Astapenko <3767150+Jamakase@users.noreply.github.com>
Co-authored-by: Jenny Brown <85510829+airbyte-jenny@users.noreply.github.com>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
Co-authored-by: Jared Rhizor <jared@dataline.io>
Co-authored-by: Charles <giardina.charles@gmail.com>
Co-authored-by: Varun B Patil <varun.basavaraj.patil@gmail.com>
Co-authored-by: Dmytro <46269553+TymoshokDmytro@users.noreply.github.com>
Co-authored-by: Yaroslav Dudar <flayingfog@gmail.com>
Co-authored-by: Brian Krausz <briankrausz@gmail.com>
Co-authored-by: George Claireaux <george@claireaux.co.uk>
Co-authored-by: oleh.zorenko <19872253+Zirochkaa@users.noreply.github.com>
Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>
Co-authored-by: Eugene <etsybaev@gmail.com>
Co-authored-by: John Lafleur <john@airbyte.io>
Co-authored-by: Anna Lvova <37615075+annalvova05@users.noreply.github.com>
Co-authored-by: Marco Fontana <marco.fontana@sohohouse.com>
Co-authored-by: marcofontana.ing@gmail.com <marcofontana.ing@gmail.com>
Co-authored-by: Marco Fontana <MaxwellJK@users.noreply.github.com>
Co-authored-by: Christophe Duong <christophe.duong@gmail.com>
Co-authored-by: Serhii Lazebnyi <53845333+lazebnyi@users.noreply.github.com>
Co-authored-by: Vadym <vege1wgw@gmail.com>
Co-authored-by: Vladimir remar <vladimir.remar@gmail.com>
Co-authored-by: Oleksandr <bazarnov@me.com>
Co-authored-by: Oleksandr Bazarnov <oleksandr.bazarnov@globallogic.com>
Co-authored-by: Daniel Mateus Pires <dmateusp@gmail.com>
Co-authored-by: Daniel Mateus Pires <dpires@earnestresearch.com>
Co-authored-by: jrhizor <me@jaredrhizor.com>
2021-08-02 17:20:39 +03:00
oleh.zorenko
89ea3122db 🎉 New source: Chargebee (prepare connector for publishing) (#5067)
* Prepare Chargebee connector for publishing

* Update docs

Update `docs/SUMMARY.md` file.
Update `docs/integrations/README.md` file.

* Update changelog

* Implement change request

* Remove `name` field from streams

* Rename env var for Chargebee

Rename from `CHARGEBEE_TEST_CREDS` to `CHARGEBEE_INTEGRATION_TEST_CREDS`.

* Revert "Rename env var for Chargebee"

This reverts commit 7ddc6e0cb1.

* Revert "Revert "Rename env var for Chargebee""

This reverts commit 6df6751034.

* Add custom backoff handler

* Implement change request

* Add comment about why `order` is an empty stream

* Bump connector version
2021-07-30 23:09:18 +03:00
George Claireaux
d9f11bcf6a 🎉 New Source: S3 (+ abstract files source) (#4990)
* minor line length changes

* cdk generated source + oop structure + start of implementation

* fixed some broken syntax stuff

* pre-pyarrow convert

* introducing pyarrow

* skeleton for unit tests

* read working on multiple files

* incremental first draft

* blobfile -> fileclient

* change references of 'blob' to 'file'

* minor tidy to make draft PR

* fixes

* addressed review comments + more unit tests

* finished unit tests

* bugfixes and abstract integration tests framework

* remove old commented stuff

* docstrings

* restructure as source-s3

* Delete playground.py

* integration tests

* acceptance tests and some more reshuffling

* source S3 credentials

* change _airbyte_ columns to _ab_

* update spec with better descriptions and ordering

* created s3 source docs

* source definition

* reverse docstring change in cdk

* reverse docstring change

* reverse change

* reverse docstring change

* remove TODO comments

* add PR to changelog

* removed unused libraries

* formatting & address some review comments

* rename of files/classes for clarity

* addressing review comments

* address reviews

* add s3 source

* building spec with pydantic for provider-specific inheritance

* pydantic spec and improved path pattern with wcmatch.glob

* update path patterns info in doc

* formatting

* tests gzip and bz2 compression on csv

* updated compression support in doc

* forgot to upload bz2 test file

* added pattern validation to dataset

* formatting

* Format.

* ran testScaffoldTemplates & generated this diff

* bumped version because of documentationUrl fix

Co-authored-by: Davin Chia <davinchia@gmail.com>
2021-07-30 15:06:11 +01:00
Dmytro
82867c094a 🎉 New Source: Snapchat Marketing API (#4843) 2021-07-28 22:50:03 +03:00
Charles
b52bdc27ec Bump version: 0.28.1-alpha → 0.28.2-alpha (#5056) 2021-07-28 11:01:16 -07:00
Davin Chia
b6fe893c3e Publish jars on release. (#5046)
- Modify gradle script to add publishing task to all subprojects that we want to publish.
- Modify release script to publish jars on release.
2021-07-28 18:44:36 +08:00
Christophe Duong
9038b5e98f Fix normalization integration tests in CI (#4910) 2021-07-27 10:48:36 +02:00
Subodh Kant Chaturvedi
13bc08c417 implementation to run kube acceptance tests on GKE (#4961)
* change acceptance tests to use GKE resources

* make CI changes + introduce script

* fix path

* fix indentation

* fix indentation

* set home directory

* use another action

* fix sed command

* setup kustomize

* install kubectl

* clean up

* address review comments

* remove comments

* add master and schedule check

* use correct if condition

* fix if condition

* update script and re-test

* format

* fix compilation error

* add always clause

* move gke kube test to a separate file

* remove un-necessary line
2021-07-27 14:17:52 +05:30
Davin Chia
6565a23af5 Test Kube Log4j2 Logging Configuration (#4972)
- Use JUnit 5 tags for a cleaner build set up + make sure tests are not run unnecessarily.
- The various env set ups are replicated in the cloud_storage_logging_test.sh script. Since there isn't an easy way to dynamically inject env vars from within a JVM, this was the cleanest way of doing this.
2021-07-27 13:47:51 +08:00
Charles
4c8ce600a7 Create a test container for running Airbyte with docker-compose (#4970) 2021-07-26 13:19:41 -07:00
LiRen Tu
e8f20b286a Create jobs database tables without init container (#4942)
* Refactor jobs and configs database initialization

* Add unit tests

* Format code

* Refactor code

* Update document

* Fix tests

* Add back init script to create db and user permission

* Remove old schema files

* Dry database instance implementations

* Revert unnecessary changes

* Rename resource directories

* Format code

* Add readme

* Move and rename database schema to jobs database schema

* Introduce table schema interface

* Rearrange packages

* Format code

* Address review comments

* Show more logs for acceptance test

* Do not depend on service uuid for db readiness
2021-07-25 22:33:35 -07:00
Davin Chia
ec123a3a5b Use Cloudrepo.io to host temporary built jars. (#4949)
Continuation of my saga to publish a public jar while we wait for the upstream fix to be merged in.

Turns out using the GCS bucket as a public maven repository still requires GCS login. This is not feasible since it requires all our community contributors to have and sign into gcloud. This also required injecting this everywhere in the build and was extremely ugly.

We could publish to the central maven repository, but it takes 2 days for us to get an OSS account. We'd also have to dedup on artifact id.
I explored GCP's Artifact Registry, but I didn't manage to set it up after an hour. It is also only possible to upload via Maven/Gradle and there were a lot of steps to jump through to authenticate.
I went with CloudRepo as they were the best in term of set up, UX and performance. I was able to set something up in 10 mins. I was able to upload my Jars manually via the UI. CoudRepo also supports public read-only repository, which is exactly what we need (publish is still gated on authentication). Ran into a slight hiccup and support got back to me in 30 mins. Frankly think we should consider using them for our private cloud. They are also fairly cheap. Anyone who wants to upload jars will have to use credentials that I've added to shared-engineering folder. Just navigate to the CloudRepo url.
2021-07-24 14:08:18 +08:00
Sherif A. Nada
804b5607cc remove unused secrets from ci_credentials.sh (#4957) 2021-07-23 13:26:03 -07:00
vovavovavovavova
c403fc230e New Source Cart: Base (#4754)
* pushing auto-generated files for cart source

* New Source Cart: Implementation (#4785)

* save the setup

* preparing the streams::in progress

* adjust new source streams.py

* upd spec

* save schemas without nullable

* fixing with schemas

* save before test

* successful check

* fix discover

* rename wrong name

* read+

* upd configured_catalog

* format

* fixing schemas (doc mismatch a bit with responses also with fields presense) + fix abnormal json

* modify configured_catalog

* gradle jsons

* requested changes

* upd wording

* increase timeouts

* if Cart store name is incorrect: improve user hint && upd failure timeout to be able to check wrong stor name (not only wrong token which is fast)

* cleanup spec.json

* latest upd

* New Source Cart (#4588): Documentation (#4859)

* save docs

* forgotten file name

* finish docs?

* Update docs/integrations/sources/cart.md

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* Update docs/integrations/sources/cart.md

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* Update docs/integrations/sources/cart.md

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* Update docs/integrations/sources/cart.md

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* merge && resolve conflict

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* apply add abs_property

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* rm outdated in acceptance.yaml

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>
2021-07-23 12:37:27 +03:00
Davin Chia
73e6f576bb Fix Kube S3 Logging Bug. (#4914)
See bluedenim/log4j-s3-search#101 for a summary.

Because this is upstream, I built the jar locally and uploaded it to a temp gcs bucket serving as a maven repository.

I've added this location and the new version in the build.gradle.

I want to merge this in now since there are several users waiting on this. Created #4915 to follow up with the OSS PR and clean up work.
2021-07-23 12:34:35 +08:00
Andrii Leonets
4c0d1a0399 🎉 New Source: Big Query (#4457)
New Source: BigQuery
2021-07-22 15:39:37 +03:00
Daniel Mateus Pires
c4de62e77a 🎉 New source: US census (#4228)
Co-authored-by: Sherif Nada <snadalive@gmail.com>
2021-07-20 00:17:53 -07:00
midavadim
472857d672 🎉 Source mixpanel: migration to CDK (#4566)
* Mixpanel initiation

* copied schemas and specs file from singer connector

* authentication and a few streams

* Added Funnels + FunnelsList

* Added example of funnel response

* added incremental Funnels stream with tests

* added Annotations, CohortMembers, Engage, Cohorts, Funnels

* added Revenue

* fixed formatting

* fixed variable names

* fixed cohort_members and updated export streams

* moved start_date and date checks into SourceMixpanel class

* added error handling

* added unit test, update docs and ci creds

* fix url base for export stream

* added full and incremental read for export stream

* updated acceptance tests, added limit correction based on number of streams, export cursor is stored in datatime string

* Funnel stream - added complex state which contains state for each funnel

* added attribution windows support and project timezone config

* fixed formatting

* added default timezone

* added dynamic schema generation for Engage and Export streams

* fixed formatting

* fixed ability to pass start_date in datetime format as well

* fixed ability to pass start_date in datetime format as well

* added additional_properties field for dynamic schemas. updates regex for start_date matching to support old config file

* fixed formatting

* export stream - convert all values to default type - string

* added schema ref

* added new properties for funnel stream

* fixed formatting in funnel schema

* added build related files

* update changelog

* fixed and added comments, renamed rate_limit variable

* fixed formatting

* changed normalization for reserved mixpanel attributes like $browser

* alphabetise spec fields

* added description about API limit handling

* updated comment
2021-07-20 02:31:13 +03:00
Sherif A. Nada
a4e4b0289b Point to new location for connector build status history (#4840) 2021-07-19 15:31:27 -07:00
Sherif A. Nada
a4bb30430d Python Demo Destination: KVDB (#4786) 2021-07-19 09:44:36 -07:00
Vadym
24be682163 🎉 New source: Pipedrive connector (#4686)
* Add pipedrive source initial

* Add initial schemas.
Add MVP source implementation.

* Implement MVP streams

* Complete MVP streams implementation

* Apply schema format

* Add test creds

* Update streams.py
Fix schemas

* Update replication_start_date format.
Add extra pagination condition

* Refactor streams, remove unused classes.

* Add pipedrive.md docs file.
Add Pipedrive source definitions.

* Add json source definition.

* Update spec.json

* Add docs mentions throughout the project files
2021-07-19 15:08:23 +03:00
Serhii Lazebnyi
9b2388cd31 🎉 Source intercom: migration to CDK (#4676)
* Added Intercom implementation

* Updated segments docs

* Updated _send_request method to new airbyte-cdk version

* Updated cursor field to datetime string

* Added filtering by state for incremental sync

* Updated cursor paths for test incremental sync

* Added dict type validation to get_data method

* Updated catalog

* Updated typing for start_date

* Updated singer seed to cdk seed

* Updated connector docs

* Updated sample config file

* Sorted streams alphabetically

* Removed placeholder comments

* Renamed rate_limit to queries_per_hour

* Updated common sleep time to backoff_time method
2021-07-19 14:14:51 +03:00
LiRen Tu
e577b4987e 🎉 Migrate config persistence to database (#4670)
* Implement db config persistence

* Fix database readiness check

* Reduce logging noise

* Setup config database in config persistence factory

* Update documentation

* Load seed from yaml files

* Refactor config persistence factory

* Add one more test to mimic migration

* Remove unnecessary changes

* Run code formatter

* Update placeholder env values

* Set default config database parameters in docker compose

Co-authored-by: Christophe Duong <christophe.duong@gmail.com>

* Default setupDatabase to false

* Rename variable

* Set default config db parameters for server

* Remove config db parameters from the env file

* Remove unnecessary environment statements

* Hide config persistence factory (#4772)

* Remove CONFIG_DATABASE_HOST

* Use builder in the test

* Simplify config persistence builder

* Clarify config db connection readiness

* Format code

* Add logging

* Fix typo

Co-authored-by: Christophe Duong <christophe.duong@gmail.com>

* Add a config_id only index

* Reuse record insertion code

* Add id field name to config schema

* Support data loading from legacy config schemas

* Log missing logs in migration test

* Move airbyte configs table to separate directory

* Update exception message

* Dump specific tables from the job database

* Remove postgres specific uuid extension

* Comment out future branch

* Default configs db variables to empty

When defaulting them to the jobs db variables, it somehow does not work.

* Log inserted config records

* Log all db write operations

* Add back config db variables in env file to mute warnings

* Log connection exception to debug flaky e2e test

* Leave config db variables empty

`.env` file does not support variable expansion.

Co-authored-by: Christophe Duong <christophe.duong@gmail.com>
Co-authored-by: Charles <giardina.charles@gmail.com>
2021-07-19 03:52:40 -07:00
LiRen Tu
a8308c4061 🎉 New Destination: Google Cloud Storage (#4784)
* Adding Google Cloud Storage as destination

* Removed few comments and amended the version

* Added documentation in docs/integrations/destinations/gcs.md

* Amended gcs.md with the right pull id

* Implemented all the fixes requested by tuliren as per https://github.com/airbytehq/airbyte/pull/4329

* Renaming all the files

* Branch alligned to S3 0.1.7 (with Avro and Jsonl). Removed redundant file by making S3 a dependency for GCS

* Removed some additional duplicates between GCS and S3

* Revert changes in the root files

* Revert jdbc files

* Fix package names

* Refactor gcs config

* Format code

* Fix gcs connection

* Format code

* Add acceptance tests

* Fix parquet acceptance test

* Add ci credentials

* Register the connector and update documentations

* Fix typo

* Format code

* Add unit test

* Add comments

* Update readme

Co-authored-by: Sherif A. Nada <snadalive@gmail.com>

Co-authored-by: Marco Fontana <marco.fontana@sohohouse.com>
Co-authored-by: marcofontana.ing@gmail.com <marcofontana.ing@gmail.com>
Co-authored-by: Marco Fontana <MaxwellJK@users.noreply.github.com>
Co-authored-by: Sherif A. Nada <snadalive@gmail.com>
2021-07-16 19:22:12 -07:00
Anna Lvova
28239a87a1 🎉 Stripe Source: Fix subscriptions stream to return all kinds of subscriptions (including expired and canceled) (#4669)
#4669 Stripe Source: Fix subscriptions stream to return all kinds of subscriptions (including expired and canceled)
Co-authored-by: Oleksandr Bazarnov <oleksandr.bazarnov@globallogic.com>
2021-07-16 20:49:57 +03:00
Charles
76adc25820 Split Platform and Connector Builds (#4514) 2021-07-15 10:49:15 -07:00
Jared Rhizor
baed7b4997 use kube service user for pod sweeper (#4737)
* use kube service user for pod sweeper

* add pod sweeper logs

* temporarily switch to stable for testing

* temporarily remove building steps for kube testing since it can use prod images

* output date strings from date command

* load stable images

* remove loading since it can pull the images

* increase window for success storage to two hours

* revert test logging changes
2021-07-14 16:29:10 -07:00
Yaroslav Dudar
20f9c2da62 🎉 New source: Typeform (#4541)
Typeform source: Forms and Responses streams
2021-07-13 13:06:50 +03:00
Sherif A. Nada
168b0b3aa1 Periodic connector tests workflow: add Accept header per github docs recommendation (#4722) 2021-07-12 23:23:24 -07:00
vovavovavovavova
db223a4d06 🎉 New Source: Zendesk Sunshine (#4359)
* pre-PR

* add git config

* format

* Update airbyte-integrations/connectors/source-zendesk-sunshine/requirements.txt

upd requirements.txt remove extra

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* Update airbyte-integrations/connectors/source-zendesk-sunshine/source_zendesk_sunshine/streams.py

backoff time int to float (btw real return type in headers is integer)

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>

* requested changes

* fix newline absence && rm unnecessary temp file

* url_base to property

* rm extra var coming property

* rm extra var coming property

* save

* finishing updating the documentation

* forgotten definition

* add nullable to pass the test

* fix date in the log

Co-authored-by: Eugene Kulak <widowmakerreborn@gmail.com>
2021-07-09 09:55:40 +03:00
midavadim
ca3befee14 🎉 New Source: Paypal Transaction (#4240)
* Added spec.json

* Initialization

* added oauth2 autorization

* added spec, check, discover + catalogs/configurared_catalogs

* updated request_params

* added paging, slicing (1d)

* Use oath2 for paypal

* incremental sync, acceptance test

* incremental sync, acceptance test

* Added spec.json

* Initialization

* added oauth2 autorization

* added spec, check, discover + catalogs/configurared_catalogs

* updated request_params

* added paging, slicing (1d)

* Use oath2 for paypal

* incremental sync, acceptance test

* incremental sync, acceptance test

* Added spec.json

* Initialization

* added oauth2 autorization

* added spec, check, discover + catalogs/configurared_catalogs

* updated request_params

* added paging, slicing (1d)

* Use oath2 for paypal

* incremental sync, acceptance test

* updated slices and api limits, added validation for input dates

* added tests, fixed cursor related information in schemas and configured catalogs, removed old comments, re-arranged Base PaypalTransactionStream class

* added input param 'env' to support production and sandbox envs

* added support for sandbox option, updated pattern for optional end date option

* added github secrets

* added support for sandbox option, updated pattern for optional end date option

* fixed Copyright date, removed debug mesages

* added docs

* fix for test failure - The sync should produce at least one STATE message

* removed optional parameter 'end_date'

* removed detailed info about balances schema

* Delete employees.json

* Delete customers.json

* Added requests_per_minute rate limit

* added unit tests, added custom backoff

* added test for stream slices with stream state

* removed comments

* updated docs pages

* fixed format for json files

* fixed types in schemas and link to the schema. fixed primary key for Transactions stream

* updated stream slices

* Updated tests, unified stream_slices for both streams, all instance variables instantiated directly in __init__ method

* added CHANGELOG.md

* Added build seeds

* fixed closing double quotation mark

* added paypal entry in builds.md

* add fixture helper

* added paypal transaction generator script

* fixed styling

* maximum allowed start_date is extracted from API response now.

* fixed schemas

* fixed schemas - removed datetime

* now maximum_allowed_start_date is identified by last_refreshed_datetime attr in API response.

* added possibility to specify additional properties

Co-authored-by: Sherif Nada <snadalive@gmail.com>
2021-07-08 18:47:00 +03:00
Dmytro
b8b205a3ff 🎉 New source: Square (#4439) 2021-07-07 20:20:18 +03:00