1
0
mirror of synced 2026-01-06 15:03:36 -05:00
Commit Graph

61 Commits

Author SHA1 Message Date
N-o-Z
9b57d972af 🐛 Fix S3 destination integration tests to use bucket path (#18031)
* Fix S3 destination tests

* Add Changelog

* CR Fixes

* add unit test

* version bump

* fix dockerfile

* auto-bump connector version

Co-authored-by: Edward Gao <edward.gao@airbyte.io>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-10-25 15:21:32 -07:00
Ryan Fu
7a7091e2dc Updated s3.md changelog to include changes for enforcing TLS (#17545) 2022-10-03 17:54:52 -07:00
Ryan Fu
1d956df434 🚨 Add SSL documentation and check logic for S3 Destination 🚨 (#17340)
* Adds logic to fail upon non-deterministic custom S3 endpoint and documentation for insecure settings

* Reused config factory settings to a single static variable

* Updated error message and example in the spec.json to match expectation of secured endpoint

* Added validation check within the base s3

* Integrated AdaptiveDestinationRunner with S3Destination

* Reduced visibility for testing and fixed AdaptiveDestinationRunner issue

* Adds speicifc secure protocol with S3 and empty endpoint check

* Bumps docker version and adds comments and clearer string methods

* auto-bump connector version [ci skip]

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-10-03 13:56:58 -07:00
Nugusbayev Kanagat
3494405078 🎉 New Destination: R2 Cloudflare (#15296)
* added destination-r2 spec

added base-java-s3 module

updated common s3 lib

local

changed dependency of s3 in all realted modules

minor style fixes

added common module for s3 integration tests

* fix configuration for r2 integration tests

* fix configuration for r2 integration tests

* added docs for destination-r2

* minor import fixes

* mark test as disabled

* fixed imports in destination-snowflake

* restored styling task

* added tests for S3DEstinationConfig

* added upload threads count configuration for r2(due to limitations)

* deleted parquet format type

* Fix import in gcs

* Fix import in redshift

* Fix import in snowflake

* Fix one more import in gcs

* Fix one more import in redshift

* Fix import in databricks

Co-authored-by: Liren Tu <tuliren.git@outlook.com>
2022-09-25 19:20:47 -07:00
Liren Tu
fd547c64d7 🐛 Avro related destinations: fix field name clashing with combined restrict (#16243)
* Add test case for allOf and oneOf

* Bump version

* Add pr id

* Bump gcs version

* Bump dest jdbc

* Bump redshift

* Bump snowflake

* Bump databricks

* Bump bigquery

* Revert "Bump dest jdbc"

This reverts commit f10497e96a.

* Use a new pat to avoid api rate limit

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* Revert databricks bump

* auto-bump connector version [ci skip]

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-09-01 18:06:27 -07:00
Haithem SOUALA
7834eefb96 :fix: Destination S3: connector check ignores the s3_bucket_path (#15207)
* fix: use correct var for s3 bucket path

* typo

* fix - use bucket path instead of bucket name

* fix: use correct var for s3 bucket path

* typo

* fix - use bucket path instead of bucket name

* add fix to changelog

* solve conflict

* solve conflict

* fix - use bucket path instead of bucket name

* solve md file conflict

* solve dockerfile conflict

* auto-bump connector version [ci skip]

* add eof

Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
Co-authored-by: marcosmarxm <marcosmarxm@gmail.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-08-24 06:46:29 -03:00
Jagruti Tiwari
8288c16485 fix: replace airbyte oss with airbyte open source (#15885)
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2022-08-24 01:01:53 -03:00
Bhupesh Varshney
4b4b4993c1 Docs: fix broken link to parquet docs (#15632) 2022-08-15 08:02:38 -03:00
VitaliiMaltsev
a280113cce Destination S3: add LZO compression support (#15394)
* Fixed bucket naming for S3

* Destination S3: add LZO compression support for parquet files

* Destination S3: add LZO compression support for parquet files

* implemented logic for aarch64

* removed redundant logging

* updated changelog

* moved intstall of native-lzo lib to Dockerfile

* removed redundant logging

* add unit test for aarch64

* bump version

* auto-bump connector version [ci skip]

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-08-10 20:44:51 +03:00
Greg Solovyev
95aeb39325 Fix typo in change log (#15343)
* Fix typo in several change logs

Co-authored-by: andrii.leonets <aleonets@gmail.com>
2022-08-05 09:13:24 -07:00
oneshcheret
823a79b69a S3, Databricks and Gcs destinations fix test and publish (#15360)
* Postgres source added items for array data type

* Postgres source updated tests for array data type

* S3 destination fix key pair oauth test

* S3 destination clean code

* S3 destination bump version

* S3 destination bump version

* Databricks and gcs destinations bump versions

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-08-05 17:21:56 +03:00
Andrii Leonets
e7b9e00949 Fix multiply log bindings (#14801)
* Fix multiply log bindings

* Exclude slf4j-reload4j

* Exclude slf4j-log4j12 for debezium

* Increase version for debezium related sources and json converter related destinations

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

* rebump bigquery versions

* mark destinations s3, gcs as unpublishd

* auto-bump connector version [ci skip]

* auto-bump connector version [ci skip]

Co-authored-by: subodh <subodh1810@gmail.com>
Co-authored-by: Greg Solovyev <grishick@users.noreply.github.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: Edward Gao <edward.gao@airbyte.io>
2022-08-03 09:36:30 -07:00
Alexander Tsukanov
82297a9b18 🎉Redshift, Databricks, Snowflake, S3 Destinations: Make S3 output filename configurable (#14494)
airbyte-4184: Make S3 output filename configurable
2022-07-15 15:56:21 +03:00
Topher Lubaway
9c6c092a22 Revert "Improving docusaurus sidebar generation (#1927) (#14369)" (#14596)
This reverts commit a2c194a11f.
2022-07-11 15:27:14 -05:00
Mykyta Serbynevskiy
a2c194a11f Improving docusaurus sidebar generation (#1927) (#14369)
* Improving docusaurus sidebar generation (#1927)

* Added "Career & open positions" folder to sidebar, adjusted "Project overview" folder

* Deleted "career-and-open-positions" folder from sidebar
2022-07-08 14:18:27 -05:00
Liren Tu
98d087ba3d Release new destination s3 version to pick up latest change (#14332)
* Bump s3 version

* Update pr id

* auto-bump connector version

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-06-30 21:58:53 -07:00
VitaliiMaltsev
11ea066f47 Remove "additionalProperties": false from specs for connectors with staging (#14114)
* Remove "additionalProperties": false from spec for connectors with staging

* Remove "additionalProperties": false from spec for Redshift destination

* bump versions

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-06-24 20:04:04 +03:00
VitaliiMaltsev
c283d9d159 Deprecate PART_SIZE_MB in connectors using S3/GCS storage (#13753)
* Removed part_size from connectors that use StreamTransferManager

* fixed S3DestinationConfigTest

* fixed S3JsonlFormatConfigTest

* upadate changelog and bump version

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

* upadate changelog and bump version for Redshift and Snowflake destinations

* auto-bump connector version

* fix GCS staging test

* fix GCS staging test

* auto-bump connector version

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-06-20 14:56:33 +03:00
oneshcheret
8e54f4fd6e S3 and GCS destinations: Updating processing data types for Avro/Parquet formats (#13483)
* S3 destination: Updating processing data types for Avro/Parquet formats

* S3 destination: handle comparing data types

* S3 destination: clean code

* S3 destination: clean code

* S3 destination: handle case with unexpected json schema type

* S3 destination: clean code

* S3 destination: Extract the same logic for Avro/Parquet formats to separate parent class

* S3 destination: clean code

* S3 destination: clean code

* GCS destination: Update data types processing for Avro/Parquet formats

* GCS destination: clean redundant code

* S3 destination: handle case with numbers inside array

* S3 destination: clean code

* S3 destination: add unit test

* S3 destination: update unit test cases with number types.

* S3 destination: update unit tests.

* S3 destination: bump version for s3 and gcs

* auto-bump connector version

* auto-bump connector version

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-06-14 14:15:58 +03:00
Tuhai Maksym
70d48fa874 Destination-S3: Remove configurable part size (#13043)
* Destination-S3: Remove configurable part size

* Changelog updated

* auto-bump connector version

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-05-20 22:48:55 +03:00
Andy
a3658ba13a Update spec to replace markdown (#12797)
* Update spec to replace markdown

* Update version

* S3 Destination: Update change log

* auto-bump connector version

Co-authored-by: oneshcheret <sashkaliife@gmail.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-05-12 19:16:00 +03:00
LiRen Tu
bd59726668 Publish new connectors to log offending json fields in json to avro conversion for debugging (#12578)
* Publish new connectors to log debugging info in json to avro conversion

* Add pull request id

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

* auto-bump connector version

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-05-04 11:47:07 -07:00
oneshcheret
fd7f21e071 S3 destination: updating docs regarding certification (#11966)
* S3 destination: updating docs regarding certification

* S3 destination: updating docs by new template

* Apply suggestions from code review

Co-authored-by: Andy <andy@airbyte.io>

* S3 destination: updating docs by new template

Co-authored-by: Andy <andy@airbyte.io>
2022-04-29 18:17:13 +03:00
LiRen Tu
9a0442c143 🎉 Destination s3 / gcs: add option for uncompressed csv and jsonl format (#12167)
* Add gzip compression option

* Add file extension method to s3 format config

* Pass gzip compression to serialized buffer

* Add unit test

* Format code

* Update integration test

* Bump version and update doc

* Fix unit test

* Add extra gzip tests for csv and jsonl

* Make compression an oneOf param

* Migrate csv config to new compression spec

* Migrate jsonl config to new compression spec

* Update docs

* Fix unit test

* Fix integration tests

* Format code

* Bump version

* auto-bump connector version

* Bump gcs version in seed

Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2022-04-22 15:38:28 -07:00
George Claireaux
e57ac9e4bb Destination GCS & S3: bump version (#12274)
* bump version for GCS & S3 Destination

* auto-bump connector version

* gcs version bump

* Update docs

Co-authored-by: subodh <subodh1810@gmail.com>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: Liren Tu <tuliren.git@outlook.com>
2022-04-22 15:10:53 -07:00
andriikorotkov
4ac5d3c8e4 updated docs (#11840) 2022-04-15 17:06:48 +03:00
Christophe Duong
393ba35626 🐛 fix s3/gcs bucket cleanup (#11728)
* Restrict bucket clean up

* bumpversion

* Fix GCS acceptance tests
2022-04-05 18:46:21 +02:00
Christophe Duong
18043c8448 🐛 Make s3 file name backward compatible with <0.2.12 (#11666)
* fix s3 file name
2022-04-04 14:18:14 +02:00
Christophe Duong
f0256626b6 🐛 Fix bucket path for destination s3 (#11496)
* Fix bucket path for destination s3

* bumpversion
2022-03-29 11:39:26 +02:00
Christophe Duong
848bb349b5 🎉 Change destination-s3 buffering to reduce/stabilize memory/thread consumption (#11294)
* Refactor destination-s3 to use the new serialization strategy and get memory usage under control
2022-03-28 17:40:44 +02:00
Eugene
bf9a189be9 🎉Destination-s3: added support for AWS Glue crawler (#11173)
* [4654] Destination-s3: added support for AWS Glue crawler
2022-03-23 20:31:47 +02:00
Greg Solovyev
f48a6a0909 destination-s3: add a test for listObjects permission on destination bucket (#10856)
* Add a test for listObjects permission to destination-s3 connector

* add testIAMUserHasListObjectPermission method to S3Destination
  and call this method from S3Destination::check. Method throws
  an exception if IAM user does not have listObjects permission
  on the destination bucket

* add a unit test to S3DestinationTest to verify that S3Destination::check
  fails if listObjects throws an exception

* add a unit test to S3DestinationTest to verify that S3Destination::check
  succeeds if listObjects succeeds

* Add S3DestinationConfigFactory in order to be able to mock S3 client
  used in S3Destination::check

* Addressing review comments:

 - separate positive and negative unit tests
 - fix formatting
 - reuse s3 client for both positive and negative tests

* Add information about PR #10856 to the changelog

* Prepare for publishing new version:
 * Bump version to 0.2.10 in Dockerfile
 * Bump version to 0.2.10 in changelog

* Update destination-s3 version in connector index

* Update seed spec for destination-s3 connector
2022-03-11 15:12:07 -08:00
andriikorotkov
838ce1418f 🐛 Prevented double slashes in S3 destination path (#10318)
* Prevented double slashes in S3 destination path

* added test and update documentation

* updated documentation
2022-02-15 11:38:59 +02:00
Lake Mossman
3d8a0dc048 Add ExitOnOutOfMemoryError to java connectors and bump versions (#10256) 2022-02-14 15:49:15 -08:00
Álvaro Queiroz
41f89d1ab2 Destination S3: use instanceprofile if credentials are not provided (#9399)
* use instanceprofile to auth if id is not provided

* restore support for using endpoint

* update readme

* update changelog

* update documentation, add setup guide

* Update docs/integrations/destinations/s3.md

Co-authored-by: Edward Gao <edward.gao@airbyte.io>

* minor fixes

* add error message

* now using RuntimeException

* Update airbyte-integrations/connectors/destination-s3/src/main/java/io/airbyte/integrations/destination/s3/S3DestinationConfig.java

Co-authored-by: Edward Gao <edward.gao@airbyte.io>

* bump connector version

* update seed file

Co-authored-by: Edward Gao <edward.gao@airbyte.io>
Co-authored-by: Marcos Marx <marcosmarxm@gmail.com>
2022-01-13 21:32:26 -03:00
Andrii Leonets
c904b44998 BigQuery Destination : Fix GCS processing of Facebook data (#9415)
* Fix GCS Avro file processing with invalid "-" character

* Extend test data to cover the case

* incr ver

* s3 ver upd

* add dependency

* add dependency
2022-01-13 19:16:20 +02:00
LiRen Tu
5f6785dcbf 🐞 Destination S3 & GCS Avro: support array with unknown item type (#9367)
* Support array field with empty items specification

* Remove all exceptions

* Format code

* Bump connector versions

* Bump bigquery versions

* Update docs

* Remove unused code

* Update doc for PR #9363

* Update doc about defaulting all improperly typed fields to string

* Ignore bigquery

* Update version and doc

* Update doc

* Bump version in seed
2022-01-11 23:12:37 -08:00
VitaliiMaltsev
a3100cba8f 🐛 Destination S3: avro and parquet formats have issues with JsonToAvroSchemaConverter (#8574)
* add namespace to avro record type

* refactoring

* Add unit tests

* added CHANGELOG

* fix typo in method name

* fix typo in method name

* fix for jdk 17

* created recursive keys adding

* refactoring

* format code

* cleanup Dockerfile

* refactoring

* removed unneded tests case

* updated namespace generation

* removed unneeded method from AvroNameTransformer

* resolved merge conflicts

* removed unused imports

* reformat the code

* bump version

* bump Bigquery Denormalized version

Co-authored-by: vmaltsev <vitalii.maltsev@globallogic.com>
Co-authored-by: Oleksandr Sheheda <alexandrshegeda@gmail.com>
2021-12-21 17:51:48 +02:00
LiRen Tu
9b75a76720 📑 Update source database performance test docs (#8980)
* Add missing pr number

* Update source database performance test docs

* Update doc

* Update script paths
2021-12-20 20:49:22 -08:00
LiRen Tu
a6c2d058a9 🐞 Destination S3 & GCS: remove excessive logging (#8974)
* Bump s3 and gcs version

* Bump version in seed
2021-12-20 17:47:55 -08:00
Edward Gao
7038533ae0 🎉 Destination Redshift (copy): accept bucket path for staging data (#8607) 2021-12-16 16:41:22 -08:00
Edward Gao
0b8c0f5dff Bump connector versions for publishing (#8717)
version bumps for https://github.com/airbytehq/airbyte/pull/8562
2021-12-10 19:27:26 -08:00
Edward Gao
fc91f67747 move S3Config into destination-s3; update dependencies accordingly (#8562) 2021-12-10 15:51:25 -08:00
LiRen Tu
804fe6bcb7 Bump s3 version to remove excessive logging (#8501)
* Bump s3 version to remove excessive logging

* Bump seed version and update spec
2021-12-03 15:41:14 -08:00
VitaliiMaltsev
065bbf60da 🎉 Destination S3 Support writing timestamps (#7732)
* get date-time format form json schema

* created universal date-time converter

* implemented jsonnode transformation for avro and parquet

* removed unneeded dependency from build.gradle

* fix checkstyle

* add DateTimeUtilsTest

* add AvroRecordHelperTest

* resolve merge conflicts | fix checkstyle

* update LocalTime parsing

* added String type to avro schema for Logical Types, removed date-time conversion

* fix checkstyle

* fix checkstyle

* added static String schema, added comments

* bump version

Co-authored-by: vmaltsev <vitalii.maltsev@globallogic.com>
2021-11-30 12:08:36 +02:00
LiRen Tu
c9c41dcd08 🎉 Destination S3 & GCS: support additional properties (#7288)
* Log json schema

* Use patched json avro converter

* Rename schema

* Update unit test cases

* Fix ab ap field schema conversion

* Rename files

* Add unit test cases

* Fix dependency for databricks

* Bump versions

* Update documentations

* Update gcs doc

* Set additional properties field name

* Revert s3 and gcs version

* Specify extra props fields

* Refactor json avro conversion doc

* Update connector doc

* Fix databricks spec typo

* Bump connector versions in seed
2021-11-03 11:37:24 -07:00
Abhi Vaidyanatha
ae32ecbb27 GitBook: [master] 186 pages and 77 assets modified 2021-10-08 21:17:47 +00:00
Abhi Vaidyanatha
276d914a57 Direct to Discourse tagged list instead of specific issues on Troubleshooting. (#6848) 2021-10-07 00:05:56 -07:00
Abhi Vaidyanatha
fd2f96e5ff Docs Refresh: S3 Destination (#6704) 2021-10-04 00:36:58 -07:00
Eugene
64ce6fd484 🐛Destination S3 and GCS - Fixed connector's bug that prevent writing streams with more than 50GB (#5890)
Co-authored-by: ievgeniit <ievgeniit>
2021-09-17 14:21:07 -07:00