1
0
mirror of synced 2025-12-21 19:11:14 -05:00
Commit Graph

2421 Commits

Author SHA1 Message Date
LiRen Tu
e2c5b1d4c7 🎉 S3 destination: Avro & Jsonl output (#4227)
* Add jsonl format to spec.json

* Implement jsonl writer

* Add documentation

* Add acceptance test

* Update document

* Bump version

* Update document example

* Implement avro writer

* Implement compression codec

* Update documentation

* Revise documentation

* Add more tests

* Add acceptance test

* Format code

* Create helper method for name updater

* Update csv doc with normalization

* Update version date
2021-06-23 14:54:39 -07:00
Christophe Duong
75a1dda07e 🎉 New BigQuery destination with Structured/Repeated Records (#4176) 2021-06-23 16:19:36 +02:00
Marcos Marx
b57736b0ce Documentation correct markdown list in csv and json destination (#4165)
* correct list in markdown csv and json

* add copy cmd and change variable name to used in dest settings
2021-06-21 19:53:26 -03:00
Marcos Marx
f365c0c430 destination mssql correct spec doc info (#4185) 2021-06-17 19:51:36 -03:00
LiRen Tu
53b51c08d1 Bump S3 destination version (#4166) 2021-06-16 11:54:50 -07:00
Prashantha Patali
0884fa74a6 📚More clarity added for Windows based Docker Desktop environments (#4068)
* Additinal details about Windows specific long filename error during cloning

* Corrected the path

* Revert "Corrected the path"

This reverts commit bbd3b78fcb.

* Revert "Additinal details about Windows specific long filename error during cloning"

This reverts commit 0b695eea1a.

* Email is mandatory 'Specify your preferences', Link to docker inst guide

* Use relative paths while linking

* Typo in Destination section

* On Windows, fixing long filename error during cloning

* For Windows with WSL2 and Docker, clarify steps to locate destination local folder

* Link to locating local files on Windows

* Update docs/contributing-to-airbyte/updating-documentation.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/quickstart/set-up-a-connection.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/examples/postgres-replication.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/examples/postgres-replication.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/examples/postgres-replication.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/local-deployment.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/deploying-airbyte/local-deployment.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

Co-authored-by: Patali, Prashanth <ppatali@hidglobal.com>
Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>
2021-06-16 14:14:25 -03:00
LiRen Tu
87552b25e4 🎉 Destination S3: parquet output (#3908)
* Add skeleton code for parquet writer

* Refactor s3 destination code

* Add parquet to spec

* Complete parquet writer

* Change testing data from int to double

* Add acceptance test for parquet writer

* Handle special schema field names

* Format code

* Add parquet config

* Add documentation

* Add unit tests

* Fix typo

* Update document

* Bump version

* Fix date format

* Fix credential filename

* Update doc

* Update test and publish commands

* Refactor s3 format config

* Append compression codec file extension

* Update doc

* Remove compression codec file extension

* Add comments

* Add README, CHANGELOG, and sample configs

* Move changelog

* Use switch statement

* Move filename helper method to base writer

* Rename converter

* Separate test cases

* Drop union type length restriction

* Support array with multiple types

* Move comments to connector doc

* Share config between acceptance tests

* Add doc about additional properties

* Move shared code out of if branch

* Add doc about adding a new format

* Format code

* Bump version to 0.1.4

* Fix default max padding size
2021-06-14 16:49:27 -07:00
Duk Panhavad
d2f9643650 🎉 Destination S3: alternative S3 support (#4038)
Authored by @panhavad

* base from prev PR
* add s3 alternative destination connector feature
* fix testGetOutputFilename
* default using aws
* Update airbyte-integrations/connectors/destination-jdbc/src/main/java/io/airbyte/integrations/destination/jdbc/copy/s3/S3Config.java
2021-06-13 00:57:19 -07:00
Abhi Vaidyanatha
e26f813536 Remove Postgres references in OracleDB Destination Docs (#4043)
Co-authored-by: Abhi Vaidyanatha <abhivaidyanatha@Abhis-MacBook-Pro.local>
2021-06-10 15:29:48 -06:00
jacqueskpoty
8c9bf60bd8 updating integrations destinations local files doccs (#3910)
* updating integrations destinations local files doccs

* updating integrations destinations local files doccs

* Update docs/integrations/destinations/local-json.md

Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>

* updating integrations destinations local files doccs

* updating integrations destinations local files doccs

* Update docs/integrations/destinations/local-csv.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/integrations/destinations/local-csv.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/integrations/destinations/local-csv.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* Update docs/integrations/destinations/local-csv.md

Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>

* updating integrations destinations local files doccs

Co-authored-by: jsewawilson <jwilson@pebblepost.com>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
Co-authored-by: Abhi Vaidyanatha <abhi@airbyte.io>
2021-06-07 21:34:35 -03:00
Christophe Duong
913c55df6b 🎉 Add dataset location to BigQuery connector (issue #3277) (#3889)
Authored-by: sabifranjo <sabifranjo@gmail.com>
Authored-by: Sabolc Franjo <sabolc.franjo@ev-box.com>
2021-06-07 15:33:47 +02:00
masonwheeler
8dadd1cebd Oracle destination implementation (#3498)
Working implementation of Oracle destination

Co-authored-by: cgardens <giardina.charles@gmail.com>
2021-06-03 16:27:09 -06:00
LiRen Tu
c13b9883e8 🎉 New destination: S3 (#3672)
* Update README icon links

* Update airbyte-specification doc

* Extend base connector

* Remove redundant region

* Separate warning from info

* Implement s3 destination

* Run format

* Clarify logging message

* Rename variables and functions

* Update documentation

* Rename and annotate interface

* Inject formatter factory

* Remove part size

* Fix spec field names and add unit tests

* Add unit tests for csv output formatter

* Format code

* Complete acceptance test and fix bugs

* Fix uuid

* Remove generator template files

They belong to another PR.

* Add unhappy test case

* Checkin airbyte state message

* Adjust stream transfer manager parameters

* Use underscore in filename

* Create csv sheet generator to handle data processing

* Format code

* Add partition id to filename

* Rename date format variable
2021-06-03 09:40:51 -07:00
etsybaev
1fed4b0e2c 🐛 Destination MSSQL: fix issue with unicode symbols handling (ex.\u2028 showed as ?) (#3671)
* [3552] Fixed MSSQL _airbyte_data field's type to support unicode
2021-06-01 09:38:13 +03:00
masonwheeler
a67c769980 MS SQL Server Destination implementation
Fixes issue #613.

Normalization is not yet enabled.  This will have to be added at a later point.
2021-05-17 09:54:52 -06:00
Marcos Marx
a18b0e80d0 GitBook: [master] 161 pages and 75 assets modified 2021-05-16 22:51:02 +00:00
Subodh Kant Chaturvedi
a2ec9ee1e5 mysql destination docs (#3275)
* setup doc for mysql destination

* update the documentation in right places
2021-05-07 13:38:07 +05:30
Jared Rhizor
32ee10e76d snowflake gcs staging (#3183)
* spec

* snowflake gcs

* spec comment

* fmt

* remove sout

* add docs

* format

* respond to comments

* fix

* bump to 0.3.3
2021-05-04 09:59:32 -07:00
Marcos Marx
7c70eb02cd GitBook: [master] 84 pages and 72 assets modified 2021-05-03 12:14:16 +00:00
Christophe Duong
77ffd74b32 Ignore records that are too big in Redshift destinations (instead of failing) (#2988)
* Abort sync if one of the part fails to copy to temp table

* Check for record size when copying data from s3 to redshift

* Handle big record in RedshiftInsertDestination too
2021-04-30 21:04:03 +02:00
Davin Chia
e63ab8490a Make Redshift part size configurable. (#3053)
The Redshift Copy strategy currently has it's part size set to 10 MB. Since S3 allows a file to be broken into max of 10k parts, this results in a 100GB table limit. A user is trying to sync a table of 115GB and running into this issue.

This makes the part size configurable so users can increase this size if needed.
2021-04-27 13:35:35 +08:00
Davin Chia
42686add8a Release connectors with namespace change. (#2990)
Release all connectors affected by namespace change. Includes all JDBC sources and destinations.

Also add documentation for normalisation. Prerequisite to actually releasing 0.21.0-alpha.
2021-04-21 11:35:08 +08:00
Michel Tricot
e4326aa21e Revert "GitBook: [master] 25 pages and 46 assets modified"
This reverts commit 0b4eddc7cf.
2021-04-16 14:20:42 -07:00
Michel Tricot
0b4eddc7cf GitBook: [master] 25 pages and 46 assets modified 2021-04-16 20:29:16 +00:00
Davin Chia
e8c13cf0fe Update documentation to include Redshift COPY strategy. (#2728) 2021-04-06 09:10:19 +08:00
Shrif Nada
43f7a31f77 GitBook: [master] 113 pages modified 2021-02-10 23:13:18 +00:00
Charles
9a81bd6e5c MeiliSearch Destination (#1964) 2021-02-08 18:44:55 -08:00
Sherif A. Nada
ae9fea7f22 GitBook: [sherif/update-connector-health-feb312021] 4 pages modified (#1909)
Co-authored-by: Shrif Nada <sherif@dataline.io>
2021-02-01 09:55:47 -08:00
Christophe Duong
0fd4b5c674 Local File Destinations UX change with destination paths (#1882)
* Add logging info when writing local data files

* Make Local CSV/Json destination always writes in /local (unecessary to specify it in the configs)

* BumpVersion of CSV and Local Json destinations
(throws Exceptions when failing too)
2021-01-29 13:30:03 +01:00
Sherif A. Nada
aeecbcfb04 GitBook: [sherif/bigquery-docs] 3 pages modified (#1883) 2021-01-28 10:44:00 -08:00
Shrif Nada
ff458bbd9d GitBook: [master] 111 pages modified 2021-01-26 04:14:55 +00:00
Jared Rhizor
ee9de795ff add schema grants to snowflake docs (#1836)
* add schema grants to snowflake docs

* remove revoke current grants
2021-01-25 18:54:22 -08:00
Jared Rhizor
4ab0464274 add postgres troubleshooting docs (#1653)
* add postgres troubleshooting docs

* update headings

* add to other db sources

* move connection troubleshooting to faq
2021-01-25 10:56:09 -08:00
John Lafleur
7f865c3ef1 GitBook: [docs_connector_health] 4 pages modified (#1719)
Co-authored-by: John Lafleur <john@dataline.io>
2021-01-20 09:37:28 +11:00
Christophe Duong
194d6cb5d0 Add metadata prefix to destination internal columns (#1708)
* Add metadata prefix to destination internal columns

* Bumpversion all destinations for new normalization behavior

* Fix acceptance tests
2021-01-19 17:21:15 +01:00
Michel Tricot
1214e2b958 Create a simpler Getting Started with demo data (#1408) 2020-12-21 16:09:12 -08:00
John Lafleur
21c2993373 GitBook: [changelog_update] 4 pages modified (#1361)
* GitBook: [changelog_update] 4 pages modified

* Update docs/changelog.md

Co-authored-by: Charles <giardina.charles@gmail.com>

Co-authored-by: John Lafleur <john@dataline.io>
Co-authored-by: Charles <giardina.charles@gmail.com>
2020-12-18 05:11:14 +11:00
Jared Rhizor
1f90cf97a8 local kubernetes support (#1254)
* initial attempt at generating local kube setup from docker compose

* update current state

* mounts not working

* working mounts, failing cors

* working UI

* add remaining todos

* update todos

* A

* use kustomize to select image versions

* kube process builder factory

* fix misalignment

* don't allow any retries for requested jobs

* fix log waiting and path handling

* update todos

* local volume handling

* propagate return code correctly

* update todos

* update docs

* fmt

* add to docs

* fix conflicting config file bug

* fmt

* delete unused file

* remove comment

* add job id and attempt as inputs

* rename to WorkerEnvironment

* fix example custom overlay

* less trigger-happy docs

* rename mounts

* show local csv as not working in kube in the docs

* use config maps for everything

* fix paths

* fix build

* fix stripe integration test usage

* fix papercups on kube
2020-12-16 14:34:55 -08:00
Charles
01cdeafdb1 release v0.8.0-alpha (#1339) 2020-12-16 14:01:57 -08:00
John Lafleur
6668a514f4 GitBook: [master] 10 pages and 335 assets modified 2020-12-04 06:42:08 +00:00
Christophe Duong
d06392e900 Redshift Destination & refactoring to introduce destination-jdbc
Closes #193
Closes #1126
2020-12-03 18:07:46 +01:00
John Lafleur
a5f509bf49 GitBook: [master] 15 pages and 4 assets modified 2020-11-25 19:01:56 +00:00
Christophe Duong
206d3cbea8 Naming conventions managed in destinations (#1060)
Handled field and table identifier conversions as necessary in order to successfully in destinations
2020-11-25 18:53:23 +01:00
Charles
e1f19de716 remove incorrect citation (#940) 2020-11-11 22:58:44 -08:00
John Lafleur
f20e574e8e Update connector changelog and relative links (#903) 2020-11-10 19:26:24 -08:00
Michel Tricot
5ecdfeb012 GitBook: [master] 8 pages and 42 assets modified 2020-11-11 02:46:18 +00:00
Charles
3d3b449472 fix link to csv destination from source file(#885) 2020-11-10 14:55:08 -08:00
John Lafleur
458a91a19e Documentation - Consolidation of documentation around building new/custom connectors + consistency on vocabulary (#868) 2020-11-10 14:46:39 -08:00
Charles
02a93dd259 remove snowflake use of streamRenamer in standard tests (#866) 2020-11-09 23:54:52 -08:00
Michel Tricot
f914670461 GitBook: [master] 11 pages and 21 assets modified 2020-11-07 03:38:16 +00:00