1
0
mirror of synced 2025-12-21 11:01:41 -05:00
Commit Graph

234 Commits

Author SHA1 Message Date
Teal Larson
2073513689 docs: remove lingering mentions of dbt core (#45110) 2024-09-03 17:16:49 -04:00
Juan
a7863a0e93 Update cdc.md (#44394) 2024-08-19 13:45:00 -04:00
Johnny Schmidt
e9a37d45ac S3 Dest emits V2 fields and captures failures (#42409) 2024-08-14 09:41:29 -07:00
Davin Chia
697eb863f8 docs: add workloads blog post. (#42918) 2024-07-31 15:50:42 -04:00
Davin Chia
aa5d6dd6b3 docs: worker v2 public docs. (#42873)
Update docs to prep for Worker V2 OSS launch.

The main focus is a brief explanation of what/why/how of the feature. We have a blog post in the works I'm going to link to for more explaination once it's published.

- Remove old *_WORKER configs.
- Update the diagram to reflect the new flow. Viewing now is not great due to the rendering. Will likely follow up on a different diagram.
- Write Worker V2 explanation. Note that I left all the Docker pieces in place and tried to make things 'flow'. Will follow up here when we deprecate Docker officially.

Co-authored-by: Jimmy Ma <gosusnp@users.noreply.github.com>
2024-07-31 05:44:44 +00:00
Davin Chia
cb90cbe561 docs: add version after which connector secrets are deleted. (#42515)
Update doc to reflect when connector secrets are deleted.
2024-07-24 20:01:22 -04:00
Davin Chia
b656d6e830 docs: document how Airbyte works with Secrets. (#42422)
Document how Airbyte works with secrets.
2024-07-23 15:03:37 -04:00
Davin Chia
372aea7b11 doc: add Resumable Full Refresh documentation. (#41334)
Adding Resumable Full Refresh documentation as we prep for release.
- reorganise the refreshes documentation.
- split resumability into it's own documentation and add the resumable full refresh section.

Co-authored-by: Audrey Maldonado <audrey.maldonado@gmail.com>
2024-07-10 23:02:02 -04:00
Lake Mossman
5bdbaa4a4b Update support level terminology in docs (#40723) 2024-07-08 15:37:02 -07:00
Lake Mossman
8e7f5022c2 Revert "Updating terminology Community Connector > Marketplace Connec… (#40710) 2024-07-03 14:03:25 -07:00
Audrey Maldonado
53c5dd6ce2 Updating terminology Community Connector > Marketplace Connector, Certified Connector > Integration (#40678)
Co-authored-by: Natik Gadzhi <natik@respawn.io>
2024-07-02 21:15:29 +00:00
Davin Chia
23b7e013b7 docs: increase the max partial failure attempt to 20. (#39322)
We are increasing the max partial failure attempt to 20 to accommodate the new CDC load algorithmn.

This may result in longer jobs. However all jobs that are longer will be moving data, so we'll also increase the chance a job succeeds in the event of transient errors.

Change was made in airbyte-platform. This is the docs update.
2024-06-06 12:57:36 -04:00
Natik Gadzhi
cb80e6922a [tools] prettier rules for .md + formatting cleanup 2024-05-07 08:19:33 -07:00
Evan Tahler
8913b22a3d Remove schema evolution from CDC docs and other tweaks (#37731) 2024-05-01 15:17:44 -07:00
Evan Tahler
f23881ee14 Update supported-data-types and nullability (#36259) 2024-03-18 22:50:18 +00:00
Marius Posta
f90b5d1b62 docs: update and remove some bad content (#36197) 2024-03-15 13:12:37 -07:00
Marius Posta
2c21cfe2be docs: fix or remove broken links (#36185) 2024-03-15 09:07:03 -07:00
Davin Chia
c8bec40cc3 Improve Heartbeating External Docs. (#35934)
Better explanation for:

General philosophy.
What a heartbeat error means.
2024-03-08 15:34:27 -08:00
Davin Chia
e66ec11534 Better Heartbeating External Documentation. (#35932)
In,
- airbytehq/airbyte-platform-internal@96baf5b
- Better Destination Heartbeat Error Messages airbyte-platform-internal#11595

we improve our heartbeat error messages and point users to this external document.

Here, we improve external documentation to help users understand what is happening and what they can do.
2024-03-08 15:07:19 -08:00
Maxime Carbonneau-Leclerc
d6346e1076 Improve documentation on check command (#35542)
Co-authored-by: Ella Rohm-Ensing <erohmensing@gmail.com>
2024-02-23 17:03:03 +00:00
Evan Tahler
2be84313b1 Update airbyte-protocol.md (#35123) 2024-02-10 02:09:54 +00:00
Ryan Waskewich
7ed1e385e3 Update cdc.md - Add MongoDB support (#34671) 2024-01-30 12:38:00 -05:00
Marcos Marx
c8d06f4878 Docs: create overview architecture diagram + reorder docs (#33896) 2024-01-24 08:37:32 -03:00
Natik Gadzhi
68e833104b Docs: review and update connector form scratch documentation and related guides (#33822)
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2024-01-02 16:59:51 -08:00
Davin Chia
bf9218d5c7 Document State Message Principles (#33787) 2023-12-26 14:10:47 -08:00
Davin Chia
4d84cc707b Update Airbyte Protocol documentation. (#33786)
- Update the changelog section.
- Remove all reference to V1 from the docs since we did not do this migration, and folded the new state message functionality into V0.
2023-12-26 12:08:08 -08:00
Catherine Noll
c01bda4678 Docs: information on schemaless sources & destinations (#33595)
Co-authored-by: Evan Tahler <evan@airbyte.io>
2023-12-19 10:57:17 -05:00
Evan Tahler
015c745884 Update airbyte-protocol.md (#33501) 2023-12-14 10:42:02 -06:00
Aaron ("AJ") Steers
d083d15665 Docs fix: ConfiguredAirbyteCatalog refs (#33483) 2023-12-14 08:39:29 -08:00
Joe Bell
e20896cede Adding Mermaid Diagrams to Docs (#32667) 2023-12-12 23:32:36 +00:00
Tim Roes
c5db1e947b 📚 Documentation November 2023 overhaul (#32811)
Co-authored-by: Natalie Kwong <38087517+nataliekwong@users.noreply.github.com>
Co-authored-by: timroes <timroes@users.noreply.github.com>
Co-authored-by: nataliekwong <nataliekwong@users.noreply.github.com>
2023-11-26 11:57:38 -08:00
Richard Gourley
831a5b01ce Small typo fixed - An JSON -> A JSON (#32637) 2023-11-17 06:39:59 -06:00
rijumone
75e71e3378 Update airbyte-protocol.md (#32490) 2023-11-14 14:33:07 -06:00
Joe Bell
c49674acee Update column change image (#32301) 2023-11-08 09:35:23 -08:00
Benoit Moriceau
d61eb3d843 Source+Destination heartbeat docs (#32064)
Co-authored-by: Jose Pefaur <jose.pefaur@gmail.com>
2023-11-01 15:19:53 -07:00
Evan Tahler
30612bccd9 Docs: more removal of dbt for T&D (#31567) 2023-10-23 12:35:41 -05:00
Przemysław Dąbek
e766d05f9d Update references and point to existing locations (#30712)
Co-authored-by: Sajarin <sajarindider@gmail.com>
2023-09-26 15:19:57 -04:00
Marcos Marx
ab34d4dc88 Remove Kustomize references in docs and replace with Helm Chart (#30008) 2023-08-31 10:24:08 -03:00
Nicholas Hawkes
dcc1ea99c2 fix: Typo in incremental-append-deduped.md (#29971) 2023-08-30 07:32:09 -05:00
Evan Tahler
41f0d4923c [docs] remove early access notes from T&D (#29982)
Co-authored-by: Pedro S. Lopez <pedroslopez@me.com>
2023-08-29 23:39:11 +00:00
Ben Church
b666e2e8a3 Remove release stage references in favor of support level or ab_internal (#29232)
* First pass on removing release stage references

* Automated Commit - Format and Process Resources Changes

* Fix metadata tests

* Fix qa engine tests

* Fix pipeline tests

* Update registry report

* Fix support-level arg

* Add dummy change

* Deal with --allow_community

* Improve log message

* Add docs

* Revert "Add dummy change"

This reverts commit 1861b9525a.

* Code review comments

* Run format

* Install connector ops to qa-engine

* Fix CI test

* REVERT ME

* Revert "REVERT ME"

This reverts commit dce9b8e39b.

---------

Co-authored-by: bnchrch <bnchrch@users.noreply.github.com>
2023-08-15 19:10:55 +00:00
Ryan Br
4dba9ddc7f Rbroughan/add retry docs (#29421)
* Update docs with basic attempt information.

* Add backoff and env vars docs.

* Currently -> by default.

* Add retry attempt example tables.
2023-08-15 09:53:24 -07:00
Evan Tahler
cde2c1bb15 docs: _airbyte_meta Errors (#29380)
* lint

* docs: `_airbyte_meta` Errors

* link

* format

* paste from master

* fix toc madness
2023-08-14 19:00:51 -06:00
Evan Tahler
9210547af5 [Docs] No Deduped + Hostory, Append + Deduped is the future! (#29114)
* [Docs] No `Deduped + Hostory`, `Append + Deduped` is the future!

* fix links
2023-08-08 15:07:49 -07:00
Alexandre Cuoci
4de7801d64 Add public docs for typing & deduping (#28902)
* Add public docs for typing & deduping

* bugfix

* fixes

* Apply suggestions from code review

Co-authored-by: Evan Tahler <evan@airbyte.io>

---------

Co-authored-by: Evan Tahler <evan@airbyte.io>
2023-08-01 15:06:34 -07:00
Cynthia Yin
7e4797d90d Destinations V2: clean up AirbyteType code (#28430)
* general cleanup - move stuff around, add more comments

* guarantee `getAirbyteProtocolType` won't handle array values for `type`

* rename OneOf to Union

* simplify union ordering logic

* update testChooseUnion

* fix docs typos

* Automated Commit - Format and Process Resources Changes

* address comments

* Automated Commit - Format and Process Resources Changes

---------

Co-authored-by: cynthiaxyin <cynthiaxyin@users.noreply.github.com>
2023-07-20 14:59:52 -06:00
Edward Gao
934acaa137 Destination bigquery: rerelease 1s1t behind gate (#27936)
* Revert "Revert "Destination Bigquery: Scaffolding for destinations v2 (#27268)""

This reverts commit 348c577dbb.

* version bumps+changelog

* Speed up BQ by having 2 queries, and not an OR (#27981)

* 🐛 Destination Bigquery: fix bug in standard inserts for syncs >10K records (#27856)

* only run t+d code if it's enabled

* dockerfile+changelog

* remove changelog entry

* Destinations V2: handle optional fields for `object` and `array` types (#27898)

* catch null schema

* fix null properties

* clean up

* consolidate + add more tests

* try catch

* empty json test

* Automated Commit - Formatting Changes

* remove todo

* destination bigquery: misc updates to 1s1t code (#28057)

* switch to checkedconsumer

* add unit test for buildColumnId

* use flag

* restructure prefix check

* fix build

* more type-parsing fixes (#28100)

* more type-parsing fixes

* handle duplicates

* Automated Commit - Format and Process Resources Changes

* add tests for asColumns

* Automated Commit - Format and Process Resources Changes

* log warnings instead of throwing exception

* better log message

* error level

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>

* Automated Commit - Formatting Changes

* Improve protocol type parsing (#28126)

* Automated Commit - Formatting Changes

* Change from T&D every 10k records to an increasing time based interval (#28130)

* fifteen minute t&d

* add typing and deduping operation valve for increased intervals of typing and deduping

* Automated Commit - Format and Process Resources Changes

* resolve bizarre merge conflict

* Automated Commit - Format and Process Resources Changes

---------

Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>

* Simplify and speed up CDC delete support [DestinationsV2] (#28029)

* Simplify and speed up CDC delete support [DestinationsV2]

* better QUOTE

* spotbugs?

* recompile dbt image for local arch and use that when building images

* things compile, but tests fail

* tests working-ish

* comment

* fix logic to re-insert deleted records for cursor comparison.

tests pass!

* remove comment

* Skip CDC re-include logic if there are no CDC columns

* stop hardcoding pk (#28092)

* wip

* remove TODOs

---------

Co-authored-by: Edward Gao <edward.gao@airbyte.io>

* update method name

* Automated Commit - Formatting Changes

* depend on pinned normalization version

* implement 1s1t DATs for destination-bigquery (#27852)

* intiial implementation

* Automated Commit - Formatting Changes

* add second sync to test

* do concurrent things

* Automated Commit - Formatting Changes

* clarify comment

* minor tweaks

* more stuff

* Automated Commit - Formatting Changes

* minor cleanup

* lots of fixes

* handle sql vs json null better
* verify extra columns
* only check deleted_at if in DEDUP mode and the column exists
* add full refresh append test case

* Automated Commit - Formatting Changes

* add tests for the remaining sync modes

* Automated Commit - Formatting Changes

* readability stuff

* Automated Commit - Formatting Changes

* add test for gcs mode

* remove static fields

* Automated Commit - Formatting Changes

* add more test cases, tweak test scaffold

* cleanup

* Automated Commit - Formatting Changes

* extract recorddiffer

* and use it in the sql generator test

* fix

* comment

* naming+comment

* one more comment

* better assert

* remove unnecessary thing

* one last thing

* Automated Commit - Formatting Changes

* enable concurrent execution on all java integration tests

* add test for default namespace

* Automated Commit - Formatting Changes

* implement a 2-stream test

* Automated Commit - Formatting Changes

* extract methods

* invert jsonNodesNotEquivalent

* Automated Commit - Formatting Changes

* fix conditional

* pull out diffSingleRecord

* Automated Commit - Formatting Changes

* handle nulls correctly

* remove raw-specific handling; break up methods

* Automated Commit - Formatting Changes

---------

Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: octavia-approvington <octavia-approvington@users.noreply.github.com>

* Destinations V2: move create raw tables earlier (#28255)

* move create raw tables

* better log message

* stop building normalization (#28256)

* fix ability to run tests

* disable incremental t+d for now

* Automated Commit - Formatting Changes

---------

Co-authored-by: Evan Tahler <evan@airbyte.io>
Co-authored-by: Cynthia Yin <cynthia@airbyte.io>
Co-authored-by: cynthiaxyin <cynthiaxyin@users.noreply.github.com>
Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: Joe Bell <joseph.bell@airbyte.io>
Co-authored-by: jbfbell <jbfbell@users.noreply.github.com>
Co-authored-by: octavia-approvington <octavia-approvington@users.noreply.github.com>
2023-07-14 09:34:56 -05:00
Jimmy Ma
c7916d798d Add Orchestrator Diagram to the docs (#16675) 2023-07-09 19:20:38 -07:00
Edward Gao
52b8cbe39d Revert "Destination Bigquery: Scaffolding for destinations v2 (#27268)" (#27891)
* Revert "Destination Bigquery: Scaffolding for destinations v2 (#27268)"

This reverts commit ba3e39bb0c.

* bump versions to 1.5.1 everywhere
2023-06-30 20:26:48 -04:00
Edward Gao
ba3e39bb0c Destination Bigquery: Scaffolding for destinations v2 (#27268)
* copy files from edgao branch

* start writing create table statement

* add basic unit test setup

* create a table, probably

* remove outdated todo

* derp, one more column

* ugh

* add partitioning+clustering

* use StringSubstitutor

* substitutions in updateTable

* wip generate update/insert statement

* split up into smaller methods

* handle json types correctly

* rename stuff

* more json_query vs _value stuff

* minor tweak

* super basic test setup

* laying foundation for type parsing

* more stuff

* tweaks

* more progress on type parsing

* fix json_value stuff?

* misc fixes in insert

* fix dedupFinalTable

* add testDedupRaw

* full e2e test

* type parsing: gave up and mirrored the dbt code structure to avoid bugs

* type parsing - more cleanup

* handle column name collisions

* handle tablename collisions...?

* comments

* remove original ns/name from quotedstream

* also javadoc

* remove redundant method

* fix table rename

* add incremental append test

* add full refresh append test

* comment

* call T+D sql in a reasonable location for standard inserts

* add config option

* use config option here

* type parsing - fix fromJsonSchema

* gate everything

* log query + runtime

* add spec option temporarily

* Raw Table Updates

* fix more stuff

* first big pass at toDialectType

* no quotes

* wrap everything in quotes

* resolve some TODOs

* log sql statement in tests

* overwriteFinalTable returns optional

* minor clean up

* add raw dataset override

* try to preserve the original namespace for t+d?

* write to the raw table correctly

* update todos

* write directly to raw table

this is kind of dumb because we're still trying to do tmp table operations,
and we still don't ack state until the end of the entire sync.

* standard inserts write to raw table correctly

* imports + log statements

* move logs + add comment

* explicitly create raw table

* move comment to better place

* Typing issues

* bash attempt

* formatting updates

* formatting updates

* write to the airbyte schema by default unless overriden by config options

* standard inserts truncate raw table at start of sync

* full refresh overwrite will overwrite correctly!

* fix avro record schema parsing

* better raw table recreate

* rename raw table to match standard inserts

* full refresh overwrite does tmp table things

* small clean up

* small clean up

* remove errors entry if no errors

* pull out destination config into singleton

* clean up singleton stuff

* make sure dest config exists when trying to do lookups

* avoid stringifying null

* quick thoughts on alter table

* add basic cdc testcase

* tweak cdc test setup

* rename raw table to match standard inserts

* minor tweak

* delete exact sql string assertions

* switch to JSON type

* minor cleanup

* sql whitespace changes

* explain cdc deletions

* GCS Staging Full Refresh create temp table

* assert schema

* first out of order cdc test

* add another cdc test case (currently failing)

* better test structure

* make this work

* oops, fix test

* stop trying to delete deletion records

* minor improvements to code+test

* enable concurrent test runs on integration test

* move stuff to static initializer

* extract utility method

* formatting

* Move conditional to the base java package, replace conditionals which did not use the typing and deduping flag but should have been.

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* switch back to empty list; write big assert

* minor wording tweaks

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* DestinationConfigTest

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* formatting

* remove ParsedType

* 🤖 Auto format destination-gcs code [skip ci]

* 🤖 Auto format destination-bigquery code [skip ci]

* tests verify every data type

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* full update with all data types

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* move stuff to new base lib

* 🤖 Auto format destination-gcs code [skip ci]

* Automated Commit - Formatting Changes

* 🤖 Auto format destination-bigquery code [skip ci]

* fix test

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-bigquery code [skip ci]

* 🤖 Auto format destination-gcs code [skip ci]

* asserts in dedupFinalTable

* better asserts in dedupRawTable

* [wip] test case for all data types

* 🤖 Auto format destination-gcs code [skip ci]

* 🤖 Auto format destination-bigquery code [skip ci]

* AirbyteTypeTest

* Automated Commit - Formatting Changes

* remove comments

* test chooseOneOf

* slightly better test output

* Automated Commit - Formatting Changes

* add some awful pretty print code

* more comment

* minor tweaks

* verify array/object type

* fix test

* handle deletions more correctly

* test toDialectType

* Destinations v2: better namespace handling (#27682)

* [wip] better namespace handling

* 🤖 Auto format destination-bigquery code [skip ci]

* wip also implement in gcs

* get gcs working (?)

* 🤖 Auto format destination-bigquery code [skip ci]

* remove duplicate method

* 🤖 Auto format destination-bigquery code [skip ci]

* fixed my code style settings

* make ci happy?

* 🤖 Auto format destination-bigquery code [skip ci]

* make ci happy?

* remove incorrect test

* blank line change

* initialize singleton

---------

Co-authored-by: octavia-squidington-iii <octavia-squidington-iii@users.noreply.github.com>

* reset args correctly

* Automated Commit - Formatting Changes

* more bash stuff

* parse implicit structs

* initialize singleton in more tests

* Automated Commit - Formatting Changes

* I missed this namespace handling thing

* test more schemas

* fix singular types specified in arrays

* Automated Commit - Formatting Changes

* disable test for unimplemented feature

* initialize singleton

* remove spec options; changelogs+metadata

* randomize namespace

* also bump dockerfile

* unremove namespace sanitizing in legacy mode

* ... disable the correct test

* even more unit test fixes!

* move integration test to integration tests

---------

Co-authored-by: Cynthia Yin <cynthia@airbyte.io>
Co-authored-by: Joe Bell <joseph.bell@airbyte.io>
Co-authored-by: octavia-squidington-iii <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: edgao <edgao@users.noreply.github.com>
Co-authored-by: cynthiaxyin <cynthiaxyin@users.noreply.github.com>
2023-06-29 08:44:37 -07:00