1
0
mirror of synced 2025-12-21 11:01:41 -05:00
Commit Graph

1266 Commits

Author SHA1 Message Date
Joe Reuter
da5b432255 Vector DB CDK: AzureOpenAIEmbedder (#30136) 2023-09-14 12:41:00 +02:00
clnoll
3fec3cc7b6 🤖 Bump patch version of Airbyte CDK 2023-09-13 14:55:13 +00:00
Catherine Noll
6902b44b0a File-based CDK: update record parse error message (#30347) 2023-09-13 10:45:55 -04:00
Marius Posta
51c67d7eaa gradle: remove airbyteDocker.outputs dependencies (#30314) 2023-09-11 17:16:27 -07:00
Denys Davydov
5248854f2e CAT: changing format or airbyte_type is a breaking change (#30172)
Co-authored-by: davydov-d <davydov-d@users.noreply.github.com>
Co-authored-by: Augustin <augustin@airbyte.io>
2023-09-08 05:55:52 +00:00
Maxime Carbonneau-Leclerc
82b74cd06c Document mockserver usage for CDK development (#30202) 2023-09-07 12:31:01 -04:00
Marius Posta
ef2849e35e gradle: fix airbyteDocker task inputs (#30187) 2023-09-07 03:46:31 -07:00
Maxime Carbonneau-Leclerc
0d5ea947db Formatting issue (#30223) 2023-09-06 21:02:22 +00:00
maxi297
fa81dc1e5a 🤖 Bump patch version of Airbyte CDK 2023-09-06 17:45:53 +00:00
Maxime Carbonneau-Leclerc
48e8816b6b [oncall #2838] migrate parsing errors as config errors (#30209) 2023-09-06 13:38:48 -04:00
flash1293
cae3ac4b71 🤖 Bump patch version of Airbyte CDK 2023-09-06 13:00:48 +00:00
Joe Reuter
f2a8bebdc5 Vector DB CDK: Add "from field" embedding strategy (#30140) 2023-09-06 14:54:17 +02:00
Joe Reuter
56580b70c3 Vector DB CDK: Better error message for misconfigured text fields (#30129)
Co-authored-by: Pedro S. Lopez <pedroslopez@me.com>
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
2023-09-06 10:11:56 +02:00
Marius Posta
be1e1adabd gradle: cleanup (#30060) 2023-09-05 14:05:40 -05:00
maxi297
76ac725665 🤖 Bump patch version of Airbyte CDK 2023-09-03 13:33:46 +00:00
Maxime Carbonneau-Leclerc
5b653676aa Update spec and fix autogenerated headers with skip after (#30123) 2023-09-03 09:26:53 -04:00
Maxime Carbonneau-Leclerc
399b4d1fca File-based CDK: ensure no errors in Sentry given empty CSV (#29944) 2023-09-02 09:40:08 -04:00
flash1293
54d40a9c3c 🤖 Bump patch version of Airbyte CDK 2023-09-01 13:12:44 +00:00
Joe Reuter
7966a4e8f6 Vector DB CDK: Fix id generation, improve config spec, add base test case (#30081) 2023-09-01 15:04:10 +02:00
maxi297
06758650e5 🤖 Bump patch version of Airbyte CDK 2023-08-31 22:39:51 +00:00
Maxime Carbonneau-Leclerc
63d3e40914 [ISSUE #29660] support empty keys with record selection (#30057) 2023-08-31 18:31:09 -04:00
Maxime Carbonneau-Leclerc
fc955e89e2 Up pydantic version in order to fix destination tests (#30034) 2023-08-31 11:12:55 -04:00
Alexandre Girard
7264b3e1d7 Fix mypy issues in AbstractSource + minor refactoring (#29927)
Co-authored-by: girarda <girarda@users.noreply.github.com>
2023-08-31 07:35:17 -07:00
flash1293
6074bcafbf 🤖 Bump patch version of Airbyte CDK 2023-08-29 14:11:36 +00:00
Joe Reuter
a6547456b9 Vector based CDK (#29703)
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
2023-08-29 16:04:32 +02:00
maxi297
408e933159 🤖 Bump patch version of Airbyte CDK 2023-08-28 22:07:17 +00:00
Maxime Carbonneau-Leclerc
e2fb04f72d File-based CDK: allow user to provided column names (#29868) 2023-08-28 18:00:19 -04:00
Marius Posta
3e680675a4 github workflows: repo-wide auto-format (#29798)
Co-authored-by: postamar <postamar@users.noreply.github.com>
2023-08-25 10:20:41 -07:00
maxi297
f49b869094 🤖 Bump patch version of Airbyte CDK 2023-08-25 15:51:18 +00:00
Maxime Carbonneau-Leclerc
82a96e0c69 File-based CDK: allow for extension mismatch (#29835) 2023-08-25 11:44:49 -04:00
maxi297
c9a2166108 🤖 Bump patch version of Airbyte CDK 2023-08-23 21:12:06 +00:00
Maxime Carbonneau-Leclerc
cb2796de0a File-based CDK: Remove excessive logging when there are more fields i… (#29778) 2023-08-23 17:05:54 -04:00
maxi297
d405e3f77b 🤖 Bump patch version of Airbyte CDK 2023-08-23 15:37:34 +00:00
Maxime Carbonneau-Leclerc
40b76a7813 Source S3: v4 rollout/feature parity (#29753) 2023-08-23 11:30:08 -04:00
maxi297
68a6c53010 🤖 Bump patch version of Airbyte CDK 2023-08-21 20:04:28 +00:00
Maxime Carbonneau-Leclerc
b801a3d24f Do not stop processing file on parsing error (#29679) 2023-08-21 15:56:01 -04:00
flash1293
9b1b08bed7 🤖 Bump patch version of Airbyte CDK 2023-08-21 10:51:01 +00:00
Joe Reuter
f8de9d12df CDK: Remove list endpoint (#29581) 2023-08-21 12:43:44 +02:00
Joe Reuter
d293e1cce4 Embedded CDK: run a check before starting to load (#29079) 2023-08-21 12:42:58 +02:00
girarda
255ecef227 🤖 Bump patch version of Airbyte CDK 2023-08-18 03:08:27 +00:00
Alexandre Girard
b4ce532762 low-code: Allow formatting datetimes as milliseconds since unix epoch (#29504)
Co-authored-by: girarda <girarda@users.noreply.github.com>
2023-08-17 18:49:28 -07:00
Maxime Carbonneau-Leclerc
ec290eed8a Issue 27019/doc (#29434)
Co-authored-by: Alexandre Girard <alexandre@airbyte.io>
Co-authored-by: Catherine Noll <clnoll@users.noreply.github.com>
2023-08-17 17:28:13 -04:00
Maxime Carbonneau-Leclerc
e9d99630ed Removing validation on skip rows and autogenerated headers (#29488) 2023-08-17 16:14:19 -04:00
clnoll
46f390f7c9 🤖 Bump minor version of Airbyte CDK 2023-08-15 18:11:40 +00:00
Catherine Noll
7c1d6081de File-based CDK: handle legacy path_prefix + globs (#29389) 2023-08-15 12:18:25 -04:00
Brian Lai
5908b85e69 [file-based cdk] Remove CSV quoting_behavior config option (#29388)
* remove CSV quoting_behavior config option

* cleanup after getting latest master
2023-08-14 20:37:38 -04:00
Alexandre Girard
b512fa4628 file-based CDK: Configurable strings_can_be_null (#29298)
* [ISSUE #28893] infer csv schema

* [ISSUE #28893] align with pyarrow

* Automated Commit - Formatting Changes

* [ISSUE #28893] legacy inference and infer only when needed

* [ISSUE #28893] fix scenario tests

* [ISSUE #28893] using discovered schema as part of read

* [ISSUE #28893] self-review + cleanup

* [ISSUE #28893] fix test

* [ISSUE #28893] code review part #1

* [ISSUE #28893] code review part #2

* Fix test

* formatcdk

* first pass

* [ISSUE #28893] code review

* fix mypy issues

* comment

* rename for clarity

* Add a scenario test case

* this isn't optional anymore

* FIX test log level

* Re-adding failing tests

* [ISSUE #28893] improve inferrence to consider multiple types per value

* Automated Commit - Formatting Changes

* [ISSUE #28893] remove InferenceType.PRIMITIVE_AND_COMPLEX_TYPES

* Code review

* Automated Commit - Formatting Changes

* fix unit tests

---------

Co-authored-by: maxi297 <maxime@airbyte.io>
Co-authored-by: maxi297 <maxi297@users.noreply.github.com>
2023-08-14 12:51:27 -07:00
Maxime Carbonneau-Leclerc
12f1304a67 Issue 28893/infer schema csv (#29099) 2023-08-14 15:14:46 -04:00
Catherine Noll
6946052513 Source S3: maintain backwards compatibility between V3 & V4 state messages (#29028) 2023-08-11 11:38:43 -04:00
Alexandre Girard
1a120ecd4b File-CDK (Avro) Set double_as_string to false by default (#29339)
* set double_as_string to false by default

* Use default config when irrelevant to the test

* Update description

* Update the description again
2023-08-10 14:31:52 -07:00