1
0
mirror of synced 2026-02-02 07:01:59 -05:00
Commit Graph

453 Commits

Author SHA1 Message Date
Yevhenii
11e865177a CDK: Enable debug logging when running availability check (#31368) 2023-10-13 13:00:46 +03:00
Joe Reuter
e35a1f2cd9 File CDK: Allow configuration of parsed records during check and discover from parser (#31281)
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
2023-10-13 09:50:22 +02:00
Joe Reuter
0e4f290065 Vector DB CDK: Fix openai compatible embedder (#31330) 2023-10-13 09:23:00 +02:00
Catherine Noll
8536725944 CDK: URL-encode query parameters and request body (#30407) 2023-10-12 09:56:55 -04:00
Joe Reuter
67324a4b5b Vector DB CDK: Batch by documents separately for each stream and namespace (#31158) 2023-10-12 13:47:27 +00:00
Alexandre Girard
25fc396cdf CDK: ThreadBasedConcurrentStream skeleton and top-level AbstractStream (#30111)
Co-authored-by: girarda <girarda@users.noreply.github.com>
Co-authored-by: Maxime Carbonneau-Leclerc <maxi297@users.noreply.github.com>
Co-authored-by: Catherine Noll <clnoll@users.noreply.github.com>
2023-10-11 16:46:02 -07:00
Yevhenii
17136a0c8a CDK: Fix initialize of token_expiry_is_time_of_expiration field (#31279) 2023-10-11 16:35:56 +00:00
Yevhenii
c17fae5855 CDK: create new method for parsing refresh token lifespan (#30698)
Co-authored-by: yevhenii-ldv <yevhenii-ldv@users.noreply.github.com>
2023-10-10 17:08:41 +03:00
Ben Church
4c97b2994a CDK: coerce read records to an iterator (#31122)
Co-authored-by: bnchrch <bnchrch@users.noreply.github.com>
2023-10-06 10:01:29 -07:00
Yevhenii
00452c9bd3 CDK: Enable Page Number/Offset to be set on the first request (#30978)
Co-authored-by: yevhenii-ldv <yevhenii-ldv@users.noreply.github.com>
2023-10-05 15:31:30 +03:00
Roman Yermilov [GL]
e561d5d432 Airbyte CDK: fix none type binary error in parquet parser (#31073) 2023-10-05 15:56:02 +04:00
Anton Karpets
767800d2d7 🐛Airbyte CDK: fix parsing of UUID fields in avro files (#31096) 2023-10-05 10:53:18 +03:00
Joe Reuter
07cc1c389e Vector DB CDK: Fix chunk size for openai embedder (#31067) 2023-10-04 21:51:44 +02:00
Joe Reuter
a1ca73a724 CDK: Add helper for cloud env (#30980) 2023-10-02 19:11:40 +02:00
Joe Reuter
5ab372170b Vector DB CDK: Add embedding option for openai-compatible embedding services (#30137) 2023-10-02 16:21:44 +00:00
Eugene Kulak
1e7b4b1cc5 🐛 CDK: fix deletion of request cache files (#30925)
Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>
2023-09-29 16:32:56 +00:00
Eugene Kulak
5eba3c3b57 CDK: Fix request_cache clearing and move it to tmp folder (#30719)
Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>
2023-09-28 21:27:40 +03:00
Marius Posta
7ae97175a6 gradle: fix repo wide behaviour (#30607) 2023-09-28 05:01:13 -07:00
Martin Hwasser
0964d0e1c9 🐛 Set Azure OpenAI chunk_size to 16. (#30795) 2023-09-27 17:12:25 +02:00
Yevhenii
8cdafabd82 Airbyte CDK: Change Error message if stream is not found (#30723)
Co-authored-by: Yevhenii Kurochkin <ykurochkin@flyaps.com>
2023-09-25 18:13:19 +03:00
Joe Reuter
7e3437f05b Add chunking options to vector_db CDK (#30305)
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
2023-09-25 10:09:37 +00:00
Ben Church
5d8278900f Github Action: Add format.yml workflow (#30604)
Co-authored-by: bnchrch <bnchrch@users.noreply.github.com>
Co-authored-by: octavia-approvington <octavia-approvington@users.noreply.github.com>
2023-09-21 18:11:41 -05:00
Maxime Carbonneau-Leclerc
b335880fda jira invalid user-provided urls generating sentry issues (#30672) 2023-09-21 15:01:17 -04:00
Joe Reuter
a609902106 Vector DB CDK: Split openai embedding calls (#30512) 2023-09-19 14:21:13 +00:00
Edward Gao
8805edcea6 DV2: Better errors in the UI (#30491)
Co-authored-by: edgao <edgao@users.noreply.github.com>
2023-09-18 15:58:56 +00:00
Maxime Carbonneau-Leclerc
b6836ad950 [ISSUE #30353] remove file_type from stream config (#30453) 2023-09-18 08:50:00 -04:00
Maxime Carbonneau-Leclerc
3e41ce7cd6 Maxi297/fix datetime format inference issue (#30442) 2023-09-15 09:40:47 -04:00
Joe Reuter
da5b432255 Vector DB CDK: AzureOpenAIEmbedder (#30136) 2023-09-14 12:41:00 +02:00
Catherine Noll
6902b44b0a File-based CDK: update record parse error message (#30347) 2023-09-13 10:45:55 -04:00
Denys Davydov
5248854f2e CAT: changing format or airbyte_type is a breaking change (#30172)
Co-authored-by: davydov-d <davydov-d@users.noreply.github.com>
Co-authored-by: Augustin <augustin@airbyte.io>
2023-09-08 05:55:52 +00:00
Maxime Carbonneau-Leclerc
0d5ea947db Formatting issue (#30223) 2023-09-06 21:02:22 +00:00
Maxime Carbonneau-Leclerc
48e8816b6b [oncall #2838] migrate parsing errors as config errors (#30209) 2023-09-06 13:38:48 -04:00
Joe Reuter
f2a8bebdc5 Vector DB CDK: Add "from field" embedding strategy (#30140) 2023-09-06 14:54:17 +02:00
Joe Reuter
56580b70c3 Vector DB CDK: Better error message for misconfigured text fields (#30129)
Co-authored-by: Pedro S. Lopez <pedroslopez@me.com>
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
2023-09-06 10:11:56 +02:00
Maxime Carbonneau-Leclerc
5b653676aa Update spec and fix autogenerated headers with skip after (#30123) 2023-09-03 09:26:53 -04:00
Maxime Carbonneau-Leclerc
399b4d1fca File-based CDK: ensure no errors in Sentry given empty CSV (#29944) 2023-09-02 09:40:08 -04:00
Joe Reuter
7966a4e8f6 Vector DB CDK: Fix id generation, improve config spec, add base test case (#30081) 2023-09-01 15:04:10 +02:00
Maxime Carbonneau-Leclerc
63d3e40914 [ISSUE #29660] support empty keys with record selection (#30057) 2023-08-31 18:31:09 -04:00
Alexandre Girard
7264b3e1d7 Fix mypy issues in AbstractSource + minor refactoring (#29927)
Co-authored-by: girarda <girarda@users.noreply.github.com>
2023-08-31 07:35:17 -07:00
Joe Reuter
a6547456b9 Vector based CDK (#29703)
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
2023-08-29 16:04:32 +02:00
Maxime Carbonneau-Leclerc
e2fb04f72d File-based CDK: allow user to provided column names (#29868) 2023-08-28 18:00:19 -04:00
Marius Posta
3e680675a4 github workflows: repo-wide auto-format (#29798)
Co-authored-by: postamar <postamar@users.noreply.github.com>
2023-08-25 10:20:41 -07:00
Maxime Carbonneau-Leclerc
82a96e0c69 File-based CDK: allow for extension mismatch (#29835) 2023-08-25 11:44:49 -04:00
Maxime Carbonneau-Leclerc
cb2796de0a File-based CDK: Remove excessive logging when there are more fields i… (#29778) 2023-08-23 17:05:54 -04:00
Maxime Carbonneau-Leclerc
40b76a7813 Source S3: v4 rollout/feature parity (#29753) 2023-08-23 11:30:08 -04:00
Maxime Carbonneau-Leclerc
b801a3d24f Do not stop processing file on parsing error (#29679) 2023-08-21 15:56:01 -04:00
Joe Reuter
f8de9d12df CDK: Remove list endpoint (#29581) 2023-08-21 12:43:44 +02:00
Joe Reuter
d293e1cce4 Embedded CDK: run a check before starting to load (#29079) 2023-08-21 12:42:58 +02:00
Alexandre Girard
b4ce532762 low-code: Allow formatting datetimes as milliseconds since unix epoch (#29504)
Co-authored-by: girarda <girarda@users.noreply.github.com>
2023-08-17 18:49:28 -07:00
Maxime Carbonneau-Leclerc
ec290eed8a Issue 27019/doc (#29434)
Co-authored-by: Alexandre Girard <alexandre@airbyte.io>
Co-authored-by: Catherine Noll <clnoll@users.noreply.github.com>
2023-08-17 17:28:13 -04:00