Yevhenii
|
11e865177a
|
CDK: Enable debug logging when running availability check (#31368)
|
2023-10-13 13:00:46 +03:00 |
|
Joe Reuter
|
e35a1f2cd9
|
File CDK: Allow configuration of parsed records during check and discover from parser (#31281)
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
|
2023-10-13 09:50:22 +02:00 |
|
Joe Reuter
|
0e4f290065
|
Vector DB CDK: Fix openai compatible embedder (#31330)
|
2023-10-13 09:23:00 +02:00 |
|
Catherine Noll
|
8536725944
|
CDK: URL-encode query parameters and request body (#30407)
|
2023-10-12 09:56:55 -04:00 |
|
Joe Reuter
|
67324a4b5b
|
Vector DB CDK: Batch by documents separately for each stream and namespace (#31158)
|
2023-10-12 13:47:27 +00:00 |
|
Alexandre Girard
|
25fc396cdf
|
CDK: ThreadBasedConcurrentStream skeleton and top-level AbstractStream (#30111)
Co-authored-by: girarda <girarda@users.noreply.github.com>
Co-authored-by: Maxime Carbonneau-Leclerc <maxi297@users.noreply.github.com>
Co-authored-by: Catherine Noll <clnoll@users.noreply.github.com>
|
2023-10-11 16:46:02 -07:00 |
|
Yevhenii
|
17136a0c8a
|
CDK: Fix initialize of token_expiry_is_time_of_expiration field (#31279)
|
2023-10-11 16:35:56 +00:00 |
|
Yevhenii
|
c17fae5855
|
CDK: create new method for parsing refresh token lifespan (#30698)
Co-authored-by: yevhenii-ldv <yevhenii-ldv@users.noreply.github.com>
|
2023-10-10 17:08:41 +03:00 |
|
Ben Church
|
4c97b2994a
|
CDK: coerce read records to an iterator (#31122)
Co-authored-by: bnchrch <bnchrch@users.noreply.github.com>
|
2023-10-06 10:01:29 -07:00 |
|
Yevhenii
|
00452c9bd3
|
CDK: Enable Page Number/Offset to be set on the first request (#30978)
Co-authored-by: yevhenii-ldv <yevhenii-ldv@users.noreply.github.com>
|
2023-10-05 15:31:30 +03:00 |
|
Roman Yermilov [GL]
|
e561d5d432
|
Airbyte CDK: fix none type binary error in parquet parser (#31073)
|
2023-10-05 15:56:02 +04:00 |
|
Anton Karpets
|
767800d2d7
|
🐛Airbyte CDK: fix parsing of UUID fields in avro files (#31096)
|
2023-10-05 10:53:18 +03:00 |
|
Joe Reuter
|
07cc1c389e
|
Vector DB CDK: Fix chunk size for openai embedder (#31067)
|
2023-10-04 21:51:44 +02:00 |
|
Joe Reuter
|
a1ca73a724
|
CDK: Add helper for cloud env (#30980)
|
2023-10-02 19:11:40 +02:00 |
|
Joe Reuter
|
5ab372170b
|
Vector DB CDK: Add embedding option for openai-compatible embedding services (#30137)
|
2023-10-02 16:21:44 +00:00 |
|
Eugene Kulak
|
1e7b4b1cc5
|
🐛 CDK: fix deletion of request cache files (#30925)
Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>
|
2023-09-29 16:32:56 +00:00 |
|
Eugene Kulak
|
5eba3c3b57
|
CDK: Fix request_cache clearing and move it to tmp folder (#30719)
Co-authored-by: Eugene Kulak <kulak.eugene@gmail.com>
|
2023-09-28 21:27:40 +03:00 |
|
Marius Posta
|
7ae97175a6
|
gradle: fix repo wide behaviour (#30607)
|
2023-09-28 05:01:13 -07:00 |
|
Martin Hwasser
|
0964d0e1c9
|
🐛 Set Azure OpenAI chunk_size to 16. (#30795)
|
2023-09-27 17:12:25 +02:00 |
|
Yevhenii
|
8cdafabd82
|
Airbyte CDK: Change Error message if stream is not found (#30723)
Co-authored-by: Yevhenii Kurochkin <ykurochkin@flyaps.com>
|
2023-09-25 18:13:19 +03:00 |
|
Joe Reuter
|
7e3437f05b
|
Add chunking options to vector_db CDK (#30305)
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
|
2023-09-25 10:09:37 +00:00 |
|
Ben Church
|
5d8278900f
|
Github Action: Add format.yml workflow (#30604)
Co-authored-by: bnchrch <bnchrch@users.noreply.github.com>
Co-authored-by: octavia-approvington <octavia-approvington@users.noreply.github.com>
|
2023-09-21 18:11:41 -05:00 |
|
Maxime Carbonneau-Leclerc
|
b335880fda
|
jira invalid user-provided urls generating sentry issues (#30672)
|
2023-09-21 15:01:17 -04:00 |
|
Joe Reuter
|
a609902106
|
Vector DB CDK: Split openai embedding calls (#30512)
|
2023-09-19 14:21:13 +00:00 |
|
Edward Gao
|
8805edcea6
|
DV2: Better errors in the UI (#30491)
Co-authored-by: edgao <edgao@users.noreply.github.com>
|
2023-09-18 15:58:56 +00:00 |
|
Maxime Carbonneau-Leclerc
|
b6836ad950
|
[ISSUE #30353] remove file_type from stream config (#30453)
|
2023-09-18 08:50:00 -04:00 |
|
Maxime Carbonneau-Leclerc
|
3e41ce7cd6
|
Maxi297/fix datetime format inference issue (#30442)
|
2023-09-15 09:40:47 -04:00 |
|
Joe Reuter
|
da5b432255
|
Vector DB CDK: AzureOpenAIEmbedder (#30136)
|
2023-09-14 12:41:00 +02:00 |
|
Catherine Noll
|
6902b44b0a
|
File-based CDK: update record parse error message (#30347)
|
2023-09-13 10:45:55 -04:00 |
|
Denys Davydov
|
5248854f2e
|
CAT: changing format or airbyte_type is a breaking change (#30172)
Co-authored-by: davydov-d <davydov-d@users.noreply.github.com>
Co-authored-by: Augustin <augustin@airbyte.io>
|
2023-09-08 05:55:52 +00:00 |
|
Maxime Carbonneau-Leclerc
|
0d5ea947db
|
Formatting issue (#30223)
|
2023-09-06 21:02:22 +00:00 |
|
Maxime Carbonneau-Leclerc
|
48e8816b6b
|
[oncall #2838] migrate parsing errors as config errors (#30209)
|
2023-09-06 13:38:48 -04:00 |
|
Joe Reuter
|
f2a8bebdc5
|
Vector DB CDK: Add "from field" embedding strategy (#30140)
|
2023-09-06 14:54:17 +02:00 |
|
Joe Reuter
|
56580b70c3
|
Vector DB CDK: Better error message for misconfigured text fields (#30129)
Co-authored-by: Pedro S. Lopez <pedroslopez@me.com>
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
|
2023-09-06 10:11:56 +02:00 |
|
Maxime Carbonneau-Leclerc
|
5b653676aa
|
Update spec and fix autogenerated headers with skip after (#30123)
|
2023-09-03 09:26:53 -04:00 |
|
Maxime Carbonneau-Leclerc
|
399b4d1fca
|
File-based CDK: ensure no errors in Sentry given empty CSV (#29944)
|
2023-09-02 09:40:08 -04:00 |
|
Joe Reuter
|
7966a4e8f6
|
Vector DB CDK: Fix id generation, improve config spec, add base test case (#30081)
|
2023-09-01 15:04:10 +02:00 |
|
Maxime Carbonneau-Leclerc
|
63d3e40914
|
[ISSUE #29660] support empty keys with record selection (#30057)
|
2023-08-31 18:31:09 -04:00 |
|
Alexandre Girard
|
7264b3e1d7
|
Fix mypy issues in AbstractSource + minor refactoring (#29927)
Co-authored-by: girarda <girarda@users.noreply.github.com>
|
2023-08-31 07:35:17 -07:00 |
|
Joe Reuter
|
a6547456b9
|
Vector based CDK (#29703)
Co-authored-by: flash1293 <flash1293@users.noreply.github.com>
|
2023-08-29 16:04:32 +02:00 |
|
Maxime Carbonneau-Leclerc
|
e2fb04f72d
|
File-based CDK: allow user to provided column names (#29868)
|
2023-08-28 18:00:19 -04:00 |
|
Marius Posta
|
3e680675a4
|
github workflows: repo-wide auto-format (#29798)
Co-authored-by: postamar <postamar@users.noreply.github.com>
|
2023-08-25 10:20:41 -07:00 |
|
Maxime Carbonneau-Leclerc
|
82a96e0c69
|
File-based CDK: allow for extension mismatch (#29835)
|
2023-08-25 11:44:49 -04:00 |
|
Maxime Carbonneau-Leclerc
|
cb2796de0a
|
File-based CDK: Remove excessive logging when there are more fields i… (#29778)
|
2023-08-23 17:05:54 -04:00 |
|
Maxime Carbonneau-Leclerc
|
40b76a7813
|
✨ Source S3: v4 rollout/feature parity (#29753)
|
2023-08-23 11:30:08 -04:00 |
|
Maxime Carbonneau-Leclerc
|
b801a3d24f
|
Do not stop processing file on parsing error (#29679)
|
2023-08-21 15:56:01 -04:00 |
|
Joe Reuter
|
f8de9d12df
|
CDK: Remove list endpoint (#29581)
|
2023-08-21 12:43:44 +02:00 |
|
Joe Reuter
|
d293e1cce4
|
Embedded CDK: run a check before starting to load (#29079)
|
2023-08-21 12:42:58 +02:00 |
|
Alexandre Girard
|
b4ce532762
|
low-code: Allow formatting datetimes as milliseconds since unix epoch (#29504)
Co-authored-by: girarda <girarda@users.noreply.github.com>
|
2023-08-17 18:49:28 -07:00 |
|
Maxime Carbonneau-Leclerc
|
ec290eed8a
|
Issue 27019/doc (#29434)
Co-authored-by: Alexandre Girard <alexandre@airbyte.io>
Co-authored-by: Catherine Noll <clnoll@users.noreply.github.com>
|
2023-08-17 17:28:13 -04:00 |
|