1
0
mirror of synced 2026-01-14 12:07:57 -05:00
Commit Graph

1085 Commits

Author SHA1 Message Date
brianjlai
d30560ec6a 🤖 patch bump Python CDK to version 3.4.1 2024-07-12 23:11:31 +00:00
Brian Lai
e456bca099 🐛 [RFR for API Sources] Fix bug where checkpoint reader stops syncing too early if first partition is complete (#41658) 2024-07-12 19:00:25 -04:00
lazebnyi
8382098e54 🤖 minor bump Python CDK to version 3.4.0 2024-07-11 13:19:46 +00:00
Anton Karpets
6c439a8859 [file-based cdk]: add config option to limit number of files for schema discover (#39317)
Co-authored-by: askarpets <anton.karpets@globallogic.com>
Co-authored-by: Serhii Lazebnyi <serhii.lazebnyi@globallogic.com>
Co-authored-by: Serhii Lazebnyi <53845333+lazebnyi@users.noreply.github.com>
2024-07-11 15:16:09 +02:00
Brian Lai
9e23b3f89b 🐛 [airbyte-cdk] Fix bug where substreams depending on an RFR parent stream don't paginate or use existing state (#40671) 2024-07-11 02:53:20 -04:00
lazebnyi
e556bbbefe 🤖 minor bump Python CDK to version 3.3.0 2024-07-10 21:22:06 +00:00
Serhii Lazebnyi
f9b5d5b1a7 [airbyte-cdk] add incomplete status to availability check during read (#41034) 2024-07-10 23:18:28 +02:00
lazebnyi
783c85e476 🤖 patch bump Python CDK to version 3.2.1 2024-07-10 15:54:57 +00:00
Serhii Lazebnyi
338f33a435 [airbyte-cdk] follow-up #40681 - flush buffer for each RATE_LIMITED message print (#41463) 2024-07-10 17:50:57 +02:00
lazebnyi
67490ebc48 🤖 minor bump Python CDK to version 3.2.0 2024-07-10 12:04:38 +00:00
Serhii Lazebnyi
f00ed4a925 [airbyte-cdk] add running stream status with rate limit reason to backoff aproach (#40681) 2024-07-10 14:00:01 +02:00
lazebnyi
4d32562cb3 🤖 minor bump Python CDK to version 3.1.0 2024-07-10 11:15:37 +00:00
Serhii Lazebnyi
bc60a740a2 [airbyte-cdk] add incomplete stream status to nonexistent stream handling (#40568) 2024-07-10 13:10:30 +02:00
artem1205
d1d365ef17 🤖 major bump Python CDK to version 3.0.0 2024-07-09 10:05:13 +00:00
Artem Inzhyyants
02c5f59ccf ref(airbyte-cdk): use http_client inside HttpStream (#39811)
Signed-off-by: Artem Inzhyyants <artem.inzhyyants@gmail.com>
2024-07-09 12:01:03 +02:00
Boris Staal
1c4ff2ae59 build(airbyte-ci): Test python versions matrix (#40731)
Co-authored-by: Augustin <augustin@airbyte.io>
2024-07-09 11:14:28 +02:00
cmm-airbyte
8be1b6ef1e 🤖 minor bump Python CDK to version 2.4.0 2024-07-08 09:04:15 +00:00
Cristina Mariscal
c4b8212ba7 CDK: Add support for input format parsing at jinja macro format_datetime (#40759)
Co-authored-by: cristina.mariscal <cristina.mariscal@cristina.mariscal--MacBook-Pro---DFJ27FJFXX>
2024-07-08 08:42:09 +00:00
bazarnov
3bc5b512af 🤖 patch bump Python CDK to version 2.3.1 2024-07-08 07:15:32 +00:00
Baz
0c237d81d0 🐛 [CDK, Declarative Source]: fix bug when type is missing for anyOf in nested arrays (#40667) 2024-07-08 10:09:02 +03:00
Augustin
1eac62ff3f airbyte-cdk: run poetry check in CI (#39404) 2024-07-05 15:35:37 -07:00
aldogonzalez8
cb6f2c51b3 🤖 minor bump Python CDK to version 2.3.0 2024-07-05 15:52:37 +00:00
aldogonzalez8
2106854ebc 🤖 minor bump Python CDK to version 2.2.0 2024-07-05 15:29:14 +00:00
Cristina Mariscal
a8e985b7a0 Revert "CDK: Add jinja macro format_datetime_string" (#40747) 2024-07-05 15:04:32 +00:00
aldogonzalez8
c3fe18ea9d 🤖 minor bump Python CDK to version 2.1.0 2024-07-05 14:33:38 +00:00
Aldo Gonzalez
1422786282 feat(Airbyte CDK): add with_json_schema method to ConfiguredAirbyteStreamBuilder (#40737) 2024-07-05 08:27:57 -06:00
Cristina Mariscal
b9c213a473 CDK: Add jinja macro format_datetime_string (#40744) 2024-07-05 13:19:43 +00:00
Natik Gadzhi
4a06230436 feat(python cdk): Allow regex_search in jinja interpolations (#40696)
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
2024-07-03 23:21:51 +00:00
Boris Staal
6dd9b7ab25 chore(cdk): Avoid using time.sleep in unit tests for backoff of http stream (#40239)
Co-authored-by: Alexandre Girard <alexandre@airbyte.io>
2024-07-02 11:47:56 -07:00
Maxime Carbonneau-Leclerc
2b7ef3fb25 Validate error handler fallback (#40570)
Co-authored-by: Serhii Lazebnyi <serhii.lazebnyi@globallogic.com>
2024-06-27 17:03:03 -04:00
Boris Staal
4f6c29a9ca chore(cdk): @deprecated for experimental classes made more explicit and filtered in tests (#40240) 2024-06-26 18:15:41 -07:00
erohmensing
7cfb9e9d11 🤖 major bump Python CDK to version 2.0.0 2024-06-21 00:00:47 +00:00
Ella Rohm-Ensing
fc12432305 airbyte-cdk: only update airbyte-protocol-models to pydantic v2 (#39524)
## What

Migrating Pydantic V2 for Protocol Messages to speed up emitting records. This gives us 2.5x boost over V1. 

Close https://github.com/airbytehq/airbyte-internal-issues/issues/8333

## How
- Switch to using protocol models generated for pydantic_v2, in a new (temporary) package, `airbyte-protocol-models-pdv2` .
- Update pydantic dependency of the CDK accordingly to v2.
- For minimal impact, still use the compatibility code `pydantic.v1` in all of our pydantic code from airbyte-cdk that does not interact with the protocol models.

## Review guide
1. Checkout the code and clear your CDK virtual env (either `rm -rf .venv && python -m venv .venv` or `poetry env list; poetry env remove <env>`. This is necessary to fully clean out the `airbyte_protocol` library, for some reason. Then: `poetry lock --no-update && poetry install --all-extras`. This should install the CDK with new models. 
2. Run unit tests on the CDK
3. Take your favorite connector and point it's `pyproject.toml` on local CDK (see example in `source-s3`) and try running it's tests and it's regression tests.

## User Impact

> [!warning]
> This is a major CDK change due to the pydantic dependency change - if connectors use pydantic 1.10, they will break and will need to do similar `from pydantic.v1` updates to get running again. Therefore, we should release this as a major CDK version bump.

## Can this PR be safely reverted and rolled back?
- [x] YES 💚
- [ ] NO 

Even if sources migrate to this version, state format should not change, so a revert should be possible.

## Follow up work - Ella to move into issues

<details>

### Source-s3 - turn this into an issue
- [ ] Update source s3 CDK version and any required code changes
- [ ] Fix source-s3 unit tests
- [ ] Run source-s3 regression tests
- [ ] Merge and release source-s3 by June 21st

### Docs
- [ ] Update documentation on how to build with CDK 

### CDK pieces
- [ ] Update file-based CDK format validation to use Pydantic V2
  - This is doable, and requires a breaking change to change `OneOfOptionConfig`. There are a few unhandled test cases that present issues we're unsure of how to handle so far.
- [ ] Update low-code component generators to use Pydantic V2
  - This is doable, there are a few issues around custom component generation that are unhandled.

### Further CDK performance work - create issues for these
- [ ] Research if we can replace prints with buffered output (write to byte buffer and then flush to stdout)
- [ ] Replace `json` with `orjson`
...

</details>
2024-06-21 01:53:44 +02:00
Augustin
6d42ecafb0 Augustin/protocolv2 (#39863)
## What
<!--
* Describe what the change is solving. Link all GitHub issues related to this change.
-->

Separate out the `datamodel-codegen` workflow into a dagger workflow. This enables us to, upstack, properly generate the same v1 models as previously. Unfortunately datamodel-codegen's "pydantic v1" output on its v2 versions doesn't output what one would expect - see [issue](https://github.com/koxudaxi/datamodel-code-generator/issues/1950) (thanks AJ!). 

## How
<!--
* Describe how code changes achieve the solution.
-->
* Convert the script from bash to python (in dagger) and run it via a shell script (to install dagger)

## User Impact
<!--
* What is the end result perceived by the user?
* If there are negative side effects, please list them. 
-->
None. Development experience is also the same

## Can this PR be safely reverted and rolled back?
<!--
* If unsure, leave it blank.
-->
- [x] YES 💚
- [ ] NO 
2024-06-21 01:36:43 +02:00
lazebnyi
2ae1e4c2a1 🤖 minor bump Python CDK to version 1.8.0 2024-06-19 23:16:59 +00:00
Serhii Lazebnyi
a284676a4d feat(airbyte-cdk): add DatetimeIntervalCursor (#39603) 2024-06-20 01:11:13 +02:00
maxi297
9f95601317 🤖 patch bump Python CDK to version 1.7.3 2024-06-19 14:39:39 +00:00
maxi297
0673e8e01b 🤖 patch bump Python CDK to version 1.7.2 2024-06-19 14:03:14 +00:00
maxi297
fbc4ebbf61 🤖 patch bump Python CDK to version 1.7.1 2024-06-19 13:32:18 +00:00
Maxime Carbonneau-Leclerc
0386ca21ae Exclude airbyte-cdk modules from schema discovery (#39586) 2024-06-19 09:29:45 -04:00
artem1205
58b7b7faa6 🤖 minor bump Python CDK to version 1.7.0 2024-06-19 09:25:30 +00:00
artem1205
d4492f171b 🤖 minor bump Python CDK to version 1.6.0 2024-06-19 09:15:20 +00:00
Artem Inzhyyants
f49c8054ad feat(airbyte-cdk): add json_schema from ConfiguredCatalog to Stream (#39522)
Signed-off-by: Artem Inzhyyants <artem.inzhyyants@gmail.com>
2024-06-19 11:12:33 +02:00
maxi297
a026ee203d 🤖 patch bump Python CDK to version 1.5.6 2024-06-19 03:25:14 +00:00
maxi297
ad0abea35e 🤖 patch bump Python CDK to version 1.5.5 2024-06-19 01:57:28 +00:00
maxi297
dc30e1cfd3 🤖 patch bump Python CDK to version 1.5.4 2024-06-19 01:50:28 +00:00
Maxime Carbonneau-Leclerc
5b4bd3485d Allow access to _partition for source-jira (#39576) 2024-06-18 21:47:30 -04:00
maxi297
92d48432c1 🤖 patch bump Python CDK to version 1.5.3 2024-06-18 21:29:40 +00:00
Maxime Carbonneau-Leclerc
ad5d2b48a2 Ensure error message is the same after migration to HttpClient (#39565) 2024-06-18 17:26:48 -04:00
maxi297
94067de9c1 🤖 patch bump Python CDK to version 1.5.2 2024-06-18 17:02:24 +00:00