1
0
mirror of synced 2026-01-23 22:01:53 -05:00
Commit Graph

15 Commits

Author SHA1 Message Date
Artem Inzhyyants
df34893b63 feat(airbyte-cdk): replace pydantic BaseModel with dataclasses + serpyco-rs in protocol (#44444)
Signed-off-by: Artem Inzhyyants <artem.inzhyyants@gmail.com>
2024-09-02 17:48:17 +02:00
Ella Rohm-Ensing
fc12432305 airbyte-cdk: only update airbyte-protocol-models to pydantic v2 (#39524)
## What

Migrating Pydantic V2 for Protocol Messages to speed up emitting records. This gives us 2.5x boost over V1. 

Close https://github.com/airbytehq/airbyte-internal-issues/issues/8333

## How
- Switch to using protocol models generated for pydantic_v2, in a new (temporary) package, `airbyte-protocol-models-pdv2` .
- Update pydantic dependency of the CDK accordingly to v2.
- For minimal impact, still use the compatibility code `pydantic.v1` in all of our pydantic code from airbyte-cdk that does not interact with the protocol models.

## Review guide
1. Checkout the code and clear your CDK virtual env (either `rm -rf .venv && python -m venv .venv` or `poetry env list; poetry env remove <env>`. This is necessary to fully clean out the `airbyte_protocol` library, for some reason. Then: `poetry lock --no-update && poetry install --all-extras`. This should install the CDK with new models. 
2. Run unit tests on the CDK
3. Take your favorite connector and point it's `pyproject.toml` on local CDK (see example in `source-s3`) and try running it's tests and it's regression tests.

## User Impact

> [!warning]
> This is a major CDK change due to the pydantic dependency change - if connectors use pydantic 1.10, they will break and will need to do similar `from pydantic.v1` updates to get running again. Therefore, we should release this as a major CDK version bump.

## Can this PR be safely reverted and rolled back?
- [x] YES 💚
- [ ] NO 

Even if sources migrate to this version, state format should not change, so a revert should be possible.

## Follow up work - Ella to move into issues

<details>

### Source-s3 - turn this into an issue
- [ ] Update source s3 CDK version and any required code changes
- [ ] Fix source-s3 unit tests
- [ ] Run source-s3 regression tests
- [ ] Merge and release source-s3 by June 21st

### Docs
- [ ] Update documentation on how to build with CDK 

### CDK pieces
- [ ] Update file-based CDK format validation to use Pydantic V2
  - This is doable, and requires a breaking change to change `OneOfOptionConfig`. There are a few unhandled test cases that present issues we're unsure of how to handle so far.
- [ ] Update low-code component generators to use Pydantic V2
  - This is doable, there are a few issues around custom component generation that are unhandled.

### Further CDK performance work - create issues for these
- [ ] Research if we can replace prints with buffered output (write to byte buffer and then flush to stdout)
- [ ] Replace `json` with `orjson`
...

</details>
2024-06-21 01:53:44 +02:00
Maxime Carbonneau-Leclerc
9b1d7205f3 Add discover to entrypoint wrapper (#39396) 2024-06-11 10:18:31 -04:00
Aldo Gonzalez
e7508a4572 Airbyte CDK: Add delete method to HttpMocker (#38169) 2024-05-14 08:06:18 -06:00
Brian Lai
b1f3b8a36f [airbyte-cdk] entrypoint wrapper should use per-stream state not legacy format (#35976) 2024-03-11 19:09:08 -04:00
Artem Inzhyyants
0954ad3d3a Airbyte CDK: add interpolation for request options (#35485)
Signed-off-by: Artem Inzhyyants <artem.inzhyyants@gmail.com>
Co-authored-by: Alexandre Girard <alexandre@airbyte.io>
2024-02-22 19:40:44 +01:00
Maxime Carbonneau-Leclerc
60a2618154 [ISSUE #34910] add headers to HttpResponse for test framework (#35105) 2024-02-09 12:19:29 -05:00
Maxime Carbonneau-Leclerc
ca8590e2b4 Have StateBuilder return our actual state object and not simply a dict (#34625) 2024-01-30 08:46:03 -05:00
Anton Karpets
8305d05e52 Airbyte CDK: add POST method to HttpMocker (#34001)
Co-authored-by: maxi297 <maxime@airbyte.io>
2024-01-10 12:44:58 +02:00
Maxime Carbonneau-Leclerc
66edb4b0f0 Issue 32871/more integration test tooling to test events stream (#33305)
Co-authored-by: Catherine Noll <clnoll@users.noreply.github.com>
2023-12-14 09:23:19 -05:00
Maxime Carbonneau-Leclerc
0c2d43fdf9 Issue 32871/extract trace message creation (#33227) 2023-12-11 09:20:45 -05:00
Maxime Carbonneau-Leclerc
d3f2aa548a [ISSUE #33202] allow for loose query params validation (#33226) 2023-12-11 08:57:41 -05:00
Augustin
0b33caecda Revert "[skip ci] formatting: add missing license headers (#33250)" (#33289) 2023-12-11 11:38:37 +01:00
Augustin
60c1cc01ad [skip ci] formatting: add missing license headers (#33250) 2023-12-11 10:15:18 +01:00
Maxime Carbonneau-Leclerc
69cb3a571e [ISSUE #32868] create HttpMocker (#32937) and [ISSUE #32869] response builder (#32983)
Co-authored-by: octavia-approvington <octavia-approvington@users.noreply.github.com>
2023-12-05 08:48:45 -05:00