Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
1.4 KiB
Clickhouse Migration Guide
Upgrading to 2.0.0
This version differs from 1.0.0 radically. Whereas 1.0.0 wrote all your data
as JSON to raw tables in airbyte_internal database, 2.0.0 will properly separate
your schema into typed columns and write to the specified database in the
configuration and the un-prefixed table name. You will no longer see
airbyte_internal.{database}_raw__stream_{table} and will instead see
{database}.{table}.
While is treated as a "breaking change", connections should continue to function with no changes, albeit writing data to a completely different location and in a different form. So any downstream pipelines will need updating to ingest the new data location / format.
Gotchas
- If the "Hostname" property in your configuration contains the protocol ("http or "https"), you need to remove it.
Migrating existing data to the new format
Unfortunately Airbyte has no way to migrate the existing raw tables to the new typed format. The only "out of the box" way to get your data into the new format is to re-sync it from scratch.
Removing the old tables
Because the new destination has no knowledge of the old destination's table
naming semantics, we will not remove existing data. If you would like to, you
will need to delete all the tables saved in the old format, which for most
people should be under airbyte_internal.{database}_raw__, but may vary based
on your specific configuration.