1
0
mirror of synced 2025-12-30 03:02:21 -05:00
Files
airbyte/docs/integrations/sources/rss.md
Jared Rhizor dffbeb3d20 🎉 New Source: RSS [Python CDK] (#18838)
* add source-rss

* update tests

* add docs

* fix doc link

* changes to pass tests

* update catalog

* Update test_streams.py

* fix time zone issue

* update source def

* auto-bump connector version

Co-authored-by: Vincent Koc <vincentkoc@ieee.org>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2022-11-14 10:51:18 -03:00

1.2 KiB

RSS

Overview

The RSS source allows you to read data from any individual RSS feed.

Output schema

This source is capable of syncing the following streams:

  • items
    • Provides stats about specific RSS items.
    • Most fields are simply kept from RSS items as strings if present (title, link, description, author, category, comments, enclosure, guid).
    • The date field is handled differently. It's transformed into a UTC datetime in a published field for easier use in data warehouses and other destinations.
    • The RSS feed you're subscribing to must have a valid pubDate field for each item for incremental syncs to work properly.
    • Since guid is not a required field, there is no primary key for the feed, only a cursor on the published date.

Features

Feature Supported?
Full Refresh Sync Yes
Incremental - Append Sync Yes
Namespaces No

Requirements / Setup Guide

Only the url of an RSS feed is required.

Performance considerations

None

Changelog

Version Date Pull Request Subject
0.1.0 2022-10-12 18838 Initial release supporting RSS