1
0
mirror of synced 2025-12-30 21:02:43 -05:00
Files
airbyte/docs/integrations/sources/rss.md
Jared Rhizor dffbeb3d20 🎉 New Source: RSS [Python CDK] (#18838)
* add source-rss

* update tests

* add docs

* fix doc link

* changes to pass tests

* update catalog

* Update test_streams.py

* fix time zone issue

* update source def

* auto-bump connector version

Co-authored-by: Vincent Koc <vincentkoc@ieee.org>
Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com>
Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
2022-11-14 10:51:18 -03:00

38 lines
1.2 KiB
Markdown

# RSS
## Overview
The RSS source allows you to read data from any individual RSS feed.
#### Output schema
This source is capable of syncing the following streams:
* `items`
* Provides stats about specific RSS items.
* Most fields are simply kept from RSS items as strings if present (`title`, `link`, `description`, `author`, `category`, `comments`, `enclosure`, `guid`).
* The date field is handled differently. It's transformed into a UTC datetime in a `published` field for easier use in data warehouses and other destinations.
* The RSS feed you're subscribing to must have a valid `pubDate` field for each item for incremental syncs to work properly.
* Since `guid` is not a required field, there is no primary key for the feed, only a cursor on the published date.
#### Features
| Feature | Supported? |
| :--- | :--- |
| Full Refresh Sync | Yes |
| Incremental - Append Sync | Yes |
| Namespaces | No |
### Requirements / Setup Guide
Only the `url` of an RSS feed is required.
## Performance considerations
None
## Changelog
| Version | Date | Pull Request | Subject |
| :--- | :--- | :--- | :--- |
| 0.1.0 | 2022-10-12 | [18838](https://github.com/airbytehq/airbyte/pull/18838) | Initial release supporting RSS |