* add source-rss * update tests * add docs * fix doc link * changes to pass tests * update catalog * Update test_streams.py * fix time zone issue * update source def * auto-bump connector version Co-authored-by: Vincent Koc <vincentkoc@ieee.org> Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com> Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
38 lines
1.2 KiB
Markdown
38 lines
1.2 KiB
Markdown
# RSS
|
|
|
|
## Overview
|
|
|
|
The RSS source allows you to read data from any individual RSS feed.
|
|
|
|
#### Output schema
|
|
|
|
This source is capable of syncing the following streams:
|
|
* `items`
|
|
* Provides stats about specific RSS items.
|
|
* Most fields are simply kept from RSS items as strings if present (`title`, `link`, `description`, `author`, `category`, `comments`, `enclosure`, `guid`).
|
|
* The date field is handled differently. It's transformed into a UTC datetime in a `published` field for easier use in data warehouses and other destinations.
|
|
* The RSS feed you're subscribing to must have a valid `pubDate` field for each item for incremental syncs to work properly.
|
|
* Since `guid` is not a required field, there is no primary key for the feed, only a cursor on the published date.
|
|
|
|
#### Features
|
|
|
|
| Feature | Supported? |
|
|
| :--- | :--- |
|
|
| Full Refresh Sync | Yes |
|
|
| Incremental - Append Sync | Yes |
|
|
| Namespaces | No |
|
|
|
|
### Requirements / Setup Guide
|
|
|
|
Only the `url` of an RSS feed is required.
|
|
|
|
## Performance considerations
|
|
|
|
None
|
|
|
|
## Changelog
|
|
|
|
| Version | Date | Pull Request | Subject |
|
|
| :--- | :--- | :--- | :--- |
|
|
| 0.1.0 | 2022-10-12 | [18838](https://github.com/airbytehq/airbyte/pull/18838) | Initial release supporting RSS |
|