* add source-rss * update tests * add docs * fix doc link * changes to pass tests * update catalog * Update test_streams.py * fix time zone issue * update source def * auto-bump connector version Co-authored-by: Vincent Koc <vincentkoc@ieee.org> Co-authored-by: Octavia Squidington III <octavia-squidington-iii@users.noreply.github.com> Co-authored-by: Marcos Marx <marcosmarxm@users.noreply.github.com>
1.2 KiB
1.2 KiB
RSS
Overview
The RSS source allows you to read data from any individual RSS feed.
Output schema
This source is capable of syncing the following streams:
items- Provides stats about specific RSS items.
- Most fields are simply kept from RSS items as strings if present (
title,link,description,author,category,comments,enclosure,guid). - The date field is handled differently. It's transformed into a UTC datetime in a
publishedfield for easier use in data warehouses and other destinations. - The RSS feed you're subscribing to must have a valid
pubDatefield for each item for incremental syncs to work properly. - Since
guidis not a required field, there is no primary key for the feed, only a cursor on the published date.
Features
| Feature | Supported? |
|---|---|
| Full Refresh Sync | Yes |
| Incremental - Append Sync | Yes |
| Namespaces | No |
Requirements / Setup Guide
Only the url of an RSS feed is required.
Performance considerations
None
Changelog
| Version | Date | Pull Request | Subject |
|---|---|---|---|
| 0.1.0 | 2022-10-12 | 18838 | Initial release supporting RSS |