🧪 Skills

News Fetcher

Install, configure, validate, and run the news-fetcher Python CLI for aggregating RSS/Atom and HTML news sources with deduplication, clustering, ranking, sou...

v0.1.4
❤️ 0
⬇️ 157
👁 1
Share

Description


name: news-fetcher description: Install, configure, validate, and run the news-fetcher Python CLI for aggregating RSS/Atom and HTML news sources with deduplication, clustering, ranking, source diversity, and summaries. Use when an agent needs to fetch news, create or validate a config, troubleshoot news-fetcher installation, or produce JSON/Markdown/CSV/RSS output from multiple sources.

News Fetcher

Use this skill to get a working news-fetcher installation and run it correctly.

Important

  • Installing the ClawHub skill does not install the Python package.
  • Install the Python package separately with pip.
  • Put global options before run.

Correct:

news-fetcher --config config.yaml --limit 10 run

Wrong:

news-fetcher run --config config.yaml --limit 10

Minimal working install

python3 -m venv .venv
. .venv/bin/activate
pip install --upgrade pip
pip install "git+https://github.com/miniade/news-fetcher.git@v0.1.4"
news-fetcher version

Expect news-fetcher version 0.1.4.

Minimal working config

Generate a starter config:

news-fetcher config example > config.yaml

Or create one manually:

sources:
  - name: BBC News
    url: http://feeds.bbci.co.uk/news/rss.xml
    weight: 1.0
    type: rss

  - name: Reuters Tech
    url: https://www.reutersagency.com/feed/?best-topics=tech
    weight: 1.2
    type: rss

  - name: Example HTML Source
    url: https://example.com/news
    weight: 0.9
    type: html
    selector: main article

thresholds:
  similarity: 0.8
  min_score: 0.3
  cluster_size: 2
  max_per_source: 3

weights:
  content: 0.6
  source: 0.2
  publish_time: 0.2

Validate it:

news-fetcher config validate config.yaml

Common commands

Run with a config:

news-fetcher --config config.yaml --limit 20 run

Write Markdown output:

news-fetcher --config config.yaml --format markdown --output news.md run

Filter by time:

news-fetcher --config config.yaml --since 2026-03-01T00:00:00 run

Raise the score threshold:

news-fetcher --config config.yaml --min-score 0.5 run

Override sources directly from the CLI:

news-fetcher --sources "http://feeds.bbci.co.uk/news/rss.xml,https://news.ycombinator.com/rss" --limit 10 run

HTML sources

For type: html sources, set selector when the site does not use clean <article> blocks.

Example:

sources:
  - name: Example HTML Source
    url: https://example.com/news
    type: html
    selector: .story-card

Troubleshooting

If news-fetcher is missing after installing the skill, install the Python package:

pip install "git+https://github.com/miniade/news-fetcher.git@v0.1.4"

If config validation fails, run:

news-fetcher config validate config.yaml

If results are too sparse, lower thresholds.min_score or raise thresholds.max_per_source.

References

Reviews (0)

Sign in to write a review.

No reviews yet. Be the first to review!

Comments (0)

Sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Compatible Platforms

Pricing

Free

Related Configs