Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Proxy RSS reads through a static IP proxy like QuotaGuard #3551

Open
2 of 4 tasks
jmduke opened this issue Oct 13, 2024 · 3 comments
Open
2 of 4 tasks

Proxy RSS reads through a static IP proxy like QuotaGuard #3551

jmduke opened this issue Oct 13, 2024 · 3 comments
Assignees
Labels
Module: RSS-to-Email Type: Stability Work that will make Buttondown more reliable and decrease manual/operator labor for staff.

Comments

@jmduke
Copy link
Member

jmduke commented Oct 13, 2024

THE PROBLEM: Cloudflare sometimes blocks our RSS fetches because we're a bot (which is fair — we are!).

THE SOLUTION: We register with them as a cool, chill, friendly bot

THE PROBLEM: To verify our incoming traffic, they need an allowlist of IPs — and Heroku doesn't give us sacrosanct ones!

This will let us submit Buttondown as a registered crawler to Cloudflare, which should obviate some issues we have with CF blocking our requests.

I think we can just use Fixie or some similar option (the overall bandwidth is pretty low); the HTTP call is in retrieve_items.

(We will also get to use this for the Validity stuff, if we need to!)

Tasks

@jmduke jmduke added Type: Stability Work that will make Buttondown more reliable and decrease manual/operator labor for staff. Module: RSS-to-Email labels Oct 13, 2024
@catdevnull
Copy link

https://developers.cloudflare.com/bots/reference/verified-bots-policy/
A bot or proxy must have a minimum amount of traffic for Cloudflare to be able find it in the sampled data. The minimum traffic should have more than 1000 requests per day across multiple domains.

do we do this much traffic?

user-agent

ButtondownBot/1.0 or are we already using something else?

  • also, we'll need to start checking robots.txt :/
  • we need to document our bot, do we have something like that already?

@jmduke
Copy link
Member Author

jmduke commented Oct 16, 2024

do we do this much traffic?

yup!

user-agent

we are! and lightweight docs here: https://docs.buttondown.com/rss-to-email#troubleshooting, but can be expanded.

@catdevnull
Copy link

okay, I submitted the bot. now we have to wait. I think they will email you @jmduke

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Module: RSS-to-Email Type: Stability Work that will make Buttondown more reliable and decrease manual/operator labor for staff.
Projects
Status: In progress
Development

No branches or pull requests

2 participants