Commit Graph

147 Commits

Author SHA1 Message Date
Shadowfacts 71203435d6 Don't update feeds when starting GenServer 2024-01-31 14:02:16 -05:00
Shadowfacts fd42f20920 Don't require content for items 2023-12-03 21:33:32 -05:00
Shadowfacts 60a2dcd73f Fix Verge extractor failing for features 2023-12-03 21:30:52 -05:00
Shadowfacts 3e6211c9ba Fix unused binding warnings 2023-12-03 21:28:24 -05:00
Shadowfacts fec640a37a Improve The Verge extractor 2023-07-12 22:11:31 -07:00
Shadowfacts b0089083db Fix inline script tags not being removed 2023-07-12 20:50:17 -07:00
Shadowfacts 2dda6d7f46 Add OIDC login 2023-06-25 15:19:11 -07:00
Shadowfacts 1f94e9080d Filter more things out of Slate and The Verge 2023-06-25 14:12:15 -07:00
Shadowfacts 6dd4f3ca82 Add ELB extractor 2023-06-25 14:06:18 -07:00
Shadowfacts 53cbe0a7e9 Update things, fix warnings 2023-06-25 14:03:16 -07:00
Shadowfacts 86d7ffc7d9 Make regex filters case insensitive 2022-12-05 10:58:49 -05:00
Shadowfacts d7a37b5c64 Fix feed retry handler being handle_cast instead of handle_info 2022-10-24 09:57:54 -04:00
Shadowfacts ff3d9affe5 Fix error when parsing updating feeds without item guids 2022-09-19 10:02:11 -04:00
Shadowfacts 0e90ee527c Remove Sentry breadcrumbs from network requests
There ends up being a multi-hour gap between the breadcrumbs listed and
the error itself, so they're useless.
2022-09-14 18:13:09 -04:00
Shadowfacts e7184a2535 Add extractor for The Verge 2022-09-14 17:47:22 -04:00
Shadowfacts b9be2879ed Fix srcsets overriding rewritten image srcs 2022-07-17 15:13:13 -04:00
Shadowfacts 852db1520f Add birchtree.me extractor 2022-07-17 15:13:08 -04:00
Shadowfacts 87ea1f5624 Ignore errors when there's no feed data 2022-04-18 18:20:46 -04:00
Shadowfacts a7a296b342 Exponential backoff retries for fetching feeds 2022-04-18 18:14:58 -04:00
Shadowfacts bbc729b5ca Handle gzipped http responses 2022-04-18 17:23:26 -04:00
Shadowfacts f0299639e2 Daring Fireball: strip dd tag 2022-01-15 14:53:03 -05:00
Shadowfacts 7b9956a1aa Fix fervor item schema 2022-01-12 18:01:35 -05:00
Shadowfacts 37a802b7a8 Don't put content from builtin extractor through readable_html twice 2021-11-06 12:01:23 -04:00
Shadowfacts d2d4651f1d Add Ars Technica extractor for multi-page articles 2021-11-06 12:00:35 -04:00
Shadowfacts e84ebc473a Add support for external readability implementation 2021-11-06 12:00:35 -04:00
Shadowfacts f1435611ef Switch fervor api to use string ids 2021-10-30 13:52:27 -04:00
Shadowfacts e3ec1d6040
Fix missing clause in scrape stage 2021-10-22 16:20:50 -04:00
Shadowfacts b1c0ba3998
Switch to Mint 2021-10-22 16:17:27 -04:00
Shadowfacts 5ece9cd21c
Yet more Sentry 2021-09-22 19:54:45 -04:00
Shadowfacts 6916647737
Don't try to convert data URIs to data URIs 2021-09-22 19:46:07 -04:00
Shadowfacts 1c2ef3bc51
More Sentry logging 2021-09-22 15:06:45 -04:00
Shadowfacts 64162fee92
Fix not handling non-200 HTTP codes when fetching feed 2021-09-22 14:41:29 -04:00
Shadowfacts fce1bf6c2f
Add Sentry 2021-09-22 13:59:44 -04:00
Shadowfacts 6e0271bf4b
Slate extractor: strip newsletter signup form 2021-09-19 22:32:10 -04:00
Shadowfacts 3b12f62379
Fix items w/o guids getting duplicated on every update 2021-09-12 19:55:19 -04:00
Shadowfacts 4d7843ee5f
Add force update feeds 2021-09-08 20:17:33 -04:00
Shadowfacts ddceb28803
Fix update feeds thinking all items already exist
Repo.exists? needs a query, not the keyword list
2021-09-08 20:10:19 -04:00
Shadowfacts a02ec174be
Use unique index to prevent duplicate items from being created 2021-09-08 09:35:46 -04:00
Shadowfacts cd36b40978
Fix double refreshing feeds 2021-09-08 09:35:37 -04:00
Shadowfacts 162ba74dde
Extract authors from feeds 2021-09-03 17:09:16 -04:00
Shadowfacts 5990d0e4c2
Add Slate extractor 2021-09-03 17:09:10 -04:00
Shadowfacts a85dca5b3d
Add filtering by item content 2021-08-28 12:17:16 -04:00
Shadowfacts 5c8baa2057
Generalize WP lazy-loading stripper 2021-03-31 20:19:01 -04:00
Shadowfacts 0593fcdb9a
Switch to hackney via Tesla 2021-03-31 19:33:19 -04:00
Shadowfacts 33d1cac5e1
Recover from errors in custom extractors 2021-03-31 15:30:17 -04:00
Shadowfacts 26b832b622
Fix whatever.scalzi.com extractor 2021-03-31 15:30:05 -04:00
Shadowfacts 0ded09a65d
Fix redirect handling not working with HTTPoison 2021-03-31 15:29:52 -04:00
Shadowfacts e10a614f3e
Switch back to HTTPoison 2021-03-31 14:43:59 -04:00
Shadowfacts 8e18a415eb
Fix error when attempting to convert image w/o Content-Type header to data URI 2020-10-24 13:37:06 -04:00
Shadowfacts 3bbf42df75
Fix error when sending never-updated feeds via Fever API 2020-10-24 13:32:00 -04:00