Commit Graph

24 Commits

Author SHA1 Message Date
Shadowfacts 6916647737
Don't try to convert data URIs to data URIs 2021-09-22 19:46:07 -04:00
Shadowfacts fce1bf6c2f
Add Sentry 2021-09-22 13:59:44 -04:00
Shadowfacts 5990d0e4c2
Add Slate extractor 2021-09-03 17:09:10 -04:00
Shadowfacts 0593fcdb9a
Switch to hackney via Tesla 2021-03-31 19:33:19 -04:00
Shadowfacts 33d1cac5e1
Recover from errors in custom extractors 2021-03-31 15:30:17 -04:00
Shadowfacts e10a614f3e
Switch back to HTTPoison 2021-03-31 14:43:59 -04:00
Shadowfacts 8e18a415eb
Fix error when attempting to convert image w/o Content-Type header to data URI 2020-10-24 13:37:06 -04:00
Shadowfacts 1beff21fc5
Switch to Mojito for HTTP requests 2020-09-11 19:15:19 -04:00
Shadowfacts 4f16933198
Add gemini protocol feed fetching 2020-07-18 19:27:53 -04:00
Shadowfacts fc2b8f6036
Add basic LiveView pipeline editor, scrape stage config editing 2020-06-08 22:49:45 -04:00
Shadowfacts 4cccab8df0
Remove old code 2020-06-01 18:30:59 -04:00
Shadowfacts c37bed932f
Fix pipeline validation not working 2020-05-31 15:56:27 -04:00
Shadowfacts 4a09ce1cb0
Fix scraping images w/ URLs w/o schemes 2020-02-17 12:09:03 -05:00
Shadowfacts e684737fcd
Implement basic favicon scraping 2019-11-10 14:23:07 -05:00
Shadowfacts c9cc9f2428
Fix crash while scraping images 2019-11-01 18:29:41 -04:00
Shadowfacts 5d38d9567e
Fix error while validating scrape stage options 2019-11-01 18:27:08 -04:00
Shadowfacts 3bc37952d1
Add option to convert images in article content to data URIs 2019-10-31 21:59:55 -04:00
Shadowfacts cfd9f7505a
Rewrite image URLs without hosts to use the host of the article URL 2019-10-31 17:38:16 -04:00
Shadowfacts eec0b918e7
Change extractors to accept/return html trees 2019-10-31 17:12:02 -04:00
Shadowfacts 3192969889
Replace site-specific pipeline stages with new extractor architecture 2019-10-31 16:45:52 -04:00
Shadowfacts 1015fd5162
Add types, Dialyzer, fix Dialyzer warnings 2019-08-30 19:31:38 -04:00
Shadowfacts e55a694194
Add Daring Fireball scraper 2019-07-21 19:04:43 -04:00
Shadowfacts 17310911ce
Add pipeline stage option validation/error reporting 2019-07-21 12:21:28 -04:00
Shadowfacts 0a1909dbc4
Start pipeline system 2019-07-08 22:45:02 -04:00