A web-crawling news aggregation engine using Python and Beautiful Soup, and a Flask application that spits out a slightly too large static frontend from the crawled data.
Live and refreshed hourly at https://minim.jedd.pw
- BBC NEWS UK (http://feeds.bbci.co.uk/news/rss.xml?edition=uk)
- BBC NEWS US (http://feeds.bbci.co.uk/news/rss.xml?edition=us)
- VICE (http://vice.com/rss)
- VOX (http://vox.com/rss/index.xml)
- YOUR-FAVE-ONLINE-TEXT-OUTLET (http://www.asourceyouwroteaniftycrawlingscriptforandsubmittedapullrequestwith.heckyes)
- $git clone https://github.com/Jeddf/Minim.git
- $cd Minim/minimCrawler
- $pip3 install -r requirements.txt
- $python3 crawl.py --host --user --password --db
- $cd ..
- $pip3 install -r requirements.txt
- $python3 Minim.py --host --user --password --db