Task Instructions
User Manual:
Open the PDF directly: View PDF
.
Page Count: 1
| Download | |
| Open PDF In Browser | View PDF |
Homework 1: Simple WebScraping Each day several online shops publish a lot of flash deals and offers, some of them interesting to us. As we don’t want to check those pages daily, we want a program to list those offers automatically. The webpage is: https://www.banggood.com/Flashdeals.html Your goal is to build a simple python application, using the following libraries: • urllib • BeautifulSoup4 To scrape some data: namely the product name and both prices, offer and regular price, from the webpage, and print it (with the ultimate goal of using any notification method to send the user the information). Requirements • Code must be developed entirely on github.com, but can be a private repository, on that case, add github user: carlesm, as a collaborator. • Code must follow good practice and good python conventions: variable naming, identation, documentation, etc. • Code must work. • It should be clean and easy to improve, for instance, printing the book title should be easily replaced by a notification to Telegram, or whatever. Scoring: If all requeriments are met students get 0.25 extra points for Practice 1. Only delivery should be the address of the github repo. All documentation, considerations, etc. should be in the README.md file on the github repo. 1
Source Exif Data:
File Type : PDF File Type Extension : pdf MIME Type : application/pdf PDF Version : 1.5 Linearized : No Create Date : 2019:02:25 04:30:03+01:00 Creator : LaTeX with hyperref package Modify Date : 2019:03:05 12:41:13+01:00 Producer : XeTeX 0.99996 Page Mode : UseOutlines Page Count : 1EXIF Metadata provided by EXIF.tools