Guangming

Web-crawling Guangming Daily

This is a project dedicated to crawl the articles of Guangming Daily for textual analysis.
The project began in April, 2018, and has been improved by me from time to time.

Usage

Download the guangming.py file.
Run Configuration:

(1) This program requires python 3.0 or higher as interpreter.

(2) Packages: Install requests, beautifulsoup4, pandas

If you don't have pip, follow the instructions on https://pip.pypa.io/en/stable/installing/ to install pip on your computer.

After you have pip, type in the following commands in cmd to install these packages.
```
pip install requests

pip install beautifulsoup4

pip install pandas
```
If you are choosing pycharm to run guangming.py, the latest version of pycharm can intelligently set up the installing process for you.
Run guangming.py, input in the info as required by the program.
That's it! The program will scrape the information from Guangming Daily according to your input.

Recent added features include:

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
guangming.py		guangming.py