Skip to content

Hacked together a scrapy.py spider to scrape the Bureau of Transportation Statistics Website which uses an APS.net framework

Notifications You must be signed in to change notification settings

icykip/bts.gov_stats_APS.net_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

bts.gov_stats_APS.net_scraper

Scraper for the Bureua of Transportation Statistics website which runs on a APS.net framework, likely applicable to other parts of the site

This is my first attempt at scraping a website outside of class, first time using scrapy, and first time working with an APS.net framework, so it probably isn't to pretty, but learned a lot building it over the last couple of hours and thankful to have it for my DS4A project.

resources:

libraries:

  • scrapy
  • pandas

Bureau Website: https://transtats.bts.gov/ONTIME/OriginDestination.aspx

APS.net framweork scraping guide: https://blog.scrapinghub.com/2016/04/20/scrapy-tips-from-the-pros-april-2016-edition

Scrapy documentation: https://docs.scrapy.org/en/latest/index.html

About

Hacked together a scrapy.py spider to scrape the Bureau of Transportation Statistics Website which uses an APS.net framework

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages