Skip to content

Salpertio/vimms-lair-scrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Vimm's Lair Scraper

This script scrapes game links from Vimm’s Lair for personal or educational use, focusing on Xbox game categories and individual game pages. It dynamically builds URLs based on the site structure to avoid errors, making it easy to adapt for other game directories.

How It Works

  • Category & Game Link Handling: Differentiates between category pages (/vault/Xbox) and game pages (/vault/[game_number]) to build correct URLs.
  • Adaptable for Other Games: To scrape other game directories, simply change xbox_category_url in the script to a different system, e.g., /vault/PS2 for PlayStation 2 games.
Usage Instructions
  1. Install dependencies:

    pip install requests beautifulsoup4
  2. Run the script:

    python script.py
Important Notes
  • Respects Site Limits: This script does not bypass the download-per-client limit, and bypassing it may not be feasible. However, the script does bypass some bot protections by mimicking regular browser behavior through the use of a User-Agent header. Without this header, direct access may result in a block.

  • Disclaimer: Use responsibly and in accordance with Vimm’s Lair’s terms of service. While the script is designed with ethical scraping practices in mind, you're responsible for its use.

  • Proxy Consideration: It did occur to me to use proxies, but the overhead is annoying to me. Proxies add more variables, such as the need to manage multiple IP addresses and handle potential connection issues or delays. Using proxies can also introduce added complexity with rate limiting, as some IPs might be blocked or flagged more quickly, requiring constant monitoring and adjustments. Additionally, proxies can slow down the scraping process due to network latency, especially if rotating between multiple servers. Overall, while proxies could bypass certain limitations, the added overhead and maintenance make it less practical for my use case.

About

a *polite* scraper for vimms lair

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages