Bright Data
Popular repositories Loading
-
-
Amazon-popular-books-dataset
Amazon-popular-books-dataset PublicA dataset sample of the most reviewed and best-selling books on Amazon
-
java-web-scraping
java-web-scraping PublicQuick guide with code example how to use Java for web scraping
-
eCommerce-dataset-samples
eCommerce-dataset-samples PublicA collection of multiple e-commerce dataset samples. Each sample contains over 1,000 records. These datasets are ideal for product trend analysis, pricing strategies, consumer sentiment insights, a…
-
Instagram-dataset-samples
Instagram-dataset-samples PublicSample datasets of over 400 Instagram coding influencers
Repositories
- Rotating-Residential-Proxies Public
Reliable, high-performance residential proxies with 150M+ IPs for seamless, compliant web scraping. Try for free!
luminati-io/Rotating-Residential-Proxies’s past year of commit activity - web-scraping-with-regex Public
Scrape websites in Python using regex, parse HTML effectively, and handle dynamic pages while overcoming regex’s inherent limitations
luminati-io/web-scraping-with-regex’s past year of commit activity - training-ai-models Public
Enhance your AI models by fine-tuning with OpenAI's toolkit. Learn data prep, training steps, and advanced tuning approaches for optimal performance.
luminati-io/training-ai-models’s past year of commit activity - python-requests Public
Python's 'requests' library: learn HTTP methods, parsing responses, proxy usage, timeouts, and more for efficient web scraping.
luminati-io/python-requests’s past year of commit activity - parsing-xml-with-python Public
Parse XML in Python using ElementTree, lxml, SAX, and more for efficient data processing and structured data integration.
luminati-io/parsing-xml-with-python’s past year of commit activity - curl-user-agent Public
Customize and rotate the User-Agent header in cURL for more effective web scraping and better request management.
luminati-io/curl-user-agent’s past year of commit activity - Awesome-Web-Scraping Public
A list of libraries, tools, and APIs for web scraping and data processing. Find everything you need for extracting, managing, and processing data from the web, from HTTP libraries to browser automation tools and proxy services.
luminati-io/Awesome-Web-Scraping’s past year of commit activity - web-scraping-with-lxml Public
Use Python’s lxml library for web scraping static and dynamic content, with examples, proxy integration, and real-world use cases.
luminati-io/web-scraping-with-lxml’s past year of commit activity - web-scraping-with-curl-impersonate Public
Use cURL Impersonate for browser-like web scraping in CLI and Python, with support for proxies, TLS fingerprinting, and anti-bot evasion.
luminati-io/web-scraping-with-curl-impersonate’s past year of commit activity - cloudscraper-in-python Public
Use the cloudscraper Python library to bypass Cloudflare, handle CAPTCHAs, rotate proxies, and scrape protected content effectively.
luminati-io/cloudscraper-in-python’s past year of commit activity