Skip to content
@luminati-io

Bright Data

How the world collects public web data

Popular repositories Loading

  1. luminati-proxy luminati-proxy Public

    Luminati HTTP/HTTPS Proxy manager

    JavaScript 765 194

  2. Amazon-popular-books-dataset Amazon-popular-books-dataset Public

    A dataset sample of the most reviewed and best-selling books on Amazon

    25 6

  3. api api Public

    luminati.io API

    Java 16 9

  4. java-web-scraping java-web-scraping Public

    Quick guide with code example how to use Java for web scraping

    16 4

  5. eCommerce-dataset-samples eCommerce-dataset-samples Public

    A collection of multiple e-commerce dataset samples. Each sample contains over 1,000 records. These datasets are ideal for product trend analysis, pricing strategies, consumer sentiment insights, a…

    14 2

  6. Instagram-dataset-samples Instagram-dataset-samples Public

    Sample datasets of over 400 Instagram coding influencers

    12 2

Repositories

Showing 10 of 279 repositories
  • Rotating-Residential-Proxies Public

    Reliable, high-performance residential proxies with 150M+ IPs for seamless, compliant web scraping. Try for free!

    luminati-io/Rotating-Residential-Proxies’s past year of commit activity
    0 0 0 0 Updated Apr 8, 2025
  • web-scraping-with-regex Public

    Scrape websites in Python using regex, parse HTML effectively, and handle dynamic pages while overcoming regex’s inherent limitations

    luminati-io/web-scraping-with-regex’s past year of commit activity
    0 0 0 0 Updated Apr 8, 2025
  • training-ai-models Public

    Enhance your AI models by fine-tuning with OpenAI's toolkit. Learn data prep, training steps, and advanced tuning approaches for optimal performance.

    luminati-io/training-ai-models’s past year of commit activity
    0 0 0 0 Updated Apr 8, 2025
  • python-requests Public

    Python's 'requests' library: learn HTTP methods, parsing responses, proxy usage, timeouts, and more for efficient web scraping.

    luminati-io/python-requests’s past year of commit activity
    0 0 0 0 Updated Apr 8, 2025
  • parsing-xml-with-python Public

    Parse XML in Python using ElementTree, lxml, SAX, and more for efficient data processing and structured data integration.

    luminati-io/parsing-xml-with-python’s past year of commit activity
    0 0 0 0 Updated Apr 8, 2025
  • curl-user-agent Public

    Customize and rotate the User-Agent header in cURL for more effective web scraping and better request management.

    luminati-io/curl-user-agent’s past year of commit activity
    0 0 0 0 Updated Apr 8, 2025
  • Awesome-Web-Scraping Public

    A list of libraries, tools, and APIs for web scraping and data processing. Find everything you need for extracting, managing, and processing data from the web, from HTTP libraries to browser automation tools and proxy services.

    luminati-io/Awesome-Web-Scraping’s past year of commit activity
    4 0 0 0 Updated Apr 6, 2025
  • web-scraping-with-lxml Public

    Use Python’s lxml library for web scraping static and dynamic content, with examples, proxy integration, and real-world use cases.

    luminati-io/web-scraping-with-lxml’s past year of commit activity
    0 0 0 0 Updated Apr 2, 2025
  • web-scraping-with-curl-impersonate Public

    Use cURL Impersonate for browser-like web scraping in CLI and Python, with support for proxies, TLS fingerprinting, and anti-bot evasion.

    luminati-io/web-scraping-with-curl-impersonate’s past year of commit activity
    0 0 0 0 Updated Apr 2, 2025
  • cloudscraper-in-python Public

    Use the cloudscraper Python library to bypass Cloudflare, handle CAPTCHAs, rotate proxies, and scrape protected content effectively.

    luminati-io/cloudscraper-in-python’s past year of commit activity
    0 0 0 0 Updated Apr 2, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…