Skip to content

Web Scraping

rayobrowse is built for web scraping at scale. It’s used in production on Rayobyte’s scraping platform to process millions of pages per day across some of the most difficult websites.

Modern bot detection checks dozens of signals: user agent, WebGL renderer, canvas fingerprint, font list, screen resolution, timezone, WebRTC leaks, and more. Standard headless Chromium fails these checks immediately.

Each browser session gets a realistic device fingerprint from a database of thousands of real-world profiles. Your scraping code connects via CDP and operates normally — the stealth is handled at the browser level.

The simplest approach for scraping:

from playwright.sync_api import sync_playwright
with sync_playwright() as p:
browser = p.chromium.connect_over_cdp(
"ws://localhost:9222/connect?headless=true&os=windows&proxy=http://user:pass@host:port"
)
page = browser.new_context().new_page()
page.goto("https://target-site.com")
content = page.content()
browser.close()