Question 1

How does the Web Crawling API work?

Accepted Answer

POST a seed URL to the /api/crawl endpoint. BrowserSolver launches a cloud Chromium browser, follows all internal links up to your configured depth, and returns each page's URL, title, metadata, and full content as Markdown or HTML. No local browser or infrastructure needed.

Question 2

Does the crawler handle JavaScript-rendered navigation?

Accepted Answer

Yes. BrowserSolver uses a real Chromium browser to render each page before extracting links. Client-side routing, lazy-loaded menus, and SPAs are all handled correctly. Content hidden behind JavaScript is never missed.

Question 3

How do I control which pages get crawled?

Accepted Answer

Set max_pages to cap the total crawl size, crawl_depth to limit link-following depth, and use include_patterns or exclude_patterns (regex or glob) to include or skip URLs matching specific patterns.

Question 4

What output formats does the crawl API return?

Accepted Answer

Each crawled page returns structured JSON with the URL, status code, title, and content. Content can be Markdown (clean, LLM-ready text), HTML (raw source), or plain text. The full crawl result is available via a status polling endpoint.

Question 5

Can I crawl authenticated sites?

Accepted Answer

Yes. Pass session cookies, Authorization headers, or use BrowserSolver's profile system to persist a logged-in browser state across crawl requests. Authenticated content is fully accessible.

Question 6

Is there a free tier to get started?

Accepted Answer

Yes. Sign up for a free API key and start with the included free credits. No credit card required to get started.

Web Crawling API

Quick Start

How It Works

Send a Request

Get Your Image

Use It Anywhere

Features

Full-Site Discovery

JavaScript Navigation

Configurable Depth & Scope

Structured Output

Use Cases

Frequently Asked Questions

Related APIs

Ready to build without browser headaches?