HTML API Request Builder
The easiest way to get started with ScrapingRocket is by using our HTML API Request Builder. This tool allows you to interactively build and test scraping requests, making it simple to scrape data from websites with advanced options such as JavaScript rendering and custom task automation.
-
Extractor Function (Optional): Write your own Extractor Function to extract custom data from the website’s HTML. You can use our Extractor Sandbox to generate your extractor and customize it for your needs. Check it out here.
-
JavaScript Rendering: To render websites that rely on JavaScript to load content, you can enable the JavaScript Rendering option by ticking the box. This will ensure that the website behaves as if it were being opened in a real browser.
-
JS Scenario Automation: If you want to automate specific actions on a page, such as clicking buttons, waiting for elements to appear, filling out forms, scrolling, or executing custom JavaScript, you can use our JS Scenario. Here's an example of how it works:
{
"instructions": [
{"click": "#button_id"},
{"wait": 1000},
{"wait_for": "#slow_div"},
{"scroll_x": 1000},
{"scroll_y": 1000},
{"fill": ["#input_1", "value_1"]},
{"evaluate": "console.log(window)"}
]
}
This simple JSON configuration can be used to automate a variety of web scraping tasks and custom interactions on websites.
Request Options
In the HTML API Request Builder, you can customize the request with the following options:
-
Wait (in ms): Define a delay in milliseconds between actions. For example, you can wait 30 seconds (30000 ms) for the page to fully load.
-
Wait For (CSS Selector): Wait for a specific element to load before proceeding with the scraping process.
-
Wait Browser: If you need more time for the page to finish loading, you can enable this option.
-
Proxy (Optional): You can specify your own proxies or use our Premium Proxies. Choose the country of the proxy to optimize your scraping based on geographical location.
-
Custom Headers (Optional): You can set custom HTTP headers to mimic specific browser requests or handle any anti-scraping measures.
-
Cookies (Optional): Set cookies that may be needed for scraping or to preserve session data.
Captcha Resolution
If the website you're trying to scrape requires CAPTCHA solving, don't worry! ScrapingRocket provides an option to resolve CAPTCHA challenges seamlessly, ensuring your scraping tasks continue smoothly.
Subscription via Our Website
Once you’re ready to scale up, you can sign up for a subscription plan directly on our website. We offer flexible pricing options, including:
- Custom Proxies: Use your own proxies or select from a range of premium proxies available in various locations.
- Advanced Features: Use our visual selector tool to create Cheerio configurations, automate tasks, and scrape websites that require JavaScript rendering.
- Affordable Pricing: Our pricing is designed to fit various needs, from personal projects to large-scale business applications. You can view our pricing options here.
For more information and to get started, visit our subscription page.