Types
Browser
Extracts text and HTML content from web pages
The Browser step allows you to extract content from web pages by providing a URL. You can configure how the content is extracted with the following options:
URL
Enter the complete URL of the web page you want to extract content from.
Strip HTML
Enable this option if you want to remove all HTML tags and only extract the text content from the page. When disabled, the raw HTML will be preserved.
CSS Selector
Optionally specify a CSS selector (e.g. ‘#report’, ‘.content’) to target specific elements on the page. The step will:
- Wait for the specified selector to be available before extracting content
- Extract content from the entire page after the selector is found
If no selector is provided, the step will immediately capture and extract content from the entire page.
Security Considerations
When using the Browser step, keep these security best practices in mind:
- Only access web pages you have permission to scrape
- Be mindful of the website’s robots.txt file and terms of service
- Avoid extracting sensitive information
Was this page helpful?