Scrapingant

Learn how to use Scrapingant with Composio

Overview

Enum

SCRAPINGANT

Description

ScrapingAnt is a web scraping API that provides tools for data extraction, including features like Chrome page rendering, low latency rotating proxies, JavaScript execution, and unlimited parallel requests.

Authentication Details

scrapingant_api_key
stringRequired

Actions

This tool extracts content from a given url and converts it into markdown format. it is particularly useful for preparing text for language learning models (llms) and retrieval-augmented generation (rag) systems. it supports get, post, put, and delete methods.

Action Parameters

block_resource
array
browser
boolean
cookies
string
js_snippet
string
method
stringDefaults to get
proxy_country
string
proxy_type
string
return_page_source
boolean
url
stringRequired
wait_for_selector
string

Action Response

data
object
error
successful
boolean
This tool allows you to extract structured data from a web page using scrapingant's ai-powered extraction capabilities. you provide a url and an ai query (prompt) describing what data you want to extract, and the tool returns the extracted data in a structured format. it supports additional parameters for browser rendering, proxies, and cookies to handle dynamic content and localization.

Action Parameters

cookies
string
enable_javascript
boolean
extract_properties
stringRequired
proxy_country
string
proxy_type
string
return_text
boolean
url
stringRequired
wait_for_selector
string

Action Response

data
object
error
successful
boolean
This tool retrieves the current api credit usage status for the authenticated scrapingant account. it enables users to monitor their consumption of api credits, check their current usage against the subscription limits, and manage their api credits effectively.

Action Parameters

Action Response

data
object
error
successful
boolean
This tool scrapes a web page using the scrapingant api. it fetches the html content of the specified url. users can customize the scraping behavior by enabling a headless browser, using proxies, waiting for specific elements, executing javascript, passing cookies, and blocking certain resources.

Action Parameters

block_resource
array
browser
booleanDefaults to True
cookies
string
js_snippet
string
proxy_country
string
proxy_type
string
return_page_source
boolean
url
stringRequired
wait_for_selector
string

Action Response

data
object
error
successful
boolean
This tool scrapes a target url and returns an extended json response. it utilizes scrapingant's /v2/extended endpoint, providing richer information than the standard scraping tool, including page html, cookies, headers, and additional details.

Action Parameters

url
stringRequired

Action Response

data
object
error
successful
boolean