Apify is a web scraping and automation platform that provides thousands of ready-made scrapers (called Actors) for popular websites. Cargo’s native integration with Apify allows you to run actors, execute pre-configured tasks, and fetch scraped data directly from your workflows.
How to set up Apify
To connect Apify, you need an API token from your Apify account. This integration requires your own Apify account and API credentials.
Connection details
| Field | Description |
|---|
| API Token | Your Apify API token from account settings |
Get your API token from the
Apify Console under Settings → Integrations.
Apify actions
Run Actor
Execute any Apify actor with custom input parameters.
Use cases
- Web scraping – Run scrapers for LinkedIn, Twitter, Instagram, Amazon, etc.
- Data extraction – Extract structured data from any website
- Lead generation – Scrape contact information from directories
- Competitive analysis – Monitor competitor websites and pricing
Configuration
| Field | Description |
|---|
| Actor | Select from your available actors |
| Input | JSON input parameters for the actor (optional) |
| Memory (MB) | Memory limit for the run (default: 2048 MB) |
| Timeout | Maximum run duration in seconds (default: 3600) |
Output
| Field | Description |
|---|
| runId | Unique identifier for the run |
| status | Final status (SUCCEEDED, FAILED, etc.) |
| datasetId | ID of the dataset containing results |
| keyValueStoreId | ID of the key-value store |
| startedAt | When the run started |
| finishedAt | When the run completed |
Use the datasetId from the output with the Get Dataset extractor to retrieve the scraped data.
Run Task
Execute a pre-configured Apify task. Tasks are saved actor configurations that can be reused.
Use cases
- Recurring scrapes – Run the same scraping job repeatedly
- Standardized workflows – Use pre-configured settings for consistent results
- Team collaboration – Share task configurations across team members
Configuration
| Field | Description |
|---|
| Task | Select from your available tasks |
| Input Override | JSON input to override task defaults (optional) |
| Memory (MB) | Memory limit for the run (optional) |
| Timeout | Maximum run duration in seconds (optional) |
Output
Same as Run Actor.
Apify data models
Create data models from Apify datasets to power your workflows.
Get Dataset
Fetch items from an Apify dataset. Use this to retrieve results from a previous actor or task run.
Configuration
| Field | Description |
|---|
| Dataset ID | The ID of the dataset (from a previous run) |
| Limit | Maximum items to fetch (default: 1000) |
| Clean data | Remove empty items and hidden fields (default: on) |
The dataset schema is automatically detected from the data, so columns will match whatever the actor returns.
Common workflows
Scrape and enrich leads
- Run Actor – Use a LinkedIn scraper to extract company data
- Get Dataset – Fetch the scraped results
- Enrich – Use other integrations to add contact information
Monitor competitors
- Run Task – Execute a pre-configured price monitoring scraper
- Get Dataset – Retrieve the latest pricing data
- Compare – Analyze changes against your own pricing
Build prospect lists
- Run Actor – Scrape a directory or database
- Get Dataset – Pull the extracted contacts
- Push to CRM – Send qualified leads to HubSpot or Salesforce
Popular Apify Actors
| Actor | Description |
|---|
| Web Scraper | General-purpose website scraper |
| Google Search Scraper | Extract Google search results |
| LinkedIn Scraper | Scrape LinkedIn profiles and companies |
| Instagram Scraper | Extract posts, profiles, and hashtags |
| Amazon Scraper | Product data from Amazon |
| Twitter Scraper | Tweets, profiles, and trends |