Overview
Integration details
| Class | Package | Serializable | JS support | Version |
|---|---|---|---|---|
| BrightDataWebScraperAPI | langchain-brightdata | ✅ | ❌ |
Tool features
| Native async | Returns artifact | Return data | Pricing |
|---|---|---|---|
| ❌ | ❌ | Structured data from websites (Amazon products, LinkedIn profiles, etc.) | Requires Bright Data account |
Setup
The integration lives in thelangchain-brightdata package.
Instantiation
Here we show how to instantiate an instance of the BrightDataWebScraperAPI tool. This tool allows you to extract structured data from various websites including Amazon product details, LinkedIn profiles, and more using Bright Data’s Dataset API. The tool accepts the following parameter during instantiation:bright_data_api_key(required, str): Your Bright Data API key for authentication.
Invocation
Basic Usage
Advanced Usage with Parameters
Customization Options
The BrightDataWebScraperAPI tool accepts several parameters for customization:| Parameter | Type | Description |
|---|---|---|
url | str | The URL to extract data from |
dataset_type | str | Type of dataset to use (see available types below) |
zipcode | str | Optional zipcode for location-specific data |
keyword | str | Search keyword (required for amazon_product_search) |
first_name | str | First name (required for linkedin_people_search) |
last_name | str | Last name (required for linkedin_people_search) |
num_of_reviews | str | Number of reviews (required for facebook_company_reviews) |
num_of_comments | str | Number of comments (for youtube_comments, default: 10) |
days_limit | str | Days to limit results (for google_maps_reviews, default: 3) |
Available dataset types (44 datasets)
E-commerce (10 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
amazon_product | Product details, pricing, specs | url (with /dp/) |
amazon_product_reviews | Customer reviews and ratings | url (with /dp/) |
amazon_product_search | Search results from Amazon | keyword, url |
walmart_product | Walmart product data | url (with /ip/) |
walmart_seller | Walmart seller information | url |
ebay_product | eBay product data | url |
homedepot_products | Home Depot product data | url |
zara_products | Zara product data | url |
etsy_products | Etsy product data | url |
bestbuy_products | Best Buy product data | url |
LinkedIn (5 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
linkedin_person_profile | Professional profile data | url |
linkedin_company_profile | Company information | url |
linkedin_job_listings | Job listing details | url |
linkedin_posts | Post content and engagement | url |
linkedin_people_search | Search for people | url, first_name, last_name |
Business intelligence (2 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
crunchbase_company | Company funding, investors, metrics | url |
zoominfo_company_profile | B2B company intelligence | url |
Instagram (4 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
instagram_profiles | Profile data and stats | url |
instagram_posts | Post content and engagement | url |
instagram_reels | Reel content and metrics | url |
instagram_comments | Comments on posts | url |
Facebook (4 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
facebook_posts | Post content and engagement | url |
facebook_marketplace_listings | Marketplace listing data | url |
facebook_company_reviews | Company reviews | url, num_of_reviews |
facebook_events | Event details | url |
TikTok (4 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
tiktok_profiles | Profile data and stats | url |
tiktok_posts | Video content and metrics | url |
tiktok_shop | Shop product data | url |
tiktok_comments | Comments on videos | url |
YouTube (3 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
youtube_profiles | Channel profile data | url |
youtube_videos | Video content and metrics | url |
youtube_comments | Comments on videos | url, num_of_comments (default: 10) |
Google (3 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
google_maps_reviews | Business reviews from Maps | url, days_limit (default: 3) |
google_shopping | Shopping product data | url |
google_play_store | App store data | url |
Other platforms (9 datasets)
| Dataset Type | Description | Required Inputs |
|---|---|---|
apple_app_store | iOS app data | url |
x_posts | X (Twitter) post data | url |
reddit_posts | Reddit post data | url |
github_repository_file | GitHub file content | url |
yahoo_finance_business | Financial business data | url |
reuter_news | News article data | url |
zillow_properties_listing | Real estate listing data | url |
booking_hotel_listings | Hotel listing data | url |