Get fresh datasets from popular websites
No more maintaining scrapers or bypassing blocks – just structured and validated data tailored to your business needs.
Trusted by 20,000+ customers worldwide
Billions of records at your service
- 120+ domains
- 190+ datasets
- 7.7K+ data sample downloads
LinkedIn people profiles
Amazon products
LinkedIn company information
Instagram - Profiles
Crunchbase companies information
Linkedin job listings information
Zillow properties listing information
Instagram - Posts
LinkedIn posts
X (formerly Twitter) - Posts
Google Maps full information
TikTok - Profiles
Facebook - Pages Posts by Profile URL
Youtube - Videos posts
Amazon Reviews
TikTok - Posts
Indeed job listings information
Shopee - products
Companies information enriched dataset
Walmart - products
Employees business enriched dataset
TikTok Shop
YouTube - Profiles
Glassdoor companies overview information
IMDB media
X (formerly Twitter) - Profiles
Airbnb Properties Information
Google maps reviews
Reddit- Posts
Yahoo Finance business information
Google News
Instagram - Reels
Booking Hotel Listings
Glassdoor companies reviews
Shein- Products
LinkedIn profiles Jobs Listings
Yelp businesses overview
Facebook - Comments
Instagram - Comments
Zoominfo companies information
pitchbook companies information
Glassdoor job listings information
Otodom Poland
Google Shopping
Amazon sellers info
Amazon products global dataset
G2 software product overview
eBay
Github repository
Facebook - Posts by group URL
Home Depot US
Amazon best seller products
Australia real estate properties
Facebook Marketplace
Facebook - Posts by post URL
Etsy
Google Play Store
TikTok - Comments
Trustpilot business reviews
G2 software - product reviews
Amazon products search
Booking Listings Search
Goodreads books
Yelp businesses reviews
Reddit - Comments
Amazon Walmart
World population
Zillow price history
Zara - Products
Indeed companies info
Wikipedia articles
Zoopla properties listing information
Target
Facebook - Profiles
Pinterest - Posts
Youtube - Comments
Lazada - Products
Best Buy products
NBA players' stats
Walmart sellers info
Lowes.com
Facebook Events
Ikea - Products
Realtor international properties listings
Sephora products
BBC news
OLX Brazil - marketplace ads
Xing social network
Facebook - Pages and Profiles
Ozon.ru products
Facebook - Reels by profile URL
Google Play Store reviews
Facebook Company Reviews
Google Shopping products search US
Wayfair products
Creative Commons Images
Myntra products
Owler companies information
Slintel 6sense company information
Digikey - Products
H&M - Products
Naver products
US lawyers directory
Webmotors Brasil - Cars Listings
Tokopedia Products
Manta businesses
Apple App Store reviews
Mouser - Products
CNN news
Agoda Properties Listings
Wildberries.ru products
Zonaprop Argentina - Properties Listing
Carsales Cars Listings search page information
Quora posts
Pinterest - Profiles
VentureRadar company information
Zalando products
Inmuebles24 Mexico - Properties Listings
Chileautos Chile - Cars Listings
Yapo Chile - marketplace ads
Asos - Products
Trustradius product reviews
Lazada - Reviews
Vimeo - Videos posts
Bluesky - Posts
World zipcodes
Hermes- Products
Lego - Products
mercadolivre.com.br products
Metrocuadrado - Properties Listings
Home Depot CA
Chanel Products
Dior - Products
Toctoc - Properties Listings
Lazada products search (GMV)
Top 500 Bluesky Profiles
Kroger.com
Ashleyfurniture - Products
Apple App Store
Creative Commons 3D Models
Properati Argentina and Colombia - Properties Listings
Infocasas Uruguay - Properties Listings
AE.com - Complete Products
Mango Products
Balenciaga.com - Products
Mediamarkt.de products
Fanatics.com - Products
Toysrus - Products
Zara Home Products
Crateandbarrel - Products
Rona.ca products
Carters.com - Products
Loewe.com - Products
Prada.com - Products
Fendi Products
Delvaux - Products
Massimo Dutti - Products
Bottegaveneta.com - Products
Ysl.com - Products
Raymourflanigan.com - Products
Snapchat posts
Mybobs.com - Products
Macys.com
Mattressfirm - Products
Sleepnumber.com - Products
Celine.com - Products
Berluti.com - Products
llbean.com - Products
La-z-boy.com - Products
Montblanc - Products
Moynat.com - Products
Costco products
Micro Center Products
B&H Products
Filter any dataset with a single prompt
Describe exactly what you need, and let AI apply the perfect filters in seconds.
- Describe data needs in plain English
- AI applies accurate filters automatically
- Narrow huge datasets to only what matters to you
- Cut costs by skipping irrelevant data
- Export filtered data in your preferred format
Maximize value with strategic cost savings
Smart Data Updates
Access only "New Records" or "Updated Records," ensuring you pay only for what you need"
Dataset Bundles
Gain greater value by purchasing two or more datasets together, with exclusive discounts.
Volume Discounts
Get more for less with significant savings when purchasing large datasets or updates subscriptions
Enriched Datasets
Save time and resources with pre-built datasets that combine multiple sources into one clean dataset
Couldn’t find what you’re looking for?
Tell us about your project, and we’ll find the right data to help turn your ideas into reality.
Datasets Pricing
- Clean and validated
- Refreshed monthly
- JSON/CSV/Parquet
Power AI Agents Instantly
Our datasets are AI/LLM-optimized: clearly structured, well-documented, with code and recipes for easy LLM/chatbot integration.
Structured & Clean
Pre-processed data with consistent schemas, perfect for AI model training and inference.
Code Examples
Ready-to-use Python, Node.js, cURL, PHP, Go, Java, and Ruby snippets for easy integration with AI workflows.
Documentation
curl --request GET
--url https://api.brightdata.com/datasets/snapshots/{id}/download
--header 'Authorization: Bearer '
import requests
url = "https://api.brightdata.com/datasets/snapshots/{id}/download"
headers = {"Authorization": "Bearer "}
response = requests.get(url, headers=headers)
print(response.json())
const url = 'https://api.brightdata.com/datasets/snapshots/{id}/download';
const options = {method: 'GET', headers: {Authorization: 'Bearer '}, body: undefined};
try {
const response = await fetch(url, options);
const data = await response.json();
console.log(data);
} catch (error) {
console.error(error);
}
HttpResponse response = Unirest.get("https://api.brightdata.com/datasets/snapshots/{id}/download")
.header("Authorization", "Bearer ")
.asString();
require 'uri'
require 'net/http'
url = URI("https://api.brightdata.com/datasets/snapshots/{id}/download")
http = Net::HTTP.new(url.host, url.port)
http.use_ssl = true
request = Net::HTTP::Get.new(url)
request["Authorization"] = 'Bearer '
response = http.request(request)
puts response.read_body
Any website. Any data. Your way.
Effortless Data Filtering
Easily customize datasets with AI-powered tools—no coding needed, and only pay for what you need.
Dynamic Data Updates
Get full updates, new records, or updates to existing data, with flexible subscription options.
Developer-Friendly API
Filter and retrieve data directly to your application, streamlining your workflow.
Flexible Delivery Options
Export data via S3, API, Webhook, and more, to match your infrastructure.
Multiple Output Formats
Receive data in JSON, CSV, Parquet, or compressed formats to suit your needs.
Data Integrity Insights
Access detailed fill rates and stats to ensure data meets your specific requirements.
Discover how data can work for you
Power AI and LLMs with rich, endless data
Access high-quality datasets to train and optimize AI and ML models for personalized content, image recognition, and LLM advancements.
RELEVANT DATASETS:
Text, Images, Videos ,3D Models, and more.
Transform data into smarter investments
Leverage financial data to track company growth, spot market trends, and benchmark industry performance.
RELEVANT DATASETS:
LinkedIn Companies, CrunchBase, Enriched companies data, and more.
Drive growth with sales opportunities and insights
Enhance your lead database, uncover new opportunities, prioritize high-value prospects, automate lead scoring, and identify purchase intent.
RELEVANT DATASETS:
LinkedIn People, Enriched employee data, and more.
Gain a competitive edge with real-time intelligence
Analyze marketing metrics, brand sentiment, influencer performance, and campaign success while staying informed on competitor pricing, regulatory updates, and talent trends to refine strategies and maintain a market advantage.
RELEVANT DATASETS:
Instagram profiles, TikTok posts, Facebook groups, and more.
Discover data-driven property opportunities
Monitor listing data, market trends, and property forecasts to identify investment opportunities, predict market shifts, and make smarter real estate investments.
RELEVANT DATASETS:
Zillow properties, Airbnb, and more.
Top-Rated by Users
Bright Data is a leading web data platform, trusted by over 20,000 customers worldwide. It offers award-winning proxy networks, AI-powered web scrapers, and business-ready datasets, enabling efficient and reliable data collection across various industries.
Datasets FAQs
What are Bright Data’s Marketplace Datasets?
Bright Data Dataset Marketplace are validated collections of high-quality datasets covering various topics, sourced from various reliable and diverse public online data sources. These datasets are meticulously gathered, cleaned, and structured to provide valuable business insights.
What types of datasets are available through Bright Data?
Bright Data offers diverse datasets spanning industries such as AI and LLMs, e-commerce, finance, travel, social media, and more. These datasets encompass various data types, including text, images, videos, and structured data, providing comprehensive coverage for different analytical needs.
Are the datasets in the marketplace customizable?
Yes, we get that different projects have unique requirements. This is why we offer customization options for datasets, allowing users to tailor the data to specific parameters such as timeframes, geographic regions, or specific data fields. This ensures that the datasets you receive are perfectly suited to your needs.
Are Bright Data Datasets ethically sourced?
Bright Data prioritizes ethical data-sourcing practices. They adhere to strict ethical guidelines and comply with all relevant regulations to ensure that the data provided is obtained ethically and legally. Additionally, Bright Data is committed to maintaining the privacy and security of data subjects and users.
Can I trust the quality of Bright Data Datasets?
Yes. Each dataset undergoes rigorous quality assurance processes to ensure accuracy, reliability, and relevance. Additionally, we continuously update and refresh our datasets to reflect the latest information, ensuring that users always have access to the most current data.
What are some common use cases for Bright Data Datasets?
Common use cases include machine learning and AI model training, product enrichment, market research, trend analysis, sentiment analysis.
What data formats and delivery methods does Bright Data support?
Data formats are available in JSON, NDJSON, CSV, XLSX and Parquet. Datasets can be delivered via Snowflake, Webhook, Google Cloud, Email, PubSub, Amazon S3, SFTP or Azure. You can also iInitiate requests through API for on-demand data.
What If I want fresh, up-to-date datasets?
Not a problem. Before proceeding to checkout, you will be able to define the time range of the data freshness you would like to get.
What is the difference between pre-collected and fresh data?
You can choose between instantly available datasets, with data dating back from a few days to a couple of months, or freshly collected data.
Do you have subscription options?
Yes. You can subscribe to any dataset and receive fresh data directly to your storage on a daily, weekly, monthly, quarterly or yearly basis.