Historical datasets
Learn from historical datasets to make more data-driven decisions, such as identifying trends and forecasting future conditions, evaluating performance, and improving business strategies.
- Available as a custom dataset request
- Millions of records avaialbe from tens of websites
- 100% compliant scraping
{
"type": "object",
"fields": {
"historical_data": {
"type": "array",
"active": true,
"items": {
"type": "object",
"fields": {
"title": {
"type": "text",
"active": true,
"sample_value": "Advances in Cancer Treatment: A Historical Overview"
},
"authors": {
"type": "text",
"active": true,
"sample_value": "Smith J, Lee A, Patel R"
},
"journal_info": {
"type": "text",
"active": true,
"sample_value": "J Cancer Hist. 1982 Jan;15(1):13-25."
},
"publication_date": {
"type": "text",
"active": true,
"sample_value": "1982-01-01"
},
"doi": {
"type": "text",
"active": true,
"sample_value": "10.1097/00005042-198201000-00003"
},
"pmid": {
"type": "text",
"active": true,
"sample_value": "31421234"
},
"abstract_snippet": {
"type": "text",
"active": true,
"sample_value": "This article explores key developments in cancer treatment across decades, analyzing breakthroughs in chemotherapy, radiotherapy, and immunotherapy."
},
"publication_type": {
"type": "text",
"active": true,
"sample_value": "Historical Article"
},
"citation_count": {
"type": "integer",
"active": true,
"sample_value": 157
},
"research_trend_analysis": {
"type": "text",
"active": true,
"sample_value": "Increased focus on immunotherapy from the 1990s onward"
},
"link_to_full_text": {
"type": "url",
"active": true,
"sample_value": "https://pubmed.ncbi.nlm.nih.gov/31421234/"
}
}
}
},
"related_searches": {
"type": "array",
"active": true,
"items": {
"type": "object",
"fields": {
"related_search_term": {
"type": "text",
"active": true,
"sample_value": "historical trends in cancer research"
},
"related_search_link": {
"type": "url",
"active": true,
"sample_value": "https://pubmed.ncbi.nlm.nih.gov/?term=historical+trends+cancer+research"
}
}
}
},
"url": {
"type": "url",
"required": true,
"active": true
}
}
}
Historical dataset sample
Automated dataset creation platform
-
Initial setup
Add the URLs of your target website.
-
Sample creation
Get AI-generated schema and sample. Set up validation rules.
-
Proof of concept
The scraper is built based on schema and validation rules.
-
Data collection & delivery
Data is collected and delivered.
Custom Dataset Pricing
- AI-Generated schema & sample
- Control over data validation
- Real-time product quantity est.
- Daily, Weekly, Monthly, Custom
Historical datasets datasets tailored to your needs
Data subscription
Subscribe to access datasets at a significantly reduced cost.
File output formats
JSON, NDJSON, JSON Lines, CSV, Parquet. Optional .gz compression.
Flexible delivery
Snowflake, Amazon S3 bucket, Google Cloud, Azure, and SFTP.
Scalable data
Scale without worrying about infra, proxy servers, or blocks.
Cost savings
Customize any dataset using filters and formatting options.
Code maintenance
Datasets are maintained based on website structure changes.
Simplified integrations
Benefit from integrations with Snowflake and AWS.
24/7 support
A dedicated team of data professionals is here to help.
Leaders in compliance
Data is ethically obtained and compliant with all privacy laws.
Get structured and reliable Historical datasets data
We’ll provide the data while you focus on the rest
High-volume web data
With our unblocking capabilities and round-the-clock IP rotation we ensure access to all data points on a website.
Data for immediate use
Every aspect of the data collection process is thoroughly validated as part of our robust data validation process.
Automated data flow
Create custom schedules to automate data delivery and watch the data flow seamlessly into your storage.