Hacker News dataset

Get insight into technology trends, startup advancements, and the pulse of the technology community

  • Available as a custom dataset
  • Tap into all major Hacker News data points
  • 100% compliant scraping
Request dataset
Hacker news dataset
                              {
  "type": "object",
  "fields": {
    "posts": {
      "type": "array",
      "active": true,
      "items": {
        "type": "object",
        "fields": {
          "post_id": {
            "type": "text",
            "active": true,
            "sample_value": "12345678"
          },
          "title": {
            "type": "text",
            "active": true,
            "sample_value": "New AI breakthrough in machine learning"
          },
          "author": {
            "type": "text",
            "active": true,
            "sample_value": "johndoe"
          },
          "points": {
            "type": "integer",
            "active": true,
            "sample_value": 150
          },
          "comment_count": {
            "type": "integer",
            "active": true,
            "sample_value": 42
          },
          "post_url": {
            "type": "url",
            "active": true,
            "sample_value": "https://news.ycombinator.com/item?id=12345678"
          },
          "submission_date": {
            "type": "text",
            "active": true,
            "sample_value": "2023-10-25T12:34:56Z"
          },
          "post_type": {
            "type": "text",
            "active": true,
            "sample_value": "story"
          },
          "tags": {
            "type": "array",
            "active": true,
            "items": {
              "type": "text",
              "sample_value": "AI"
            }
          }
        }
      }
    },
    "related_searches": {
      "type": "array",
      "active": true,
      "items": {
        "type": "object",
        "fields": {
          "related_search_term": {
            "type": "text",
            "active": true,
            "sample_value": "machine learning"
          },
          "related_search_link": {
            "type": "url",
            "active": true,
            "sample_value": "https://news.ycombinator.com/search?query=machine+learning"
          }
        }
      }
    },
    "url": {
      "type": "url",
      "required": true,
      "active": true,
      "sample_value": "https://news.ycombinator.com"
    }
  }
}
                              
                            

Hacker News dataset sample

Choose from fully managed or self-managed datasets. The fully managed dataset offers a hands-off experience managed by our partners, while self-managed custom datasets allow you to set up the project and validation rules yourself. The Hacker News dataset data points may include: post title, author, points, comment count, post URL, submission date, and more.
THE PROCESS

Automated dataset creation platform

Streamline your data-collection process so you can focus on what matters.
  1. Initial setup

    Add the URLs of your target website.

  2. Sample creation

    Get AI-generated schema and sample. Set up validation rules.

  3. Proof of concept

    The scraper is built based on schema and validation rules.

  4. Data collection & delivery

    Data is collected and delivered.

Custom Dataset Pricing

CUSTOM DATASET
Subscription
Starting from
$300/month
One time
Starting from
$1,000
Proof of Concept
One time
$500
  • AI-Generated schema & sample
  • Control over data validation
  • Real-time product quantity est.
  • Daily, Weekly, Monthly, Custom

Hacker News datasets tailored to your needs

Get easy to use, well-structured datasets for any use case

Data subscription

Subscribe to access datasets at a significantly reduced cost.

File output formats

JSON, NDJSON, JSON Lines, CSV, Parquet. Optional .gz compression.

Flexible delivery

Snowflake, Amazon S3 bucket, Google Cloud, Azure, and SFTP.

Scalable data

Scale without worrying about infra, proxy servers, or blocks.

Cost savings

Customize any dataset using filters and formatting options.

Code maintenance

Datasets are maintained based on website structure changes.

Simplified integrations

Benefit from integrations with Snowflake and AWS.

24/7 support

A dedicated team of data professionals is here to help.

Leaders in compliance

Data is ethically obtained and compliant with all privacy laws.

Get structured and reliable Hacker News data

We’ll provide the data while you focus on the rest

High-volume web data

With our unblocking capabilities and round-the-clock IP rotation we ensure access to all data points on a website.

Data for immediate use

Every aspect of the data collection process is thoroughly validated as part of our robust data validation process.

Automated data flow

Create custom schedules to automate data delivery and watch the data flow seamlessly into your storage.

How companies use Hacker News datasets

Venture trends

Investors and venture capitalists use the Hacker News dataset to identify budding startups, emerging investment trends, and sectors gaining popularity. Examining discussions and sentiment around new technologies and entrepreneurial ventures helps spot potential investment opportunities and forecast shifts in the tech landscape.
Get dataset
venture trends

Innovation planning

Hacker News' dataset provides an in-depth view of the tech industry's landscape and allows organizations to benchmark their innovations, stay abreast of technological breakthroughs, and formulate future-focused business strategies based on discussions and trends highlighted in the dataset.
Get dataset
innovation planning

Sector monitoring

A rapidly evolving digital landscape requires companies to monitor discussions around technology and startups to foresee and manage risks. Real-time community engagement allows companies to swiftly address issues that could adversely affect their reputation and market position.
Get dataset
Risk assessment

Get your Hacker News dataset today.