4.6 out of five star rating on Trustpilot

GitHub Scraper

Name: Bright Data
Rating: 4.6 (874 reviews)

Scrape GitHub and collect public data such as username, URL, code language, code, number of lines, size, number of issues, and much more.

Contact sales Start free trial

No credit card required

Scrape on demand via API or no-code scraper
Pay only for successfully delivered results
Bulk request handling, up to 5K URLs
Retrieve results in
multiple formats

Trusted by 20,000+ customers worldwide

Effortlessly scrape GitHub data

GitHub Scraper API

Use this API to start collecting data with specified parameters

API based scraper
Use our interface to build your api request
Automation in scale
Build your own scheduler to control the frequency
Delivery
Deliver the data to your preferred storage or download it

GitHub No-Code Scraper

Use this "Plug and Play" scraper to start collecting data

Control Panel based scraper
The entire interaction is within our control panel
Easy to use
Add your input to the scraper and you are ready to go
Retrieve results from the CP
Results can be downloaded directly from the CP

Web Scrapers

Available GitHub scrapers

Remove the need to develop and maintain the infrastructure. Simply extract high volume web data, and ensure scalability and reliability using web scraper APIs or no-code scrapers.

Github repository

URL, ID, Code language, Code, Num lines, User name, User url, Size, and more.

1.3K+

80+

Start free trial

Github repository - Discover github code by repository URL

URL, ID, Code language, Code, Num lines, User name, User url, Size, and more.

1.3K+

80+

Start free trial

Github repository - discover new records by search url

URL, ID, Code language, Code, Num lines, User name, User url, Size, and more.

1.3K+

80+

Start free trial

Available delivery options

PRICING

GitHub Scraper API subscription plans

Sign-up now and we’ll match your first deposit dollar for dollar, up to $500!

Get API Key

Pay as you go

$1.5 /1K RECORDS

No commitment

Start free trial

Pay-as-you-go without a monthly commitment

25% OFF

510K Records

$1.3

$0.98 /1K RECORDS

$499 Billed monthly

Start free trial

Use this coupon code: APIS25

Tailored for teams looking to scale their operations

25% OFF

1.2M Records

$1.1

$0.83 /1K RECORDS

$999 Billed monthly

Start free trial

Use this coupon code: APIS25

Designed for large teams with extensive operational needs

25% OFF

2.7M Records

$0.75 /1K RECORDS

$1999 Billed monthly

Start free trial

Use this coupon code: APIS25

Advanced support and features for critical operations

Enterprise

For industry leaders: Elite data services for top-tier business requirements

Contact sales

Account Manager
Custom packages
Premium SLA
Priority support
Tailored onboarding
SSO
Customizations
Audit Logs

We accept these payment methods:

Just want GitHub data? Skip scraping.
Purchase a GitHub dataset

CODE EXAMPLES

Easily scrape GitHub data without worrying about being blocked.

Input

JSON

curl -H "Authorization: Bearer API_TOKEN" -H "Content-Type: application/json" -d '[{"url":"https://github.com/TheAlgorithms/Python/blob/master/divide_and_conquer/power.py"},{"url":"https://github.com/AkarshSatija/msSync/blob/master/index.js"},{"url":"https://github.com/WerWolv/ImHex/blob/master/main/gui/source/main.cpp"}]' "https://api.brightdata.com/datasets/v3/trigger?dataset_id=gd_lyrexgxc24b3d4imjt&format=json&uncompressed_webhook=true"

Output

JSON

[
  {
    "db_source": "1760939725313",
    "timestamp": "2025-10-19",
    "url": "https:\/\/github.com\/anthropics\/claude-code\/blob\/main\/README.md?raw=true",
    "id": "[email protected]",
    "code_language": "Markdown",
    "code": [
      "# Claude Code",
      "",
      "![](https:\/\/img.shields.io\/badge\/Node.js-18%2B-brightgreen?style=flat-square) [![npm]](https:\/\/www.npmjs.com\/package\/@an...",
      "",
      "[npm]: https:\/\/img.shields.io\/npm\/v\/@anthropic-ai\/claude-code.svg?style=flat-square",
      "",
      "Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster ...",
      ""
    ],
    "num_lines": 47,
    "user_name": "anthropics"
  },
  {
    "db_source": "1760601419396",
    "timestamp": "2025-10-15",
    "url": "https:\/\/github.com\/y-wako\/webscriping_Training\/blob\/main\/README.md?raw=true",
    "id": "[email protected]",
    "code_language": "Markdown",
    "code": null,
    "num_lines": 0,
    "user_name": "y-wako"
  },
  {
    "db_source": "1760853319260",
    "timestamp": "2025-10-18",
    "url": "https:\/\/github.com\/WerWolv\/ImHex\/blob\/master\/main\/gui\/source\/main.cpp?raw=true",
    "id": "311683390@main\/gui\/source\/main.cpp",
    "code_language": "C++",
    "code": [
      "#include \u003Chex.hpp\u003E",
      "",
      "#include \u003Cclocale\u003E",
      "",
      "#include \u003Chex\/helpers\/logger.hpp\u003E",
      "",
      "#include \u003Cwindow.hpp\u003E",
      "#include \u003Cmessaging.hpp\u003E"
    ],
    "num_lines": 68,
    "user_name": "WerWolv"
  },
  {
    "db_source": "1760853319260",
    "timestamp": "2025-10-18",
    "url": "https:\/\/github.com\/TheAlgorithms\/Python\/blob\/master\/divide_and_conquer\/power.py?raw=true",
    "id": "63476337@divide_and_conquer\/power.py",
    "code_language": "Python",
    "code": [
      "def actual_power(a: int, b: int) -\u003E int:",
      "  \u0022\u0022\u0022",
      "  Function using divide and conquer to calculate a^b.",
      "  It only works for integer a,b.",
      "",
      "  :param a: The base of the power operation, an integer.",
      "  :param b: The exponent of the power operation, a non-negative integer.",
      "  :return: The result of a^b."
    ],
    "num_lines": 53,
    "user_name": "TheAlgorithms"
  },
  {
    "db_source": "1760853319260",
    "timestamp": "2025-10-18",
    "url": "https:\/\/github.com\/AkarshSatija\/msSync\/blob\/master\/index.js?raw=true",
    "id": "[email protected]",
    "code_language": "JavaScript",
    "code": [
      "\u0027use strict\u0027;",
      "",
      "async function main(bucketName, directoryPath, keyFile) {",
      "  const {Storage} = require(\u0027@google-cloud\/storage\u0027);",
      "  const fs = require(\u0027fs\u0027);",
      "  const path = require(\u0027path\u0027);",
      "  const fileList = [];",
      ""
    ],
    "num_lines": 69,
    "user_name": "AkarshSatija"
  }
]

DEPLOY FASTER

One API call. Tons of data.

Data Discovery

Detecting data structures and patterns to ensure efficient, targeted extraction of data.

Bulk Request Handling

Reduce server load and optimize data collection for high-volume scraping tasks.

Data Parsing

Efficiently converts raw HTML into structured data, easing data integration and analysis.

Data validation

Ensure data reliability and save time on manual checks and preprocessing.

UNDER THE HOOD

Never worry about proxies and CAPTCHAs again

Automatic IP Rotation
CAPTCHA Solver
User Agent Rotation
Custom Headers
JavaScript Rendering
Residential Proxies

BEST-IN-CLASS DX

Easy to start. Easier to scale.

Unmatched Stability

Ensure consistent performance and minimize failures by relying on the world’s leading proxy infrastructure.

Simplified Web Scraping

Put your scraping on auto-pilot using production-ready APIs, saving resources and reducing maintenance.

Unlimited Scalability

Effortlessly scale your scraping projects to meet data demands, maintaining optimal performance.

Get API Key

API for Seamless GitHub Data Access

Comprehensive, Scalable, and Compliant GitHub Data Extraction

FLEXIBLE

Tailored to your workflow

Get structured data in JSON, NDJSON, or CSV files through Webhook or API delivery.

SCALABLE

Built-in infrastructure and unblocking

Get maximum control and flexibility without maintaining proxy and unblocking infrastructure. Easily scrape data from any geo-location while avoiding CAPTCHAs and blocks.

STABLE

Battle-proven infrastructure

Bright Data’s platform powers over 20,000+ companies worldwide, offering peace of mind with 99.99% uptime, access to 150M+ real user IPs covering 195 countries.

COMPLIANT

Industry leading compliance

Our privacy practices comply with data protection laws, including the EU data protection regulatory framework, GDPR, and CCPA – respecting requests to exercise privacy rights and more.

GitHub Scraper API use cases

Scrape Github user profile data

Scrape workflows and keep up to date with the trends

Scrape Github data to find new deployment on public repositories

Read GitHub enterprise profile and billing data

Get API Key

Why 20,000+ Customers Choose Bright Data

100% Compliant

Scraped data is ethically obtained and compliant with all privacy laws.

24/7 Global Support

A dedicated team of data professionals is here to help.

Complete Data Coverage

Access 150 million+ global IPs to scrape data from any website.

Unmatched Data Quality

Advanced technologies and validation methods for quality data.

Powerful Infrastructure

Scrape high-volume data without getting blocked.

Custom Solutions

Get tailored solutions to meet unique needs and goals.

Bright Data is used by world's top brands

We help businesses grow with secure, scalable, flexible data management.

Bright Data has their own proxy infrastructure which helps keep your web data flowing plus, their web unlocker helps beat any pesky CAPTCHAs that might be holding you back.

Nicholas Renotte

Data Science Specialist
Watch now
We are really impressed with the reliability, and very happy with Bright Data overall. We have a regular communication channel with our account manager, who is very helpful.

Yorgos Panzaris

CTO at Convert Group
We are very pleased with the partnership with Bright Data. Everything’s been good, the network has been very stable, we’re happy with the customer service and the support staff is bar none in our book.

Cheddi Rai

CEO at AdRetreaver
I recommend using Bright Data’s products for any company, especially in the finance industry. Bright Data is trustworthy and compliant, the service is great, the products are flawless and their network is fast and stable.

Xiaolong Shi

Crawler Engineer at Bitget
From my experience, Bright Data’s service has been invaluable. Bright Data helped us collect enough public web data to meet our needs, and with its support and development staff, we optimized many of our processes.

Charmagne Cruz

Head of Reporting & Analytics, Business Technologies and Pricing at Shopee Philippines Inc.
Having the best quality and quantity of data is the most important thing, and that’s where the combination of Bright Data and tgndata works.

George Koutsoudopoulos

CEO at tgndata
Watch now