Skip to content

Lubitzy/markcuban-scraper

Repository files navigation

Mark Cuban Companies Web Scraper

Node Axios Cheerio License: MIT

🔍 Preview Output

image

🚀 How it Works

  1. Fetches the main page using Axios
  2. Collects all /companies/ links
  3. Loads each company page
  4. Extracts brand name, website, and social links
  5. Saves everything into example-output.csv

A lightweight Node.js web scraper built with Axios and Cheerio.
This scraper collects company information from markcubancompanies.com, including:

  • Brand name
  • Official website
  • Facebook link
  • Instagram link

All results are exported into a clean CSV file (example-output.csv).
This project is part of my portfolio to demonstrate real-world web scraping ability.


🚀 Features

  • Automatically extracts all company URLs
  • Detects official brand websites (external links only)
  • Detects Facebook and Instagram using hostname matching
  • Fallback logic when website links are not clearly labeled
  • Custom User-Agent and polite delay between requests
  • Clean CSV output with escaped formatting

▶️ Installation

Install all required dependencies:

npm install

▶️ Run the Scraper

Run the script using:

node scrape-mcc.js

📄 Output (CSV)

The scraper generates a CSV file:

example-output.csv

About

Sample Node.js web scraper that extracts company website and social media links. Includes a CSV output for demonstration.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors