- Fetches the main page using Axios
- Collects all /companies/ links
- Loads each company page
- Extracts brand name, website, and social links
- Saves everything into example-output.csv
A lightweight Node.js web scraper built with Axios and Cheerio.
This scraper collects company information from markcubancompanies.com, including:
- Brand name
- Official website
- Facebook link
- Instagram link
All results are exported into a clean CSV file (example-output.csv).
This project is part of my portfolio to demonstrate real-world web scraping ability.
- Automatically extracts all company URLs
- Detects official brand websites (external links only)
- Detects Facebook and Instagram using hostname matching
- Fallback logic when website links are not clearly labeled
- Custom User-Agent and polite delay between requests
- Clean CSV output with escaped formatting
Install all required dependencies:
npm installRun the script using:
node scrape-mcc.jsThe scraper generates a CSV file:
example-output.csv