DataMassif
HTML LinkExtractor

HTML LinkExtractor

Scraping external and internal links from the specified site can be performed on internal links up to the selected level.

HTML LinkExtractor

Scraper for external and internal links from a specified website. Supports multi-page scraping and navigation through internal pages of a website to a specified depth, allowing you to go through all pages of the website, collecting internal and external links. Has built-in tools for bypassing CloudFlare protection.

Data collected

  • Number of external links
  • Number of internal links
  • External links: links themselves
    • anchors
    • anchors stripped of HTML tags
    • nofollow parameter
    • tag in full
  • Internal links:
    • links themselves
    • anchors
    • anchors stripped of HTML tags
    • nofollow parameter
    • tag in its entirety
  • Array with all collected pages

Use Cases

  • Obtaining a complete site map (saving all internal links)
  • Obtaining all external links from the site
  • Checking backlinks to your site

Similar scrapers

Other tools in the "Content & Backlink Scrapers" category.

HTML ArticleExtractor

HTML ArticleExtractor

Collects articles from web pages: title, content with and without HTML markup

HTML EmailExtractor

HTML EmailExtractor

Scraping email addresses from website pages

HTML TextExtractor

HTML TextExtractor

Text block scraper, allows you to collect content from arbitrary websites

Net HTTP

Net HTTP

Downloads the specified page, supports multi-page scraping