HTML LinkExtractor
Scraper for external and internal links from a specified website. Supports multi-page scraping and navigation through internal pages of a website to a specified depth, allowing you to go through all pages of the website, collecting internal and external links. Has built-in tools for bypassing CloudFlare protection.
Data collected
- Number of external links
- Number of internal links
- External links:
links themselves
- anchors
- anchors stripped of HTML tags
- nofollow parameter
- tag in full
- Internal links:
- links themselves
- anchors
- anchors stripped of HTML tags
- nofollow parameter
- tag in its entirety
- Array with all collected pages
Use Cases
- Obtaining a complete site map (saving all internal links)
- Obtaining all external links from the site
- Checking backlinks to your site