Aarhus University Seal

Hyperlink Extraction

You may want to collect and preserve the hyperlinks from one or more webpages. We recommend using the application Screaming Frog SEO.

On the Screaming Frog SEO page you will find a step by step guide specifically to link extraction.

Another option is Digital Methods’ tool Link Ripper. This software runs in a web browser, where you insert one or more web addresses, and the software pulls out all the hyperlinks from these webpages. It is possible to extract internal as well as external hyperlinks.

If you need to extract hyperlinks from the text on a web page that you have already found, for instance the result page of a Google search, you may want to use the Digital Methods’ Harvester Tool. This software also runs in a web browser, where you insert any text, including web source code, and the software pulls out all the hyperlinks from the text.

Finally, in case you need to extract the hyperlinks from more web pages and websites we recommend using the Digital Methods’ Issue Crawler.

Important: Please note that the data handling by these services may not be compliant with GDPR.

You need to create an account (free), and then the software runs in your web browser. The Issue Crawler is made to create hyperlink network analyses, but the first step of this is crawling the websites to be included in the network, which means visiting the specified websites and extracting their hyperlinks. Thus, you can use the Issue Crawler as a hyperlink extraction tool, without using it for the network analysis. Just run the network analysis, and once this is done you can export the list of hyperlinks that the software generated (the point ‘retrieve startingpoints and network URLs).

Application: Please visit CDMM's Screaming Frog SEO page.

Service: https://wiki.digitalmethods.net/Dmi/ToolLinkRipper

Works on: