Aarhus University Seal

Data Collection

This page contains lists of tools for collecting digital data from various types of online sources, plus a list of existing collections.

About the Lists
The tools and services listed here are trusted and tried, and CDMM works continuously towards maintaining and expanding the selection.

Should you find a tool that you think we should consider adding to the list, or should you encounter something that no longer works or has changed drastically, please let us know by contacting asger [at] cc.au.dk and nb [at] cc.au.dk.

Tip
Clicking all links of interest in a category will open them in new tabs for a detailed overview.

Disclaimer
CDMM accepts no liability for tools or services from third parties. Technical issues or support questions for such tools or services must be directed at the developers' own support.


Web Pages

Save web pages as PDF files with good preservation of visual content and links.


Archive a webpage directly in The Internet Archive as an immediate copy and a more stable reference.


Powerful browser for web archiving purposes. Built-in screenshot function that supports manual scrolling.


Mac application for screenshots, supports scrolling webpages.


Screen recorder feature on Mac.


Free screen capture of images and video. Supports autoscroll.
Can record streaming video. Windows only.


Save web pages as single HTML files. Powerful tool
for saving web pages, even from social media.


Screen capture of images and video. Supports autoscroll.
Can record streaming video.


Subscription service for screenshots, supports scrolling webpages and several modes of delivery.


Generate a timeline and thumbnails showing
the development of a web page over time.


Subscription service monitoring web pages for changes with alerts to the subscriber.


Mac application, can save web pages as PDF files with good preservation of visual content and links.

Websites

Can automatically save entire websites as HTML. Vulnerable to scripts and loops. Fast but complex.


Can save web pages by elements as you click them. Results are stored online, and may be downloaded. Includes an experimental autopilot function.


Can automatically save entire websites as HTML. Vulnerable to scripts and loops.


A webscraper which also supportes API extraction from social media.


Screen recorder feature on Mac.


Free screen capture of images and video. Supports autoscroll.
Can record streaming video. Windows only.


Screen capture of images and video. Supports autoscroll.
Can record streaming video.


Subscription service for screenshots, supports scrolling webpages and several modes of delivery.


Subscription service monitoring web pages for changes with alerts to the subscriber.

Social Media

Download files from Instagram accounts, hashtags and locations.


Download videos from TikTok accounts and hashtags.


Download videos, channels, playlists etc. from
YouTube, Vimeo or other popular video sharing services.


Various methods for downloading videos from Facebook.


Can extract data from Facebook, YouTube, and Amazon. Functions limited by service providers' policies. Steep learning curve.


Tutorial guide for Facebook via Facepager. Warning: Limited use, steep learning curve.


A webscraper which also supportes API extraction from social media.


Screen recorder feature on Mac.


Free screen capture of images and video. Supports autoscroll.
Can record streaming video. Windows only.


Screen capture of images and video. Supports autoscroll.
Can record streaming video.


Online service, explores the networks of spotify artists.


Firefox plugin, can save most embedded video files.


Extract data for YouTube channels and videos.


Record video and chats from YouTube live streaming

Hyperlinks

Find external links pointing back to a specific website.


A short guide to extracting Hyperlinks from web pages or websites.


Can extract (scrape) hyperlinks from a list of web page URLs.

Video

Download files from Instagram accounts, hashtags and locations.


Download videos from TikTok accounts and hashtags.


Download videos, channels, playlists etc. from
YouTube, Vimeo or other popular video sharing services.


Various methods for downloading videos from Facebook.


Screen recorder feature on Mac.


Free screen capture of images and video. Supports autoscroll.
Can record streaming video. Windows only.


Screen capture of images and video. Supports autoscroll.
Can record streaming video.


Firefox plugin, can save most embedded video files.


Mac application, can save most embedded video files.

Audio

Download videos, channels, playlists etc. from
YouTube, Vimeo or other popular video sharing services.


Screen recorder feature on Mac. Audio only option.


Free screen capture of images and video. Audio only option.
Supports autoscroll. Can record streaming video. Windows only.


Record sound playing on the computer, such as radio live streams.


Firefox plugin, can save most embedded video files (or audio only from the videos).


Mac application, can save most embedded video files (or audio only from the videos).

Mobile Media

Screen recorder feature on Mac.


Screen recording feature on Android 11 and later.


Screen recording featured in Apple’s mobile devices with iOS 11 and later.

Radio, TV, Newspapers, Books

A short list of Danish media archives.


Experimental API for publicly available data
and metadata at the Royal Danish Library.


Danish media archives of newspapers and older books.

Existing Collections

This is a list of tried and recommended collections of media content and data.

It is not an exhaustive or complete list of existing collections.


A short list of Danish media archives.


Experimental API for publicly available data
and metadata at the Royal Danish Library.


Danish media archives of newspapers and older books.


Early web data collections published at The Internet Archive.


The Danish national web archive. Restricted access.


The world's largest web archive. Open for all users.


Advice for looking further into existing web archives.


A selection of academic collections and datasets from social media.


User uploaded social media collections.