Aarhus University Seal

Data Collection

CDMM accepts no liability for tools or services from third parties. Technical issues or support questions for such tools or services must be directed at the developers' own support.

Below you will find tools to help create your own collection, and a list of existing collections.

Create Your Own Collection


Download videos, channels, playlists etc. from
YouTube, Vimeo or other popular video sharing services.


Can automatically save entire websites as HTML. Vulnerable to scripts and loops. Fast but complex.


Find external links pointing back to a specific website.


Can save web pages by elements as you click them. Results are stored online, and may be downloaded. Includes an experimental autopilot function.


Can extract data from Twitter, Facebook, YouTube, and Amazon. Functions limited by service providers' policies. Steep learning curve.


Tutorial guide for Facebook via Facepager. Warning: Limited use, steep learning curve.


Save web pages as PDF files with good preservation of visual content and links.


Can automatically save entire websites as HTML. Vulnerable to scripts and loops.


Archive a webpage directly in The Internet Archive as an immediate copy and a more stable reference.


Powerful browser for web archiving purposes. Built-in screenshot function that supports manual scrolling.


Mac application for screenshots, supports scrolling webpages.


Screen recorder feature on Mac.


Can extract (scrape) hyperlinks from a list of web page URLs.


Screen recording feature on Android 11 and later.


Screen recording featured in Apple’s mobile devices with iOS 11 and later.


Free screen capture of images and video. Supports autoscroll.
Can record streaming video. Windows only.


Screen capture of images and video. Supports autoscroll.
Can record streaming video.


Record sound playing on the computer, such as radio live streams.


Subscription service for screenshots, supports scrolling webpages and several modes of delivery.


Generate a timeline and thumbnails showing
the development of a website over time.


Archive data from a Twitter account. Seems to be limited to 100 tweets per update.


Advice for various approaches to archiving Twitter content.


Firefox plugin, can save most embedded video files.


Mac application, can save most embedded video files.


Subscription service monitoring web pages for changes with alerts to the subscriber.


Can save web pages locally in the same manner as Conifer. Includes Autopilot function for automatic scrolling.


Mac application, can save web pages as PDF files with good preservation of visual content and links.


Extract data for YouTube channels and videos.


Record video and chats from YouTube live streaming


The tools listed above are trusted and tried, and CDMM works continuously towards maintaining and expanding the selection.

There are many other tools for digital data collecting to be found, which may vary a lot in pricing, quality and trustworthiness.

Should you find a tool that you think we should consider adding to the list, or should you encounter a tool or a tutorial in our lists that no longer works or has changed drastically, please let us know by contacting asger [at] cc.au.dk and nb [at] cc.au.dk.

Existing Collections


A short list of Danish media archives.


Experimental API for publicly available data
and metadata at the Royal Danish Library.


Danish media archives of newspapers and older books.


Eaarly web data collections wpublished at The Internet Archive.


The Danish national web archive. Restricted access.


The world's largest web archive. Open for all users.


Advice for looking further into existing web archives.


A selection of academic collections and datasets from social media.


User uploaded social media collections.