DE webarchive

The dataset consists of all the content from the .de top level domain as crawled by the Internet Archive.

Tags
Data and Resources
To access the resources you must log in
  • Internet Archive Wayback MachineHTML

    The original dataset is accessible through the Internet Archive's Wayback...

    The resource: 'Internet Archive Wayback ...' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Trans National Access
AccessibilityMode API Access
Attribution requirements See https://archive.org/about/terms.php
Availability On-Line
Basic rights Other rights
ChildrenData No
Consent obtained also covers the envisaged transfer of the personal data outside the EU No
Consent of the data subject No
CreationDate 1994-12-02 - 2013-09-30
Creator Internet Archive, San Francisco
DataProtectionDirective Unknown
DiskSize 60000000
Distribution requirements No re-distribution allowed
Field/Scope of use Research only
Format application/warc,application/arc
IP/Copyrights Content in the dataset may fall under copyright law
Item URL http://data.d4science.org/ctlg/ResourceCatalogue/de_webarchive
http://data.d4science.org/ctlg/ResourceCatalogue/de_webarchive
ManifestationType Replica
Personal data was manifestly made public by the data subject Yes
PersonalData Yes
PersonalSensitiveData No
ProcessingDegree Primary
Restrictions on use Access according to the Internet Archive's Terms of Use (https://archive.org/about/terms.php). No replicas may be provided. Content in the dataset may fall under copyright and data protection law.
Semantic Coverage germany
Sublicense rights No
Territory of use World Wide
ThematicCluster Web Analytics
TimeCoverage 1994-12-02 /2013-09-30
system:type SoBigData.eu: Dataset
Management Info
Field Value
Author Gerhard Gossen
Maintainer Gerhard Gossen
Version 1
Last Updated 29 June 2018, 11:32 (CEST)
Created 29 June 2018, 11:32 (CEST)