19 items found

Availability: On-Site Tags: Web data

Filter Results
  • Dataset

    Air Quality Datasets over L'Aquila Region

    These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData.
  • Access required...

    ×

    Dataset

    Private Post-earthquake Reconstruction Progress Datasets over L'Aquila City

    Reconstruction data sets, provided by the National Public Entities of USRA and USRC. These data sets are stored in CSV files and provide comprehensive information related to...
  • TrainingMaterial

    Introduction to Data Curation

    This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...
    • PDF
      The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
  • Dataset

    ClueWeb09

    The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on...
  • Dataset

    Global Peace Index data

    A dataset of the Global Peace Index (GPI), which ranks 163 independent states and territories according to their level of peacefulness. The GPI covers 99.7 per cent of the...
  • Dataset

    NYSE transactions

    This dataset contains financial data on the price of the top 250 most liquid assets of New York Stock Exchange (NYSE) from 2006 to 2014. The dataset contains transactions,...
  • Dataset

    FED data

    March 2001- September 2013 quarterly data of US banks' holdings. The number of financial institutions present in the data is pretty stable during quarters, starting from...
  • Dataset

    Retail market dataset

    The dataset contains purchases of Unicoop Tirreno customers, description and information of the shops (both small shops and supermarkets) and the customers.
  • Dataset

    Retail Market Data

    This dataset contains Retail Market Data about food products, from 2007, for about 130 shops of an Italian Distribution chain. Data are of about 1 M of Active Clients, and...
  • Dataset

    Russell 3000 stock prices

    This dataset contains the price and volume of the 3000 stocks belonging to the Russell 3000 Index, roughly corresponding to the 3000 more capitalized stocks. Traded volume and...
  • Dataset

    .ee Web archive

    .ee Web archive consisting of snapshots from 2015
  • Dataset

    Articles and comments of major Estonian newspapers

    The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016.
  • Dataset

    ClueWeb12

    The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information...
  • Dataset

    German Academic Web

    The dataset contains regular crawls of the websites for German academic institutions.
  • Dataset

    MSN Search query log

    The data consists of an MSN Search query log excerpt with 15 million queries, from US users, sampled over one month of activity. Data attributes made available per query: 1)...
  • Dataset

    CoPhIR

    The CoPhIR (Content-based Photo Image Retrieval) Test-Collection has been developed to make significant tests on the scalability of the SAPIR project infrastructure (SAPIR:...
    • The resource: 'cophir.isti.cnr.it' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Interactive Learning Environments

    King’s College London developed a variety of data science materials based on R and Python. R is a de facto standard in statistical computing and visualisation, while our...
    • ZIP
      The resource: 'Rstudio docker image' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'VirtualBox' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Swirl courses' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Efficiency - Effectiveness Trade-offs in Learning to Rank

    This tutorial provides an 'Introduction to Learning to Rank' and focuses on 'Dealing with the Efficiency/Effectiveness trade-off in Web Search'. Moreover, it provides two...
    • PDF
      The resource: 'Introduction to Learning ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Dealing with the ...' is not accessible as guest user. You must login to access it!
    • python
      The resource: 'Hands-on Session 1 ' is not accessible as guest user. You must login to access it!
    • python
      The resource: 'Hands-on Session 2 ' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Publicly available ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Istella Learning to Rank ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Jupyter Notebooks

    King’s College London has developed complete stories around Jupyter Notebooks that form easy recipes for reproducible methods in social data science. Jupyter...
    • ZIP
      The resource: 'Historical Cultures Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Prediction Modelling ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Social and Cultural ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Social Sensing Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Visual Arts Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Ananke Guide' is not accessible as guest user. You must login to access it!
    • mp4
      The resource: 'Ananke Guide Video' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).