-
Wikinews dataset
This dataset consists of a sample of 365 news published by Wikinews from November 2004 to June 2014 and annotated with about 5000 entities, each associated with a saliency...-
JSON
The resource: 'entity-saliency' is not accessible as guest user. You must login to access it!
-
JSON
-
Amazon Network
Network was collected by crawling Amazon website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently...-
HTML
The resource: 'Amazon Network ' is not accessible as guest user. You must login to access it!
-
HTML
-
The Italian Music Dataset
The dataset is built by exploiting the Spotify and SoundCloud APIs. It is composed of over 14,500 different songs of both famous and less famous Italian musicians. Each song...-
JSON
The resource: 'Dataset' is not accessible as guest user. You must login to access it!
-
JSON
-
German Academic Web
The dataset contains regular crawls of the websites for German academic institutions. -
GERDAQ Dataset
This is a benchmark dataset of annotated search-engine queries. Mentions of entities in search-engine queries are tagged with the entity they refer to. Wikipedia is used as...-
XML
The resource: 'GERDAQ dataset' is not accessible as guest user. You must login to access it!
-
XML
-
MSN Search query log
The data consists of an MSN Search query log excerpt with 15 million queries, from US users, sampled over one month of activity. Data attributes made available per query: 1)... -
Egonetworks
This package contains classes and functions for the structural analysis of ego networks. An ego network is a simple model that represents a social network from the point of... -
MaxAndSam Network Reconstruction Method
This method reconstructs socio-economic and financial networks from partial information, i.e., the knowledge of intrinsic node-specific properties and of the number of...-
RAR
The resource: 'Reconstruction of ...' is not accessible as guest user. You must login to access it!
-
RAR
-
F1-Communities
F1-Communities is a novel approach to evaluate community detection algorithms on ground truth. It leverages precision and recall to provide a scalable measure that allows to... -
DEMON
DEMON is a local-first approach to community discovery, able to unveil the modular organization of real complex networks. This is achieved by democratically letting each node...-
HTML
The resource: 'DEMON Source Code' is not accessible as guest user. You must login to access it!
-
HTML
-
QuickRank
QuickRank is an efficient Learning to Rank toolkit providing multi-threaded C++ implementation of several algorithms: GBRT, LambdaMART, Oblivious GBRT / LambdaMART,...-
URL
The resource: 'Quick Rank Test' is not accessible as guest user. You must login to access it!
-
URL
The resource: 'Quick Rank Train' is not accessible as guest user. You must login to access it!
-
URL
The resource: 'Quick Rank Train No Validation' is not accessible as guest user. You must login to access it!
-
URL
-
Maximum entropy network reconstruction
This method reconstructs a bipartite network by using the Maximum Entropy principle. This method is useful to assess aggregated and single bank's systemicness and vulnerability... -
Tail granger causality network construction
This method constructs a causality network by implementing Granger-causality tests for extreme events in multivariate time series. -
ArchiveSpark
ArchiveSpark is an Apache Spark framework for easy data access, processing, extraction as well as derivation for Web archives and archival collections. It has a simple and... -
NDlib-rest
Network Diffusion Library REST Service. This project offers a REST interface for the NDlib Python library. -
Dictionary creator
This tool creates a dictionary with inverse document frequency (idf) values from the Google NGrams dataset. -
WIRE dataset
This dataset consists of 503 pairs of Wikipedia entities drawn from the New York Times dataset with a human assigned relatedness score. The domain experts based their... -
Wikipedia Word Embeddings
Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0... -
Amazon reviews
This (link to the) dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014. This dataset includes reviews...-
HTML
The resource: 'Julian McAuley's repository.' is not accessible as guest user. You must login to access it!
-
HTML