200 items found

Filter Results
  • SoBigData.eu: Method

    Noun Phrase Chunker

    Base Noun Phrase Chunker, producing NounChunk annotations.
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Method

    Open Nlp German Pipeline

    This method is the German tokeniser, sentence splitter and POS tagger from Apache OpenNLP.
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Method

    Part Of Speech Tagger For Tweets

    This service tags tweets with part-of-speech information, e.g. nouns and verbs.
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Method

    QuickRank

    QuickRank is an efficient Learning to Rank toolkit providing multi-threaded C++ implementation of several algorithms: GBRT, LambdaMART, Oblivious GBRT / LambdaMART,...
    • URL
      The resource: 'Quick Rank Test' is not accessible as guest user. You must login to access it!
    • URL
      The resource: 'Quick Rank Train' is not accessible as guest user. You must login to access it!
    • URL
      The resource: 'Quick Rank Train No Validation' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Method

    Epidemic Sentiment Analysis

    This tool is a sentiment analysis framework inspired by models of epidemic spreading, which aims to extend sentiment-tagged lexicons. It is easily extendable to multiple...
  • SoBigData.eu: Application

    WAT

    WAT is an entity linker, namely a tool that identifies meaningful substrings (called "spots") in an unstructured English text and link each of them to the unambiguous entity...
    • HTML
      The resource: 'Link to the Application' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    bond yield_equity log-returns_CDS spreads

    Financial data used to construct a bipartite network of systemically important banks and sovereign bonds.
  • SoBigData.eu: Dataset

    WEIBO interactions

    This dataset is obtained from the 2012 WISE Challenge: built upon the logs of the popular Chinese micro-blog service WEIBO, its interactions represent mentions of users in...
    • HTML
      The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Sheffield NERD Tweet Corpus

    The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...
    • FINF
      The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    ClueWeb12

    The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information...
  • SoBigData.eu: Method

    Soccer teams ranking simulator

    This algorithm simulates the outcomes of an entire season of each team of a football league only relying on technical data (i.e., excluding the goals scored), by exploiting a...
  • SoBigData.eu: Method

    Measurement Expression Annotator

    This method annotates numbers and measurement expressions, with their normalised values in the Intenational System units.
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Method

    Twitter Opinion Mining English

    This tool recognises opinionated sentences in English tweets and it classifies them as positive or negative. It also indicates emotion type, author and target of the opinion,...
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Emergency Tweets 2009 L'Aquila earthquake

    This dataset comprises 1,100 Italian tweets shared in the aftermath of the 2009 L’Aquila earthquake (https://en.wikipedia.org/wiki/2009_L%27Aquila_earthquake). The earthquake...
    • ZIP
      The resource: 'EAQ-LAQ.zip' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    WIRE dataset

    This dataset consists of 503 pairs of Wikipedia entities drawn from the New York Times dataset with a human assigned relatedness score. The domain experts based their...
    • CSV
      The resource: 'WIRE dataset' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    ClueWeb09

    The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on...
  • SoBigData.eu: Dataset

    Wikinews dataset

    This dataset consists of a sample of 365 news published by Wikinews from November 2004 to June 2014 and annotated with about 5000 entities, each associated with a saliency...
    • JSON
      The resource: 'entity-saliency' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Open data from NervousNet

    This dataset contains anonymized proximity information sent by 154 mobile phones (both Android and iPhone) via phone apps. These information are sent by bluetooth beacons every...
    • ZIP
      The resource: 'open data from NervousNet' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    GPS Origin Destination Matrix in Tuscany

    This dataset is the origin and destination matrix among the municipalities of Tuscany extracted starting from GPS tracks of private vehicles collected from 2014-02-10 to...
    • CSV
      The resource: ' GPS Origin Destination Matrix' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Twitter fake followers

    Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the...
You can also access this registry using the API (see API Docs).