-
Boilerplate Removal using a Neural Sequence Labeling Model
The extraction of main content from web pages is an important task for numerous applications, ranging from usability aspects, like reader views for news articles in web... -
A Critical Reassessment of the Saerens-Latinne-Decaestecker Algorithm
We critically re-examine the Saerens-Latinne-Decaestecker (SLD) algorithm, a well-known method for estimating class prior probabilities (“priors”) and adjusting posterior...