Package: boilerpipeR
Maintainer: Mario Annau <mario.annau@gmail.com>
License: Apache License (== 2.0)
Title: Interface to the boilerpipe Java library by Christian
        Kohlschutter (http://code.google.com/p/boilerpipe/)
Authors@R: c(person("Mario", "Annau", role = c("aut", "cre"),
    email = "mario.annau@gmail.com"))
Description: Generic Extraction of main text content from HTML files; removal
    of ads, sidebars and headers using the boilerpipe Java library. The
    extraction heuristics from boilerpipe show a robust performance for a wide
    range of web site templates.
Version: 1.1
Date: 2013-01-10
Depends: rJava
Collate: 'Extractor.R' 'onload.R' 'boilerpipeR-package.R'
Packaged: 2014-01-11 15:40:21 UTC; hornik
Author: Mario Annau [aut, cre]
NeedsCompilation: no
Repository: CRAN
Date/Publication: 2014-01-11 16:56:38
