Package: boilerpipeR
Version: 1.2
Date: 2014-05-11
Title: Interface to the boilerpipe Java library by Christian
        Kohlschutter (http://code.google.com/p/boilerpipe/)
Authors@R: c(person("Mario", "Annau", role = c("aut", "cre"),
    email = "mario.annau@gmail.com"))
Imports: rJava
Suggests: RCurl
Description: Generic Extraction of main text content from HTML files; removal
    of ads, sidebars and headers using the boilerpipe Java library. The
    extraction heuristics from boilerpipe show a robust performance for a wide
    range of web site templates.
License: Apache License (== 2.0)
URL: https://github.com/mannau/boilerpipeR
BugReports: https://github.com/mannau/boilerpipeR/issues
Packaged: 2014-05-12 06:36:45 UTC; mario
Author: Mario Annau [aut, cre]
Maintainer: Mario Annau <mario.annau@gmail.com>
NeedsCompilation: no
Repository: CRAN
Date/Publication: 2014-05-12 09:22:31
