fast content extraction

https://github.com/c4milo/boilerpipe

# Sep 6, 2013