Extract text contained within an HTML document

WWW: http://search.cpan.org/dist/HTML-ContentExtractor/
