HTML::FormatText - Format HTML as plaintext |
HTML::FormatText - Format HTML as plaintext
version 2.16
use HTML::TreeBuilder; $tree = HTML::TreeBuilder->new->parse_file("test.html");
use HTML::FormatText; $formatter = HTML::FormatText->new(leftmargin => 0, rightmargin => 50); print $formatter->format($tree);
or, more simply:
use HTML::FormatText; my $string = HTML::FormatText->format_file( 'test.html', leftmargin => 0, rightmargin => 50 );
HTML::FormatText is a formatter that outputs plain text. All character attributes (bold/italic/underline) are ignored. Formatting of HTML tables and forms is not implemented.
HTML::FormatText is built on the HTML::Formatter manpage and documentation for that module applies to this - especially new in the HTML::Formatter manpage, format_file in the HTML::Formatter manpage and format_string in the HTML::Formatter manpage.
You might specify the following parameters when constructing the formatter:
This software is copyright (c) 2016 by Nigel Metheringham, 2002-2005 Sean M Burke, 1999-2002 Gisle Aas.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.
HTML::FormatText - Format HTML as plaintext |