Html2text

If you have an HTML file and you just want the text without markup, you can of course display the file in Konqueror and copy the text and paste it into a new file. However, if you want to do a similar thing for a large number of files, a command-line tool is more useful.

The html2text command reads an HTML file and outputs plain text, having stripped out the HTML tags. You can even run it against a URL:

[email protected]:~ > html2text http://news.bbc.co.uk

Was this article helpful?

0 0

Post a comment