w_title(html2stx)dnl w_doc_id(h2sman)dnl w_section(1)dnl w_author(Panu A. Kalliokoski)dnl w_man_desc(convert HTML documents into Stx) ! SYNOPSIS ''html2stx'' [ /file/ ] ! DESCRIPTION ''html2stx'' takes the given /file/, which should contain an HTML document, and converts it to structured text (Stx). If no file is given, standard input is read instead. The program does not attempt to convert every possibly convertible piece of markup into Stx. For example, w_lt`'font`'w_gt tags are simply ignored. This tends to result in a nice, clean, beautiful document. (If it doesn't, the source document probably does not contain enough information to start with.) ! OPTIONS None. ! DIAGNOSTICS ''html2stx'' is a python script and will throw an exception if something goes amiss. In this case, the return value will be non-zero. ! SEE ALSO ''stx2any'' (1), _Stx markup reference_ (''PREFIX/share/doc/stx2any/examples/Stx-ref.txt'') ! BUGS - The word wrapping algorithm is probably not very clever. - Sometimes there are extra linebreaks in the output. - Probably many others. ! AUTHOR This manual page was written by w_author. ''html2stx'' is derived from the ''html2text'' utility by Aaron Swartz. ''html2text'' is a utility for converting html into "Markdown" structured text; the changes required to make it work for Stx were done by Panu Kalliokoski.