convert html to xml
I have a growing bunch of studies which I compose in LaTeX markup. I
currently these post in PDF format, on-line and freely-accessible. I
created the web site with the Debian package make4ht.
But in their present form, the studies are not readily findable by
search engines. And if they are not listed by search engines, it does
little good to publish them.
WordPress is touted as the best platform for search engine
optimization (S.E.O.). I plan to post my studies on a WordPress web
site and have someone knowledgeable apply S.E.O. techniques.
The problem is finding a way to import the studies into WordPress.
The studies are lengthy and complex, with much use of italic, bold
italic, and footnotes. Manual transcription is not practical.
HTML looked promising, and make4ht produces good HTML. But the
various plug-ins I have found for importing HTML do not work with the
current version of WordPress.
The next promising solution is XML; it appears that WordPress is able
to read XML. But I have not yet found a Debian package which is able
to convert HTML to XML.
RLH
Reply to: