On 30/08/2025 11:17, Russell L. Harris wrote:
I have a growing bunch of studies which I compose in LaTeX markup. I currently these post in PDF format, on-line and freely-accessible. I created the web site with the Debian package make4ht. But in their present form, the studies are not readily findable by search engines.
For me it is not uncommon to get PDF files in search results. That is why I suspect that something is wrong with your PDF's. Are they generated to be sent to printer or to be published on a web site? Does "pdftotext FILE.PDF -" is able to extract readable text? Does "pdfinfo FILE.PDF" list author, title, etc.? Are links to these files have descriptive context?
WordPress is touted as the best platform for search engine optimization (S.E.O.).
I often get in search results pages with poor metadata. I admit there are other aspects like markup and CSS suitable for smartphones, but I suspect other issues with your documents again. Reading Google recommendations might provide some insights. Tuning a bit TeX4ht output may be enough.
The problem is finding a way to import the studies into WordPress.
You have asked it earlier. It seems, active subscribers on this list do not have this specific experience. I expect, there are enough ways to import content into WordPress. You may ask your question in some LaTeX community. You may ask in some WordPress community what formats are suitable for import (perhaps there are not so much participants familiar with LaTeX there).
The next promising solution is XML;
Are you realizing that XML is a rather generic data format? You need some specific format *based* on XML.