<[[Brennen]]> WareLogging. pdftohtml. apt-get install pdftohtml # noframe option seems to reduce nonsense in output. # -i ignores images pdftohtml -noframes -i ./pdf_filename # if the PDF has copy protection stuff going on, # this will ignore it: pdftohtml -nodrm -noframes -i ./pdf_filename