1 to 3 months –
10-30 hrs/week –
I have a clear photo (jpg) of a page of a book which has been converted to pdf format. I would like a utility (able to run on linux and preferably with python bindings) which is able to convert the pdf to a mobi format. The text may contain non-latin characters (chinese, greek, etc.) and contains images and tables.
Ideally the resulting mobi document would have the text as text, while tables and images are showed as images ...