Dear @linux and @academicchatter folks:

Please suggest libre/open source tools that allow for the extraction of text and images from scientific pdf documents?

P.S: I’m on a linux machine. Would like something terminal friendly, if possible!

  • Carunga@feddit.de
    link
    fedilink
    arrow-up
    2
    ·
    7 months ago

    Try Zotero. It is a complete literature databas but it’s PDF reader is very good at extracting images and text. Works on all OS, web and mobile. Native Linux client has been very smooth for me. Oh, terminal it doesn’t do though. If you want to extract a large amount in an automated way, its probably not the right tool.