tldr/pages/common/pdftotext.md

25 lines
682 B
Markdown
Raw Normal View History

2017-12-05 20:12:55 +00:00
# pdftotext
> Convert PDF files to plain text format.
> More information: <https://www.xpdfreader.com/pdftotext-man.html>.
2017-12-05 20:12:55 +00:00
- Convert `filename.pdf` to plain text and print it to `stdout`:
2017-12-05 20:12:55 +00:00
`pdftotext {{filename.pdf}} -`
- Convert `filename.pdf` to plain text and save it as `filename.txt`:
2017-12-05 20:12:55 +00:00
`pdftotext {{filename.pdf}}`
- Convert `filename.pdf` to plain text and preserve the layout:
2019-12-02 16:44:17 +00:00
`pdftotext -layout {{filename.pdf}}`
- Convert `input.pdf` to plain text and save it as `output.txt`:
2017-12-05 20:12:55 +00:00
`pdftotext {{input.pdf}} {{output.txt}}`
- Convert pages 2, 3 and 4 of `input.pdf` to plain text and save them as `output.txt`:
2017-12-05 20:12:55 +00:00
2017-12-07 06:29:30 +00:00
`pdftotext -f {{2}} -l {{4}} {{input.pdf}} {{output.txt}}`