tldr/pages/common/ocrmypdf.md

29 lines
945 B
Markdown
Raw Permalink Normal View History

2021-07-23 18:28:55 +01:00
# ocrmypdf
> Generate a searchable PDF or PDF/A from a scanned PDF or an image of text.
> More information: <https://ocrmypdf.readthedocs.io/en/latest/cookbook.html>.
- Create a new searchable PDF/A file from a scanned PDF or image file:
`ocrmypdf {{path/to/input_file}} {{path/to/output.pdf}}`
- Replace a scanned PDF file with a searchable PDF file:
`ocrmypdf {{path/to/file.pdf}} {{path/to/file.pdf}}`
2021-07-23 18:28:55 +01:00
- Skip pages of a mixed-format input PDF file that already contain text:
`ocrmypdf --skip-text {{path/to/input.pdf}} {{path/to/output.pdf}}`
- Clean, de-skew, and rotate pages of a poor scan:
`ocrmypdf --clean --deskew --rotate-pages {{path/to/input_file}} {{path/to/output.pdf}}`
- Set the metadata of the searchable PDF file:
`ocrmypdf --title "{{title}}" --author "{{author}}" --subject "{{subject}}" --keywords "{{keyword; key phrase; ...}}" {{path/to/input_file}} {{path/to/output.pdf}}`
- Display help:
`ocrmypdf --help`