tldr/pages/linux/pdfgrep.md

24 lines
678 B
Markdown
Raw Normal View History

2017-10-25 21:41:07 +01:00
# pdfgrep
> Search text in PDF files.
- Find lines that match pattern in a PDF:
`pdfgrep {{pattern}} {{file.pdf}}`
- Include file name and page number for each matched line:
`pdfgrep --with-filename --page-number {{pattern}} {{file.pdf}}`
2017-11-01 22:29:10 +00:00
- Do a case insensitive search for lines that begin with "foo" and return the first 3 matches:
2017-10-25 21:41:07 +01:00
`pdfgrep --max-count {{3}} --ignore-case {{'^foo'}} {{file.pdf}}`
2017-11-01 22:27:26 +00:00
- Find pattern in files with a .pdf extension in the current directory recursively:
2017-10-25 21:41:07 +01:00
2017-10-26 00:34:41 +01:00
`pdfgrep --recursive {{pattern}}`
2017-11-01 22:27:26 +00:00
- Find pattern on files that match a specific glob in the current directory recursively:
2017-10-26 00:34:41 +01:00
2017-10-26 00:35:50 +01:00
`pdfgrep --recursive --include {{'*book.pdf'}} {{pattern}}`