tldr/pages/common/pup.md

28 lines
530 B
Markdown
Raw Normal View History

2017-10-08 17:00:15 +01:00
# pup
> HTML parsing tool.
- Transform raw HTML file into a cleaned, indented, and colored format:
`cat {{index.html}} | pup --color`
- Filter HTML by element tag name:
`cat {{index.html}} | pup '{{tag}}'`
- Filter HTML by id:
`cat {{index.html}} | pup '{{tag#id}}'`
- Filter HTML by attribute value:
`cat {{index.html}} | pup '{{tag[attribute="value"}}'`
- Print all text from selected nodes and children:
2017-10-08 17:13:22 +01:00
`cat {{index.html}} | pup '{{tag}} text{}'`
2017-10-08 17:00:15 +01:00
- Print HTML as JSON:
`cat {{index.html}} | pup '{{tag}} json{}'`