yq/pkg/yqlib/doc/usage/xml.md

71 lines
1.4 KiB
Markdown
Raw Normal View History

2021-12-21 04:02:07 +00:00
# XML
2021-12-21 05:52:54 +00:00
Encode and decode to and from XML. Whitespace is not conserved for round trips - but the order of the fields are.
2021-12-21 04:02:07 +00:00
2021-12-21 05:52:54 +00:00
As yaml does not have the concept of attributes, xml attributes are converted to regular fields with a prefix to prevent clobbering. This defaults to "+", use the `--xml-attribute-prefix` to change.
Consecutive xml nodes with the same name are assumed to be arrays.
2021-12-21 04:02:07 +00:00
All values in XML are assumed to be strings - but you can use `from_yaml` to parse them into their correct types:
```
yq e -p=xml '.myNumberField |= from_yaml' my.xml
```
2021-12-21 05:52:54 +00:00
XML nodes that have attributes then plain content, e.g:
```xml
<cat name="tiger">meow</cat>
```
The content of the node will be set as a field in the map with the key "+content". Use the `--xml-content-name` flag to change this.
2021-12-31 08:58:39 +00:00
## Parse xml: with comments
A best attempt is made to preserve comments.
2021-12-22 00:39:10 +00:00
Given a sample.xml file of:
```xml
<!-- before cat -->
<cat>
<!-- in cat before -->
<x>3<!-- multi
2021-12-31 08:58:39 +00:00
line comment
2021-12-22 00:39:10 +00:00
for x --></x>
2021-12-31 01:36:59 +00:00
<!-- before y -->
2021-12-22 00:39:10 +00:00
<y>
<!-- in y before -->
2021-12-31 08:58:39 +00:00
<d><!-- in d before -->z<!-- in d after --></d>
2021-12-22 00:39:10 +00:00
<!-- in y after -->
</y>
<!-- in_cat_after -->
</cat>
<!-- after cat -->
```
then
```bash
yq e -p=xml '.' sample.xml
```
will output
```yaml
2021-12-31 08:58:39 +00:00
# before cat
2021-12-22 00:39:10 +00:00
cat:
2021-12-31 08:58:39 +00:00
# in cat before
x: "3" # multi
# line comment
# for x
# before y
2021-12-31 01:36:59 +00:00
2021-12-22 00:39:10 +00:00
y:
2021-12-31 01:50:16 +00:00
# in d before
2021-12-31 08:58:39 +00:00
d: z # in d after
# in y after
2021-12-31 01:50:16 +00:00
2021-12-31 08:58:39 +00:00
# after cat
2021-12-22 00:39:10 +00:00
```