#cbor #edn #diagnostics

bin+lib cbor-edn

Converter and processor for CBOR Diagnostic Notation (EDN)

4 releases

0.0.4 Jun 18, 2024
0.0.3 Jun 14, 2024
0.0.2 Jun 13, 2024
0.0.1 Jun 12, 2024

#545 in Parser implementations

MIT/Apache

185KB
3.5K SLoC

cbor-edn

Tools for processing CBOR Diagnostic Notation (EDN)

The parser used by this crate is a PEG (Parsing Expression Grammer) parser built from the ABNF used in the EDN specification.

The crate's main types represent not only the parsed items but also all the parts that have no bearing on the translation to CBOR (whitespace, comments, and optional trailing commas) and choices that may or may not influence the CBOR (encoding indicators). This allows detailed manipulation (for example inside comments) and a delayed processing of application oriented literals.

Parsed values are expected to round-trip to identical representations when serialized. Most manipulations of the values will ensure that their serialization output can also be round-tripped from the internal format to the EDN serialization and back into the internal format, but this can not be provided by all. (For example, removing all optional commas while retaining comments would make the previous distinction between whether a comment was before or after a comma indistinguishable).

Correct parsing does not guarantee that the value can also be encoded into CBOR. While there are aspects that could be handled at parsing time and are not (eg. tag numbers exceeding the encodable number space), there are cases that can not be handled by a library without further context or privileges (eg. the e'' application oriented literal that needs application context, or the ref'' application oriented literal that defers to relative files, accessing which can involve file or network access). Consequentially, conversion to CBOR through the various .to_cbor() methods is inherently fallible.

Completeness

Known limitations are:

  • Support for inspecting and constructing CBOR items is incomplete. The most common types can be constructed; contructing or inspecting more exotic items is possible through parsing hand-crafted EDN/CBOR and using the generated serializations, respectively.

  • Options for attaching comments and space are limited; [Item::set_comment()] is currently the only way to get data in (apart from, again, parsing hand-crafted EDN).

  • Indenting EDN works for the easy cases, but more exotic cases such as overflowing the limited width, long keys, or hash comments, easily disrupt the visual result.

Security

This library does not access network or file system in any surprising ways and does not endanger memory safety on its own. The main threat in using it is not resource bound: even without packed CBOR, heavy nesting can easily overflow the stack, and the float conversions are costly in time. Unless resource usage per user is limited, it is recommended to limit untrusted user input to the length of repeated { characters that do not yet overflow the stack.

The crate has not been audited internally or externally. As the licenses state, the software is provided "as is".

CLI application

Some functionality is available through a binary included with this crate:

$ echo "[1, 2, 'x', ip'2001:db1::/64']" | cbor-edn diag2diag
[1, 2, 'x', ip'2001:db1::/64']

License: MIT OR Apache-2.0

Dependencies

~6–15MB
~265K SLoC