#parser #string-parser #logstash

dissect

Logstash inspired dissect extractor

13 unstable releases (6 breaking)

0.7.3 Oct 26, 2023
0.7.1 Sep 26, 2023
0.7.0 May 24, 2023
0.5.0 Nov 3, 2022
0.1.2 Mar 4, 2020

#237 in Programming languages

Download history 634/week @ 2024-06-17 303/week @ 2024-06-24 316/week @ 2024-07-01 334/week @ 2024-07-08 221/week @ 2024-07-15 436/week @ 2024-07-22 446/week @ 2024-07-29 135/week @ 2024-08-05 202/week @ 2024-08-12 34/week @ 2024-08-19 82/week @ 2024-08-26 149/week @ 2024-09-02 96/week @ 2024-09-09 102/week @ 2024-09-16 346/week @ 2024-09-23 173/week @ 2024-09-30

725 downloads per month
Used in 2 crates (via tremor-script)

Apache-2.0

67KB
1.5K SLoC

Rust 1K SLoC // 0.0% comments Shell 235 SLoC // 0.0% comments

Dissect   Latest Version Build Status Quality Checks License Checks Security Checks Code Coverage

dissect parser


Dissect parsing inspired by logstash's dissect plugin.

Parses a string into a map.

Use as a library

The dissect parser was designed so that KV style parsing could be embedded into tremor's scripting language for extract operations.

The parser can also be used standalone. A fairly gnarly example of parsing logs from this libraries tests illustrates better than words can:

  let pattern = r#"%{syslog_timestamp} %{syslog_hostname} %{?syslog_prog}: %{syslog_program_aux}[%{syslog_pid:int}] %{request_unix_time} %{request_timestamp} %{request_elapsed_time} %{server_addr}:%{server_port:int} %{remote_addr}:%{remote_port:int} "%{response_content_type}" %{response_content_length} %{request_status} %{bytes_sent} %{request_length} "%{url_scheme}" "%{http_host}" "%{request_method} %{request_url} %{request_protocol}" "%{http_referer}" "%{http_user_agent}" "%{http_x_forwarded_for}" "%{http_ttrue_client_ip}" "%{remote_user}" "%{is_bot}" "%{admin_user}" "%{http_via}" "%{response_location}" "%{set_cookie}" "%{http_cookie}" "%{moawsl_info}" "%{php_message}" "%{akamai_edgescape}" "%{uid_info}" "%{geoip_country}" "%{geoip_region}" "%{geoip_city}" "%{geoip_postal}" "%{geoip_dma}" "%{server_id}" "%{txid}" "%{hpcnt}" "%{client_accept}" "%{client_accept_charset}" "%{client_accept_encoding}" "%{client_accept_language}" "%{client_accept_datetime}" "%{client_pragma}" "%{client_transfer_encoding}" "%{client_attdeviceid}" "%{client_wap_profile}" %{weblog_end}"#;
  let p = lex("%{name}%{_}%{_(|)}%{age}");
  assert!(lex(pattern).is_ok());
  assert!(p.is_ok());

Dependencies

~6MB
~104K SLoC