#symbol #string #grammar #rules #generation #generate

branchy

Provides tools for generating strings and sequences using context-free grammars

4 releases

0.2.1 May 14, 2021
0.2.0 Dec 31, 2019
0.1.1 Dec 29, 2019
0.1.0 Dec 29, 2019

#2060 in Algorithms

MIT/Apache

51KB
757 lines

docs.rs Build Status

branchy

This crate provides basic tools for generation of text strings and other sequences using context-free grammars.

Text generation example

The following example demonstrates random generation of a short sentence based on a sequence of input symbols and a set of rules, which may be applied to the symbol sequence until it is fully expanded.

use branchy::{
    Symbol,
    Rule,
    ExpanderBuilder
};

let input = vec![Symbol::Nonterminal("person"), Symbol::Terminal("comes from a"), Symbol::Nonterminal("location")];

let rules = vec![
    Rule::new("person", vec![Symbol::Nonterminal("name")]),
    Rule::new(
        "person",
        vec![Symbol::Nonterminal("name"), Symbol::Terminal("the"), Symbol::Nonterminal("occupation")]
    ),
    Rule::new("name", vec![Symbol::Terminal("Alice")]),
    Rule::new("name", vec![Symbol::Terminal("Bob")]),
    Rule::new("occupation", vec![Symbol::Terminal("blacksmith")]),
    Rule::new("occupation", vec![Symbol::Terminal("baker")]),
    Rule::new("location", vec![Symbol::Nonterminal("size"), Symbol::Nonterminal("settlement_type")]),
    Rule::new("size", vec![Symbol::Terminal("small")]),
    Rule::new("size", vec![Symbol::Terminal("big")]),
    Rule::new("settlement_type", vec![Symbol::Terminal("village")]),
    Rule::new("settlement_type", vec![Symbol::Terminal("town")])
];

let mut expander = ExpanderBuilder::from(rules).build();

let expansion_result = expander.expand(input);
assert!(expansion_result.is_ok());

let expanded_string = expansion_result.unwrap().join(" ");
println!("{}", expanded_string);

When run, this example prints out a sentence, similar to the ones below:

Alice the blacksmith comes from a big village

Bob comes from a small town

Bob the baker comes from a big town

As you can see, both the input sequence and the rules of the grammar are described in terms of Nonterminal (ones that can be further expanded) and Terminal symbols. All of the rules have a non-terminal symbol value on their left-hand side and a sequence which may contain both Nonterminal and Terminal symbols on their right-hand side.

The "magic" happens in Expander's expand() method, which repeatedly selects and applies matching rules until the sequence is fully expanded (i.e contains only terminal symbols).

By default, UniformRandomRuleSelector is used to select rules while expanding, therefore the result is randomized. As we'll see below, this can be changed, if needed, via ExpanderBuilder.

Using a custom rule selector

When constructing an Expander, you can provide your own rule selector via ExpanderBuilder's with_rule_selector() method.

The following example defines a custom rule selector, which always chooses the first matching rule, and then uses it in generation of a short phrase. As you can see, rule selectors need to implement at least the select_matching_rule() method from the RuleSelector trait.

use branchy::{
    Symbol,
    Rule,
    ExpanderBuilder,
    RuleSelector
};

struct AlwaysFirstRuleSelector;

impl<Nt, T> RuleSelector<Nt, T> for AlwaysFirstRuleSelector {
    fn select_matching_rule<'a>(&self, matching_rules: &[&'a Rule<Nt, T>]) -> Option<&'a Rule<Nt, T>> {
        if matching_rules.is_empty() {
            None
        } else {
            Some(matching_rules[0])
        }
    }
}

let input = vec![Symbol::Terminal("Have a"), Symbol::Nonterminal("adjective"), Symbol::Nonterminal("time_of_day")];

let mut expander = ExpanderBuilder::new()
    .with_new_rule("adjective", vec![Symbol::Terminal("wonderful")])
    .with_new_rule("adjective", vec![Symbol::Terminal("great")])
    .with_new_rule("time_of_day", vec![Symbol::Terminal("afternoon")])
    .with_new_rule("time_of_day", vec![Symbol::Terminal("evening")])
    .with_rule_selector(AlwaysFirstRuleSelector)
    .build();

let expanded_string = expander.expand(input).unwrap().join(" ");

assert_eq!(expanded_string, "Have a wonderful afternoon");

This example also sets the rules of the grammar directly on ExpanderBuilder via the with_new_rule() method. See the documentation of ExpanderBuilder for more helper methods.

Logging

To help you debug your grammars, branchy provides the ExpansionLogger trait, which you can implement in order to be notified of the progress of the expansion and the steps it takes.

For example, in order to log the rules selected at each step of the expansion, you can implement the on_nonterm_expanded() method. The following example writes a message via println!() on every step.

use branchy::{
    Symbol,
    Rule,
    ExpanderBuilder,
    ExpansionLogger,
    TerminalValue,
    NonterminalValue
};

struct StdOutLogger;

impl<Nt, T> ExpansionLogger<Nt, T> for StdOutLogger
    where Nt: NonterminalValue + std::fmt::Debug,
          T:  TerminalValue + std::fmt::Debug
{
    fn on_nonterm_expanded(&mut self, expanded_nonterm_value: &Nt, rule: &Rule<Nt, T>) {
        println!("expanded {:?} to {:?}", expanded_nonterm_value, rule.replacement);
    }
}

let input = vec![
    Symbol::Terminal("There is a"),
    Symbol::Nonterminal("site_description"),
    Symbol::Terminal("to the"),
    Symbol::Nonterminal("direction"),
    Symbol::Terminal("of the town.")
];

let rules = vec![
    Rule::new("site_description", vec![Symbol::Nonterminal("adjective"), Symbol::Nonterminal("site")]),
    Rule::new("adjective", vec![Symbol::Terminal("huge")]),
    Rule::new("adjective", vec![Symbol::Terminal("dark")]),
    Rule::new("site", vec![Symbol::Terminal("forest")]),
    Rule::new("site", vec![Symbol::Terminal("cave")]),
    Rule::new("direction", vec![Symbol::Terminal("north")]),
    Rule::new("direction", vec![Symbol::Terminal("east")])
];

let mut expander = ExpanderBuilder::from(rules)
    .with_logger(StdOutLogger)
    .build();

expander.expand(input).unwrap();

This example produces output similar to the following:

expanded "site_description" to [Nonterminal("adjective"), Nonterminal("site")]
expanded "adjective" to [Terminal("dark")]
expanded "site" to [Terminal("cave")]
expanded "direction" to [Terminal("east")]

Generating non-text sequences

Even though the primary use-case for branchy is generating text strings, it can be used for grammars producing other kinds of sequences. Any type implementing Clone + PartialEq can be used for values of non-terminal symbols and any type implementing Clone can be used for terminals. See NonterminalValue and TerminalValue traits.

Dependencies

~1.4–2MB
~36K SLoC