`ltlfilt`

Changing syntaxes
Altering the formula
Filtering
Using --format and --output

This tool is a filter for LTL formulas. (It will also work with PSL formulas.) It can be used to perform a number of tasks. Essentially:

converting formulas from one syntax to another,
transforming formulas,
selecting formulas matching some criterion.

Changing syntaxes

Because it read and write formulas, ltlfilt accepts all the common input and output options.

Additionally, if no -f or -F option is specified, and ltlfilt will read formulas from the standard input if it is not connected to a terminal.

For instance the following will convert two LTL formulas expressed using infix notation (with different names supported for the same operators) and convert it into LBT's syntax.

ltlfilt -l -f 'p1 U (p2 & GFp3)' -f 'X<>[]p4'

U p1 & p2 G F p3
X F G p4

Conversely, here is how to rewrite formulas expressed using the LBT's Polish notation. Let's take the following four formulas taken from examples distributed with scheck:

cat >scheck.ltl<<EOF
! | G p0 & G p1 F p3
| | X p7 F p6 & | | t p3 p7 U | f p3 p3
& U & X p0 X p4 F p1 X X U X F p5 U p0 X X p3
U p0 & | p0 p5 p1
EOF

These can be turned into something easier to read (to the human) with:

ltlfilt --lbt-input -F scheck.ltl

!(Gp0 | (Gp1 & Fp3))
p3 | Xp7 | Fp6
((Xp0 & Xp4) U Fp1) & XX(XFp5 U (p0 U XXp3))
p0 U (p1 & (p0 | p5))

Altering the formula

The following options can be used to modify the formulas that have been read.

    --boolean-to-isop      rewrite Boolean subformulas as irredundant sum of
                           products (implies at least -r1)
    --define[=FILENAME]    when used with --relabel or --relabel-bool, output
                           the relabeling map using #define statements
    --exclusive-ap=AP,AP,...   if any of those APs occur in the formula, add
                           a term ensuring two of them may not be true at the
                           same time.  Use this option multiple times to
                           declare independent groups of exclusive
                           propositions.
    --from-ltlf[=alive]    transform LTLf (finite LTL) to LTL by introducing
                           some 'alive' proposition
    --negate               negate each formula
    --nnf                  rewrite formulas in negative normal form
    --relabel[=abc|pnn]    relabel all atomic propositions, alphabetically
                           unless specified otherwise
    --relabel-bool[=abc|pnn]   relabel Boolean subexpressions, alphabetically
                           unless specified otherwise
    --remove-wm            rewrite operators W and M using U and R (this is
                           an alias for --unabbreviate=WM)
    --remove-x             remove X operators (valid only for
                           stutter-insensitive properties)
-r, --simplify[=LEVEL]     simplify formulas according to LEVEL (see below);
                           LEVEL is set to 3 if omitted
    --sonf[=PREFIX]        rewrite formulas in suffix operator normal form
    --sonf-aps[=FILENAME]  when used with --sonf, output the newly introduced
                           atomic propositions
    --unabbreviate[=STR]   remove all occurrences of the operators specified
                           by STR, which must be a substring of "eFGiMRW^",
                           where 'e', 'i', and '^' stand respectively for
                           <->, ->, and xor.  If no argument is passed, the
                           subset "eFGiMW^" is used.

As with randltl, the -r option can be used to simplify formulas.

ltlfilt --lbt-input -F scheck.ltl -r

F!p0 & (F!p1 | G!p3)
p3 | Xp7 | Fp6
Fp1 & XX(XFp5 U (p0 U XXp3))
p0 U (p1 & (p0 | p5))

You may notice that operands of n-ary operators such as & or | can be reordered by ltlfilt even when the formula is not changed otherwise. This is because Spot internally order all operands of commutative and associative operators, and that this order depends on the order in which the subformulas are first encountered. Adding transformation options such as -r may alter this order. However this difference is semantically insignificant.

Formulas can be easily negated using the -n option, rewritten into negative normal form using the --nnf option, and the W and M operators can be rewritten using U and R using the --remove-wm option (note that this is already done when a formula is output in Spin's syntax).

Another way to alter formula is to rename the atomic propositions it uses. The --relabel=abc will relabel all atomic propositions using letters of the alphabet, while --relabel=pnn will use p0, p1, etc. as in LBT's syntax.

ltlfilt --lbt-input -F scheck.ltl -r --relabel=abc

F!a & (F!b | G!c)
a | Xb | Fc
Fa & XX(XFb U (c U XXd))
a U (b & (a | c))

Note that the relabeling is reset between each formula: p3 became c in the first formula, but it became d in the third.

Another use of relabeling is to get rid of complex atomic propositions such as the one shown when presenting lenient mode:

ltlfilt --lenient --relabel=pnn -f '(a < b) U (process[2]@ok)'

p0 U p1

Finally, there is a second variant of the relabeling procedure that is enabled by --relabel-bool=abc or --relabel-book=pnn. With this option, Boolean subformulas that do not interfere with other subformulas will be changed into atomic propositions. For instance:

ltlfilt -f '(a & !b) & GF(a & !b) & FG(!c)' --relabel-bool=pnn
ltlfilt -f '(a & !b) & GF(a & !b) & FG(!c & a)' --relabel-bool=pnn

p0 & GFp0 & FGp1
p0 & p1 & GF(p0 & p1) & FG(p0 & p2)

In the first formula, the independent a & !b and !c subformulas were respectively renamed p0 and p1. In the second formula, a & !b and !c & a are dependent so they could not be renamed; instead a, !b and c were renamed as p0, p1 and p2.

This option was originally developed to remove superfluous formulas from benchmarks of LTL translators. For instance the automata generated for GF(a|b) and GF(p0) should be structurally equivalent: replacing p0 by a|b in the second automaton should turn in into the first automaton, and vice-versa. (However algorithms dealing with GF(a|b) might be slower because they have to deal with more atomic propositions.) So given a long list of LTL formulas, we can combine --relabel-bool and -u to keep only one instance of formulas that are equivalent after such relabeling. We also suggest to use --nnf so that !FG(a -> b) would become GF(p0) as well. For instance here are some LTL formulas extracted from an industrial project:

ltlfilt --nnf -u --relabel-bool <<EOF
G (hfe_rdy -> F !hfe_req)
G (lup_sr_valid -> F lup_sr_clean )
G F (hfe_req)
reset && X G (!reset)
G ( (F hfe_clk) && (F ! hfe_clk) )
G ( (F lup_clk) && (F ! lup_clk) )
G F (lup_sr_clean)
G ( ( !(lup_addr_5_ <-> (X lup_addr_5_)) || !(lup_addr_6_ <-> (X lup_addr_6_)) || !(lup_addr_7_ <-> (X lup_addr_7_)) || !(lup_addr_8_ <-> (X lup_addr_8_)) ) -> ( (X !lup_sr_clean) && X ( (!( !(lup_addr_5_ <-> (X lup_addr_5_)) || !(lup_addr_6_ <-> (X lup_addr_6_)) || !(lup_addr_7_ <-> (X lup_addr_7_)) || !(lup_addr_8_ <-> (X lup_addr_8_)) )) U lup_sr_clean ) ) )
G F ( !(lup_addr_5_ <-> (X lup_addr_5_)) || !(lup_addr_6_ <-> (X lup_addr_6_)) || !(lup_addr_7_ <-> (X lup_addr_7_)) || !(lup_addr_8_ <-> (X lup_addr_8_)) )
(lup_addr_8__5__eq_0)
((hfe_block_0__eq_0)&&(hfe_block_1__eq_0)&&(hfe_block_2__eq_0)&&(hfe_block_3__eq_0))
G ((lup_addr_8__5__eq_0) -> X( (lup_addr_8__5__eq_0) || (lup_addr_8__5__eq_1) ) )
G ((lup_addr_8__5__eq_1) -> X( (lup_addr_8__5__eq_1) || (lup_addr_8__5__eq_2) ) )
G ((lup_addr_8__5__eq_2) -> X( (lup_addr_8__5__eq_2) || (lup_addr_8__5__eq_3) ) )
G ((lup_addr_8__5__eq_3) -> X( (lup_addr_8__5__eq_3) || (lup_addr_8__5__eq_4) ) )
G ((lup_addr_8__5__eq_4) -> X( (lup_addr_8__5__eq_4) || (lup_addr_8__5__eq_5) ) )
G ((lup_addr_8__5__eq_5) -> X( (lup_addr_8__5__eq_5) || (lup_addr_8__5__eq_6) ) )
G ((lup_addr_8__5__eq_6) -> X( (lup_addr_8__5__eq_6) || (lup_addr_8__5__eq_7) ) )
G ((lup_addr_8__5__eq_7) -> X( (lup_addr_8__5__eq_7) || (lup_addr_8__5__eq_8) ) )
G ((lup_addr_8__5__eq_8) -> X( (lup_addr_8__5__eq_8) || (lup_addr_8__5__eq_9) ) )
G ((lup_addr_8__5__eq_9) -> X( (lup_addr_8__5__eq_9) || (lup_addr_8__5__eq_10) ) )
G ((lup_addr_8__5__eq_10) -> X( (lup_addr_8__5__eq_10) || (lup_addr_8__5__eq_11) ) )
G ((lup_addr_8__5__eq_11) -> X( (lup_addr_8__5__eq_11) || (lup_addr_8__5__eq_12) ) )
G ((lup_addr_8__5__eq_12) -> X( (lup_addr_8__5__eq_12) || (lup_addr_8__5__eq_13) ) )
G ((lup_addr_8__5__eq_13) -> X( (lup_addr_8__5__eq_13) || (lup_addr_8__5__eq_14) ) )
G ((lup_addr_8__5__eq_14) -> X( (lup_addr_8__5__eq_14) || (lup_addr_8__5__eq_15) ) )
G ((lup_addr_8__5__eq_15) -> X( (lup_addr_8__5__eq_15) || (lup_addr_8__5__eq_0) ) )
G (((X hfe_clk) -> hfe_clk)->((hfe_req->X hfe_req)&&((!hfe_req) -> (X !hfe_req))))
G (((X lup_clk) -> lup_clk)->((lup_sr_clean->X lup_sr_clean)&&((!lup_sr_clean) -> (X !lup_sr_clean))))
EOF

G(a | Fb)
GFa
a & XG!a
G(Fa & F!a)
G((((a & Xa) | (!a & X!a)) & ((b & Xb) | (!b & X!b)) & ((c & Xc) | (!c & X!c)) & ((d & Xd) | (!d & X!d))) | (X!e & X((((a & Xa) | (!a & X!a)) & ((b & Xb) | (!b & X!b)) & ((c & Xc) | (!c & X!c)) & ((d & Xd) | (!d & X!d))) U e)))
GF((a & X!a) | (!a & Xa) | (b & X!b) | (!b & Xb) | (c & X!c) | (!c & Xc) | (d & X!d) | (!d & Xd))
a
G(!a | X(a | b))
G((!b & Xb) | ((!a | Xa) & (a | X!a)))

Here 29 formulas were reduced into 9 formulas after relabeling of Boolean subexpression and removing of duplicate formulas. In other words the original set of formulas contains 9 different patterns.

An option that can be used in combination with --relabel or --relabel-bool is --define. This causes the correspondence between old and new names to be printed as a set of #define statements.

ltlfilt -f '(a & !b) & GF(a & !b) & FG(!c)' --relabel-bool=pnn --define --spin

#define p0 (a && !b)
#define p1 (!c)
p0 && []<>p0 && <>[]p1

This can be used for instance if you want to use some complex atomic propositions with third-party translators that do not understand them. For instance the following sequence show how to use ltl3ba to create a neverclaim for an LTL formula containing atomic propositions that ltl3ba cannot parse:

ltlfilt -f '"proc@loc1" U "proc@loc2"' --relabel=pnn --define=ltlex.def --spin |
  ltl3ba -F - >ltlex.never
cat ltlex.def ltlex.never

#define p0 ((proc@loc1))
#define p1 ((proc@loc2))
never { /* p0 U p1  */
T0_init:
	if
	:: (!p1 && p0) -> goto T0_init
	:: (p1) -> goto accept_all
	fi;
accept_all:
	skip
}

As a side note, the tool ltldo might be a simpler answer to this syntactic problem:

ltldo ltl3ba -f '"proc@loc1" U "proc@loc2"' --spin

never {
T0_init:
  if
  :: ((proc@loc1) && (!(proc@loc2))) -> goto T0_init
  :: (proc@loc2) -> goto accept_all
  fi;
accept_all:
  skip
}

This case also relabels the formula before calling ltl3ba, and it then rename all the atomic propositions in the output.

An example showing how to use the --from-ltlf option is on a separate page.

Filtering

ltlfilt supports many ways to filter formulas:

    --accept-word=WORD     keep formulas that accept WORD
    --ap=RANGE             match formulas with a number of atomic
                           propositions in RANGE
    --boolean              match Boolean formulas
    --bsize=RANGE          match formulas with Boolean size in RANGE
    --equivalent-to=FORMULA   match formulas equivalent to FORMULA
    --eventual             match pure eventualities
    --guarantee            match guarantee formulas (even pathological)
    --implied-by=FORMULA   match formulas implied by FORMULA
    --imply=FORMULA        match formulas implying FORMULA
    --liveness             match liveness properties
    --ltl                  match only LTL formulas (no PSL operator)
-N, --nth=RANGE            assuming input formulas are numbered from 1, keep
                           only those in RANGE
    --obligation           match obligation formulas (even pathological)
    --persistence          match persistence formulas (even pathological)
    --recurrence           match recurrence formulas (even pathological)
    --reject-word=WORD     keep formulas that reject WORD
    --safety               match safety formulas (even pathological)
    --size=RANGE           match formulas with size in RANGE
    --stutter-insensitive, --stutter-invariant
                           match stutter-insensitive LTL formulas
    --suspendable          synonym for --universal --eventual
    --syntactic-guarantee  match syntactic-guarantee formulas
    --syntactic-obligation match syntactic-obligation formulas
    --syntactic-persistence   match syntactic-persistence formulas
    --syntactic-recurrence match syntactic-recurrence formulas
    --syntactic-safety     match syntactic-safety formulas
    --syntactic-stutter-invariant, --nox
                           match stutter-invariant formulas syntactically
                           (LTL-X or siPSL)
    --universal            match purely universal formulas
-u, --unique               drop formulas that have already been output (not
                           affected by -v)
-v, --invert-match         select non-matching formulas

Most of the above options should be self-explanatory. For instance the following command will extract all formulas from scheck.ltl which do not represent guarantee properties.

ltlfilt --lbt-input -F scheck.ltl -v --guarantee

!(Gp0 | (Gp1 & Fp3))

Combining ltlfilt with randltl makes it easier to generate random formulas that respect certain constraints. For instance let us generate 10 formulas that are equivalent to a U b:

randltl -n -1 a b | ltlfilt --equivalent-to 'a U b' -n 10

b W (a U b)
a U b
!(!a R !b)
b | (a U b)
(a xor (a & b)) U b
a U ((a | !a) R b)
(b <-> !b) U (a U b)
(a | b) U b
(a U b) & (b U !FX(0))
(a U b) <-> ((b | X(0)) -> b)

The -n -1 option to randltl will cause it to output an infinite stream of random formulas. ltlfilt, which reads its standard input by default, will select only those equivalent to a U b. The output of ltlfilt is limited to 10 formulas using -n 10. (As would using | head -n 10.) Less trivial formulas could be obtained by adding the -r option to randltl (or equivalently adding the -r and -u option to ltlfilt).

Another similar example, that requires two calls to ltlfilt, is the generation of random pathological safety formulas. Pathological safety formulas are safety formulas that do not look so syntactically. We can generate some starting again with randltl, then ignoring all syntactic safety formulas, and keeping only the safety formulas in the remaining list.

randltl -r -n -1 a b | ltlfilt -v --syntactic-safety | ltlfilt --safety -n 10

F((!b & GF!b) | (b & FGb))
a | G((a & GFa) | (!a & FG!a))
XXG(!a & (Fa W Gb))
G(Ga | (F!a & X!b))
b W ((!b & (a W XG!b)) | (b & (!a M XFb)))
Xa W (b | ((!b M F!a) R !a))
Xa | (a & ((!a & F!b) | (a & Gb))) | (!a & ((a & F!b) | (!a & Gb)))
(b M a) & XGb
Xb | (!a U !b)
(!b & (!b W !a)) | (b & (b M a))

ltlfilt's filtering ability can also be used to answer questions about a single formula. For instance is a U (b U a) equivalent to b U a?

ltlfilt -f 'a U (b U a)' --equivalent-to 'b U a'

a U (b U a)

The command prints the formula and returns an exit status of 0 if the two formulas are equivalent. It would print nothing and set the exit status to 1, were the two formulas not equivalent.

Is the formula F(a & X(!a & Gb)) stutter-invariant?

ltlfilt -f 'F(a & X(!a & Gb))' --stutter-invariant

F(a & X(!a & Gb))

Yes it is. And since it is stutter-invariant, there exist some equivalent formulas that do not use X operator. The --remove-x option gives one:

ltlfilt -f 'F(a & X(!a & Gb))' --remove-x

F(a & ((a & (a U (!a & Gb)) & ((!b U !a) | (b U !a))) | (!a & (!a U (a & !a & Gb)) & ((!b U a) | (b U a))) | (b & (b U (!a & !b & Gb)) & ((!a U !b) | (a U !b))) | (!b & (!b U (!a & b & Gb)) & ((!a U b) | (a U b))) | (!a & Gb & (G!a | Ga) & (Gb | G!b))))

We could even verify that the resulting horrible formula is equivalent to the original one:

ltlfilt -f 'F(a & X(!a & Gb))' --remove-x | ltlfilt --equivalent-to 'F(a & X(!a & Gb))'

F(a & ((a & (a U (!a & Gb)) & ((!b U !a) | (b U !a))) | (!a & (!a U (a & !a & Gb)) & ((!b U a) | (b U a))) | (b & (b U (!a & !b & Gb)) & ((!a U !b) | (a U !b))) | (!b & (!b U (!a & b & Gb)) & ((!a U b) | (a U b))) | (!a & Gb & (G!a | Ga) & (Gb | G!b))))

It is therefore equivalent (otherwise the output would have been empty).

The difference between --size and --bsize lies in the way Boolean subformula are counted. With --size the size of the formula is exactly the number of atomic propositions and operators used. For instance the following command generates 10 random formulas with size 5 (the reason randltl uses --tree-size=8 is because the "tree" of the formula generated randomly can be reduced by trivial simplifications such as !!f being rewritten to f, yielding formulas of smaller sizes).

randltl -n -1 --tree-size=8 a b | ltlfilt --size=5 -n 10

!F!Ga
X!(a U b)
!G(a & b)
(b W a) W 0
b R X!b
GF!Xa
FGXX(0)
Xb & Ga
a xor !Fb
a xor FXb

With --bsize, any Boolean subformula is counted as "1" in the total size. So F(a & b & c) would have Boolean-size 2. This type of size is probably a better way to classify formulas that are going to be translated as automata, since transitions are labeled by Boolean formulas: the complexity of the Boolean subformulas has little influence on the overall translation. Here are 10 random formula with Boolean-size 5:

randltl -n -1 --tree-size=12 a b | ltlfilt --bsize=5 -n 10

F(1 U X(0))
Gb xor Fa
FX!Fa
(a -> !b) & XFb
0 R (a U !b)
XXa R !b
(!a & !(!a xor b)) xor (0 R b)
GF(1 U b)
(a U b) R b
!(Ga M 1)

Using `--format` and `--output`

The --format option can be used the alter the way formulas are output. The list of supported %-escape sequences are recalled in the --help output:

%<                         the part of the line before the formula if it
                           comes from a column extracted from a CSV file
%>                         the part of the line after the formula if it comes
                           from a column extracted from a CSV file
%%                         a single %
%b                         the Boolean-length of the formula (i.e., all
                           Boolean subformulas count as 1)
%f                         the formula (in the selected syntax)
%F                         the name of the input file
%h, %[vw]h                 the class of the formula is the Manna-Pnueli
                           hierarchy ([v] replaces abbreviations by class
                           names, [w] for all compatible classes)
%L                         the original line number in the input file
%[OP]n                     the nesting depth of operator OP.  OP should be a
                           single letter denoting the operator to count, or
                           multiple letters to fuse several operators during
                           depth evaluation.  Add '~' to rewrite the formula
                           in negative normal form before counting.
%r                         wall-clock time elapsed in seconds (excluding
                           parsing)
%R, %[LETTERS]R            CPU time (excluding parsing), in seconds; Add
                           LETTERS to restrict to (u) user time, (s) system
                           time, (p) parent process, or (c) children
                           processes.
%s                         the length (or size) of the formula
%x, %[LETTERS]X, %[LETTERS]x   number of atomic propositions used in the
                           formula;  add LETTERS to list atomic propositions
                           with (n) no quoting, (s) occasional double-quotes
                           with C-style escape, (d) double-quotes with
                           C-style escape, (c) double-quotes with CSV-style
                           escape, (p) between parentheses, any extra
                           non-alphanumeric character will be used to
                           separate propositions

As a trivial example, use

--latex --format='$%f$'

to enclose formula in LaTeX format with $...$ .

But --format can be useful in more complex scenarios. For instance you could print only the line numbers containing formulas matching some criterion. In the following, we print only the numbers of the lines of scheck.ltl that contain guarantee formulas:

ltlfilt --lbt-input -F scheck.ltl --guarantee --format=%L

2
3
4

We could also prefix each formula by its size, in order to sort the file by formula size:

ltlfilt --lbt-input scheck.ltl --format='%s,%f' | sort -n

7,p0 U (p1 & (p0 | p5))
7,p3 | Xp7 | Fp6
9,!(Gp0 | (Gp1 & Fp3))
20,((Xp0 & Xp4) U Fp1) & XX(XFp5 U (p0 U XXp3))

More examples of how to use --format to create CSV files are on a separate page

The --output option interprets its argument as an output filename, but after evaluating the %-escape sequence for each formula. This makes it very easy to partition a list of formulas in different files. For instance here is how to split scheck.ltl according to formula sizes.

ltlfilt --lbt-input scheck.ltl --output='scheck-%s.ltl'
wc -l scheck*.ltl

1 scheck-20.ltl
2 scheck-7.ltl
1 scheck-9.ltl
4 scheck.ltl
8 total

ltlfilt

Table of Contents

Changing syntaxes

Altering the formula

Filtering

Using --format and --output

`ltlfilt`

Using `--format` and `--output`