Crates.io | tree-sitter-souffle |
lib.rs | tree-sitter-souffle |
version | 0.4.0 |
source | src |
created_at | 2022-10-21 22:44:10.567489 |
updated_at | 2022-10-24 18:54:28.984072 |
description | Soufflé grammar for the tree-sitter parsing library |
homepage | https://github.com/langston-barrett/tree-sitter-souffle |
repository | https://github.com/langston-barrett/tree-sitter-souffle |
max_upload_size | |
id | 693973 |
size | 776,408 |
A tree-sitter grammar for Soufflé.
Table of Contents
The grammar is fairly complete. It parses:
There are a few known issues.
This grammar aims to support the following use-cases:
It does not currently aim to support round-trip printing (i.e., code formatting and refactoring).
This parser has limited support for parsing C pre-processor #line
directives,
which may be helpful for analysis tasks. To avoid over-complicating the grammar,
it does not handle directives in the middle of top-level entities (e.g., in
between two conjuncts of a rule). You can configure your preprocessor to not
emit such tokens (-P
for mcpp
).
The grammar doesn't mirror the structure of the Soufflé C++ parser implementation nor the grammar as presented in the Soufflé documentation. It instead tries to map nonterminals to more abstract categories, i.e., categories that match the way we think about the language. For instance, the Soufflé C++ parser doesn't have a nonterminal for constants, whereas this parser does. As another example, the Soufflé documentation has this specification for type declarations:
type_decl ::= '.type' IDENT ("<:" type_name | "=" ( type_name ( "|" type_name )* | record_list | adt_branch ( "|" adt_branch )* ))
This grammar instead has something more like
type_decl ::= subtype | type_synonym | type_union | record | adt
so that the parse tree encodes a bit more "semantic" information. The hope is that these choices make it easier to write tree-sitter queries.
For the same reason, the grammar doesn't specify keywords and operators like
.functor
, eqrel
, *
, and :-
as their own nonterminals, but rather inlines
them into the grammar.
Run
tree-sitter test
or equivalently
npm test
This script downloads the Soufflé, ddisasm, cclyzer++, and other repos and attempts to parse their Datalog files.
./scripts/parse-examples
CHANGELOG.md
Cargo.toml
git checkout main && git pull origin && git tag -a vX.Y.Z -m vX.Y.Z && git push --tags
cargo publish