-
-
Notifications
You must be signed in to change notification settings - Fork 425
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ambiguities and Priorities in 1.2.1 earley #1443
Comments
Hi @MaxOstrowski , What arguments are you giving to Lark? Specifically as the "ambiguity" parameter? |
We tried all of them, none of them made any difference, so we thought that we might have used it wrong? |
Thank you for reporting this bug. Looks like the issue is caused by a bugfix I included in v1.2.1. I just created a PR with a fix: #1444 If you can test it, let me know if it fixed the issue. I will probably make a new release later today. |
Okay, new version released - https://github.com/lark-parser/lark/releases/tag/1.2.2 It should be now fixed. Let me know otherwise. Sorry for the inconvenience! |
Actually I will re-open. Perhaps your problem isn't entirely solved. P.S. regarding the efficiency concerns - Earley must be aware of all the ambiguities in order to parse the text correctly. Lark stores the ambiguities in the SPPF structure, which is fairly efficient. I think the final step can be rewritten a bit more efficiently, because we currently produce the trees even for the root solutions that we don't return. It just takes a bit of reshuffling, I'll try and do it tomorrow. |
Regarding our problem, with the newest version (1.2.2) we do not get any ambiguous solutions anymore. Thanks a lot! |
Since lark version 1.2.1, there seem to be (more?) ambiguities produced for our syntax.
While the syntax is ambiguous, we used prioritiies to fix this before.
So whenever a condition ends with a number, like:
x = 5.
there are 2 interpretations, either the dot comes from the start rule, and 5 is an INTEGER, or the DOT comes from a float
5.
and the dot from the start rule is omitted.The priority
unsigned_integer.1: INT
fixed this before.but it feels like a big waste of performance that all possible ambiguities are produced.
Is there a way to enforce only producing the most preferred ast?
Might be related to #1441.
The text was updated successfully, but these errors were encountered: