Vocabulary interface missing an iterable or equivalent method #904

grosenberg · 2015-06-15T04:45:34Z

Depreciation of Recognizer.getTokenNames() presently leaves no equivalent or convenient method to iterate over the valid set of symbolic token names. Since the underlying String[] is now private, cannot iterate based on size.

Adding a Vocabulary.getSymbolicNames() method, returning a String[], would be helpful.

sharwell · 2015-06-15T14:35:48Z

💡 ATN.maxTokenType should provide you with the upper bound.

parrt · 2015-06-20T00:14:57Z

I agree that we should have a way to iterate or get the list but I don't think we can change the interface unless we make a major version, such as 4.6 but I'm heading to 4.5.1 shortly. @sharwell do you have a suggestion for an enhancement to the interface?

sharwell · 2015-06-20T19:57:29Z

We can certainly update the VocabularyImpl class, and you can change the generated VOCABULARY field to be an instance of VocabularyImpl instead of just Vocabulary.

As for the specific API, I'm not sure yet because Vocabulary is meant to expose multiple different pieces of information at each index, but iterating over it would only provide a single piece of information.

grosenberg · 2015-06-20T21:54:21Z

@sharwell @parrt - the need for a list or iterable occurs, e.g., in an editor to enable quick fix help as symbolic names are being typed in. Thus the need/use is not index related. Just the single method returning the symbolic names is needed.

parrt · 2015-10-12T16:12:36Z

ha! I just needed this myself.

On Sat, Jun 20, 2015 at 2:54 PM, GRosenberg notifications@github.com
wrote:

@sharwell https://github.com/sharwell @parrt https://github.com/parrt

the need for a list or iterable occurs, e.g., in an editor to enable
quick fix help as symbolic names are being typed in. Thus the need/use is
not index related. Just the single method returning the symbolic names is
needed.

—
Reply to this email directly or view it on GitHub
#904 (comment).

Dictation in use. Please excuse homophones, malapropisms, and nonsense.

msteiger · 2016-03-24T17:33:15Z

Same problem here. I have ~~two~~ three alternative suggestions that would at least circumvent the issue that the user might be interested in any of the three names:

int Vocabulary.getTokenCount() - this would allow for index-based iterations and is a simple alternative to the deprecated method. This would restrict the Vocabulary token type values to the range [0..n] though.

int[] Vocabulary.getTokenIndices() - this would provide an arbitrary collection of ints, which would map to the range [0..n] for all existing cases, but supports other token type values, such as -1 as well.

List<Integer> Vocabulary.getTokenIndices() - similar, but more nice features such as immutability, toString and stream support. It could be backed by an AbstractList that computes (immutable) integer entries on the fly.

I would probably go for the last proposed suggestion as it the most flexible and elegant one. I can draft the required code changes and submit a pull request, if desired.

parrt · 2016-03-24T21:02:26Z

Seems like getTokenCount() would be enough; only EOF is -1 and we could specify that as a special case. Or maybe getMaxTokenType() as there is no requirement that all token types are used. getSymbolicNames() would also be dang convenient. Can methods be added to an interface and be binary compatible with code that currently refs the interface?

msteiger · 2016-03-25T09:45:05Z

Yes, they are compatible (I checked with a quick test). While an Iterable for symbolic names would be convenient at first, it might not be ideal as you cannot get the corresponding literal name from it. If iterating over names is a common use case, it might make sense though, imho.

Alternative approach: a triple that contains symbolic, literal and display name and an Iterable that can be acquired from Vocabulary. This would give all three corresponding names for each entry.

parrt added the type:improvement label Jun 20, 2015

msteiger mentioned this issue Mar 25, 2016

Add Vocabulary.getMaxTokenType() #1146

Merged

parrt added this to the 4.5.3 milestone Mar 29, 2016

parrt closed this as completed in #1146 Mar 29, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vocabulary interface missing an iterable or equivalent method #904

Vocabulary interface missing an iterable or equivalent method #904

grosenberg commented Jun 15, 2015

sharwell commented Jun 15, 2015

parrt commented Jun 20, 2015

sharwell commented Jun 20, 2015

grosenberg commented Jun 20, 2015

parrt commented Oct 12, 2015

msteiger commented Mar 24, 2016

parrt commented Mar 24, 2016

msteiger commented Mar 25, 2016

Vocabulary interface missing an iterable or equivalent method #904

Vocabulary interface missing an iterable or equivalent method #904

Comments

grosenberg commented Jun 15, 2015

sharwell commented Jun 15, 2015

parrt commented Jun 20, 2015

sharwell commented Jun 20, 2015

grosenberg commented Jun 20, 2015

parrt commented Oct 12, 2015

msteiger commented Mar 24, 2016

parrt commented Mar 24, 2016

msteiger commented Mar 25, 2016