[minor] Bit-index support (subword assignment) #26

zyedidia · 2022-07-12T02:44:46Z

This is a proposal for adding subword assignment support to FIRRTL. The full proposal can be viewed here: subword.pdf (EDIT: now out-of-date).

As a summary, this change would allow indexing expressions of type UInt or SInt to assign specific bits. This proposal only covers single-bit subword assignment at a static index. The "bit-index" expression must be used as a sink. For example, this would allow:

input x : UInt<4>
input y : UInt<1>
output z : UInt<4>

z <= x
z[0] <= y
z[1] <= bits(z, 0, 0)

The proposal gives an algorithm for transforming subword assignment to existing FIRRTL by rewriting using vectors (UInt<1>[n]). I have a prototype CIRCT implementation at https://github.com/zyedidia/circt/tree/subword-assignment that performs the transformation in the proposal.

Let me know what you think!

(I also included a small fix for the makefile since typing make on a fresh clone didn't work).

Makefile

darthscsi

Although not unique to this proposal, the assignment form doesn't compose well on the lhs. E.g., how do I set 1 bit of one element of a register of type vector of uints?

Second question is how does this participate in combinatorial cycle detection?

Third question is how does this work with width inference?

Finally, should this be a thing? There is certainly value in expressivity, but in an IR intended for transformation, sub-word updates of state are rather annoying to deal with. They may make sense in the output, but can that just be an peephole optimization? Internally standard compiler approach is to treat state in an SSA form with a single write, so this would be a RMW internally until code generation at which point it would be pattern-matched into a subword update (similarly to vector element updates or single field updates).

spec.md

ekiwi · 2022-07-19T14:42:28Z

Finally, should this be a thing? There is certainly value in expressivity, but in an IR intended for transformation, sub-word updates of state are rather annoying to deal with

Subword assignments would be a neat feature for the frontend, mostly because Verilog developers have internalized a certain coding style that can heavily rely on these kinds of assignments. One good example is the TinyAES core which was rather awkward to translate to Chisel.
I do agree that internally this should be converted into SSA. My one attempt to add subword assignments to firrtl, essentially tried to remove them very early on in the compilation flow by minimally splitting signals that are subword assigned.

zyedidia · 2022-07-21T00:00:48Z

Thanks for the feedback! Sorry for the wall of text, I've tried to respond to some of the concerns appropriately.

How does this participate in width inference?

That is a great question, and the proposal should be expanded with a dicussion of that. I think these are the options:

No participation: if you perform a bit-index on an integer with an unspecified width, you must also fully assign to it somewhere else to cause the width to be inferred.
No participation: the bit-index can only be used on integers where the width is specified.
It only participates in width inference when used as an L-value (currently as proposed the bit-index can only exist as an L-value) and an integer with an unspecified width is inferred to be at least i bits wide if it is used as x[i]. If the width is inferred somewhere else to be a value that is incompatible (a width less than i), then an error is raised.
It participates in width inference when used as both an L-value and an R-value in the same way as option 3 (assuming we add support for this being used as an R-value).

Options 1 and 2 are the simplest. Option 3 is perhaps more consistent with the rest of the spec because the bit-index information is used to infer the width. Option 4 is similar to 3 but relies on some other changes, and I'd like to note that currently the bits primop does not participate in width inference on its operand and changing that would be a major change. I'm not sure which one should be chosen, and this is definitely something to discuss.

How does this participate with combinatorial cycle detection?

There is a section in the proposal on combinational loops. The current answer is that combinational loops are allowed if the RHS only depends on bits that are distinct from the bit being assigned. Any operation that uses x depends on all bits of x, except for the bits operation, which only depends on the extracted bits of x (hi to lo). This allows writing certain "loops" (though the rule implies that they end up not being loops at the bit-level) that use the bits primop to extract distinct bits, like those shown in some of the examples in the proposal (e.g., example 6, which is from the Chisel issue tracker).

The FIRRTL spec currently has no wording on combinational loops (that I could find), so I am not sure if this information should go in the spec or if it is up to the implementation.

How do I set 1 bit of one element of a register of type vector of uints?

I think this composes fine on the LHS, although please let me know if there's a mistake I'm missing. For example:

reg r : UInt<4>[2], clock
r[0][1] <= ...

would set bit 1 of element 0. This is allowed by the "reference" production from the FIRRTL language definition.

Questions regarding the syntax, and assigning one bit vs multiple bits:

I think these are good questions with several possible solutions. There is a short discussion of this in the "Multi-bit subword assignment" section in the proposal. I think the possible solutions exist on a spectrum in a tradeoff between having inconsistencies and making large changes to the spec. The current proposal does have inconsistency between the bit-index and the bits primop (the bit-index can only be used on the LHS and only extracts one bit, while the bits primop can only be used on the RHS and extracts a range), but makes a relatively small change to the spec and leaves the way open for future larger enhancements that would bring more consistency (e.g., having a unified slicing operator for integers and vectors).

For example, one alternative is to use bits(x, 1, 0) <= ... syntax instead. This would make the bits primop the only primop that can be used on the LHS. This would also require changing the FIRRTL language definition (specifically the reference production), while the current syntax does not require a change to that grammar. In addition, there is perhaps a long-term goal of creating a general and unified slice operator (using bracket syntax such as x[hi:lo]) for both integers and vectors. Using bits on the LHS does not move towards that, while the current proposal does make a small move in that direction (because in that case x[i] would be allowed as an L-value on integers as well).

There are some other alternatives briefly listed in the full proposal, and a short list of the additional questions they raise.

Perhaps the bit-index should also be allowed on the RHS. I am not sure. But if that is not added in this proposal, it can be added in a future change if desired.

Overall, maybe the best thing would be to introduce [hi:lo]/[i] syntax for integers (and perhaps the slicing for vectors too?) and remove the bits primop. This proposal is a forwards-compatible step in that direction but doesn't go all the way.

spec.md

ekiwi · 2022-07-25T20:46:33Z

spec.md

+1>`.{firrtl} (even if `x` is an `SInt`) and the type of `x[i]`.{firrtl} is
+`UInt<1>`.{firrtl}.
+
+The bit-index can be used as a sink or source. When used as a source,


Have you considered restricting your proposal to L-value bit slices?
All R-value uses seem to already be covered by bits and thus it might help to focus the proposal on sub-word assignments.

It's true that if it isn't restricted to L-value slices there is redundancy with the bits primop, but I think the intention is to move to a new syntax that is consistent for both L and R-values, and phase out the bits primop in the future. This also has the benefit of making the Chisel emission simpler, since it doesn't have to emit different syntax based on whether the index is an L-value or an R-value.

But I think the intention is to move to a new syntax that is consistent for both L and R-values, and phase out the bits primop in the future.

I don't think that is necessary. As I said, firrtl is an IR, so it can be more explicit about some things than a user-facing language. I am very much opposed to introducing duplicate functionality since it will make the compiler more complicated for little to no gain.

I am very much opposed to introducing duplicate functionality since it will make the compiler more complicated for little to no gain.

To clarify: the plan is to remove bits for this exact reason and replace it with bit-index.

Is the concern about atomicity of updates to the spec? I was planning to just remove bits in a separate PR in favor of bit index.

To clarify: the plan is to remove bits for this exact reason and replace it with bit-index.

Why not keep bits? As @darthscsi pointed out above, bits is easier to parse and does not conflict with sub-access. Otherwise to distinguish sub-access and bit-index, one would have to know the type of the expression.

Is the concern about atomicity of updates to the spec? I was planning to just remove bits in a separate PR in favor of bit index.

Yes. I believe that before merging into main, the complete proposal should be considered.

In addition to that, removing bits will make for a back-wards incompatible change and I do not see a good reason for this, when we could just stick with the old syntax and either extend it to l-values or come up with an alternative syntax specifically for l-values. The handling of l-value and r-value bit-index is quite different anyways.

Why not keep bits? As @darthscsi pointed out above, bits is easier to parse and does not conflict with sub-access. Otherwise to distinguish sub-access and bit-index, one would have to know the type of the expression.

It seems clean to have a single unified op for extraction.

FIRRTL really screwed up here with not adding type information to each operation. Any sane parser is going to track the types of references and uses that to build up its internal FIRRTL IR (which necessarily must include type information). Hence, I'm not super concerned about this. I do admit that this means foo[a] <= bar is ambiguous without extra context. However, foo[a] <= bar : uint<8>, uint<1> is not and would be a great direction to that the FIRRTL textual format.

Yes. I believe that before merging into main, the complete proposal should be considered.

In addition to that, removing bits will make for a back-wards incompatible change and I do not see a good reason for this, when we could just stick with the old syntax and either extend it to l-values or come up with an alternative syntax specifically for l-values. The handling of l-value and r-value bit-index is quite different anyways.

I'm fine to just remove bits entirely here then.

I don't know if backwards compatibility should be a goal here. We're attempting to make it easy for FIRRTL compilers to check if they support a given FIRRTL text via: #30. I guess my concern is that it seems weird to try to be backwards compatible for bits on the RHS of a connect when the fundamental change is to extend the spec in an entirely backwards incompatible way. Or: SFC will be "incompatible" with FIRRTL 2.0.0+ after this change even though it will work for Chisel designs where a user doesn't use bit index.

Or: SFC will be "incompatible" with FIRRTL 2.0.0+ after this change even though it will work for Chisel designs where a user doesn't use bit index.

My main point is that it will be harder to support older FIRRTL versions with the same compiler if we make this change. If we only add a new operation, then we trivially support older FIRRTL versions. However, if we replace bits with [..], then we will still have to keep around the old code to support older FIRRTL versions. Not an insurmountable problem for sure. But there also does not seem to be a real upside to switching from bits to [...].

spec.md

zyedidia · 2022-08-05T00:29:22Z

There is now a PR in CIRCT that implements this via read-modify-write during the expand-whens pass: llvm/circt#3658.

seldridge · 2022-08-05T20:04:50Z

Logging approval from @azidar via offline discussion.

ekiwi · 2022-08-23T12:17:30Z

Any chance that we focus this change only on l-value bit-indices (aka subword assignments)?
Why try to fix what is not broken and create extra work by requiring parsers to track types in order to disambiguate between SubAccess ([...]) and Bits (after this proposal also [...])? Since modules might be declared after they are instantiated, knowing all types would require a two pass parser, complicating things quite a bit.

seldridge · 2022-08-23T16:58:29Z

Why try to fix what is not broken and create extra work by requiring parsers to track types in order to disambiguate between SubAccess ([...]) and Bits (after this proposal also [...])?

A parser already needs to track types assuming the parser only wants to build valid IR. 😉 The SFC parser has the appearance of not tracking types, but only because it splits parsing into parsing + InferKinds/CheckKinds + InferTypes/CheckTypes.

Since modules might be declared after they are instantiated, knowing all types would require a two pass parser, complicating things quite a bit.

We already have this problem assuming that we want to reject invalid IR in the parser. However, it's not that bad as it just means a parser need to (1) parse module definitions and (2) parse modules bodies. This is the natural split that arises when building a fast, parallel parser where each module body is parsed in parallel.

ekiwi · 2022-08-23T17:29:00Z

A parser already needs to track types assuming the parser only wants to build valid IR.

There is a difference between a parser and a type checker. Parsing is normally context-free whereas type checking does need a context.

We already have this problem assuming that we want to reject invalid IR in the parser.

Again, "type checker" =/= "parser"

ekiwi · 2022-08-23T18:21:56Z

But anyways, arguing about what a parser should and should not do isn't very helpful.
My argument is that continuing to use bits(..., ..., ...) for R-value bit extraction would be the simplest solution with no important downside that I can see. Otherwise this PR needs to change the bits section in "Expressions" to the new syntax and we need to patch all firrtl serializers. Parsers will have to be able to deal with both legacy and new syntax in order to remain backwards-compatible and the original firrtl compiler would need a more complicated parser. The same would be true for essentially any non-handcrafted parser. So anyone using YACC or antlr or similar would have to hack around the ambiguity around bit-index and SubAccess.

zyedidia · 2022-08-23T20:04:40Z

I don't think the [x:y] syntax is ambiguous at the context-free language level -- there aren't multiple possible derivation trees when purely parsing (no types). For type checking, the type checker already needs to be able to look up the type of the value being indexed for vector indexing, so I don't see how this is different.

More generally, I think these are the choices regarding syntax and their downsides:

Use bits() for the LHS and RHS
- Downsides: syntax is inconsistent with vector indexing; the bits() syntax is specifically intended to convey that it can only be used as an R-value (it has a "function call"-like syntax); this involves changing the FIRRTL grammar to allow bits() on the LHS, which is a bit cumbersome: do we remove bits from the primop non-terminal and make it a reference, even though its syntax is like a primop and not like a reference?
Use [x:y] for the LHS and RHS
- Downsides: not backwards compatible, but this can be done for the next major version release of FIRRTL.
Use bits() only for the RHS and [x:y] only for the LHS
- Downsides: inconsistent syntax between the two bits operators; more cumbersome to restrict in the grammar (needs a new non-terminal that can only be used on the LHS, since reference can currently be used on both RHS and LHS).
Allow bits() or [x:y] for the RHS and [x:y] only for the LHS (current proposal)
- Downsides: duplicate syntax for the RHS.
Allow bits() or [x:y] for both the LHS and RHS (current implementation in CIRCT)
- Downsides: duplicate syntax for the RHS and LHS; requires more changes to the FIRRTL grammar to allow bits() on the LHS (move bits to reference instead of primop?). Because of how the CIRCT parser works this is simpler to implement in CIRCT, and accepts a superset of the programs allowed by 4. Removing bits() in the future would resolve this mismatch.

All of these changes require at least a minor version increase, and only 2 requires a major version since it disallows bits() on the RHS. A big reason to have versions is to be able to improve FIRRTL more aggressively without worrying as much about older FIRRTL compilers.

ekiwi · 2022-08-23T20:13:49Z

Thanks for presenting this list. I generally prefer option 1 because it is the easiest to parse, but I can see now why you might prefer a different option. One thing to consider is that there are other primops that we might want to consider allowing as L-values in the future: tail, head and cat. (tail and head just being special cases of bits and cat being something that is allowed in Verilog.) Thus the concept of distinguishing between R-value only and L/R-value primops might make some sense.

ekiwi · 2022-08-23T20:21:29Z

This proposal has been pending for a long time. How about we all talk this through at the next Chisel meeting on Monday August 29th and make sure we get to a place where we can merge this into the spec on that day?

ekiwi · 2022-08-24T15:15:45Z

I started implementing this proposal for the firrtl compiler. One thing that needs to be clarified is how bit-indices interact with DontCare. I.e., describe how you are allowed to do something like x[0] is invalid.

zyedidia · 2022-08-24T17:34:47Z

I think it's fine to say that it will mark the particular bits in the integer as invalid. I think this is essentially the same as assigning a special constant with a bitindex, so I'm not sure it really needs any specific changes. I can add some clarifying wording to the invalidates section though.

ekiwi · 2022-08-24T17:36:09Z

I can add some clarifying wording to the invalidates section though.

I think that is a good idea. Just clarifying that bit-indices are allowed in a is invalid statement.

mwachs5

just adding a blocking request-changes to ensure we update the revision history and the title of this PR to correctly reflect the type of change this is (I believe minor as it is a feature-add?)

ekiwi · 2022-08-25T18:06:42Z

Here is my prototype implementation of this spec change for the firrtl compiler: chipsalliance/firrtl#2545

These are my tests, please let me know if any of them disagree with your intention behind the spec: https://github.com/ekiwi/firrtl/blob/sub-word-assign-2/src/test/scala/firrtlTests/SubWordAssignmentTests.scala

zyedidia · 2022-09-01T19:39:13Z

I have updated the revision history and marked this as a minor version change. I think if everyone is in agreement this is ready to go. I think once this is merged, llvm/circt#3658 will be ready to merge, and I will open a draft PR for Chisel that uses the IR nodes from Kevin's FIRRTL implementation to provide a Chisel API for this. Thanks everyone!

revision-history.yaml

jackkoenig

One minor suggestion but otherwise I think this is ready to go!

darthscsi · 2022-09-14T21:24:15Z

spec.md

@@ -990,6 +990,9 @@ sub-element in the vector.
 Invalidating a component with a bundle type recursively invalidates each
 sub-element in the bundle.

+Invalidating a particular subset of bits in an integer is possible by


This is starting to sound like invalid is being used for don't care.
For normal wires/regs/outputs, how does invalidating a subset of bits get you anything that starting with an entire invalid value which is later partially written not get you?

darthscsi · 2022-09-14T21:25:49Z

spec.md

+
+A value that is bit-indexed must be fully initialized at the bit-level. There
+must be a valid assignment accounting for every bit in the value. Registers are
+implicitly initialized with their current contents.


This is redundant with the section on registers.

darthscsi · 2022-09-14T21:26:50Z

spec.md

+Bit-indexing does not participate in width inference (see
+[@sec:width-inference]), and if a bit-index is applied to a value with an
+unspecified width, that value must have another use that allows its width to be
+inferred. Otherwise this causes an error.


Why not say that a bit-index forces the width to be at least sufficient to the slice bounds?

I think that is definitely desirable, but I think the motivation for not participating in width inference was so that the the bit-index would be interchangeable with the bits primop (when used as an R-value), and then if/when bits gets removed the width inference behavior you describe could be added.

If we want to add width inference support as part of this proposal, then perhaps this proposal should also add width inference for the bits primop.

I think either approach is reasonable.

darthscsi

I would suggest not allowing invalidate of bit slices. It adds complexity without adding anything useful, I think.

I'm not sure why you wouldn't let bit slices participate in width inference.

seldridge reviewed Jul 13, 2022

View reviewed changes

Makefile Outdated Show resolved Hide resolved

zyedidia mentioned this pull request Jul 13, 2022

[FIRRTL] Add bit-index support (subword assignment) llvm/circt#3525

Closed

darthscsi reviewed Jul 19, 2022

View reviewed changes

spec.md Show resolved Hide resolved

spec.md Outdated Show resolved Hide resolved

spec.md Outdated Show resolved Hide resolved

spec.md Outdated Show resolved Hide resolved

ekiwi reviewed Jul 25, 2022

View reviewed changes

spec.md Outdated Show resolved Hide resolved

ekiwi reviewed Jul 25, 2022

View reviewed changes

spec.md Show resolved Hide resolved

mwachs5 requested changes Aug 24, 2022

View reviewed changes

zyedidia added 8 commits September 1, 2022 10:53

Add subword assignment to spec (bit-index expression)

51a68cf

Highlight inline code

86bcca2

Update wording on connection semantics

b2dff2e

Update spec to include bit slicing

30c6c30

Note on memory ports

c6e26cd

Add ref in width section

2e26a19

Note on initialization

65ebef3

Minor update to wording

267e1df

zyedidia added 2 commits September 1, 2022 10:53

Mention bit-index in invalidate section

2872b62

Add additional clarification about UInt vs SInt

eb59122

zyedidia force-pushed the subword-assignment branch from c06edb4 to eb59122 Compare September 1, 2022 17:53

Update revision history

c72b34d

zyedidia changed the title ~~Bit-index support (subword assignment)~~ [minor] Bit-index support (subword assignment) Sep 1, 2022

jackkoenig reviewed Sep 13, 2022

View reviewed changes

revision-history.yaml Outdated Show resolved Hide resolved

jackkoenig approved these changes Sep 13, 2022

View reviewed changes

Separate bullet point

8e077c7

mwachs5 approved these changes Sep 14, 2022

View reviewed changes

darthscsi reviewed Sep 14, 2022

View reviewed changes

darthscsi approved these changes Sep 14, 2022

View reviewed changes

mwachs5 mentioned this pull request Dec 23, 2022

When to use <- or <= #52

Closed

zyedidia mentioned this pull request Apr 28, 2023

[FIRRTL] Subword assignment support via rewriting to read-modify-write llvm/circt#3658

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[minor] Bit-index support (subword assignment) #26

[minor] Bit-index support (subword assignment) #26

zyedidia commented Jul 12, 2022 •

edited

Loading

darthscsi left a comment

ekiwi commented Jul 19, 2022

zyedidia commented Jul 21, 2022

ekiwi Jul 25, 2022

zyedidia Aug 5, 2022

ekiwi Aug 5, 2022

seldridge Aug 5, 2022

ekiwi Aug 5, 2022

seldridge Aug 5, 2022

ekiwi Aug 5, 2022

zyedidia commented Aug 5, 2022

seldridge commented Aug 5, 2022

ekiwi commented Aug 23, 2022 •

edited

Loading

seldridge commented Aug 23, 2022

ekiwi commented Aug 23, 2022

ekiwi commented Aug 23, 2022

zyedidia commented Aug 23, 2022

ekiwi commented Aug 23, 2022 •

edited

Loading

ekiwi commented Aug 23, 2022

ekiwi commented Aug 24, 2022

zyedidia commented Aug 24, 2022

ekiwi commented Aug 24, 2022

mwachs5 left a comment

ekiwi commented Aug 25, 2022

zyedidia commented Sep 1, 2022

jackkoenig left a comment

darthscsi Sep 14, 2022

darthscsi Sep 14, 2022

darthscsi Sep 14, 2022

zyedidia Sep 16, 2022

darthscsi left a comment

[minor] Bit-index support (subword assignment) #26

Are you sure you want to change the base?

[minor] Bit-index support (subword assignment) #26

Conversation

zyedidia commented Jul 12, 2022 • edited Loading

darthscsi left a comment

Choose a reason for hiding this comment

ekiwi commented Jul 19, 2022

zyedidia commented Jul 21, 2022

How does this participate in width inference?

How does this participate with combinatorial cycle detection?

How do I set 1 bit of one element of a register of type vector of uints?

Questions regarding the syntax, and assigning one bit vs multiple bits:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zyedidia commented Aug 5, 2022

seldridge commented Aug 5, 2022

ekiwi commented Aug 23, 2022 • edited Loading

seldridge commented Aug 23, 2022

ekiwi commented Aug 23, 2022

ekiwi commented Aug 23, 2022

zyedidia commented Aug 23, 2022

ekiwi commented Aug 23, 2022 • edited Loading

ekiwi commented Aug 23, 2022

ekiwi commented Aug 24, 2022

zyedidia commented Aug 24, 2022

ekiwi commented Aug 24, 2022

mwachs5 left a comment

Choose a reason for hiding this comment

ekiwi commented Aug 25, 2022

zyedidia commented Sep 1, 2022

jackkoenig left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

darthscsi left a comment

Choose a reason for hiding this comment

zyedidia commented Jul 12, 2022 •

edited

Loading

ekiwi commented Aug 23, 2022 •

edited

Loading

ekiwi commented Aug 23, 2022 •

edited

Loading