BITFIELD and BITFIELD_RO feature #2107

slorello89 · 2022-04-19T14:37:14Z

Adding BITFIELD and BITFIELD_RO support as part of #2055.

Couple notes about this command

It's variadic in the allowable sub-commands, so I provided a single command for each sub-command, and then split out the sub-commands into their own structures and allowed you to arbitrarily create an array of sub-commands you want to execute.
The literal return-type of single sub-commands are an array of integers, however I abstracted that away and just have them yield a single 64 bit signed integer, which meant some updates to handle multi-bulk return-types in the single-value processor for long and long?
This is smart enough to switch between BITFIELD and BITFIELD_RO if the RO command is available and it determines that the all the commands are read-only. My question though, is there a better way to check for BITFIELD_RO availability, I just do a check against the version for 6.2.0 which is when BITFIELD_RO was added.
The BITFIELD command's encodings top out at i64/u63, so an i64 should be a suitable return type regardless of what the encoding type is.

NickCraver · 2022-04-19T20:36:58Z

src/StackExchange.Redis/APITypes/BitfieldSubCommand.cs

+/// <summary>
+/// Represents a Bitfield GET, which returns the number stored in the specified offset of a bitfield at the given encoding.
+/// </summary>
+public sealed class BitfieldGet : BitfieldSubCommand


I think the approach is a bit heavy here - we're doing a lot of class creation on what seems like should be structs local to the command generation itself and not public API surface area. I'm happy to yank this local and try other approaches - will try and play some tonight.

Interested to see what you come up with, this seemed liked best way to me. It's a weird command because of how variadic and structured it is. There's other commands where you just pass in arrays of literals and you can work everything out from that, but in this case you're really expecting things in a very specific order. Kind of tricky to design an API around, might be why it's been around for 6 years without making it into the library. I will tell you though I give a talk on bit-operations and the inability to use bitfield natively has been a real bummer 😆 so I have a bit of skin in this one.

My thinking is: these are Get*Message methods in RedisDatabase like others - they are just specific overloads. I don't know why we'd need classes on these (maybe should change the implementation on the other PR already in as well). This approach allocates another object to make the message as well as exposes them on the public API but as far as I can tell they don't need to be public. I think we can simplify these subcommands to some of those methods and remove the extra classes/allocations here overall.

Maybe for the individual commands yes, but the command itself is weirdly variadic (you can execute an arbitrary number of subcommand, it's almost like a script), so you need some structure to maintain them all in, which is the thinking behind having them all broken out into different classes with an abstract driver. The alternative would be to just expose the single-commands and just let folks batch/pipeline them, but it sort of breaks the command's API.

Well I poked at this for a bit and then git just royally screwed me with a UI change recently on main vs. current branch merging in my tool and lost everything so I'm hands off tonight. Basically I had BitfieldSubCommand -> IBitfieldSubCommand, internal members on that interface still, explicitly implemented on members, and all the API changes that entails. However, I'm not sure this is a good path or not - I don't like the amount of allocations we're doing for what is intended to be, on the Redis side, a very efficient/optimized op. Generally our audiences who want this functionality will also care about the cost (I know I would).

Overall, this one may wait, would like to talk it over with @mgravell on surface area and I'm supposed to be out this week. Given git giving me signs, gonna step away for now :)

NickCraver · 2022-04-27T12:08:51Z

@slorello89 Haven't forgotten about this, just also unlikely to get so much time-wise here before trip next week. Overall want I want to try is instead of an array feeding into the method, we can have each operation as a struct (internal, not exposed). Then using a builder model (actually on the API) we can build up the command to pass in. Ultimately having a linked list of structs, each with a ref to the next via internal interface or something (need to play - just spitballing).

Then all we're creating is a single object for the command - it could even be based on Message and control the writing internally. Given the audience for this almost certainly cares about performance I'd like to make it as efficient as reasonably possible but need a few hours to hack at that idea. If you wanna give it a stab by all means! Just didn't want this hanging without an update :)

slorello89 · 2022-04-27T12:30:26Z

Hey @NickCraver - no worries - I'm aware you were on break (I feel bad you were taking any time out at all to look at this stuff).

Just so I'm clear what you're looking for. For the variadic command, you want to convert the command classes -> structs, remove them from the public API, and use a builder to create a linked list of those structs under the hood and use the builder in the public API:

long?[] StringBitfield(RedisKey key, BitfieldCommandBuilder builder, CommandFlags flags = CommandFlags.None);

Couple other questions:

Are you ok with the non-variadic commands? Internally they would be converted to use the structs rather than the classes
Are you ok with the Signedness & BitfieldOverflowHandling enums?

NickCraver · 2022-04-27T16:44:09Z

Yeah that looks roughly what I'm guessing at - I think the enums themselves are fine (given they'll be args somewhere either way) - can revisit naming in the end, don't need to get it perfect to figure out the API surface here!

slorello89 · 2022-04-28T01:39:33Z

@NickCraver - what do you think of this?

I blasted out the classes/structs for the sub-commands and let the builder stand on its own. There didn't seem that much sense to have the individual structures broken out with the builder in place, the non-variadic ones could have used them I suppose but then that's an awful lot of repeated code and it seemed a bit redundant to have the builder initialize a bunch of structs when it could just maintain the values independently.

It maintains a LinkedList of the values that you're going to pass into Redis and the Build message builds the command message.

Inside the write for the message, it IS using a foreach, so it's incurring a bit of extra cost to allocate the enumerator, perhaps the better approach is to just use the CommandKeyValuesMessage and the LinkedList of arguments to an array when initializing the message (those writes are being done in one of the more performance-sensitive areas right?)

NickCraver · 2022-05-16T02:41:04Z

Playing with the builder in https://github.com/StackExchange/StackExchange.Redis/tree/craver/bitfield-exp - I think after trying this out having all the newed up struct is odd and I think we can mostly make them an implementation detail instead. Will poke more tomorrow I hope!

shacharPash · 2022-12-12T13:31:45Z

@NickCraver
So what is actually missing here?
I would be happy to complete what is missing so that this PR can be merged

NickCraver · 2022-12-13T15:33:43Z

@slorello89 I don't think there's something missing so much as the current API is extremely heavy and we never got back to designing what this should look like in .NET. I don't have an ETA on getting back to it currently.

mgravell

has potential - lots of thoughts; sorry this fell into a hole

mgravell · 2023-11-27T19:20:51Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+/// </summary>
+public class BitfieldCommandBuilder
+{
+    private readonly LinkedList<RedisValue> _args = new LinkedList<RedisValue>();


Linked list is usually suboptimal; honestly I think we should default to List-T here

Been a little while since I put this together so my recollection might be a bit off - IIRC my thinking was:

We don't know the size of the array ahead of time - hence we can't really initialize a List without knowing it won't need to be resized.

LinkedLists allow O(1) tail insertion

We only need to enumerate when sending it, hence the O(N) scan is what you'd expect anyway.

Though now that I think of it, I suppose you are performing allocation each time you perform the addlast 🤔

frankly, linked-list just barely gets used - I'd sooner use a List<T>, but again I wonder whether this is actually a List<SomeUnionStructThatIsGetSetAndIncrby>, so each element in the list is not an argument but a logical operation - this might also make it much easier to do very efficient single-shot operations, which I expect to be relatively common. Let me have a think here - I like the ideas in this PR, but I think we can iterate the API a little.

mgravell · 2023-11-27T19:24:14Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+    /// <param name="offset">The offset into the bitfield to increment.</param>
+    /// <param name="increment">The value to increment by.</param>
+    /// <param name="overflowHandling">How overflows will be handled when incrementing.</param>
+    public BitfieldCommandBuilder Incrby(BitfieldEncoding encoding, BitfieldOffset offset, long increment, BitfieldOverflowHandling overflowHandling = BitfieldOverflowHandling.Wrap)


I wonder if this should be Increment for consistency with StringIncrement

Makes sense.

mgravell · 2023-11-27T19:25:14Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+    /// <param name="offset">The offset into the bitfield for the subcommand.</param>
+    public BitfieldCommandBuilder Get(BitfieldEncoding encoding, BitfieldOffset offset)
+    {
+        _eligibleForReadOnly = true;


surely we can only ever set this to false, with it starting true? if we Get(...).Set(...).Get(...) we're not eligible for readonly

yep, definitely.

mgravell · 2023-11-27T19:27:25Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+
+internal class BitfieldCommandMessage : Message
+{
+    private readonly LinkedList<RedisValue> _args;


ditto List, although I think if possible we should skip an alloc here and go straight to an array; for now I'd settle for List-T, though

mgravell · 2023-11-27T19:30:31Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+/// <summary>
+/// The encoding that a sub-command should use. This is either a signed or unsigned integer of a specified length.
+/// </summary>
+public readonly struct BitfieldEncoding


should have full struct equality impl - IEquatable, override GetHashCode, override Equals (via => obj is BitFieldEncoding other && Equals(other);

mgravell · 2023-11-27T19:34:43Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+/// An offset into a bitfield. This is either a literal offset (number of bits from the beginning of the bitfield) or an
+/// encoding based offset, based off the encoding of the sub-command.
+/// </summary>
+public readonly struct BitfieldOffset


ditto struct equality

mgravell · 2023-11-27T19:35:58Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+    /// <summary>
+    /// Returns the BitfieldOffset as a RedisValue.
+    /// </summary>
+    internal RedisValue RedisValue => $"{(ByEncoding ? "#" : string.Empty)}{Offset}";


hmmm; can't cache this one - I wonder if RedisValue is hampering us here, and we should be using a custom internal struct with a custom writer.... meh, leave it like this for now, we can change that later - let's just get it correct for now

mgravell · 2023-11-27T19:42:23Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+    /// </summary>
+    /// <param name="byEncoding">Whether or not the BitfieldOffset will work off of the sub-commands integer encoding.</param>
+    /// <param name="offset">The number of either bits or encoded integers to offset into the bitfield.</param>
+    public BitfieldOffset(bool byEncoding, long offset)


I wonder if this should be (long offset, long byEncoding = true), perhaps even with an implicit conversion operator that does the same

mgravell · 2023-11-27T19:42:38Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+    public BitfieldOffset(bool byEncoding, long offset)
+    {
+        ByEncoding = byEncoding;
+        Offset = offset;


Does this have to be non-negative?

mgravell · 2023-11-27T19:44:41Z

src/StackExchange.Redis/APITypes/BitfieldCommandBuilder.cs

+    /// <summary>
+    /// Returns the BitfieldOffset as a RedisValue.
+    /// </summary>
+    internal RedisValue RedisValue => $"{(ByEncoding ? "#" : string.Empty)}{Offset}";


note: this is unnecessarily allocatey in the raw offset case; should be IMO: => ByEncoding ? new($"#{Offset}") : new(Offset);

mgravell · 2023-11-27T19:56:01Z

added thoughts, but overall I agree that the API looks weird with the custom structs; I wonder if we can hide those and just take more args on the methods; @NickCraver thoughts on how we resurrect this?

slorello89 · 2023-11-27T21:40:25Z

@mgravell - merged main and pushed up.

With structs vs extra args, I guess my main thought was it would be a fair number of args - SET would go from 3->5, INCRBY would go from 3/4->5/6, GET would go from 2->4. So it's right around the threshold where I'd consider introducing new structures. But to your and Nick's point, it is a fair amount of allocation for something that is meant to be quite lightweight.

So think just nix those structures, move to LinkedList->List, some minor name changes?

mgravell · 2023-11-28T09:07:16Z

I want to play with an alternative API before we get too carried away. This feels pretty complex at the moment. What I have in mind is something like:

public readonly struct BitfieldOperation
{
  long offset, value
  SomeInternalEnum : byte opType
  byte encoding // low 6 == width
  // high 2 == by bit/enc, signed/unsigned
  public static BitfieldOperation Get(long offset,
      byte width, bool unsigned = false, offsetByBit = false)
    public static BitfieldOperation Set(long offset,
      byte width, bool unsigned = false, offsetByBit = false)
   public static BitfieldOperation Increment(long offset,
      byte width, BitfieldOverflow overflow = /* Todo */, bool unsigned = false, offsetByBit = false)

  // Other bits not shown
}

With the main bitfield methods taking either a single BitfieldOperation or an array.

Internally, we can check the BitfieldOperation if the operand(s) to compute RO. No need for the builder, list, or packs of RedisValue - we should also be able to have each operation say "I contribute this many args", so we can do efficient write

But more importantly, it allows simple usage at the call site, i.e.

dB.Bitfield(key, [ BitfieldOperation.Get(...), BitfieldOperation.Set(...) ])

Thoughts?

slorello89 · 2023-11-29T14:40:39Z

Seems pretty sensible, I think my only thing would be to initialize the offset/encoding as their proper RedisValues in Get Set Increment (can toss validation errors there) WDYT?

mgravell · 2023-11-29T18:03:05Z

I think my only thing would be to initialize the offset/encoding as their proper RedisValues

It seems to me likely that we can avoid that entirely - ultimately we don't need a RedisValue here, IMO

slorello89 · 2023-11-29T18:17:03Z

@mgravell - Isn't it going to be implicitly cast to a bulk string when it's written to the socket anyway?

mgravell · 2023-12-01T14:59:06Z

@slorello89 yes and no; we have power to do anything we like there - we could, for example, stackalloc a small chunk, write the prefix+value to that, and write that combination to the output, zero allocs. Or possibly zero copy. It doesn't need to be a string, necessarily

slorello89 · 2023-12-01T15:02:47Z

@mgravell - I updated the API per your comments. An additional thought:

I find these signatures a bit weird: BitfieldOperation Set(long offset, byte width, long value, bool offsetByBit = true, bool unsigned = false) - because it disconnects the offset/encoding parts of the command. A more 'sensible' one to me would be BitfieldOperation Set(long offset, bool offsetByBit, byte width, bool unsigned, long value) - I know in this case they'd no longer be optional parameters, but should they be? Idk just a thought.

slorello89 · 2023-12-01T15:18:31Z

@mgravell that's fair - but is there anywhere currently that's being done (in the context of writing out to the pipeline?)

NickCraver · 2023-12-08T19:44:54Z

@slorello89 It took me a while to get over the array allocations on this for a lightweight API until I realized: "users are probably calling it the same way over and over"...in which case with proper usage that shouldn't be a problem, we just need examples of where they're re-using the same array passed in every time. It's really hard to judge the optional params because there's zero usage of this today to go by.

Is there any chance you have examples from other projects, that are calling this API set and what they're usually doing? I'd argue: if they're used most of the time, we can just not make them optional at all (we can even intelligently pack the struct better).

This and other things has made me realize just how much we're lacking a samples section (keeps coming up with logging and event handlers), that would be equally helpful in evaluating PR/API additions too. Not asking for that here, just saying: top of mind and likely a good prototype approach once that's in place.

slorello89 · 2023-12-11T13:22:44Z

@NickCraver - looking across the client ecosystem it looks like there are a couple of approaches to this API:

Just send a bunch of strings and figure it out yourself e.g. jedis, go-redis
Builder pattern with arbitrary arguments. e.g. redis-py
Strongly-typed builder pattern e.g. lettuce or redisson

I can't say I've seen many usages of this command in the wild, and I've only ever heard a few people ask for the command in .NET (I've pointed them to the ad-hoc API) - but I suspect people tend to lean on the builder patterns where available.

Examples are always nice, we are working on getting examples for us to use in code snippets in redis.io across the language communities (the issue that got the ball rolling on this again is part of that effort). See an example of these snippets here

slorello89 added 6 commits April 18, 2022 15:43

WIP

68d6d79

Bitfield feature

1695639

Merge branch 'main' into feature/bitfield

125885a

accurate reporting of num args for incr

5660713

removing unused parsing methods

bdb9128

some formatting comment updates

7554d6d

slorello89 requested a review from NickCraver April 19, 2022 14:37

slorello89 added 3 commits April 19, 2022 12:36

merge

5daca75

moving enums

c0fe65d

using new remarks pattern nick introduced

3928113

slorello89 mentioned this pull request Apr 19, 2022

Missing Commands in StackExchange.Redis #2055

Open

NickCraver reviewed Apr 19, 2022

View reviewed changes

properly checking feature flag

b3c16c4

decreasing public-api surface area, using a builder.

d3cc041

NickCraver self-assigned this May 10, 2022

NickCraver added 4 commits May 15, 2022 21:07

Merge remote-tracking branch 'origin/main' into feature/bitfield

95eb393

Fix spacing

9dae591

Simplify a bit, update docs

7c6f62f

Save all the files dammit

5f07290

slorello89 mentioned this pull request Nov 27, 2023

Create C# code snippets for bitfields #2607

Closed

mgravell reviewed Nov 27, 2023

View reviewed changes

Merge branch 'main' into feature/bitfield

d87f66b

api updates per marcs comments

1392d20

encoding -> string

a00aba5

BITFIELD and BITFIELD_RO feature #2107

Are you sure you want to change the base?

BITFIELD and BITFIELD_RO feature #2107

Conversation

slorello89 commented Apr 19, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NickCraver commented Apr 27, 2022

slorello89 commented Apr 27, 2022

NickCraver commented Apr 27, 2022

slorello89 commented Apr 28, 2022

NickCraver commented May 16, 2022

shacharPash commented Dec 12, 2022

NickCraver commented Dec 13, 2022

mgravell left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mgravell commented Nov 27, 2023

slorello89 commented Nov 27, 2023

mgravell commented Nov 28, 2023 • edited

slorello89 commented Nov 29, 2023

mgravell commented Nov 29, 2023

slorello89 commented Nov 29, 2023

mgravell commented Dec 1, 2023

slorello89 commented Dec 1, 2023

slorello89 commented Dec 1, 2023

NickCraver commented Dec 8, 2023

slorello89 commented Dec 11, 2023

slorello89 commented Apr 19, 2022 •

edited

mgravell commented Nov 28, 2023 •

edited