Managing SignalR ConnectionIds (or why you shouldn't)

This is my entry for C# Advent 2020! Thank you to Matthew Groves for inviting me to participate.

Over the past couple of weeks, I've been actively working on a new course to teach ASP.NET Core developers on how to effectively use SignalR within their applications.

One of the commons issues I see when reading questions on StackOverflow or Reddit is that many SignalR users do not know how to effectively manage connections to their hubs. Or worse, they make mistakes that'll cause performance problems in the long run.

Overview: Connections

Let's level the playing field a bit before diving into recommendations. What is a connection? A

connection

is the reference from a client to a server. In many cases, this is an open WebSocket that is used by either the client or server to send messages.

But there are also cases where a client will default to Long Polling to communicate. This isn't one CONNECTION, it's several because each connection has to reconnect, disconnect, and repeat. That's primarily why I'll call a connection a reference, because, in each socket connection to the server, the client will say "I'm connection ID XYZABC" to identify.

SignalR does a lot of work underneath the covers to bind this connection reference to internal data, such as user principles if you're using Identity.

The important thing to note with connections, and more importantly connection IDs, is that they are temporarily identifiers for a connection. If you have multiple tabs or windows open, each of these page instances will have its unique identifier. Heck, if you refresh - the ID will change.

Back in 2012, as I was learning more and more about SignalR, I wrote a post on how to persist ConnectionIds across page instances. Turns out - that was horrible advice and you shouldn't do. However, overriding connection ID generation is a tactic you could use - as long as connections always get a unique ID. That's a future post.

Do not track ConnectionIds

I've got myself into a discussion with folks about this before, but I do not advise tracking ConnectionIds. More than once, I've seen this particular workflow:

On connection: map connection ID, user name, roles, etc within the database or some caching system (Redis, for example).

On disconnection: removing the mapping from the data store.

Use case: I need to send a notification to all connections for user "Kevin".

var connectionIds = cachingLayer.GetConnectionsForUser("Kevin");
await hubContext.Clients.Clients(connectionIds).SendAsync("notify", "Hello World");

This might work. But there is a fundamental flaw. It will not work in scaled solutions (load-balanced set, basically).

If you're running multiple versions of your server at any moment, I'm going to assume you're using Redis backplane or Azure SignalR Server. Either way, you're going to run into issues.

Note: A lot of my experience is with Redis backplane, where I have replicated this issue many times. I didn't consider it a bug. Azure SignalR Server might not have the same behavior, but still, you shouldn't do this.

Additionally, you're writing A LOT of unnecessary code when you could simply use the built-in mechanisms of SignalR to do this work for you.

SignalR Mastery: Become a Pro in Real-Time Web Development

Join the thousands of developers who have already taken their first steps into building real-time web applications with SignalR.

The best SignalR Course I've ever watched ever. explaining all the details you need to know and even more about SignalR technology.

Exactly what I am looking for. Really appreciate the real world scenario. Thank you.

The author of the course really shows he knows what he is talking about. Very awesome!

Learn More

What's a better alternative?

I'm glad you asked! I want you to use Groups because it's perfectly suited JUST FOR THIS, and disconnections are cleaned up for you automatically!

Here is a sample of something I've written in the past for a project:

public override async Task OnConnectedAsync()
{
  var userName = GetUserName(Context); // get the username of the connected user

  await Groups.AddToGroupAsync(Context.ConnectionId, $"user_{userName}");
  await base.OnConnectedAsync();
}

There isn't a limit on the number of groups you can have, nor the number of connections in that group. So when a NEW connection comes in, we figure out the user (if we can) and add them to their own group.

In the future, when I want to notify all clients for user "Kevin", the process is easier.

var userName = "Kevin";
await hubContext.Clients.Group($"user_{userName}").SendAsync("notify", "Hello World");

This saves me a roundtrip to my cache or database, and also saves me the cycles of having to process all of the connectionIds.

SignalR will automatically clean up these connections too.

And better yet, using groups works perfectly in scaled solutions.

Users

Groups aren't the only way to solve this problem.

This isn't my favorite method, but you can send messages directly to users based on user principle name.

public Task NotifyUser(string user, string message)
{
    return Clients.User(user).SendAsync("notify", message);
}

Why is it not my favorite? Eh, I just think it's more difficult to use and requires more upfront planning to ensure you have the correct user principle name for sending the message. I don't use

User()

in any of my projects.

Wrapping Up

As engineers, it is easy to complicate solutions in our heads. I've made the same mistakes myself but thinking that the best solution needs to be the most complicated. Sometimes when you take a step back, a better, easier solution can present itself.

I hope this guide was useful to you. If you have further questions about SignalR, please feel free to reach out.