Understanding the Go Scheduler

__turbobrew__ · 2025-05-21T20:11:14 1747858274

Make sure you set GOMAXPROCS when the runtime is cgroup limited.

I once profiled a slow go program running on a node with 168 cores, but cpu.max was 2 cores for the cgroup. The runtime defaults to set GOMAXPROCS to the number of visible cores which was 168 in this case. Over half the runtime was the scheduler bouncing goroutines between 168 processes despite cpu.max being 2 CPU.

The JRE is smart enough to figure out if it is running in a resource limited cgroup and make sane decisions based upon that, but golang has no such thing.

xyzzy_plugh · 2025-05-21T20:17:05 1747858625

Relevant proposal to make GOMAXPROCS cgroup-aware: https://github.com/golang/go/issues/73193

yencabulator · 2025-05-21T20:42:51 1747860171

This should be automatic these days (for the basic scenarios).

https://github.com/golang/go/blob/a1a151496503cafa5e4c672e0e...

formerly_proven · 2025-05-21T22:31:42 1747866702

This is probably going to save quadrillions of CPU cycles by making an untold number of deployed Go applications a bit more CPU efficient. Since Go is the "lingua franca" of containers, many ops people assume the Go runtime is container-aware - it's not (well not in any released version, yet).

If they'd now also make the GC respect memory cgroup limits (i.e. automatic GOMEMLIMIT), we'd probably be freeing up a couple petabytes of memory across the globe.

Java has been doing these things for a while, even OpenJDK 8 has had those patches since probably before covid.

mappu · 2025-05-21T23:24:01 1747869841

GOMEMLIMIT is not as easy, you may have other processes in the same container/cgroup also using memory.

jasonthorsness · 2025-05-21T20:48:17 1747860497

uh isn't that change 3 hours old?

yencabulator · 2025-05-21T20:59:37 1747861177

Oh heh yes it is. I just remembered the original discussion from 2019 (https://github.com/golang/go/issues/33803) and grepped the source tree for cgroup to see if that got done or not, but didn't check when it got done.

As said in 2019, import https://github.com/uber-go/automaxprocs to get the functionality ASAP.

williamdclt · 2025-05-21T22:50:52 1747867852

I honestly can’t count on my fingers and toes how many times something very precisely relevant to me was brought up or sorted out hours-to-days before I looked it up. And more often than once, by people I personally knew!

Always a weird feeling, it’s a small world

jasonthorsness · 2025-05-21T21:16:02 1747862162

super-weird coincidence but welcome, I have been waiting for this for a long time!

01HNNWZ0MV43FF · 2025-05-22T02:17:15 1747880235

Trying to see if Rust and Tokio have the same problem. I don't know enough about cgroups to be sure. Tokio at this line [1] ends up delegating to `std::thread::available_parallelism` [2] which says

> It may overcount the amount of parallelism available when limited by a process-wide affinity mask or cgroup quotas and sched_getaffinity() or cgroup fs can’t be queried, e.g. due to sandboxing.

[1] https://docs.rs/tokio/1.45.0/src/tokio/loom/std/mod.rs.html#...

[2] https://doc.rust-lang.org/stable/std/thread/fn.available_par...

weiwenhao · 2025-05-22T02:56:18 1747882578

Your write-up is so detailed that I even feel like I could implement a complete golang scheduler myself

jasonthorsness · 2025-05-21T20:02:43 1747857763

It's always a sign of good design when something as complex as the scheduler described "just works" with the simple abstraction of the goroutine. What a great article.

"1/61 of the time, check the global run queue." Stuff like this is a little odd; I would have thought this would be a variable dependent on the number of physical cores.

01HNNWZ0MV43FF · 2025-05-22T02:18:51 1747880331

That's so funny. I just saw `61` in the Tokio code with a comment "copied this from Go"

kortex · 2025-05-21T21:00:27 1747861227

Fantastic writeup! Visualizations are great, the writeup is thorough but readable.

90s_dev · 2025-05-21T18:16:31 1747851391

I heard that the scheduler is a huge obstacle to many potential optimizations, is that true?

NAHWheatCracker · 2025-05-21T19:42:33 1747856553

In some ways, yes. If you want to optimize at that level you ought to use another language.

I'm not a low level optimization guy, but I've had occasions where I wanted control over which threads my goroutines are running on or prioritizing important goroutines. It's a trade off for making things less complex, which is standard for Go.

I suppose there's always hope that the Go developers can change things.

silisili · 2025-05-21T20:06:35 1747857995

You can kinda work around this though. runtime package has a LockOSThread that pins a goroutine to its current thread and prevents others from using it.

If you model it in a way where you have one goroutine per os thread that receives and does work, it gets you close. But in many cases that means rearching the entire code base, as it's not a style I typically reach for.

naikrovek · 2025-05-22T00:46:55 1747874815

That sounds a lot like just using another language.

silisili · 2025-05-22T01:25:29 1747877129

It's really not that bad. If you have a codebase in Go you can speed up, it's fine.

That said, if you're greenfielding and see this as a limitation to begin with, picking another language is probably the right way.