The two factions of C++

The dream of a single dialect-free C++ has probably been dead for many years, anyway.

Zero Ranger.
Zero Ranger.
Toggle original/dithered image

There seems to be a lot of fighting and arguing over the future of C++.

On Reddit and a certain orange website, definitely, but also surely at the official C++ standard committee meetings. You don’t need to look very far.

The Absolute State (of C++)

It looks like we’re in the following situation:

  • C++’s Evolution Working Group (EWG) just achieved consensus on adopting P3466 R0 - (Re)affirm design principles for future C++ evolution:
    • This means no ABI breaks, retain link compatibility with C and previous C++.
    • It also means no ‘viral annotations’ (no lifetime annotations, for example).1
    • It heavily doubles down on a set of incompatible goals, ie. no ABI break and the zero-overhead-principle.2
    • Whether this is good or bad, it is a (literal) doubling down on the current trajectory of the C++ language.

In the meantime:

I don’t know about you, but if I were to look at all of this as an outsider, it sure would look as if C++ is basically falling apart, and as if a vast amount of people lost faith in the ability of C++’s committee to somehow stay on top of this.4

Two Cultures

People seem to be looking for other solutions.

Say, Google. Google evidently lost faith in ’the process’ ever since the ABI-vote. This isn’t a loss of faith in the language itself, Google has one of the largest C++ codebases in the world, and it has served them incredibly well. It’s a loss of faith in the language’s ability to evolve as pressure mounts from different angles (potential government regulations, competing languages, a desire for better performance and safety guarantees from key players, etc.).

So what’s the problem? Why doesn’t C++ just…change?

Well, figuring that out is easy. Just look at what Herb Sutter said in his paper on profiles:

We must minimize the need to change existing code. For adoption in existing code, decades of experience has consistently shown that most customers with large code bases cannot and will not change even 1% of their lines of code in order to satisfy strictness rules, not even for safety reasons unless regulatory requirements compel them to do so.” – Herb Sutter

Cool. Is anyone surprised by this? I don’t think so.

Now, let’s contrast this with Chandler Carruth’s biography on the WG21 member page:

I led the design of C++ tooling and automated refactoring systems built on top of Clang and now part of the Clang project. […]
Within Google, I led the effort to scale the automated Clang-based refactoring tools up to our entire codebase, over 100 million lines of C++ code. We can analyze and apply refactorings across the entire codebase in 20 minutes.

Oh. Do you see it? (Yes you do, I highlighted it.)

It’s “automated migration tooling”. Except it’s not just that, automated migration tooling is just the peak, the single brightly glowing example.

We’re basically seeing a conflict between two starkly different camps of C++-users:

  • Nimble, modern, highly capable tech corporations that understand that their code is an asset. (This isn’t strictly big tech. Any sane greenfield C++ startup will also fall into this category.)
  • Everyone else. Every ancient corporation where people are still fighting over how to indent their code, and some young engineer is begging management to allow him to set up a linter.

One of these groups will be capable of handling a migration somewhat gracefully, and it’s the group that is capable of building their C++ stack from versioned source, not the group that still uses ancient pre-built libraries from 1998.

In practice, of course, this is a gradient. I can only imagine how much sweat, tears, bills and blood must’ve flown to turn big tech codebases from terrifying balls of mud into semi-manageable, buildable, linted, properly versioned, slightly-less-terrifying balls of mud.

With the bias of hindsight, it’s easy to think of all of this as inevitable: There was a clear disconnect between the needs of corporations such as Google (who use highly modern C++, have automated tooling and testing, and modern infrastructure), and the (very strong) desire for backwards compatibility.

To go out on a limb, the notion of a single, dialect-free and unified C++ seems like it’s been dead for years.5 We have, at the very least, two major flavors of C++:

  • Any remotely modern C++. Probably at least C++17. uniqe_ptr, constexpr, lambdas, optional. Everything can be built from versioned source using some sort of dedicated, clean and unified build process that’s at least slightly more sophisticated than raw CMake, and sort of looks like it just works if you squint a little. Some sort of static analyzers, formatter, linter. Any sort of agreement that keeping a codebase clean and modern is worthwhile.
  • Anything that’s not that. Any C++ that’s been sitting in ancient, dusted-up servers of a medium-sized bank. Any C++ that relies on some utterly ancient chunk of compiled code, whose source has been lost, and whose original authors are unreachable. Any C++ that sits deployed on pet-type servers, to the point that spinning it up anywhere else would take an engineer a full month just to figure out all of the implicit dependencies, configs, and environment variables. Any codebase which is primarily classified as a cost-center.

You’ll notice that the main difference isn’t about C++ itself at all. The difference is tooling and the ability to build from versioned source in any clean, well-defined manner. Ideally, even the ability to deploy without needing to remember that one flag or environment variable the previous guy usually set to keep everything from imploding.

A lot of people will tell you that tooling isn’t the responsibility of the C++ standard committee, and they are right. Tooling isn’t the responsibility of the C++ standard committee, because the C++ standard committee abdicates any responsibility for it (it focuses on specifications for the C++ language, not on concrete implementations). This is by design, and it’s hard to blame them considering the legacy baggage. C++ is a standard unifying different implementations.

That said, if there’s one thing which Go got right, it’s that tooling matters. C++, in comparison, is from a prehistoric age before linters were invented. C++ has no unified build system, it has nothing even close to a unified package management system, it is incredibly hard to parse and analyze (this is terrible for tooling), and is fighting a horrifying uphill battle against Hyrum’s Law for every change that needs to be made.

There’s a massive, growing rift between those two factions, and I honestly don’t see it closing anytime soon. The C++ committee seems pretty committed (committeed, if you will) to maintaining backwards compatibility, no matter the cost.

Consequences

This is why profiles are the way they are: Safety Profiles are not intended to solve the problems of modern, tech-savvy C++ corporations. They’re intended to bring improvements without requiring any changes to old code.

Likewise, modules. You’re intended to be able to “just” import a header file as a module, and there should not be any sort of backwards compatibility issues.

Of course, everyone loves features which can just be dropped-in and bring improvements without requiring any changes to old code. But it’s pretty clear that these features are designed (first and foremost) with the goal of ’legacy C++’ in mind. Any feature that would require a migration from legacy C++ is a non-starter for the C++ committee since, as Herb Sutter said, you essentially cannot expect people to migrate.

This is something which I try to keep in mind when I look at C++ papers: There’s two large audiences here. One is that of modern C++, the other is that of legacy C++. These two camps disagree fiercely, and many papers are written with the needs of one specific group in mind.

The C++-committee is trying to keep this rift from widening. That’s, presumably, why anything in the direction of Safe C++ by Sean Baxter is a non-starter for them. This is a radical, sweeping change that could create a fundamentally new way of writing C++.

Of course, there’s also the question of whether specific C++ standard committee members are just being very, very stubborn, and grasping at straws to prevent an evolution which they personally aesthetically disagree with.

Far be it from to accuse anyone, but it wouldn’t be the first time I heard that the C++ committee applied double standards such as: “We expect a full, working implementation across several working compilers from you if you’d like to see this proposal approved, but we’re still happy to commit to certain vast projects (eg. modules, profiles) that have no functioning proof of concept implementation.”

If this were the case (I genuinely cannot say) then I really wouldn’t know for how much longer C++ could continue going down that road without a much more dramatic split.

And all of that that is not even getting into the massive can of worms and problems that’d be caused by breaking ABI compatibility.


  1. If you want to be cynical, you can interpret this as an explicit “Nuh-uh” towards Rust’s lifetime annotations, and Sean Baxter’s ‘Safe C++’ proposal. If you want to be less cynical, it’s at least a bitter awareness of the fact that the industry doesn’t care to refactor existing code. ↩︎

  2. “You don’t pay for what you don’t use.” Essentially, an existing C++ feature can only ever impact your runtime performance if you actively use it. This is not quite compatible with a stable ABI since a stable ABI (understood as a feature of C++) precludes certain performance improvements. ↩︎

  3. I think Carbon is a lot more interesting than most people give it credit for. Ask me about my opinion sometime. I might write a post on it. ↩︎

  4. Is it falling apart? Eh. Depends on what you mean. All that C++ code is not going to go anywhere. So in that sense, no. C++ code will continue to exist, at the very least. ↩︎

  5. Whether it was ever alive, with all sorts of different compilers, and their own compiler extensions is a different question. ↩︎