Naked Security

Big bad decryption bug in OpenSSL – but no cause for alarm

The buggy code's in there, alright. Fortunately, it's hard to get OpenSSL to use it even if you want to, which mitigates the risk.

Written by Paul Ducklin

August 27, 2021

Naked Security buffer overflow CVE-2021-3711 CVE-2021-3712 OpenSSL vulnerability

The well-known and widely-used encryption library OpenSSL released a security patch earlier this week.

Annoyingly for those who like lean, modern, sans serif typefaces, the new version is OpenSSL 1.1.1l, which is tricky to interpret if you use a font in which upper case EYE, lower case ELL and the digit ONE look at all similar.

To spell it out phonetically, you’re after OpenSSL version ONE dot ONE dot ONE LIMA.

(At the time of writing, Naked Security’s official typeface is Flama, a Bauhaus-inspired font family derived from DIN 1451, which itself arose out of early 20th century German railway and road lettering styles. Our lower case ELLs have a neat looking rightwards curl at the bottom to improve their legibility, and ONEs get a classically European look with a crossbar at the bottom and a little leftward flick at the top. But not all typefaces are made that way.)

The bugs

OpenSSL, as its name suggests, is mainly used by network software that uses the TLS protocol (transport layer security), formerly known as SSL (secure sockets layer), to protect data in transit.

Although TLS has now replaced SSL, removing a huge number of cryptographic flaws along the way, many of the popular open source programming libraries that support it, such as OpenSSL, LibreSSL and BoringSSL, have kept old-school product names for the sake of familiarity.

Despite having TLS support as its primary aim, OpenSSL also lets you access the lower-level functions on which TLS itself depends, so you can use the libcrypto part of OpenSSL to do standalone encryption, compute file hashes, verify digital signatures and even do arithmetic with numbers that are thousands of digits long.

There are two bugs patched in the new version:

CVE-2021-3711: SM2 decryption buffer overflow.
CVE-2021-3712: Read buffer overruns processing ASN.1 strings.

Strings, long and short

The second of these bugs, CVE-2021-3712, is the less dangerous of the two, and ironically relates to how OpenSSL handles encoded cryptographic keys and certificates.

The raw data inside TLS key and certifcate files is packaged up in a format called DER, short for Distinguished Encoding Rules, which is a form of ASN.1, short for Abstract Syntax Notation version 1, a structured way of representing binary data.

(Note that if you’ve ever looked at TLS keys or certificates, you’ve probably seen something like this:

-----BEGIN PUBLIC KEY-----
MFkwEwYHKoZIzj0CAQYIKoZIzj0DAQcDQgAE+LXZfjSOTE0cigDmC3Vlbm0VABgl
Zkmp1zbZsiN9ILxqSQy5Krrza94c/eVZORK03gteh9txboKKQOh6LyAftg==
-----END PUBLIC KEY-----

That’s just a topped-and-tailed, base64 encoded version of the raw DER data, used to make the file easier to recognise and less likely to get mangled in transit than a pure binary file.)

Those names are quite a mouthful, but the important part is not the jargon, but the fact that text strings in ASN.1 are stored in a similar way to how they are in programming languages like Pascal, namely with a length field followed by exactly that much data.

In C, however, strings are stored without any length field: you just get the raw text data, ended with a zero (NUL) byte.

That makes C strings much simpler to use, but it can be annoying for three reasons.

Firstly, it means you can never be sure how long a string is until you traverse the whole thing to find out where the NUL byte is; secondly, you can’t have a NUL byte in the middle of a string, even if you want to; and thirdly, if the final NUL byte gets left out, then copying or printing out a string could go on and on for ages, and include way more information than you intended, assuming that the unterminated string is followed by a large block of non-zero data.

So, ASN.1 gives you structure and control, while C gives you simplicity and speed.

To give you the best of both worlds, OpenSSL always adds a NUL byte to its ASN.1 strings, even though this is not necessary.

This means you can access those strings via OpenSSL’s special ASN.1 functions, as usual, but also safely read them out directly from C, for example if you want to print out a message that contains the relevant string.

Actually (and this is the problem), that’s not quite true.

You can can safely treat OpenSSL’s ASN.1 strings as C strings, but only if they were generated by OpenSSL’s special “always add the NUL byte” functions, otherwise you could end up with an unterminated string, and all the problems that can cause.

If you construct the ASN.1 data yourself, maliciously or otherwise, you don’t need to include those pesky NUL bytes, and the resulting strings will not be safe to use directly from C – you will need to use OpenSSL’s special access functions all the time.

Unfortunately, a few of OpenSSL’s own functions were found to be taking shortcuts and relying on directly accessing ASN.1 strings from C, even when they couldn’t be sure that the original data had been created with those all-important NUL bytes tacked on the end.

As a result, with clever shenanigans, it might be possible for an attacker to trick OpenSSL into printing out data that goes beyond the end of the memory buffer.

This sort of read buffer overflow could lead to to data leaks, where private data accidentally gets revealed in output such as a log file or a system message.

Alternatively, read overflows could lead to memory access errors, where OpenSSL reads so much extra data beyond the missing NUL that an access violation occurs, leading to a software crash.

Data breached due to a read overflow is exactly what happened in the infamous Heartbleed bug, where code that was supposed to send a reply containing just a few bytes from memory inadvertently copied up to 64Kbytes every time instead.

This leaked out whatever else was adjacent in memory at the time, possibly including passwords or decryption keys that just happened to be nearby.

Decrypting too much

The more serious bug of the two, CVE-2021-3711, also involves a buffer overflow, but this time it’s a write overflow, making it much more dangerous.

If you can write past the end of an officially allocated block of memory and modify data that controls some other part of a program, then you may be able to manipulate the behaviour of the program in the future.

For example, you may be able to trick it into thinking that something succeeded (or failed) when it didn’t, or even to take over the flow of program execution entirely.

Using booby trapped data to take over a running program is known as RCE, short for remote code execution, which means exactly what it says: someone else, perhaps even on the other side of the world, gets to control your computer without so much as a popup dialog or a warning.

The CVE-2021-3711 bug relies on a common programming idiom used in software code that generates output data.

That idiom involves using the same data output function twice in succession: first, you run the function but say “don’t actually generate the data, just tell me how much there will be when I do it for real”; then, after setting aside a buffer of the right size, you run the function again to produce the actual data.

That way, in theory, you can reliably avoid buffer overflows by making sure that you have enough memory space before you start.

Except that in one specific case in OpenSSL, namely when using the Chinese government’s cryptographic algorithm known as ShangMi (SM), the software may end up telling you that you’ll need a buffer size up to 62 bytes too small.

That means that booby-trapped encrypted data sent into for decryption could trigger a significant, writable buffer overflow inside OpenSSL.

In particular, it looks as though the buggy code could end up getting called if OpenSSL is asked to set up a TLS connection using the ShangMi ciphersuite and then to validate the web certificate presented by the other end during the TLS handshake.

So far, we’re not aware of any working exploits using this vulnerability.

With a bit of luck

The good news here is that official TLS support for ShangMi was only introduced in RFC 8998, dated March 2021, so it’s a newcomer to the world’s cryptographic stable.

So, although OpenSSL includes implementations of the SM algorithms (SM2 for key agreement and digital signatures, SM3 for hashing, and SM4 for block encryption)…

…it doesn’t yet include the code needed to allow you to choose these algorithms as a ciphersuite for use in TLS connections.

You can’t ask your TLS client code to request a ShangMi connection to someone else’s server, as far as we can see; and you can’t get your TLS server code to accept a ShangMi connection from someone else’s client.

So the bug is in there, down in the low-level OpenSSL libcrypto code, but if you use OpenSSL at the TLS level to make or accept secure connections, we don’t think you can open up a session in which the buggy code could be triggered.

In our opinion, that greatly reduces the likelihood of criminals abusing this flaw to implant malware on your laptop, for example by luring you to a booby-trapped website and presenting you with a rogue certificate during connection setup.

What to do?

Upgrade to OpenSSL 1.1.1l if you can. Although most software on Windows, Mac, iOS and Android will not be using OpenSSL, because those platforms have their own alternative TLS implementations, some software may include an OpenSSL build of its own and will need updating independently. If in doubt, consult your vendor. Most Linux distros will have a system-wide version of OpenSSL, so check with your distro for an update. (Note: Firefox doesn’t use OpenSSL on any platforms.)
Consider rebuilding OpenSSL without ShangMi support if you can’t upgrade. Passing the options nosm2 nosm3 nosm4 to the OpenSSL config script before rebuilding will do the trick. Given that ShangiMi can’t yet be selected for use in TLS connections, you should find this to be a non-disruptive change.
If you’re a programmer, always assume the worst about data. Never assume that data you are called upon to deconstruct was created by the matching construction functions that you carefully coded into your own library. Crafting booby-trapped data packets that don’t look the way you expect (and weren’t covered in your own testing) is what a lot of cybersecurity researchers do for a living. Unfortunately, not all of them work for the Good Guys.

About the Author

Paul Ducklin

Paul Ducklin is a passionate security proselytiser. (That's like an evangelist, but more so!) He lives and breathes computer security, and would be happy for you to do so, too.

Follow him on Twitter: @duckblog

Read Similar Articles

May 24, 2021

What to expect when you’ve been hit with Avaddon ransomware

May 19, 2021

What’s New in Sophos EDR 4.0

May 19, 2021

Sophos XDR: Driven by data

11 Comments

For 25 years the FOSS fans have been telling us of the virtues of Open Source. But now you’re telling us that two bugs have hidden in Open SSL for years!

I’m beginning to believe that most productive programmers don’t have the time to study others’ complex code looking for bugs.

Well, the infamous Heartbleed bug in OpenSSL, back in 2014, was a good reminder that the saying “with many eyes, all bugs are shallow” was a myth:
https://nakedsecurity.sophos.com/2014/04/08/anatomy-of-a-data-leak-bug-openssl-heartbleed/

Here are some Linux bugs that went unnoticed for 15 years:
https://nakedsecurity.sophos.com/2014/04/08/anatomy-of-a-data-leak-bug-openssl-heartbleed/

Bug in OpenSSH for about two decades:
https://nakedsecurity.sophos.com/2018/08/23/vulnerability-in-openssh-for-two-decades-no-the-sky-isnt-falling/

Here’s a visualisation of how long bugs later deemed critical seem to have been in the Linux kernel:
https://nakedsecurity.sophos.com/2016/10/19/linux-kernel-bugs-we-add-them-in-and-then-take-years-to-get-them-out/

A bug that was in QEMU for about 10 years, and ended up faithfully reproduced in numerous Linux virtualisation projects that used its code:
https://nakedsecurity.sophos.com/2015/05/14/the-venom-virtual-machine-escape-bug-what-you-need-to-know/

This isn’t just an open source, problem, of course – closed source projcts also sometimes winkle out bugs that have lasted years or decades, and even though closed source generally can’t be viewed by just *anyone*, lots of closed source projects do get viewed “by many eyes”, often by eyes that are formally employed to look at that code closely over a sustained period, rather than by a rolling band of volunteers.

The hard part isn’t finding and fixing bugs… it’s knowing where to look in the first place :-)

Care to do a similar analysis for NON open source software?

It’s not an analysis or a comparison. It’s just some counter-examples to the specific idea that bugs will inevitably be found quickly in FLOSS because lots of people are bound to be looking at it.

As I said in my comment above, the opacity of bugs is also a problem in closed source code, even when a dedicated, full-time team has been working on the same codebase for years. I’m not trying to decide here whether open or closed source code is better or worse on principle (I have seen at least two studies that have found that the defect rates and the time that bugs lie unnoticed are much more similar than different, which seems very likely to me).

We’ve written many times before about Windows bugs that were patched in the last few years where Microsoft took the unusual step of publishing updates for Windows versions that had been out of support for ages, which implies in each case that the bug dated back at the *latest* to the last official release of that unsupported version, but probably well before that. We’ve also written about bugs found in recent times that only existed because of design decisions made in the olden days, and never revisted or revised since then.

Examples:
https://nakedsecurity.sophos.com/2019/05/15/update-now-critical-remote-wormable-windows-vulnerability/
https://nakedsecurity.sophos.com/2015/02/11/jasbug-windows-vulnerability-beyond-the-hype-what-you-need-to-know/

Also, if you remember, the Stuxnet virus spread far and wide using an obvious-once-you-know-what-to-look-for zero day vulnerability that was only uncovered and patched *after Stuxnet was noticed in the wild*, which itself took several years.

Indeed, I wasn’t trying to examine whether there was a difference in bug huntability between FLOSS and closed source. I was merely musing about the popular assumption that bugs *must surely* get found fast in FLOSS because more people are likely to look at it, stands to reason, QED. (Perhaps the aphorism should read, “All other things being equal, shallow bugs are more likely to be found if more people have access to the code.” But the truth seems to be that just looking at code, no matter how many people do it, isn’t enough on its own to bring bugs closer to the surface where they magically become easier to see.)

Thanks, but do “bug-hunters” put FLOSS code through “parsers” to identify “lax programming” – which they can’t do with closed source?

Instinctively I would expect open source bugs to become public quicker than closed source – but the corollary is that exploitation of open source bugs is likely to be quicker than with closed source.

We are very reliant on the idea that closed source software companies have teams trawling code (active, LTS and legacy) for bugs. But when push comes to shove where do commercial companies put their resources – major re-writes and new fancy features for the next competitive release (and hope old issues remain undiscovered) – or examining code that has previously passed quality testing?

Static analysers can find some bugs, and many (most?) major software makers, open or closed, will use tools of this sort. But there are many classes of bug that this sort of checking has trouble in finding.

One bit of good news for closed source projects (and and increasing amount of open source) is bug bounties, where companies offer payments to third parties to find real-world runtime bugs that are (or could be) exploitable. Numerous tools exist to help with this sort of bug-hunting including fuzzing, dynamic instrumentation and more.

At Sophos we use a mixture of bug-hunting techniques including internal code reviews, external code reviews, fuzzing, internal bug hunting using dynamic techniques, bug bounties, and more, for all the code we use (open and closed) and all the products we build from that source. Continual review (of what we build from and what we build into) aims to avoid the trap of “it passed QA last year so this part can be ignored from now on”. Our dedicated bug-hunting experts don’t just look at new code that’s being added but at the full products with all the legacy code in there too. They don’t stop looking for any specific sort of bug or stop hunting bugs in older parts of the codebase just because new code is coming along. (Importantly, though, they may hasten the adoption of new code if they think old parts need replacing or improving!)

Interesting as always!. Your article and subsequent comments do beg the question – surely these types of buffer overflow bugs highlight the need to move away from fast, but potentially unsafe languages like C. Programmers are only human after all.
With cars, we enforce seatbelts, rather than just telling people to drive more carefully.
I’ve heard Rust is fast, but generally safer to use. I’d be fascinated to hear your take on ditching languages like C for safer / stricter ones.

Bugs are possible in any language (and no matter how correct your code, you then have to rely on compilers, linkers and the OS itself both to “do what do say” and to “do what you mean”). You don’t need Rust, though it’s a popular “cureall” these days… there are also systems like F* and Clight (which aim to produce crypto code that is provably correct), and other approaches you could use, too if you are understandably wary of C and of C++.

If anything, part of the problem in contemporary online crypto libraries is simply that there is so much choice and so many places to get things wrong, not just with buggy code but also with incorrect usage.

Libraries such as NaCl and libsodium, which aren’t in Rust, are trying to fix this by offering only a few algorithms that can be accessed only in a few ways. But OpenSSL and all the other mainstream libraries remain popular because they offer variey and choice (for example, you can use authenticated encryption algorithms but without ever checking the authentication part, or choose outdated or unsuitable algorithms and configurations because legacy code demands it).

And the world, sadly, makes that necessary because a significant enough majority (or perhaps sometimes an outright majority) simply won’t move forward.

TLS 1.3, which is complex enough yet way simpler than its predecessors, is widely supported, but not widely enough that those of us who care about these things can afford to say, “I will speak and listen only for TLS 1.3, with no fallback to any previous version”, at least not without losing a massive amount of goodwill and business.

And even TLS 1.3, which supported just 5 ciphersuites when it came out, now has two new ones to support, and three new cryptographic algorithms to implement, because of ShangMi.

In other words, TLS 1.3 just got a new elliptic curve, a new block cipher, a new hash function *and* two new ways of combining them… and when the next government or nation-state bloc decides it can’t trust the Americans *or* the Chinese, and wields enough influence, we shall presumably get more ciphers, and more, until TLS 3.0 comes out (it seems traditional these days to skip Version 2) at which point we now have N+1 standards to worry about, not just 1 new standard to rule them all.

ObXKCD: https://xkcd.com/927/

Aha……

Well, I guess if it were simple, it wouldn’t still be a problem… ;-)

Can these be exploited on a DD-WRT router? ; if not, then I should be secure.

I don’t know. Does the version of DD-WRT you are using have OpenSSL as its crypto library? If so, how was that library compiled? If it contains either or both of these bugs, has it been patched?

There are lots of different DD-WRT builds (from memory – it’s a while since I used it) because each build gets tweaked for the hardware it will run on, given that most routers don’t have enough space to include all possible options and multiple library choices. (My desktop Linux comes with OpenSSL, gnutls, NSS and yet more crypto libraries, for the various apps that use them)

You’ll need to review the build for yourself,or ask on the relevant DD-WRT forum…

Comments are closed.

Big bad decryption bug in OpenSSL – but no cause for alarm

The bugs

Strings, long and short

Decrypting too much

With a bit of luck

What to do?

Paul Ducklin

Read Similar Articles

What to expect when you’ve been hit with Avaddon ransomware

What’s New in Sophos EDR 4.0

Sophos XDR: Driven by data

11 Comments

Laurence Marks

Paul Ducklin

Cassandra

Paul Ducklin

Cassandra

Paul Ducklin

Anonymous Coward

Paul Ducklin

Anonymous Coward

Anonymous

Paul Ducklin