fix: Prevent undefined behaviour from malicious AsyncRead impl #2030

Marwes · 2019-12-25T14:34:19Z

AsyncRead is safe to implement but can be implemented so that it
reports that it read more bytes than it actually did. poll_read_buf on
the other hand implicitly trusts that the returned length is actually
correct which makes it possible to advance the buffer past what has
actually been initialized.

An alternative fix could be to avoid the panic and instead advance by
n.min(b.len())

`AsyncRead` is safe to implement but can be implemented so that it reports that it read more bytes than it actually did. `poll_read_buf` on the other head implicitly trusts that the returned length is actually correct which makes it possible to advance the buffer past what has actually been initialized. An alternative fix could be to avoid the panic and instead advance by `n.min(b.len())`

DoumanAsh · 2019-12-27T08:00:55Z

tokio/src/io/async_read.rs

@@ -123,7 +123,9 @@ pub trait AsyncRead {
                // Convert to `&mut [u8]`
                let b = &mut *(b as *mut [MaybeUninit<u8>] as *mut [u8]);

-                ready!(self.poll_read(cx, b))?
+                let n = ready!(self.poll_read(cx, b))?;
+                assert!(n <= b.len(), "Bad AsyncRead implementation, more bytes were reported as read than the buffer can hold");


This probably should be debug assert.

Writing bad implemention is unlikely to be intentional

We need it to be enforced at all times, since it needs to be upheld to prevent UB. Similar to bounds checking slices. Conversely though it might be optimized away for many cases!

no, we do not.
If we do not trust the contract, then this trait itself is UB and should be marked as unsafe

It specifically needs to be checked here because it's being passed to advance_mut below, which itself is an unsafe fn similar to Vec::set_len. Consider if this using a slice instead:

let n = ready!(self.poll_read(cx, &mut buf))?; let data = &buf[..n];

Accessing the data from the slice (&buf[..n]) would be similarly bounds checked, even if the implementation of poll_read was incorrect.

Actually, the contract of BufMut::advance_mut does seem to prevent this scenario:

Panics

This function may panic if cnt > self.remaining_mut().

Implementer notes

It is recommended for implementations of advance_mut to panic if cnt > self.remaining_mut(). If the implementation does not panic, the call must behave as if cnt == self.remaining_mut().

A call with cnt == 0 should never panic and be a no-op.

@Marwes ?

It won't panic and the Vec passed here will call reserve to expand to the correct length. But it will also follow that by calling set_len https://docs.rs/bytes/0.5.3/src/bytes/buf/buf_mut.rs.html#999 which then allows uninitialized bytes to be accessed.

Separately, the advance_mut implementation seems dubious. resize(cnt); set_len(len + cnt); will always result in the Vec exposing undefined values to safe callers.

Either panicking (as the doc suggests) or just trusting that the passed cnt is correct and not calling reserve seems better by either being safer or by avoiding the reserve overhead.

udoprog · 2019-12-27T08:42:32Z

This is a really interesting PR. On a similar note, is an implementation that incorrectly reports the length (without overflowing) to be considered sound?

It's fairly straight forward to reason about zero-initialized buffers, because even if the read underflows, the contents of the buffer is well-defined at all points. Or an unsafe implementation which uses an uninitialized buffer and then makes sure that the length is appropriately updated. But here we have a situation where safe code has a fair amount of control over the argument to Vec::set_len, but constrained to whatever implements BufMut.

Marwes · 2019-12-27T10:27:16Z

@DoumanAsh Reading uninitialized bytes from safe code without the assert! (which is why it can't be a debug_assert!).

https://play.rust-lang.org/?version=stable&mode=debug&edition=2018&gist=5adef81554ca3c3174739ddaa43fb763

DoumanAsh · 2019-12-27T10:32:05Z

@Marwes Defensive programming is all good, but your scenario implies that trait itself is broken.
And you only attempt to work-around by adding assert, which is likely to be useless in 99.9% implementations.
No one in sane mind will implement such bad reader intentionally, it is likely to be unintentional mistake.

I know there were like tons of useless convo about broken AsyncRead and Read traits, but I feel like this PR is just work-around for it.
This assert should be optional at least

Marwes · 2019-12-27T10:41:58Z

The trait isn't broken, but an implementation of it can be. Since neither the implementation of the trait, nor the caller of read_buf involves any unsafe code it should be impossible to invoke undefined behavior regardless of how bad or wrong the code otherwise is.

I am aware that this may not be the best way to fix this but fact of the matter is that it needs to be fixed in one way or another so I'd prefer suggestions for other, better fixes instead of sweeping it away.

The current fixes that I am aware of is:

This PR
AsyncRead could be an unsafe trait instead, which would then put the responsibility on the implementator in the same way as if unsafe fn prepare_uninitialized_buffer were explicitly implemented.
BufMut could be changed to assert that advance_mut to never do a set_len that isn't larger than remaining_mut (which implies no reserve call). All other implementors of BufMut should also be audited if this is the case.
read_buf could advance by n.min(b.len()) instead.

udoprog · 2019-12-27T11:09:48Z

@Marwes

BufMut could be changed to assert that advance_mut to never do a set_len that isn't larger than remaining_mut (which implies no reserve call). All other implementors of BufMut should also be audited if this is the case.

Since reading the panic note on the contract to advance_mut, I don't see how an UB because of an overflow is possible. In fact, the implementation for Vec<u8> does reserve (which doesn't seem to follow the contract, but I'll open a separate issue on that!).

Worst case it seems like advance_mut will either:

panic
silently truncate cnt to remaining_buf()
grow the underlying buffer to accommodate (undocumented? behavior of Vec<u8>).

This would however cause uninitialized arrays of bytes to be propagated through the application, which might be UB, but of this I'm not sure. It's certainly a bug though. But this could also happen if an implementer of AsyncRead returns a value greater than the number of bytes it's filled to the buffer (but doesn't overflow). See #2030 (comment)

So in my view, the question is if returning wholly or partially uninitialized arrays of u8 is UB or not? If yes, it seems to me as if poll_read must be marked unsafe, or an unsafe alternative provided.

udoprog · 2019-12-27T11:26:16Z

Opened tokio-rs/bytes#354

udoprog · 2019-12-27T23:20:55Z

I was thinking about how this problem generalizes to std types and uninitialized buffers, and found this internals thread which is very much relevant: https://internals.rust-lang.org/t/uninitialized-memory/1652/10

@carllerche commented a fair bit in it, so maybe they can comment on this issue as well?

Marwes · 2019-12-28T11:01:40Z

This would however cause uninitialized arrays of bytes to be propagated through the application, which might be UB, but of this I'm not sure. It's certainly a bug though. But this could also happen if an implementer of AsyncRead returns a value greater than the number of bytes it's filled to the buffer (but doesn't overflow). See #2030 (comment)

This isn't an issue because prepare_uninitialized_buffer is called before hand. So the entire, passed buffer is either fully initialized (if prepare_uninitialized_buffer is the default implementation) or the trait implementor has explicitly implemented it which puts the responsibility on that implementation instead (since unsafe had to be written to implement the method)

udoprog · 2019-12-28T11:05:31Z

This isn't an issue because prepare_uninitialized_buffer is called before hand. So the entire, passed buffer is either fully initialized (if prepare_uninitialized_buffer is the default implementation) or the trait implementor has explicitly implemented it which puts the responsibility on that implementation instead (since unsafe had to be written to implement the method)

Ah, that's right. Thanks for pointing that out!

Marwes · 2020-01-12T18:08:28Z

Any movement on this? Does not feel great to leave a soundness hole open, even if the trait is likely going to change.

carllerche

Sorry for the delay, this got lost in my inbox...

DoumanAsh reviewed Dec 27, 2019

View reviewed changes

carllerche approved these changes Jan 21, 2020

View reviewed changes

carllerche merged commit fbe143b into tokio-rs:master Jan 21, 2020

Marwes deleted the async_buf_read_undefined_behaviour branch January 21, 2020 20:39

TimHambourger mentioned this pull request Mar 13, 2020

Make advance_mut impl of BufMut for Vec<u8> panic if cnt > remaining tokio-rs/bytes#377

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Prevent undefined behaviour from malicious AsyncRead impl #2030

fix: Prevent undefined behaviour from malicious AsyncRead impl #2030

Marwes commented Dec 25, 2019 •

edited

Loading

DoumanAsh Dec 27, 2019

udoprog Dec 27, 2019 •

edited

Loading

DoumanAsh Dec 27, 2019

udoprog Dec 27, 2019

udoprog Dec 27, 2019

Marwes Dec 27, 2019 •

edited

Loading

udoprog commented Dec 27, 2019 •

edited

Loading

Marwes commented Dec 27, 2019

DoumanAsh commented Dec 27, 2019

Marwes commented Dec 27, 2019 •

edited

Loading

udoprog commented Dec 27, 2019 •

edited

Loading

udoprog commented Dec 27, 2019

udoprog commented Dec 27, 2019

Marwes commented Dec 28, 2019

udoprog commented Dec 28, 2019

Marwes commented Jan 12, 2020

carllerche left a comment

fix: Prevent undefined behaviour from malicious AsyncRead impl #2030

fix: Prevent undefined behaviour from malicious AsyncRead impl #2030

Conversation

Marwes commented Dec 25, 2019 • edited Loading

DoumanAsh Dec 27, 2019

Choose a reason for hiding this comment

udoprog Dec 27, 2019 • edited Loading

Choose a reason for hiding this comment

DoumanAsh Dec 27, 2019

Choose a reason for hiding this comment

udoprog Dec 27, 2019

Choose a reason for hiding this comment

udoprog Dec 27, 2019

Choose a reason for hiding this comment

Panics

Implementer notes

Marwes Dec 27, 2019 • edited Loading

Choose a reason for hiding this comment

udoprog commented Dec 27, 2019 • edited Loading

Marwes commented Dec 27, 2019

DoumanAsh commented Dec 27, 2019

Marwes commented Dec 27, 2019 • edited Loading

udoprog commented Dec 27, 2019 • edited Loading

udoprog commented Dec 27, 2019

udoprog commented Dec 27, 2019

Marwes commented Dec 28, 2019

udoprog commented Dec 28, 2019

Marwes commented Jan 12, 2020

carllerche left a comment

Choose a reason for hiding this comment

Marwes commented Dec 25, 2019 •

edited

Loading

udoprog Dec 27, 2019 •

edited

Loading

Marwes Dec 27, 2019 •

edited

Loading

udoprog commented Dec 27, 2019 •

edited

Loading

Marwes commented Dec 27, 2019 •

edited

Loading

udoprog commented Dec 27, 2019 •

edited

Loading