Read has very surprising behaviour (does not match spec) #119

talex5 · 2018-12-05T17:44:26Z

A VChan endpoint claims to implement Mirage_flow_lwt.S. This says (https://github.com/mirage/mirage-flow/blob/master/src/mirage_flow.mli#L62):

  val read: flow -> (buffer or_eof, error) result io
  (** [read flow] blocks until some data is available and returns a
      fresh buffer containing it.

However, reading the code it seems that it does not return a fresh buffer. Instead, it returns a region of the underlying ring (which is under the control of the remote VM).

ocaml-vchan/lib/endpoint.ml

Lines 327 to 360 in 5fcd4e3

    
           (* Read a chunk in a blocking fashion. Note this returns a 
        
              reference to the data in the ring. *) 
        
           let rec _read_one vch event = 
        
             (* wait until at least 1 byte is available or the connection has closed *) 
        
             let avail = fast_get_data_ready vch 1 in 
        
             let state = state vch in 
        
             if avail = 0 && state = Connected 
        
             then E.recv vch.evtchn event >>= fun event -> _read_one vch event 
        
             else 
        
               if avail = 0 && state <> Connected 
        
               then Lwt.return `Eof 
        
               else 
        
                 let real_idx = Int32.(logand (rd_cons vch) (of_int (rd_ring_size vch) - 1l) |> to_int) in 
        
                 let bytes_before_wraparound = rd_ring_size vch - real_idx in 
        
                 let buf = 
        
                   if bytes_before_wraparound = 0 then begin 
        
                     (* all bytes are in a contiguous block starting at 0 *) 
        
                     Cstruct.sub vch.read 0 avail 
        
                   end else begin 
        
                     (* we'll only consume the bytes before wraparound on this iteration *) 
        
                     Cstruct.sub vch.read real_idx (min avail bytes_before_wraparound) 
        
                   end in 
        
                 Lwt.return (`Ok buf) 
        
           let read vch = 
        
             (* signal the remote that we've consumed the last block of data it sent us *) 
        
             set_rd_cons vch Int32.(of_int vch.ack_up_to); 
        
             send_notify vch Read; 
        
             (* get the fresh data *) 
        
             _read_one vch E.initial >>= function 
        
             | `Ok buf -> 
        
               (* we'll signal the remote we've consumed this data on the next iteration *) 
        
               vch.ack_up_to <- vch.ack_up_to + (Cstruct.len buf); 
        
               Lwt.return @@ Ok (`Data buf)

This means that:

The data in the returned buffer can change at any time if the remote VM is malicious. Users need to protect against this (e.g. by never reading the same byte more than once).
The data is only valid until the next read, which seems to be when the library acks the read.
Reading data is not sufficient to create more space in the ring buffer. In particular, the sender may be blocked waiting for space even after the receiver has read all of the data.

I think this needs documenting, at least. Perhaps consider renaming the current read to read_unsafe, and providing a copying alternative? (the C implementation copies)

The text was updated successfully, but these errors were encountered:

djs55 · 2018-12-05T17:52:19Z

Thanks for highlighting this. I suspect the read semantics may have been clarified after the vchan code was written (or at least that's my excuse!) I agree that we should make read spec-compliant. I don't mind about the unsafe version -- it's a bit hard to use...

…

On Wed, 5 Dec 2018, 18:44 Thomas Leonard ***@***.*** wrote: A VChan endpoint claims to implement Mirage_flow_lwt.S. This says ( https://github.com/mirage/mirage-flow/blob/master/src/mirage_flow.mli#L62 ): val read: flow -> (buffer or_eof, error) result io (** [read flow] blocks until some data is available and returns a fresh buffer containing it. However, reading the code it seems that it does *not* return a fresh buffer. Instead, it returns a region of the underlying ring (which is under the control of the remote VM). https://github.com/mirage/ocaml-vchan/blob/5fcd4e3662241751142c08506ee3f5b8f6462e05/lib/endpoint.ml#L327-L360 This means that: - The data in the returned buffer can change at any time if the remote VM is malicious. Users need to protect against this (e.g. by never reading the same byte more than once). - The data is only valid until the next read, which seems to be when the library acks the read. - Reading data is not sufficient to create more space in the ring buffer. In particular, the sender may be blocked waiting for space even after the receiver has read all of the data. I think this needs documenting, at least. Perhaps consider renaming the current read to read_unsafe, and providing a copying alternative? (the C implementation copies) — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#119>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAMHuo9tyOnqhs5501DH2oE-062Xd7BKks5u2AX7gaJpZM4ZDdc9> .

avsm · 2018-12-12T22:05:38Z

I'd be in favour of removing the non-copying version entirely, and implementing the safe version. It seems very hard to use the current one correctly.

talex5 · 2018-12-14T10:32:31Z

mirage-qubes actually uses this safely to fill a buffer of known size, which is a common thing to want to do, so I suggest that if we remove this read function, we provide a (safe) read_exactly function to fill a buffer of known size completely, without extra copies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read has very surprising behaviour (does not match spec) #119

Read has very surprising behaviour (does not match spec) #119

talex5 commented Dec 5, 2018

djs55 commented Dec 5, 2018 via email

avsm commented Dec 12, 2018

talex5 commented Dec 14, 2018

Read has very surprising behaviour (does not match spec) #119

Read has very surprising behaviour (does not match spec) #119

Comments

talex5 commented Dec 5, 2018

djs55 commented Dec 5, 2018 via email

avsm commented Dec 12, 2018

talex5 commented Dec 14, 2018