Reconstructing request-response pairs from a recording

Michael-F-Bryan · February 20, 2020, 8:40am

I'm analysing an interaction where two devices communicate using the request-response pattern over a fairly unreliable network (RS232) and would like to turn streams of "Computer sent bytes XXXX at 13:01:02" and "Device sent bytes YYYY at 13:01:03" back into the original request-response pairs.

I've rolled my own in the past based on a couple assumptions I know about the traffic behaviour, but was wondering if there any formal algorithms for this sort of thing?

Here's a playground link with the rough API I'm trying to create, plus the various assumptions I can make.

najamelan · February 21, 2020, 8:42pm

Disclaimer: I have no experience with serial connections, but I see your question is unanswered for a while now.

I feel like the question doesn't give much information about where exactly you are stuck.

What layers are you putting on top of the RS232 connection? Are you using existing crates like mio-serial?

Can you get a reliable bytestream out of it? Eg. one that guarantees bytes arrive in order? If you can, you can leverage existing infrastructure, like implementing AsyncRead/AsyncWrite for it and use tokio-codec/futures_codec to frame your connection, which probably saves you a bunch of work.

btw, thumbs up for the clean code and detailed comments

Michael-F-Bryan · February 22, 2020, 9:13am

I guess that's because I'm not sure how best to articulate the problem, and I've been struggling to find better terminology on the web.

A good analogy of what I'm trying to do is the follow TCP stream feature in WireShark.

When looking at a bunch of incoming and outgoing TCP packets, you can click on a single packet and it'll show you a view which reassembles all related packets into the incoming/outgoing streams an application would normally see.

I'm trying to implement a simplified version of this which will reassemble request-response pairs given a set of recorded incoming/outgoing messages (which have been parsed).

For the purposes of this analysis, I don't think the details of how the data is transferred will be relevant, we're analysing a recording of the communication some time after it takes place. The raw data that's been recorded is a series of bytes as well as the time our application emitted the read/write call.

Screenshot from 2020-02-22 17-08-09

I've then done some preprocessing which turns the byte stream from computer to device into a Vec<Request>, and the byte stream from device to computer is turned into a Vec<Response>. Any bytes which can't be parsed into a Request or Response are assumed to be garbage and we skip past them until the start of the next valid message.

The problem I'm trying to solve is how to take a Vec<Request> and Vec<Response> and inspect the attached timestamps and IDs to turn the inputs into a Vec<Transfer> (using the definitions from that playground link). Is there a proper name for this sort of analysis?

najamelan · February 22, 2020, 10:23am

Ok, so it sounds like you have already done the hardest part of parsing the bytestream. What exactly makes you get stuck? What about?:

loop over the Requests
search the Responses for the corresponding ID,
create the transfer object
dump it in a Vec
return the Vec

system · May 22, 2020, 10:28am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Looking for advice on going from AsyncRead/AsyncWrite -> datagrams help	4	508	November 4, 2020
RTSP streaming API sanity check code review	2	821	September 2, 2021
Ssh transport & multiple request/response over same channel	1	509	July 5, 2022
What data structure (?) am i looking for here?	3	452	June 5, 2020
Is there something similar to Framed Codecs for creating Stream adapters? help	10	497	September 11, 2022

Reconstructing request-response pairs from a recording

Related Topics