Is it OK to loop over recv / read to read all data from socket

I'm building a multi-client<->server messaging application over TCP.
I created a non blocking server using epoll to multiplex linux file descriptors.

When a fd receives data, I read() /or/ recv() into buf.
I know that I need to either specify a data length* at the start of the transmission, or use a delimiter** at the end of the transmission to segregate the messages.

*using a data length:

char *buffer_ptr = buffer;

        do {

            switch (recvd_bytes = recv(new_socket, buffer_ptr, rem_bytes, 0)) {

                  case -1: return SOCKET_ERR;

                  case 0: return CLOSE_SOCKET;

                  default: break;

            }

            buffer_ptr += recvd_bytes;

            rem_bytes -= recvd_bytes;

        } while (rem_bytes != 0);

**using a delimiter:

void get_all_buf(int sock, std::string & inStr)

{

    int n = 1, total = 0, found = 0;

    char c;

    char temp[1024*1024]; 

    // Keep reading up to a 'n'

    while (!found) {

        n = recv(sock, &temp[total], sizeof(temp) - total - 1, 0);

        if (n == -1) {

            /* Error, check 'errno' for more details */

            break;

        }

        total += n;

        temp[total] = '';

        found = (strchr(temp, 'n') != 0);

    }

    inStr = temp;

}

My question: Is it OK to loop over recv() until one of those conditions is met? What if a client sends a bogus message length or no delimiter or there is packet loss? Wont I be stuck looping recv() in my program forever?

asked Oct 30 '18 at 22:05

localhost

546

1

Welcome to the joys of protocol design. Your choices are wait forever (like NFS3 does) or timeout (like http does).

– stark
Oct 30 '18 at 22:17

this is a low latency application - receiving thousands of msgs per second

– localhost
Oct 30 '18 at 22:25

was thinking to maybe call recv a couple more times then close the connection if nothing else was on the fd

– localhost
Oct 30 '18 at 22:37

1

Consider using a per-connection buffer with an incoming message size limit. You can see something similar in the facil.io WebSocket implementation (a dynamic message size limit) or it's HTTP implementation (a hard-coded header line length limitation).

– Myst
Oct 31 '18 at 1:35

P.S. (Side Note): Using epoll in one-shot mode might be better than edge triggering if you're using multiple threads, since it could minimize lock contention (and avoid some locks altogether, depending on your design).

– Myst
Oct 31 '18 at 1:37

add a comment |

*using a data length:

char *buffer_ptr = buffer;

        do {

            switch (recvd_bytes = recv(new_socket, buffer_ptr, rem_bytes, 0)) {

                  case -1: return SOCKET_ERR;

                  case 0: return CLOSE_SOCKET;

                  default: break;

            }

            buffer_ptr += recvd_bytes;

            rem_bytes -= recvd_bytes;

        } while (rem_bytes != 0);

**using a delimiter:

void get_all_buf(int sock, std::string & inStr)

{

    int n = 1, total = 0, found = 0;

    char c;

    char temp[1024*1024]; 

    // Keep reading up to a 'n'

    while (!found) {

        n = recv(sock, &temp[total], sizeof(temp) - total - 1, 0);

        if (n == -1) {

            /* Error, check 'errno' for more details */

            break;

        }

        total += n;

        temp[total] = '';

        found = (strchr(temp, 'n') != 0);

    }

    inStr = temp;

}

asked Oct 30 '18 at 22:05

localhost

546

1

Welcome to the joys of protocol design. Your choices are wait forever (like NFS3 does) or timeout (like http does).

– stark
Oct 30 '18 at 22:17

this is a low latency application - receiving thousands of msgs per second

– localhost
Oct 30 '18 at 22:25

was thinking to maybe call recv a couple more times then close the connection if nothing else was on the fd

– localhost
Oct 30 '18 at 22:37

1

Consider using a per-connection buffer with an incoming message size limit. You can see something similar in the facil.io WebSocket implementation (a dynamic message size limit) or it's HTTP implementation (a hard-coded header line length limitation).

– Myst
Oct 31 '18 at 1:35

P.S. (Side Note): Using epoll in one-shot mode might be better than edge triggering if you're using multiple threads, since it could minimize lock contention (and avoid some locks altogether, depending on your design).

– Myst
Oct 31 '18 at 1:37

add a comment |

*using a data length:

char *buffer_ptr = buffer;

        do {

            switch (recvd_bytes = recv(new_socket, buffer_ptr, rem_bytes, 0)) {

                  case -1: return SOCKET_ERR;

                  case 0: return CLOSE_SOCKET;

                  default: break;

            }

            buffer_ptr += recvd_bytes;

            rem_bytes -= recvd_bytes;

        } while (rem_bytes != 0);

**using a delimiter:

void get_all_buf(int sock, std::string & inStr)

{

    int n = 1, total = 0, found = 0;

    char c;

    char temp[1024*1024]; 

    // Keep reading up to a 'n'

    while (!found) {

        n = recv(sock, &temp[total], sizeof(temp) - total - 1, 0);

        if (n == -1) {

            /* Error, check 'errno' for more details */

            break;

        }

        total += n;

        temp[total] = '';

        found = (strchr(temp, 'n') != 0);

    }

    inStr = temp;

}

asked Oct 30 '18 at 22:05

localhost

546

*using a data length:

char *buffer_ptr = buffer;

        do {

            switch (recvd_bytes = recv(new_socket, buffer_ptr, rem_bytes, 0)) {

                  case -1: return SOCKET_ERR;

                  case 0: return CLOSE_SOCKET;

                  default: break;

            }

            buffer_ptr += recvd_bytes;

            rem_bytes -= recvd_bytes;

        } while (rem_bytes != 0);

**using a delimiter:

void get_all_buf(int sock, std::string & inStr)

{

    int n = 1, total = 0, found = 0;

    char c;

    char temp[1024*1024]; 

    // Keep reading up to a 'n'

    while (!found) {

        n = recv(sock, &temp[total], sizeof(temp) - total - 1, 0);

        if (n == -1) {

            /* Error, check 'errno' for more details */

            break;

        }

        total += n;

        temp[total] = '';

        found = (strchr(temp, 'n') != 0);

    }

    inStr = temp;

}

c linux sockets

asked Oct 30 '18 at 22:05

localhost

546

asked Oct 30 '18 at 22:05

localhost

546

asked Oct 30 '18 at 22:05

localhost

546

asked Oct 30 '18 at 22:05

localhost

546

asked Oct 30 '18 at 22:05

localhost

546

1

Welcome to the joys of protocol design. Your choices are wait forever (like NFS3 does) or timeout (like http does).

– stark
Oct 30 '18 at 22:17

this is a low latency application - receiving thousands of msgs per second

– localhost
Oct 30 '18 at 22:25

was thinking to maybe call recv a couple more times then close the connection if nothing else was on the fd

– localhost
Oct 30 '18 at 22:37

1

Consider using a per-connection buffer with an incoming message size limit. You can see something similar in the facil.io WebSocket implementation (a dynamic message size limit) or it's HTTP implementation (a hard-coded header line length limitation).

– Myst
Oct 31 '18 at 1:35

P.S. (Side Note): Using epoll in one-shot mode might be better than edge triggering if you're using multiple threads, since it could minimize lock contention (and avoid some locks altogether, depending on your design).

– Myst
Oct 31 '18 at 1:37

add a comment |

1

Welcome to the joys of protocol design. Your choices are wait forever (like NFS3 does) or timeout (like http does).

– stark
Oct 30 '18 at 22:17

this is a low latency application - receiving thousands of msgs per second

– localhost
Oct 30 '18 at 22:25

was thinking to maybe call recv a couple more times then close the connection if nothing else was on the fd

– localhost
Oct 30 '18 at 22:37

1

Consider using a per-connection buffer with an incoming message size limit. You can see something similar in the facil.io WebSocket implementation (a dynamic message size limit) or it's HTTP implementation (a hard-coded header line length limitation).

– Myst
Oct 31 '18 at 1:35

P.S. (Side Note): Using epoll in one-shot mode might be better than edge triggering if you're using multiple threads, since it could minimize lock contention (and avoid some locks altogether, depending on your design).

– Myst
Oct 31 '18 at 1:37

Welcome to the joys of protocol design. Your choices are wait forever (like NFS3 does) or timeout (like http does).

– stark
Oct 30 '18 at 22:17

this is a low latency application - receiving thousands of msgs per second

– localhost
Oct 30 '18 at 22:25

was thinking to maybe call recv a couple more times then close the connection if nothing else was on the fd

– localhost
Oct 30 '18 at 22:37

Consider using a per-connection buffer with an incoming message size limit. You can see something similar in the facil.io WebSocket implementation (a dynamic message size limit) or it's HTTP implementation (a hard-coded header line length limitation).

– Myst
Oct 31 '18 at 1:35

P.S. (Side Note): Using epoll in one-shot mode might be better than edge triggering if you're using multiple threads, since it could minimize lock contention (and avoid some locks altogether, depending on your design).

– Myst
Oct 31 '18 at 1:37

add a comment |

1 Answer
1

active

oldest

votes

Is it OK to loop over recv() until one of those conditions is met?

Probably not, at least not for production-quality code. As you suggested, the problem with looping until you get the full message is that it leaves your thread at the mercy of the client -- if a client decides to only send part of the message and then wait for a long time (or even forever) without sending the last part, then your thread will be blocked (or looping) indefinitely and unable to serve any other purpose -- usually not what you want.

What if a client sends a bogus message length

Then you're in trouble (although if you've chosen a maximum-message-size you can detect obviously bogus message-lengths that are larger than that size, and defend yourself by e.g. forcibly closing the connection)

or there is packet loss?

If there is a reasonably small amount of packet loss, the TCP layer will automatically retransmit the data, so your program won't notice the difference (other than the message officially "arriving" a bit later than it otherwise would have). If there is really bad packet loss (e.g. someone pulled the Ethernet cable out of the wall for 5 minutes), then the rest of the message might be delayed for several minutes or more (until connectivity recovers, or the TCP layer gives up and closes the TCP connection), trapping your thread in the loop.

So what is the industrial-grade, evil-client-and-awful-network-proof solution to this dilemma, so that your server can remain responsive to other clients even when a particular client is not behaving itself?

The answer is this: don't depend on receiving the entire message all at once. Instead, you need to set up a simple state-machine for each client, such that you can recv() as many (or as few) bytes from that client's TCP socket as it cares to send to you at any particular time, and save those bytes to a local (per-client) buffer that is associated with that client, and then go back to your normal event loop even though you haven't received the entire message yet. Keep careful track of how many valid received-bytes-of-data you currently have on-hand from each client, and after each recv() call has returned, check to see if the associated per-client incoming-data-buffer contains an entire message yet, or not -- if it does, parse the message, act on it, then remove it from the buffer. Lather, rinse, and repeat.

edited Nov 16 '18 at 21:50

Russell

3,95921021

answered Oct 31 '18 at 3:05

Jeremy Friesner

39k1080161

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53073545%2fis-it-ok-to-loop-over-recv-read-to-read-all-data-from-socket%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

Is it OK to loop over recv() until one of those conditions is met?

What if a client sends a bogus message length

or there is packet loss?

edited Nov 16 '18 at 21:50

Russell

3,95921021

answered Oct 31 '18 at 3:05

Jeremy Friesner

39k1080161

add a comment |

Is it OK to loop over recv() until one of those conditions is met?

What if a client sends a bogus message length

or there is packet loss?

edited Nov 16 '18 at 21:50

Russell

3,95921021

answered Oct 31 '18 at 3:05

Jeremy Friesner

39k1080161

add a comment |

Is it OK to loop over recv() until one of those conditions is met?

What if a client sends a bogus message length

or there is packet loss?

edited Nov 16 '18 at 21:50

Russell

3,95921021

answered Oct 31 '18 at 3:05

Jeremy Friesner

39k1080161

Is it OK to loop over recv() until one of those conditions is met?

What if a client sends a bogus message length

or there is packet loss?

edited Nov 16 '18 at 21:50

Russell

3,95921021

answered Oct 31 '18 at 3:05

Jeremy Friesner

39k1080161

edited Nov 16 '18 at 21:50

Russell

3,95921021

edited Nov 16 '18 at 21:50

Russell

3,95921021

edited Nov 16 '18 at 21:50

Russell

3,95921021

answered Oct 31 '18 at 3:05

Jeremy Friesner

39k1080161

answered Oct 31 '18 at 3:05

Jeremy Friesner

39k1080161

answered Oct 31 '18 at 3:05

Jeremy Friesner

39k1080161

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Wsrtjtyk