Warp logger slow. Why?

mezeipetister · August 20, 2020, 6:51am

Hi Guys! If I create a logger with pretty_env_logger, and apply it to create error log once the response code is 500, then my service performance drops from 11.500 req / sec to 8.000 req / sec. Do you have any advice? This is my code

let logger = warp::filters::log::custom(|info| {
        if info.status() == http::StatusCode::INTERNAL_SERVER_ERROR {
            log::error!(
                "log! {}{} - elapsed: {:?}",
                info.method(),
                info.path(),
                info.elapsed()
            );
        }
    });

dthul · August 20, 2020, 8:16am

Does your performance only drop that much when all of those requests are internal server errors, or does the performance also drop by just having that condition in your code, even though none of the request actually log anything?

mezeipetister · August 20, 2020, 8:39am

Actually in my test I just have 200OK, and the performance dropped. By for a few hours I have been trying to figure it out, and one silly possibility is my laptop battery power mode. So maybe, its just something different. I need to get a server to test it out. But thank you!

All in all I do not know what is the reason of my performance drop. Now its vary between 6.000-9.000 req per sec. But in the early morning with the same code it was 11.000 req / sec. Soo strange. But I guess its not related to the logging.

alice · August 20, 2020, 8:39am

Out of curiosity, how does the performance change if you use tracing instead of the pretty_env_logger logger?

mezeipetister · August 20, 2020, 8:43am

I need to check. But without logging and tracing at all, the performance still vary between the same range. And thats really annoying, because yesterday the same wrk test using the same docker containers showed consistently 10.500 req / sec - 11.500 req / sec.

mezeipetister · August 20, 2020, 3:37pm

@alice I got something interesting. After my checks I removed completely all the logging codes, even I removed pretty_env_logger and log crated from my Cargo.toml. So removed everything from my code related to logging. And the performance was the same, dropped to ~8.000 req / sec.

And now I did a test. I used the same binary, but I started it with this:

RUST_LOG=trace ./target/release/api

And the performance gained back to 11.000 req / sec. The same code, the same binary, in the binary there is no logging at all, even there is no logging crates - in my code. And somehow this modifies the behavior of the crates related to my project.

Here is my code: https://github.com/gardenzilla/api/blob/master/src/main.rs

So now, using this trick, I do not know how, and why, but warp has the same amazing 11.000 req / sec performance.

mezeipetister · August 20, 2020, 4:01pm

Or maybe not... I checked again, and without that trick, now I constant have 11K req/sec performance. The only strange thing, if I run that binary directly from bash, I have 11K req/sec performance, if I put it in a docker container, but using the same port and etc, the same wrk test drops to 8K req/sec. But I saw 11K performance using it in a docker container yesterday. I have no idea whats going on. But 20-25% performance difference should mean something, and Docker should not burn that 20%.

qaopm · August 20, 2020, 4:19pm

To rule out Docker, can you run your test when

docker network mode is set to host (Use host networking | Docker Documentation) vs your default
docker logging driver is set to none (Configure logging drivers | Docker Documentation) vs your default logging driver

mezeipetister · August 20, 2020, 5:33pm

Ok, I have done it. Docker log none helped a bit, but still around 9K req / sec, instead of the 11K when I launch it without docker.

Here is my docker script:

service="api"
service_name="api_service"
docker run -d \
        --network host \
        -v $BASEDIR/data/${service}_space:/usr/local/bin/data \
        --name $service_name \
        --env-file ./ENV.list \
        --log-driver none \
        --restart always \
        $service_name

qaopm · August 20, 2020, 6:14pm

So now with 9K/s requests, host and none logging , how much does the throughput change when you enable RUST_LOG=trace?

Do you see similar drop when you run your test outside of docker, with stderr redirected to /dev/null?

mezeipetister · August 20, 2020, 6:31pm

Ok, so RUST_LOG=trace means 1.5K/sec, so very very low. RUST_LOG=error means 9K/sec - during the test there is no error at all.

mezeipetister · August 20, 2020, 6:38pm

I have tried the stderr redirection to /dev/null, and the performance dropped a bit to 9.5K-10.5K - based on 7 tests.

I used the following command:

RUST_LOG=error ./target/release/api 2> /dev/null

mezeipetister · August 20, 2020, 6:39pm

What does this mean?

qaopm · August 20, 2020, 6:40pm

To be consistent with the Docker environment, try it with RUST_LOG=trace ... so you're comparing apples to apples.

mezeipetister · August 20, 2020, 6:45pm

Yep, but getting a 1.5K result using trace, I switched to RUST_LOG=error, and that gave me 9K. So Apple to Apple now.

qaopm · August 20, 2020, 6:48pm

To tell the truth I find it a bit too difficult to follow which number was produced in which scenario. I just wanted to make sure that the drop you see is not related to Docker and the way you're viewing/collecting log messages. We still want to generate as many log messages as we can (trace) while discarding all of them to measure how expensive it is to construct them.

There's definitely a performance hit. Depending on what you need you can try:

use tracing instead of logging and see if its sink implementation performs better than your logging backend
limit tracing to your program, ignoring traces from the dependencies (RUST_LOG=api=trace)
if you're in control of logging, some log messages might be too heavyweight/produce too much data so you can try to limit their size

mezeipetister · August 20, 2020, 6:55pm

And could you help me to understand why the stderr redirection causes performance drop?

qaopm · August 20, 2020, 7:00pm

I'm afraid not. I wouldn't expect the redirection to reduce the performance. Perhaps it's the environment that's changing, background processes, etc. Difficult to say without running multiple tests over longer period of time.

system · November 18, 2020, 7:00pm

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
Is there a fastest logger in rust? help	10	2329	July 31, 2022
Env_logger 0.5.0-rc.1 announcements	9	2697	January 12, 2023
What is everyone using for logging? help	3	795	April 24, 2022
Newbie's Performance Problem: Http Request, Regex and Save Data \|\| Why my Rust code slower than C# help	21	901	April 30, 2023
A logging library prototype announcements	4	551	January 12, 2023

Warp logger slow. Why?

Related Topics