CheapClone vs ExpensiveClone?

zeroexcuses · July 2, 2019, 10:55am

I know that I can define my own traits -- but I am wondering if there is an idiomatic way to solve this problem. I also can't rely on Copy, as it involves Drop / destructors.
One of the great things about Rust is that it is so explicit with regards fo how much resources something takes.
However, when I see .clone(), it's not obvious to me whether it's cheap or expensive -- am I just cloning a few Rc's -- is these some O(n) Vec behind this? Not explicit when I look at the code statically.
Is there any idiomatic solution to this? This sounds silly, but I almost want to see distinctions of .this_is_constant_time_clone() vs .there_is_a_vec_here_clone() vs .probably_dont_want_to_do_this_clone()

17cupsofcoffee · July 2, 2019, 11:15am

One pattern I've seen is to clone 'Rc-like' types using a more explicit syntax:

let bar = foo.clone(); // no
let bar = Rc::clone(&foo); // yes

This makes it a bit clearer that you're cloning a pointer type, not the actual underlying data. There's a Clippy lint for this, but it's off by default since it's a stylistic choice rather than a correctness thing.

L0uisc · July 2, 2019, 11:53am

Indeed, the book also recommends this: read the paragraph just above the linked heading.

ExpHP · July 2, 2019, 1:14pm

This only helps so much, though. I can understand the frustration when a struct contains Rcs in it. Oftentimes you may have to look at a struct's definition to determine how expensive it is.

My code base currently has a type that can contain up to 30 GB of data. At some point it used to look like

struct Eigenbasis3(Vec<Ket3>);

I tried to remove all clones of it, but there was one that I just couldn't get rid of. So I finally changed it to

struct Eigenbasis3(Arc<[Ket3]>);

and had to document at the remaining clone site that we aren't actually copying the data, just for my sanity's sake.

Yandros · July 2, 2019, 1:20pm

You can also create you own custom trait for things like this:

pub(in crate) // or you can seal the trait
trait RefCounted : Clone {
    #[inline]
    fn inc_refcount (self: &'_ Self) -> Self
    {
        self.clone()
    }
}

impl<T : ?Sized> RefCounted for ::std::rc::Rc<T> {}
impl<T : ?Sized> RefCounted for ::std::sync::Arc<T> {}

and then use x.inc_refcount()

17cupsofcoffee · July 2, 2019, 2:01pm

I totally agree - my main project at the minute is a game engine, and there's no great way of exposing the fact that "hey you can clone Texture pretty much for free please don't tie yourself in knots trying to pass around references" other than adding it to the docs.

cuviper · July 2, 2019, 2:10pm

In this extreme, it probably shouldn't implement Clone at all, but rather just have a regular method that makes the cost clear.

zeroexcuses · July 2, 2019, 8:11pm

Would the general Rust community, when reading my code, be angry if I did the following:

For O(1) cost clone, pass argument to function BY REFERENCE -- function calls .clone() on its own.
For expensive clone, pass the argument as a CLONED argument.

I.e. something like:

pub struct CheapToCopyObj {};
pub struct ExpensiveToCopyObj {};

blahblah(&cheap_to_copy); // function itself does clone()
blah2blah(expensive_to_copy.clone()); // the calling function does the clone()

Does this make any sense, or just stupid/silly looking?

scottmcm · July 2, 2019, 10:35pm

Generally, if a function always needs to clone an input, it should just take ownership instead of taking a reference, regardless of whether the clone method is cheap.

zeroexcuses · July 2, 2019, 10:38pm

This might sound heretical -- what is the rationale behind this? I see a far greater distinction between "constant time clone vs huge time clone" than "pass by ref vs ownership"

RustyYato · July 2, 2019, 10:42pm

Because if the user already has a value that they aren't going to use anymore, then they still have to pay the cost of cloning. If this matters depends on context, but in the vast majority of cases it is better to pass in a value rather than force a clone.

scottmcm · July 3, 2019, 12:33am

If you take a &str and .to_owned() it immediately, it means that you're forcing a copy on the caller that they can do nothing about. If you take a String instead, then they might be able to move it in instead, and if they can't, the cost is properly attributed in the profile to the caller.

zeroexcuses · July 3, 2019, 1:08am

To the best of my knowledge, &str -> String is NOT O(1) ... so we wouldn't do this. I'm suggesting pass by ref + having called-function-clone only in situations where the clone() is guaranteed to be O(1).

kornel · July 3, 2019, 10:59am

O(1) may still be larger than move. Cheap clone can still be more work than no clone.

For example, if you have a function that takes fn foo(Arc<T>) by move, then the caller may do foo(existing_instance) without touching the refcount.

baumanj · July 3, 2019, 10:49pm

If cloning the type is so cheap you’re willing to do it unconditionally, why not just implement Copy and benefit from simpler semantics?

leudz · July 3, 2019, 10:56pm

Copy is a memcopy, most of the time that's not what you want in your Clone.

RustyYato · July 3, 2019, 10:56pm

In some cases you can't implement Copy, but you can implement Clone

zeroexcuses · July 3, 2019, 11:14pm

AFAIK, a single object can't implement both Copy and Drop.

zeroexcuses · July 3, 2019, 11:16pm

After reading all this, it seems the simplest approach is to just add a new trait CheapClone, then add a procedural macro where a struct can derive CheapClone iff all members are:

primitives (i.e. have copy)
Rc or Arc
im_rc:: some immutable data structure
already implement CheapClone

zeroexcuses · July 3, 2019, 11:17pm

FastClone probably sounds better than CheapClone.

Topic		Replies	Views
About retained ownership and `.clone()` _vs._ `{Ar,R}c::clone(&`?	13	1547	January 3, 2022
Why does Cell require Copy instead of Clone? help	12	6828	January 12, 2023
#[derive err_if_clone_is_not_O(1)]?	3	380	January 2, 2022
Starting to learn rust - review request help	6	475	December 1, 2019
How expensive is Rc<RefCell<...>> + .borrow_mut()?	5	1773	June 26, 2019

CheapClone vs ExpensiveClone?

Related Topics