How to wrap a non-object-safe Trait in an object-safe one?

Qqwy · October 23, 2019, 6:47am

I originally encountered this issue when attempting to create a Rayon ParallelIterator that a flat_map operation was applied to a dynamic number of times see this post on the Rayon issue tracker.

For standard, serialized Iterators the following works:

fn fancy<Input, Output, Item>(input: Input, depth: usize) -> Output
where
    Input : IntoIterator<Item = Item>,
    Output : std::iter::FromIterator<Item>,
    Item : Copy + std::ops::Add<Output = Item>,
{
    let mut iter: Box<dyn Iterator<Item = Item>> = Box::new(input.into_iter());
    for _ in 0..depth {
        iter = Box::new(iter.flat_map(|x| vec![x, x + x].into_iter()));
    }
    iter.collect()
}

When working with Rayon's ParallelIterators, however, things get iffy: ParallelIterator requires types that implement it to be Sized, making them non-object safe and thus impossible to pass around as trait object. Thus the following straight translation does not work:

use rayon::prelude::*;
fn fancy2<Input, Output, Item>(input: Input, depth: usize) -> Output
where
    Input : IntoParallelIterator<Item = Item>,
    Output : FromParallelIterator<Item>,
    Item : Copy + std::ops::Add<Output = Item> + Send,
{
    let mut iter: Box<dyn rayon::iter::ParallelIterator<Item = Item>> = Box::new(input.into_par_iter());
    for _ in 0..depth {
        iter = Box::new(iter.flat_map(|x| vec![x, x + x].into_par_iter()));
    }
    iter.collect()
}

My question is: How can we wrap a non-object-safe trait like ParallelIterator in a new trait that is object-safe?

cuviper · October 24, 2019, 11:04pm

I don't think it's generally possible.

If the possible types are known at compile time, you can use an enum wrapper. Rayon already implements its traits for the Either type, with Left and Right variants. If you need more than two, you can create your own enum for this without much trouble.

That won't work for your 0..depth dynamic nesting though. You might be able to implement a custom ParallelIterator type that approximates this, with an UnindexedProducer that splits your x and x + x parts. There might be other ways to refactor the code to do what you want too.

Qqwy · October 25, 2019, 9:03pm

Is there a way to make a wrapper struct/enum that can be wrapped in a Box<dyn ...>, or will it effectively get 'tainted' by the Sized requirement of its field that is an instance of ParallelIterator?

alice · October 25, 2019, 9:18pm

This is equivalent:

fn make_vec<I>(x: I, out: &mut [I])
where
    I: Copy + std::ops::Add<Output = I>,
{
    if out.len() == 1 {
        out[0] = x;
    } else {
        let (left, right) = out.split_at_mut(out.len() / 2);
        make_vec(x, left);
        make_vec(x + x, right)
    }
}
fn fancy2<Input, Output, Item>(input: Input, depth: usize) -> Output
where
    Input : IntoParallelIterator<Item = Item>,
    Output : FromParallelIterator<Item>,
    Item : Copy + std::ops::Add<Output = Item> + Send,
{
    input.into_par_iter()
        .flat_map(|x| {
            let mut vec = vec![x; 1 << depth];
            make_vec(x, &mut vec);
            vec
        })
        .collect()
}

except that it isn't as parallel as you might want.

alice · October 25, 2019, 9:39pm

You can also create a struct and manually implement ParallelIterator on it, for example see this. This will be completely parallel. Though I'm not sure it'd be faster..

cuviper · October 25, 2019, 10:05pm

You could use rayon::join on the pair of recursive make_vec calls to squeeze out more parallelism. I like your custom ParallelIterator too.

However, I hesitate to focus on this particular example. I bet the CPU time will be dominated by allocation and data movement, rather than computation, which isn't a great use of parallelism in the first place. More realistic scenarios might be helped by seeing your solutions, but they'll probably need something custom.

Qqwy · October 26, 2019, 1:44pm

@alice Thank you! That solution, where you create a new concrete struct that implements ParallelIterator and because it is a single type we do not need any dynamic dispatch, seems very clean. Hereby marking it as a solution.

@cuviper The real application I am working on does L-system expansion, which mostly means that the resulting vector in the flat_map will be much longer most of the time.

As for speed, we'll probably only be able to know for sure by benchmarking .

alice · October 26, 2019, 1:54pm

To increase the efficiency, you could probably change the if to check if depth is less than, say, 10 and use the recursive function i posted above to take care of the shorter cases without doing it in parallel, as very small tasks are usually faster to do in one thread.

system · January 24, 2020, 2:03pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Using Rayon with trait objects tutorials	3	2927	March 3, 2021
Rayon: how implement into_par_iter() and par_iter() help	6	3249	May 25, 2020
Problem with understanding Rayon IntoParallelRefIterator help	3	1537	January 12, 2023
Rayon flat_map not matching regular flat_map help	4	1282	January 12, 2023
Calling a trait object within a Rayon par_iter() closure? help	4	874	November 10, 2021

How to wrap a non-object-safe Trait in an object-safe one?

Related topics