For some reason there is no rand compatible crate that implements the Xorwow algorithm. So I decided to give it a try, and now I am here to kindly ask for a code review.
You should measure that. Compilers are often better at optimizing code than humans, and inlining is a very basic optimization. It's not as simple as "slapping inline on every function magically makes code fast". I would not add it by default.
If you want it to be widely available, yes – people generally don't want to depend on git repositories.
Another comment I have is about the surface syntax – you are using way too much vertical whitespace, you don't need to add empty lines after every closing brace. The way you are placing closing parentheses/brackets/braces is also non-idiomatic, try running your code through rustfmt.
You also seem to be extensively using for_each where it's not really warranted. This:
And make sure to measure it from another crate — like a benchmark target, not a test you write inside the library crate. Cross-crate inlining is more conditional than same-crate inlining, so in order to understand the behavior that your library’s users will see, you have to use a separate crate.
I have refactored the code and I also gave rustfmt a try. The code looks a lot cleaner now.
A quick benchmark without external dependencies didn't show much difference with or without #[inline]. I think I will let the compiler decide, at least for now.