What is AsRef<Path> + ToString>?

LegionMammal978 · April 30, 2022, 1:26pm

I guess the idea is that an OsStr is a platform-defined superset of valid UTF-8. This is an arbitrary byte string in the case of Unix, and UTF-8 + surrogate codepoints (also known as WTF-8) in the case of Windows.

If you have a str, call str::encode_utf16() to get the code units. If you have an OsStr on Windows, call OsStrExt::encode_wide() to get the code units. Collect either of these iterators into your fixed-size working buffer of choice.

In fact, RFC 2295 (os_str_pattern) aims to extend the API surface of OsStr, moving its Windows representation from WTF-8 to OMG-WTF-8 in the process. Incidentally, a recent issue regarding OsStr notes the same thing that you do, that &str to &OsStr must always work.

Yes, WTF-8 is a strict superset of UTF-8, so all valid UTF-8 byte sequences are also valid WTF-8.

You know, you might be right there. Slice::from_str() depends on Slice and Wtf8 having the same layout:

github.com

rust-lang/rust/blob/013fbc61877c8b1ca964274f171bd79952247fc3/library/std/src/sys/windows/os_str.rs#L153-L155


      
          pub fn from_str(s: &str) -> &Slice {
              unsafe { mem::transmute(Wtf8::from_str(s)) }
          }

Topic		Replies	Views
Best practices for string argument types help	8	5342	March 7, 2022
When to use AsRef<T> vs &T help	7	10612	January 12, 2023
Passing AsRef<str> parameters to function help	5	2085	March 2, 2021
What is the purpose of AsRef? help	6	2920	August 29, 2019
How to call a closure with AsRef<T> type? help	4	74	October 21, 2024

What is AsRef<Path> + ToString>?

Related topics