I think UTF-8 is not the worst choice. Indexing codepoints doesn't fix all potential problems either, as shown in this post.
That said, I feel all options have drawbacks when Unicode is involved.
I think UTF-8 is not the worst choice. Indexing codepoints doesn't fix all potential problems either, as shown in this post.
That said, I feel all options have drawbacks when Unicode is involved.