Clarification on the `str::pattern::SearchStep`

arkrde · October 31, 2022, 3:54pm

I was going thrrough the Searcher api and came across this.

I was curious about the following line: "Note that there might be more than one Reject between two Matches, there is no requirement for them to be combined into one."

My confusion is this: if there are >1 Rejects between two Matches, what would that be ? In other words, the enum SearchStep contains only two variants: Match and Reject. In case a specific span in between two Matches is not a Reject, which is indicated by >1 Rejects, is there any state that specific span could occupy other than a Match ?

Cerber-Ursi · October 31, 2022, 5:16pm

If I understand correctly, this simply means that this outcome is possible (in pseudocode):

let (m1, m2, m3, m4) = (
    searcher.next(),
    searcher.next(), 
    searcher.next(),
    searcher.next()
)
assert!(matches!((m1, m2, m3, m4), (
    SearchStep::Match(..),
    SearchStep::Reject(..),
    SearchStep::Reject(..),
    SearchStep::Match(..)
);

I.e., if you got a Reject, you can't assume that the next call will not yield Reject too.

arkrde · October 31, 2022, 6:21pm

Ok. Thanks for your reply. I think I have understood it. I think the wordings are a bit nuanced, and correct.

I was assuming that the results of each next call would produce contiguous spans. That assumption is not correct. That is the only possible explanation. In other words, the start and end points of the spans m1, m2... need not be contiguous. They can jumping all around, depending on the algorithm in use. For a normal case, like Boyer Moore, or KMP, it's quite likely that they would be contiguous. However, the specs are a bit more generic than that, which is great !

system · January 29, 2023, 6:21pm

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
Compiler says match appears to be inexhaustive? help	3	358	July 29, 2021
[Feature Request] Allow multiple `..` per pattern	5	503	December 6, 2021
[Solved] Match on u8, '0' to '9'	5	566	January 12, 2023
Blog post: Mixing matching, mutation, and moves in Rust	2	998	January 12, 2023
No warning on overlapping match patterns help	3	611	September 28, 2020

Clarification on the `str::pattern::SearchStep`

Related Topics