Skip to content

Keyword repeat #4072

Answered by fulmicoton
mustafa0x asked this question in Q&A
Nov 2, 2023 · 1 comments · 1 reply
Discussion options

You must be logged in to vote

Same thing as for lucene. Tokenizer can emit token with the a notion of position and token length.
https://blog.mikemccandless.com/2012/04/lucenes-tokenstreams-are-actually.html

But you would have to modify the stemmer tokenizer to keep the original token to have that work.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@mustafa0x
Comment options

Answer selected by mustafa0x
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants