-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Evaluate different ElasticSearch features #19
Comments
Misspellings can be done using 'fuzziness' eg. Asprin would (in the current index) list Aspirin as the second hit using the default 'AUTO' setting for fuzzines. See https://www.elastic.co/guide/en/elasticsearch/guide/current/fuzzy-query.html for more details. |
Setting the fuzziness to '1' makes Aspirin the top hit for Asprin. |
Is there a list of life sciences/pharamceutical/chemistry synonyms? |
Suppose there are different options: Not sure about common typos |
For stemming we need to index a different field at load time which contains the stemmed version and then include it in the searched fields via the API. |
So this about things like stemming , synonyms, misspellings..
I guess many of these are nice to have, but the more we allow "non-exact" matches , the more false positives we might introduce in the results.
So we'll need to experiment to find a good balance
The text was updated successfully, but these errors were encountered: