Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Quicksearch often does not find correct string composed of two words #1486

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
shimizukawa opened this issue Jan 3, 2015 · 8 comments
Closed

Comments

@shimizukawa
Copy link
Member

Steps to reproduce:

  1. Go to http://sphinx-doc.org/index.html
  2. Try searching "Output formats", "Hierarchical structure" or "Automatic indices" (this strings of two words are present right there at index page, formatted bold).
  3. Search will not find any of them, only single words, scattered thru the docs.

@shimizukawa
Copy link
Member Author

From Viacheslav Kobylinskyi on 2014-06-11 14:34:28+00:00

Issue version corrected (1.2.2 from 1.2).

@shimizukawa
Copy link
Member Author

From Viacheslav Kobylinskyi on 2014-07-08 12:22:15+00:00

Has anyone else had this issue? Is this a bug, or am i just doing something wrong?

@stefanzweig
Copy link

I have this issue for a very long time. From the javascript behind the search it searches the index made by sphinx itself. If there is no entries in the index the search return no results.

I come here from google search this symptom. :) I am seeking a solution, too.

@lakshmi-kannan
Copy link

Just to add, hyphentaed words like what-in-the-world aren't indexed either. Search returns nothing for hyphenated searches. I am seeing the same behavior as you for two words.

@davidfraser
Copy link

Also, it seems that the Japanese search does not have a js_stemmer defined, and so doesn't do word separation on the submitted searches...

@Lingnik
Copy link

Lingnik commented Aug 30, 2017

@shibukawa @shimizukawa Just curious, did the snowballstemmer work improve this? I am trying to understand stemmer limitations to see if switching to PyStemmer from PorterStemmer will improve our results, or if other improvements are required.

@shibukawa
Copy link
Contributor

  • snowballstemmer: stemming algorithm collection (including porter stemmer) for C/Java/JavaScript/Pure Python
  • PyStemmer: libsnowball wrapper for Python
  • PorterStemmer: One of the algorithms of stemming for English and its Python implementation

snowball stemmer includes two algorithms for English, but Porter stemmer is a default one.
Even if you select PyStemmer, internal logic is as same as PorterStemmer for English.

@zcorpan
Copy link

zcorpan commented Sep 9, 2019

Two consecutive words seem to work now. However, searching for words with hyphen seems to still be an issue: web-platform-tests/wpt#18943

I found #2818 (from 2016) which claims

Sample regex above allows users to search for strings containing these punctuation characters: \, /, :, ., and -.

@tk0miya would that PR fix the issue with hyphenated words?

@AA-Turner AA-Turner added this to the some future version milestone Sep 29, 2022
@github-actions github-actions bot locked as resolved and limited conversation to collaborators Feb 6, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

8 participants