Quicksearch often does not find correct string composed of two words #1486

shimizukawa · 2015-01-03T10:03:42Z

Steps to reproduce:

Go to http://sphinx-doc.org/index.html
Try searching "Output formats", "Hierarchical structure" or "Automatic indices" (this strings of two words are present right there at index page, formatted bold).
Search will not find any of them, only single words, scattered thru the docs.

Bitbucket: https://bitbucket.org/birkenfeld/sphinx/issue/1486
Originally reported by: Viacheslav Kobylinskyi
Originally created at: 2014-06-11T16:32:58.442

shimizukawa · 2015-01-03T10:03:43Z

From Viacheslav Kobylinskyi on 2014-06-11 14:34:28+00:00

Issue version corrected (1.2.2 from 1.2).

shimizukawa · 2015-01-03T10:03:44Z

From Viacheslav Kobylinskyi on 2014-07-08 12:22:15+00:00

Has anyone else had this issue? Is this a bug, or am i just doing something wrong?

stefanzweig · 2015-04-13T06:56:20Z

I have this issue for a very long time. From the javascript behind the search it searches the index made by sphinx itself. If there is no entries in the index the search return no results.

I come here from google search this symptom. :) I am seeking a solution, too.

lakshmi-kannan · 2015-07-16T21:55:24Z

Just to add, hyphentaed words like what-in-the-world aren't indexed either. Search returns nothing for hyphenated searches. I am seeing the same behavior as you for two words.

davidfraser · 2015-08-12T10:01:00Z

Also, it seems that the Japanese search does not have a js_stemmer defined, and so doesn't do word separation on the submitted searches...

Lingnik · 2017-08-30T00:37:44Z

@shibukawa @shimizukawa Just curious, did the snowballstemmer work improve this? I am trying to understand stemmer limitations to see if switching to PyStemmer from PorterStemmer will improve our results, or if other improvements are required.

shibukawa · 2017-08-30T02:28:29Z

snowballstemmer: stemming algorithm collection (including porter stemmer) for C/Java/JavaScript/Pure Python
PyStemmer: libsnowball wrapper for Python
PorterStemmer: One of the algorithms of stemming for English and its Python implementation

snowball stemmer includes two algorithms for English, but Porter stemmer is a default one.
Even if you select PyStemmer, internal logic is as same as PorterStemmer for English.

zcorpan · 2019-09-09T20:25:09Z

Two consecutive words seem to work now. However, searching for words with hyphen seems to still be an issue: web-platform-tests/wpt#18943

I found #2818 (from 2016) which claims

Sample regex above allows users to search for strings containing these punctuation characters: \, /, :, ., and -.

@tk0miya would that PR fix the issue with hyphenated words?

shimizukawa added type:bug html search labels Jan 3, 2015

zcorpan mentioned this issue Sep 9, 2019

Searching for "wpt-pr-bot" in docs doesn't find the page mentioning it web-platform-tests/wpt#18943

Open

modelmat mentioned this issue Oct 19, 2019

Search Improvements wpilibsuite/frc-docs#279

Closed

AA-Turner added this to the some future version milestone Sep 29, 2022

AA-Turner closed this as completed Jan 8, 2025

github-actions bot locked as resolved and limited conversation to collaborators Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Quicksearch often does not find correct string composed of two words #1486

Quicksearch often does not find correct string composed of two words #1486

shimizukawa commented Jan 3, 2015

shimizukawa commented Jan 3, 2015

Uh oh!

shimizukawa commented Jan 3, 2015

Uh oh!

stefanzweig commented Apr 13, 2015

Uh oh!

lakshmi-kannan commented Jul 16, 2015

Uh oh!

davidfraser commented Aug 12, 2015

Uh oh!

Lingnik commented Aug 30, 2017

Uh oh!

shibukawa commented Aug 30, 2017

Uh oh!

zcorpan commented Sep 9, 2019

Uh oh!

Uh oh!

Quicksearch often does not find correct string composed of two words #1486

Quicksearch often does not find correct string composed of two words #1486

Comments

shimizukawa commented Jan 3, 2015

shimizukawa commented Jan 3, 2015

Uh oh!

shimizukawa commented Jan 3, 2015

Uh oh!

stefanzweig commented Apr 13, 2015

Uh oh!

lakshmi-kannan commented Jul 16, 2015

Uh oh!

davidfraser commented Aug 12, 2015

Uh oh!

Lingnik commented Aug 30, 2017

Uh oh!

shibukawa commented Aug 30, 2017

Uh oh!

zcorpan commented Sep 9, 2019

Uh oh!