Last modified: 2014-02-10 18:33:05 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T36518, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 34518 - Search engine is unstable, producing false "not found" result
Search engine is unstable, producing false "not found" result
Status: RESOLVED WONTFIX
Product: Wikimedia
Classification: Unclassified
lucene-search-2 (Other open bugs)
unspecified
All All
: Low normal (vote)
: ---
Assigned To: Robert Stojnic
: testme
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2012-02-19 16:12 UTC by folengo
Modified: 2014-02-10 18:33 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
screenshot of false "not found" search result (40.87 KB, image/jpeg)
2012-02-19 16:12 UTC, folengo
Details

Description folengo 2012-02-19 16:12:43 UTC
Created attachment 10044 [details]
screenshot of false "not found" search result

The following search request is unstable :

http://ja.wikipedia.org/w/index.php?title=%E7%89%B9%E5%88%A5%3A%E6%A4%9C%E7%B4%A2&profile=default&search=%E7%A0%B4%E9%82%AA%E9%A1%95%E6%AD%A3%E9%88%94&fulltext=Search

This string is available in one article. Every now and then the search result is that nothing is found, which is a wrong result.

When the article is found, the result is displayed quite quickly (less than one second). When it is not found, the server takes a long time (about ten seconds) to process the request. 

Perhaps this is because the servers are too busy. In that case, the search engine should reply with a "sorry, we can't process your request now, try again later" error message rather than make the user believe that the searched string is absent.

Attachment : screenshot
Comment 1 Robert Stojnic 2012-02-20 21:39:07 UTC
The search rate that reaches search pool2 hosts (search6, search15) is unusually low in the last 3 days. Search6 seems to be fine, while search15 comes up with weird errors of not being able to contact itself when trying to bind RMI instances (Connection refused to host: 10.0.3.15). 

Repeating queries on search6 gives expected results, but going through ja.wikipedia.org gives results only in about 50% of the cases, as suggested in the bug report.
Comment 2 Andre Klapper 2013-03-06 14:58:33 UTC
Hi folengo,

(In reply to comment #0)
> The following search request is unstable :
> http://ja.wikipedia.org/w/index.
> php?title=%E7%89%B9%E5%88%A5%3A%E6%A4%9C%E7%B4%A2&profile=default&search=%E7%
> A0%B4%E9%82%AA%E9%A1%95%E6%AD%A3%E9%88%94&fulltext=Search

I've tried to reproduce this a few times but I always get one result instead of zero. Is this still an issue?
Comment 3 Dan Garry 2014-02-10 18:33:05 UTC
Issues such as this one should be fixed in CirrusSearch. As we're in the process of migrating over from Lucene to Cirrus, I'm marking this bug as RESOLVED WONTFIX.

If this bug persists in CirrusSearch post-migration, please don't hesitate to refile this bug in MediaWiki extensions -> CirrusSearch.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links