Last modified: 2014-04-10 16:59:16 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T62151, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 60151 - CirrusSearch: Do something about queries that run scripts across a bajillion documents
CirrusSearch: Do something about queries that run scripts across a bajillion ...
Status: RESOLVED FIXED
Product: MediaWiki extensions
Classification: Unclassified
CirrusSearch (Other open bugs)
unspecified
All All
: High normal (vote)
: ---
Assigned To: Nik Everett
Elasticsearch_1.1
:
: 57113 (view as bug list)
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-01-16 21:29 UTC by Nik Everett
Modified: 2014-04-10 16:59 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Nik Everett 2014-01-16 21:29:53 UTC
Do something about queries that run scripts across a bajillion documents.  They cause load spikes and take forever.  They, unforunately, include prefix searches for very few letters.  One idea is to move all the script scoring into the rescore.
Comment 1 Nik Everett 2014-01-29 15:16:06 UTC
I don't think we can push all script scoring into the rescore until we get multiple rescores from Elasticsearch.  BoostingQuery doesn't work for combining the phrase match because it only multiplies the scores of documents that match the negative query rather then adds them.  Sad.
Comment 2 Nik Everett 2014-02-04 16:49:19 UTC
*** Bug 57113 has been marked as a duplicate of this bug. ***
Comment 3 Nik Everett 2014-02-11 16:24:07 UTC
https://gerrit.wikimedia.org/r/#/c/112695/ is going to help a ton by speeding up those queries that run script across a ton of documents.  We've also made a bunch of changes to run the rescore across a ton of documents less frequently.  This should help.  I still want Elasticsearch 1.0's multiple rescores, but this might just make it less important.
Comment 4 Nik Everett 2014-03-07 19:06:02 UTC
It looks like this wasn't merged to 1.0 as marked in the upstream bug but instead just 1.1+.  So we wait....
Comment 5 Nik Everett 2014-04-09 14:58:52 UTC
Going to implement this now.
Comment 6 Gerrit Notification Bot 2014-04-09 19:56:18 UTC
Change 124994 had a related patch set uploaded by Manybubbles:
WIP: Use multiple rescores to apply script scoring

https://gerrit.wikimedia.org/r/124994
Comment 7 Gerrit Notification Bot 2014-04-10 16:58:01 UTC
Change 124994 merged by jenkins-bot:
Use multiple rescores to apply script scoring

https://gerrit.wikimedia.org/r/124994

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links