Last modified: 2013-10-07 21:03:22 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T56918, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 54918 - Optimize CirrusSearch in place reindexing
Optimize CirrusSearch in place reindexing
Status: RESOLVED FIXED
Product: MediaWiki extensions
Classification: Unclassified
CirrusSearch (Other open bugs)
unspecified
All All
: High normal (vote)
: ---
Assigned To: Nik Everett
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2013-10-03 13:07 UTC by Nik Everett
Modified: 2013-10-07 21:03 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Nik Everett 2013-10-03 13:07:58 UTC
Right now CirrusSearch's in place reindexing is pretty slow.  We're actually able to overwhelm our small Elasticsearch cluster using a single threaded single process in place reindex.  So, we should be more efficient about these reindexes.

I see two angles of attack:
1.  Optimize the client config: store.throttle.max_bytes_per_sec and its brothers.
2.  Optimize the process of in place reindexing:
2a.  Populate the new index with no shard replicas - just masters - then add replicas.
2b.  Raise the refresh_interval on the index to something big or turn it off all together.
2c.  Other stuff?


The optimizations in 2 are _probably_ not required for initial index builds as MediaWiki is our bottleneck there.
Comment 1 Nik Everett 2013-10-03 15:15:04 UTC
We could also increase performance by throwing more machines at the problem and using more shards.  When we get our bigger cluster we'll probably do that as well.
Comment 2 Gerrit Notification Bot 2013-10-07 16:29:14 UTC
Change 88131 had a related patch set uploaded by Manybubbles:
Optimize in place reindexing.

https://gerrit.wikimedia.org/r/88131
Comment 3 Gerrit Notification Bot 2013-10-07 20:58:06 UTC
Change 88131 merged by jenkins-bot:
Optimize in place reindexing.

https://gerrit.wikimedia.org/r/88131

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links