Last modified: 2013-10-07 21:03:22 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T56918, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 54918 - Optimize CirrusSearch in place reindexing


Summary:	Optimize CirrusSearch in place reindexing

Status:	RESOLVED FIXED

Product:	MediaWiki extensions
Classification:	Unclassified
Component:	CirrusSearch (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	High normal (vote)
Target Milestone:	---
Assigned To:	Nik Everett

URL:
Whiteboard:
Keywords:

Depends on:
Blocks:
	Show dependency tree / graph

Reported:	2013-10-03 13:07 UTC by Nik Everett
Modified:	2013-10-07 21:03 UTC (History)
CC List:	2 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Nik Everett 2013-10-03 13:07:58 UTC

Right now CirrusSearch's in place reindexing is pretty slow.  We're actually able to overwhelm our small Elasticsearch cluster using a single threaded single process in place reindex.  So, we should be more efficient about these reindexes.

I see two angles of attack:
1.  Optimize the client config: store.throttle.max_bytes_per_sec and its brothers.
2.  Optimize the process of in place reindexing:
2a.  Populate the new index with no shard replicas - just masters - then add replicas.
2b.  Raise the refresh_interval on the index to something big or turn it off all together.
2c.  Other stuff?


The optimizations in 2 are _probably_ not required for initial index builds as MediaWiki is our bottleneck there.

Comment 1 Nik Everett 2013-10-03 15:15:04 UTC

We could also increase performance by throwing more machines at the problem and using more shards.  When we get our bigger cluster we'll probably do that as well.

Comment 2 Gerrit Notification Bot 2013-10-07 16:29:14 UTC

Change 88131 had a related patch set uploaded by Manybubbles:
Optimize in place reindexing.

https://gerrit.wikimedia.org/r/88131

Comment 3 Gerrit Notification Bot 2013-10-07 20:58:06 UTC

Change 88131 merged by jenkins-bot:
Optimize in place reindexing.

https://gerrit.wikimedia.org/r/88131

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links