Last modified: 2014-11-13 23:09:48 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T75357, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 73357 - deployment-parsoidcache02 is not using betalabs puppetmaster
deployment-parsoidcache02 is not using betalabs puppetmaster
Status: RESOLVED FIXED
Product: Wikimedia Labs
Classification: Unclassified
deployment-prep (beta) (Other open bugs)
unspecified
All All
: Normal normal
: ---
Assigned To: Antoine "hashar" Musso (WMF)
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2014-11-13 11:28 UTC by Yuvi Panda
Modified: 2014-11-13 23:09 UTC (History)
10 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Yuvi Panda 2014-11-13 11:28:29 UTC
It is still using virt1000, which is the general labs puppetmaster. I don't think this is intentional :) This also prevents testing puppet patches on betalabs on this machine.
Comment 1 Antoine "hashar" Musso (WMF) 2014-11-13 22:47:14 UTC
Indeed:

deployment-parsoidcache02:~$ grep server /etc/puppet/puppet.conf
server = virt1000.wikimedia.org

Although the wikitech page shows that the instance has role::puppet::self with puppetmaster=deployment-salt.eqiad.wmflabs

puppet fails to run though due to an error with the varnish ganglia monitoring. Yuvi has been working on it.

I have removed the role::cache::parsoid class from the instance and ran again puppet. That unlocked puppet and:

$ grep server /etc/puppet/puppet.conf
server = deployment-salt.eqiad.wmflabs
$

The next run of puppet complains about certificate exchange with the deployment-salt puppet master:

Info: Creating a new SSL key for i-000005bf.eqiad.wmflabs
Info: Caching certificate for ca
Info: csr_attributes file loading from /etc/puppet/csr_attributes.yaml
Info: Creating a new SSL certificate request for i-000005bf.eqiad.wmflabs
Info: Certificate Request fingerprint (SHA256): ...
Info: Caching certificate for ca
Exiting; no certificate found and waitforcert is disabled


On deployment-salt:

# puppet ca list
  i-000005bf.eqiad.wmflabs  (SHA256) ...
#

# puppet ca sign i-000005bf.eqiad.wmflabs
Notice: Signed certificate request for i-000005bf.eqiad.wmflabs
...
#

# salt-key --list=unaccepted
Unaccepted Keys:
i-000005bf.eqiad.wmflabs
# 

# salt-key --accept=i-000005bf.eqiad.wmflabs
The following keys are going to be accepted:
Unaccepted Keys:
i-000005bf.eqiad.wmflabs
Proceed? [n/Y] Y
Key for minion i-000005bf.eqiad.wmflabs accepted.
#

Reran puppet. Starting the service of salt-minion fails, looking at the upstart log in /var/log/upstart/salt-minion.log there is:

  [CRITICAL] The Salt Master server's public key did not authenticate!
  The master public key can be found at:
  /etc/salt/pki/minion/minion_master.pub

# rm /etc/salt/pki/minion/minion_master.pub
# rm /var/run/salt-minion.pid
# killall salt-minion
# /etc/init.d/salt-minion start
# sleep 10; /etc/init.d/salt-minion status
 * salt-minion is running
#

\O/

Another puppet run confirms everything is fine. I then reapplied the role::cache::parsoid class which finished the configuration.


The whole process is explained at https://wikitech.wikimedia.org/wiki/Nova_Resource:Deployment-prep/How_code_is_updated#Converting_a_host_to_use_local_puppetmaster_and_salt_master


My assumption is that when the instance has been created, puppet failed and never applied the switch to beta cluster puppet and salt masters.


NOTE: I have no clue whether the service actually work, but the underlying infrastructure serving it seems to be fine.
Comment 2 Antoine "hashar" Musso (WMF) 2014-11-13 22:53:18 UTC
I have upgraded varnish while at it (apt-get upgrade  + restart of the varnish* services):

 Unpacking varnish (3.0.6plus~x-wm3) over (3.0.5plus~x-wm7trusty1) ...

On production, the varnish upgrade is done manually, so we have to do it manually as well on beta cluster.
Comment 3 Antoine "hashar" Musso (WMF) 2014-11-13 23:09:48 UTC
deployment-parsoidcache02 is now using betalabs puppetmaster and salt master. Varnish has been upgraded.

I have NOT verified whether the parsoid cache works as expected. I guess that would need another bug report.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links