Grab bag of fixes #36

gwklok · 2018-03-19T05:03:11Z

Remove iPXE replace with pxelinux file served via tftpd, let the initrd load the SFS layers, cuts build time, and often worker boot time. Should result in broader compatibility i.e. this fixes broadcom gigabit ethernet issues and VirtualBox fails to boot on reset bug.
Fix typo preventing credentials for kube from getting loaded for ceph volume detach.
Wire up Prometheus to collect ceph metrics.
Make worker disk partitioning a bit more robust.

No longer chainload iPXE, lots of issues with e.g. broadcom gigabit ethernet cards, virtualbox unable to boot after VM reset etc. Use simpler tftpboot/pxelinux boot, the initrd can handle getting the operos SFS files over HTTP. Cuts good amount of time out of the build and workers boot slightly quicker.

When we encounter recycled disks: Remove all signatures from the disk. pvcreate gives up if there is linux raid metadata blocks on a partition so zero some blocks in that partition.

- Fix bug with kube credentials that prevented volume detach. - Wire up prometheus to collect ceph-mgr metrics.

rlisagor

In my branch (https://github.com/rlisagor/operos/tree/upgrade), certain files are very different - e.g. the installer scripts, start-addons, etc. So some of these changes might have to be adapted/rewritten.

rlisagor · 2018-03-19T15:50:36Z

iso/controller/airootfs/etc/systemd/scripts/ceph-mon-init

 cat /etc/ceph/ceph.conf | etcd_cmd put "cluster/$OPEROS_INSTALL_ID/ceph-config"
 /usr/bin/ceph auth get client.kube | etcd_cmd put "cluster/$OPEROS_INSTALL_ID/secret-ceph-kube-keyring"

+wait_for_unit 5 ceph-mgr@${CHOSTNAME}


I believe systemctl start already waits for the service to become active, so this shouldn't be necessary.

you're right, going to have to think of something else or go back to the sleep hammer, this works because it inserts a delay long enough between systemd starting the service and it actually becoming ready, might be able to check with commands but they usually have dozens of seconds timeouts

rlisagor · 2018-03-19T15:51:06Z

iso/controller/airootfs/etc/systemd/scripts/ceph-mon-init

+/usr/bin/ceph config-key set mgr/prometheus/server_addr ${OPEROS_CONTROLLER_IP}

+systemctl enable ceph-mgr@${CHOSTNAME}.service
+systemctl start ceph-mgr@${CHOSTNAME}.service


FYI: another way to enable and start at the same time is systemctl enable --now ceph-mgr@${CHOSTNAME}.service

rlisagor · 2018-03-19T15:52:27Z

iso/installer/airootfs/root/install/104-worker-boot.sh

 # tftp
 cat > /mnt/etc/conf.d/tftpd <<EOF
-TFTPD_ARGS="--verbose --address ${OPEROS_CONTROLLER_IP} -m /tftpboot/mapfile -u ftp --secure /tftpboot"
+TFTPD_ARGS="--verbose --address ${OPEROS_CONTROLLER_IP} -m /etc//tftpd.mapfile -u ftp --secure /boot"


/etc//tftpd.mapfile -> /etc/tftpd.mapfile ?

gwklok added 7 commits March 18, 2018 16:31

Make worker disk partitiioning a little more robust

8a85608

When we encounter recycled disks: Remove all signatures from the disk. pvcreate gives up if there is linux raid metadata blocks on a partition so zero some blocks in that partition.

Ceph setup fixes.

10acdb2

- Fix bug with kube credentials that prevented volume detach. - Wire up prometheus to collect ceph-mgr metrics.

Typo

83de809

Remove iPXE from list

f89a8c7

Whitespace and spelling

cd309c8

Rearange export

edf7fb3

rlisagor reviewed Mar 19, 2018

View reviewed changes

gwklok added 3 commits March 19, 2018 10:13

Extra slash

48c3332

Better aproach wait for the mgr&mon sockets to appear

4393dda

ISD*

a2abaf4

gwklok merged commit 4c4f604 into PaxAutoma:master Mar 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Grab bag of fixes #36

Grab bag of fixes #36

Uh oh!

gwklok commented Mar 19, 2018

Uh oh!

rlisagor left a comment

Uh oh!

rlisagor Mar 19, 2018

Uh oh!

gwklok Mar 19, 2018

Uh oh!

rlisagor Mar 19, 2018

Uh oh!

rlisagor Mar 19, 2018

Uh oh!

gwklok Mar 19, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Grab bag of fixes #36

Grab bag of fixes #36

Uh oh!

Conversation

gwklok commented Mar 19, 2018

Uh oh!

rlisagor left a comment

Choose a reason for hiding this comment

Uh oh!

rlisagor Mar 19, 2018

Choose a reason for hiding this comment

Uh oh!

gwklok Mar 19, 2018

Choose a reason for hiding this comment

Uh oh!

rlisagor Mar 19, 2018

Choose a reason for hiding this comment

Uh oh!

rlisagor Mar 19, 2018

Choose a reason for hiding this comment

Uh oh!

gwklok Mar 19, 2018

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants