1-50 of 2016 results (12ms)
2019-07-31 §
18:07 <bstorm_> drained tools-worker-1015/05/03/17 to rebalance load
17:41 <bstorm_> drained tools-worker-1025 and 1026 to rebalance load
17:32 <bstorm_> drained tools-worker-1028 to rebalance load
17:29 <bstorm_> drained tools-worker-1008 to rebalance load
17:23 <bstorm_> drained tools-worker-1021 to rebalance load
17:17 <bstorm_> drained tools-worker-1007 to rebalance load
17:07 <bstorm_> drained tools-worker-1004 to rebalance load
16:27 <andrewbogott> moving tools-static-12 to cloudvirt1018
15:33 <bstorm_> T228573 spinning up 5 worker nodes for kubernetes cluster (tools-worker-1035-9)
2019-07-27 §
23:00 <zhuyifei1999_> a past probably related ticket: T194859
22:57 <zhuyifei1999_> maintain-kubeusers seems stuck. Traceback: https://phabricator.wikimedia.org/P8812, core dump: /root/core.17898. Restarting
2019-07-26 §
17:39 <bstorm_> restarted maintain-kubeusers because it was suspiciously tardy and quiet
17:14 <bstorm_> drained tools-worker-1013.tools.eqiad.wmflabs to rebalance load
17:09 <bstorm_> draining tools-worker-1020.tools.eqiad.wmflabs to rebalance load
16:32 <bstorm_> created tools-worker-1034 - T228573
15:57 <bstorm_> created tools-worker-1032 and 1033 - T228573
15:54 <bstorm_> created tools-worker-1031 - T228573
2019-07-25 §
22:01 <bstorm_> T228573 created tools-worker-1030
21:22 <jeh> rebooting tools-worker-1016 unresponsive
2019-07-24 §
10:14 <arturo> reallocating tools-puppetmaster-01 from cloudvirt1027 to cloudvirt1028 (T227539)
10:12 <arturo> reallocating tools-docker-registry-04 from cloudvirt1027 to cloudvirt1028 (T227539)
2019-07-22 §
18:39 <bstorm_> repooled tools-sgeexec-0905 after reboot
18:33 <bstorm_> depooled tools-sgeexec-0905 because it's acting kind of weird and not responding to prometheus
18:32 <bstorm_> repooled tools-sgewebgrid-lighttpd-0902 after restarting the grid-exec service
18:28 <bstorm_> depooled tools-sgewebgrid-lighttpd-0902 to find out why it is behaving weird
17:55 <bstorm_> draining tools-worker-1023 since it is having issues
17:38 <bstorm_> Adding the prometheus servers to the ferm rules via wikitech hiera for kubelet stats T228573
2019-07-20 §
19:52 <andrewbogott> rebooting tools-worker-1023
2019-07-17 §
20:23 <andrewbogott> migrating tools-sgegrid-shadow to cloudvirt1014
2019-07-15 §
14:50 <bstorm_> cleared error state from tools-sgeexec-0911 which went offline after error from job 5190035
2019-06-25 §
09:30 <arturo> detected puppet issue in all VMs: T226480
2019-06-24 §
17:42 <andrewbogott> moving tools-sgeexec-0905 to cloudvirt1015
2019-06-17 §
14:07 <andrewbogott> moving tools-sgewebgrid-lighttpd-0903 to cloudvirt1015
13:59 <andrewbogott> moving tools-sgewebgrid-generic-0902 and tools-sgewebgrid-lighttpd-0902 to cloudvirt1015 (optimistic re: T220853 )
2019-06-11 §
18:03 <bstorm_> deleted anomalous kubernetes node tools-worker-1019.eqiad.wmflabs
2019-06-05 §
18:33 <andrewbogott> repooled tools-sgeexec-0921 and tools-sgeexec-0929
18:16 <andrewbogott> depooling and moving tools-sgeexec-0921 and tools-sgeexec-0929
2019-05-30 §
13:01 <arturo> uncordon/repool tools-worker-1001/2/3. They should be fine now. I'm only leaving 1029 cordoned for testing purposes
13:01 <arturo> reboot tools-woker-1003 to cleanup sssd config and let nslcd/nscd start freshly
12:47 <arturo> reboot tools-woker-1002 to cleanup sssd config and let nslcd/nscd start freshly
12:42 <arturo> reboot tools-woker-1001 to cleanup sssd config and let nslcd/nscd start freshly
12:35 <arturo> enable puppet in tools-worker nodes
12:29 <arturo> switch hiera setting back to classic/sudoldap for tools-worker because T224651 (T224558)
12:25 <arturo> cordon/drain tools-worker-1002 because T224651 and T224651
12:23 <arturo> cordon/drain tools-worker-1001 because T224651 and T224651
12:22 <arturo> cordon/drain tools-worker-1029 because T224651 and T224651
12:20 <arturo> cordon/drain tools-worker-1003 because T224651 and T224651
11:59 <arturo> T224558 repool tools-worker-1003 (using sssd/sudo now!)
11:23 <arturo> T224558 depool tools-worker-1003
10:48 <arturo> T224558 drop/build a VM for tools-worker-1002. It didn't like the sssd/sudo change :-(