Friday, October 12, 2018

Account Creation and Password Resets Temporarily Down, October 12

Due to ongoing maintenance, account creation and password resets are down today.

At roughly 6:15PM, we there will be brief NFS downtime as we attempt to fix the issue.

Thanks for flying OCF!

Scheduled maintenance on night of 2018-10-12

The OCF is anticipating a short period of intermittent service unavailability in order to perform some additional maintenance on our hypervisors as a followup to last week's maintenance event. Specifically, we intend to migrate NFS to our new fileserver, reinstall our hypervisors onto new disks, and possibly migrate our mirrors server to new hardware. We are scheduling this event for for low-utilization hours at night to minimize any disruption to our users.

Thanks for flying OCF and send us an email if you have any questions!

Monday, October 08, 2018

IPv6 Connectivity Issues on October 8

Starting last night, the OCF has been experiencing some connectivity issues to our public SSH server over IPv6. If you are having trouble logging in to ssh.ocf.berkeley.edu, please try using IPv4 to connect. To do this, you can add -4 to your SSH connection command, like so

ssh -4 <OCF username>@ssh.ocf.berkeley.edu

If your SSH client does not support the -4 flag, you can also connect directly to our server's IPv4 address. To do this, just connect to `169.229.226.25` instead of `ssh.ocf.berkeley.edu`.

Some other services such as MySQL may also experience issues over IPv6. If neither IPv6 nor IPv4 is working for you, please let us know.

Thank you for being patient while we restore full connectivity.

UPDATE 2018-10-09 1:45AM: IPv6 connectivity should be restored to all our user-facing services.

Wednesday, October 03, 2018

Downtime on October 6

The OCF will be experiencing downtime, due to scheduled maintenance, on October 6th, from 9PM-12AM. Hosted websites will experience downtime as we briefly reboot the servers to apply critical security updates.  Once our servers are rebooted, users accessing our public login server and our apphosting server will not be able to write files as we are moving NFS to a new host. This read-only period will last no longer than 15 minutes and all operations should behave as normal by midnight at the latest.

Thanks for flying with the OCF!

8:53 Update: We've powered off the servers to do networking hardware and kernel updates.
11:03 Update: We've decided not to do the NFS migration tonight, but networking updates have been performed and services should be back up in the next hour.
11:42 Update: Most of our public servers should be back now, apart from our public mirrors and our own website (vhosts should be fine).
12:17 Update: Our website is back, but our software mirrors are not back yet.
12:51 Update: Everything should be back and working now except our HPC control server, which we are still debugging.
1:35 Update: This is the all clear, everything should be working now! Feel free to let us know by emailing help@ocf.berkeley.edu if you notice anything wrong.