Senior Site Reliability Engineer, Wikimedia Cloud Services

nairobi cityKE

full-time

bachelor

3 months ago02/22/202403/23/2024

- closed

You'll work remotely with a full-time distributed team, with members spread between Europe and North America, and need to overlap (UTC-5 to UTC+1) working hours. Some examples of the type of work you'll be doing include:

  • Expanding the capabilities of our toolforge platform
  • Expanding and refining our storage offerings, backed by Ceph and NFS
  • Scaling our team via automation
  • Providing a curated Jupyter notebook environment for data analysis and queries of Wikimedia data
  • Upgrading, customizing, and adding new services like terraform support, and Database as a service to Openstack
  • Developing new webservices for our technical community, like Quarry and PAWS
  • And the backlog has even more details!

You are responsible for:

  • Helping to create a repeatable Openstack cloud deployment
  • Implementing a network topology using Open vSwitch, providing per tenant networking, load balancing, and IPv6
  • Performing day-to-day operational tasks on Wikimedia's Cloud Services infrastructure (deployment, maintenance, configuration, troubleshooting). Develop and support automation tools and processes in support of these tasks.
  • Participating in on-call rotation and support in a 24x7 environment

Skills and Experience:

  • Comfortable working and thriving within a Linux ecosystem
  • Understand networking in the physical domain of switches and servers
  • Software development skills in at least one of the following languages: Python, Go, Javascript, and/or Ruby
  • B.S. or M.S. in Computer Science or related field or equivalent in related work experience.

Qualities that are important to us:

  • Share our values, appreciate our code of conduct, support our team norms, and work in accordance with all three
  • Strong English language skills and ability to work independently, as an effective part of a globally distributed team
  • Support of our users (volunteer and staff developers) using our service offerings
  • Passionate about the value of learning and growing together

Additionally, we'd love it if you have:

  • Utilized configuration management tools such as Puppet, Ansible, Chef, and SaltStack
  • Used Kubernetes, Docker Swarm, Mesos, or similar container orchestration platforms
  • Operated an elastic computing environment such as OpenStack or Cloudstack
  • Operated a multi-tenant capable software defined network (SDN)
  • Experience in serverless computing environments
  • Linux systems troubleshooting and debugging skills
  • Interest in open source software projects and communities

Interested and qualified? Go to Wikimedia Foundation on boards.greenhouse.io to apply

Elevolt does not charge job seekers any fees for job applications or consideration. Do not make any payments without doing your due diligence. If you think this posting is not genuine, please flag it below orcontact us

Sorry, this job is closed and is no longer accepting applications.

View Other Jobs
Wikimedia Foundation

Wikimedia Foundation

The Wikimedia Foundation is the nonprofit that hosts Wikipedia and our other free knowledge projects. We want to make it easier for everyone to share what they know. To do this, we keep Wikipedia and ...