Senior Site Reliability Engineer – CloudOps

תאור התפקיד

We are looking for a devops ninja to join our globally expanding site reliability engineering team and establish a 24/7 production reliability routine.
Responsibilities
  • work with cutting edge technology in the cloud and hardware computing space
  • install, configure, update and troubleshoot services such as web servers, relational and nonrelational databases, cm and csm tools, application servers, engage with docker and kubernetes and much more
  • monitor, troubleshoot and resolve production grade issues, troubleshoot and configure system and applicative aspects of our saas platform and applications
  • collaborate in a “devops” environment where you will work closely with our global support, solution engineering, r&d, qa and devops teams worldwide
  • maintaining a knowledge base of known issues and solutions
 

דרישות התפקיד

Desired Skills and Experience
  • Excellent problem solving skills with a desire to take on responsibility
  • Excellent written and verbal communication skills with ability to communicate technical issues to both technical and nontechnical audiences
  • A deep understanding and familiarity with:
    • Linux – CentOS, Ubuntu, Other
    • Networking knowledge – Firewalls, VPNs, proxies & Load balancers
    • Web/Application servers – Apache, Nginx, Tomcat, JVM environments
    • Monitoring and logging systems familiarity – experience with tools like Graphite, LogicMonitor, Logentries, SumoLogic, ELK stack – Advantage
    • Virtualization and containers – Xen, KVM, Qemu, Docker etc.
    • Storage, any of the following – NFS, SANs, RAID, lvm
  • 2-3 years of relevant work experience, hands-on Linux experience and preferably using languages like Shell/bash, Ruby, Python, Java, Perl
  • Background in NOC / SOC operations – great advantage
  • Experience using and administering software version control systems (SVN, Git etc.) – advantage
  • Familiarity with Atlassian Suite (Confluence, JIRA, BitBucket etc.) – advantage
  • Knowledge of the following is a big plus: Artifactory & Bintray, Build tools, CI servers
  • Knowledge with Docker & Kubernetes – great advantage
  • Experience with public clouds (AWS, GCP etc) – Great advantage
  • Ability to work independently, learn quickly and be proacti.ve
  • Ability to join on call 24×7 roster (follow the Sun model)
  • Ability to work off routine hours occasionally.