Cloud Operations/ Site Reliability Engineer

  • Next Gen Cyber LLC
  • BlackBerry - San Mateo, CA
  • Apr 29, 2018
Full time Risk Management Systems Architecture Technology R&D Systems Requirements Planning Systems Development

Job Description


  • Leads all Microsoft Server strategy, design, troubleshooting, and operations
  • Responsible for maximizing system uptime and availability, ensuring functional and performance SLAs
  • Responsible for establishing end-to-end monitoring and alerting on all critical aspects to ensure SLAs and get proactive notifications of possible issues for all systems
  • Development of an automation program for MS server virtualization and builds
  • Creation of playbooks to run when responding to alerts or incidents
  • Initiate and lead scripting and automation to streamline system updates and upgrades
  • Work with cloud operations team to resolve trouble tickets, developing and running scripts, creating SQL jobs, and troubleshooting IIS in a hosted environment
  • Works well independently and requires little or no supervision
  • Microsoft Certification (MCSE) or equivalent practical knowledge managing at least 200 servers
  • Working knowledge of Windows Server 2008-2016 and SQL 2008-2016 require
  • Working knowledge of VMware; VM management and provisioning; Vblock technology a plus
  • Expertise in Microsoft Deployment Toolkit or similar automation technologies
  • Understanding of Microsoft security concepts and Patching Automation
  • Experience with Application Monitoring (APM)
  • Demonstrated technical experience in 2 or more of the following areas

LAN/WAN networking, load balancing, and firewalls

NAS/SAN concepts and administration

SQL database administration

Advanced knowledge of IIS

Writing and developing scripts

Working experience with deployment automation frameworks (Chef, Puppet, Salt, Ansible)

  • Excellent troubleshooting and analytics skills
  • Familiar or certified with ITIL
  • Experience in AWS Cloud Service a plus
  • US Citizen

Job Type: Full-time

Job Location:

  • San Mateo, CA

Required experience:

  • ITIL: 5 years
  • SQL 2008-2016: 4 years
  • AWS Cloud: 4 years
  • Puppet: 4 years
  • Windows Server 2008-2016: 4 years
  • Chef: 4 years

Required language:

  • English

Required licenses or certifications:

  • ITIL v3
  • MCSE
  • authorized to work in the United States as a citizen no green card
  • AWS Certifications