About
Job Description
Site Reliability Support Engineer
As a key member of RMS Cloud Operations, you will be responsible for the build, deploy, and support of all RMS hosted product and SaaS offerings. This position interfaces with RMS clients, Software Development, QA, Product Management and other operational teams to engineer and manage the Continuous Delivery Platform.
Essential Responsibilities:
* Provide 24x7 operational support (L3) when on-call
* Perform complex troubleshooting of multiple applications
* Monitor and improve operations performance, security and resources usage
* Respond to inquiries and incidents
* Submit and implement change requests
* Transfer operations support knowledge through training and writing technical documentation and KB articles on the team wiki
* Build, deploy, automate the deployment and testing of software and application services
Technical Skills:
* Experience in Windows system administration, troubleshooting and applications deployment. (IIS, MSI, Active Directory, MS SQL and etc.)
* Experience in Linux system administration, troubleshooting.
* Experience in scripting, batch, PowerShell, shell, SQL scripting. Python/Perl (desirable)
* Understanding of network elements such as firewalls, load balancers, DNS, NAT, TLS/SSL
* Experience working with monitoring tools; such as Zenoss or Prometheus/Grafana
* Experience working with Jenkins. Ansible/Terraform (desirable)
* Experience with cloud computing Azure and AWS platforms
Business Skills:
* Able and willing to learn new technologies and tools independently
* Patience and dedication when working with clients, vendors, and internal teams
* Strong analytical, critical, and creative problem-solving skills
* Able to manage multiple tasks simultaneously and adapt when priorities change
* Proven excellent customer service skills with a professional demeanor
* Able to drive results and set priorities independently
Education:
Preferred BS Degree in Computer Science or related experience in technical industry