Sr DevOps Engineer Hybrid

at UMass Med School
Published September 28, 2023
Location Shrewsbury, MA
Category Default  
Job Type Full-time  


Sr DevOps Engineer Hybrid

Minimum Salary


Job Location

1 week ago (9/19/2023 11:34 AM)

Requisition Number


# of Openings


Posted Date





Under the supervision of the DevOps Manager or designee, the Senior DevOps Engineer is responsible for serving as a Subject Matter Expert (SME) for the deployment and administration of traditional and cloud-based infrastructure and resources in a multi-cloud environment. They will maintain services once they are live by measuring and monitoring availability, latency, and overall system health. They will scale systems sustainably through mechanisms such as automation, infrastructure as code using IaaC tools, and evolve systems by pushing for changes that improve reliability and velocity. The build and release procedures must be developed and implemented to guarantee that releases are properly packaged and delivered to development, test, staging, and production environments. They will oversee source code management, as well as design/implement best practices and standards for version control and configuration management. To successfully deliver client solutions, the senior DevOps Engineer will bridge the gap between the development, infrastructure, and network teams. Additionally, this SME will be responsible for bringing others up to speed on cloud and other DevOps related services, pipelines, and cloud management on a regular basis.


Engineer, automate, and orchestrate system provisioning and maintenance processes using modern cloud tools and technologies. Design and implement scalable and resilient solutions to solve complex problems and priority issues.
Actively troubleshoot complex problems and provide thoughtful solutions and root cause analysis with technical and non-technical teams.
Make design recommendations for on-premises, cloud native, and hybrid cloud application integrations solutions incorporating industry best practices.
Implement CICD standards, design and coordinate source control versioning strategies, and ensure compliance to source control, versioning, branching, and test automation policies, as well as manage and configure project dashboards.
Manage application and database monitoring services utilizing UMass Chan's standard monitoring tools.
Manage, deploy, and monitor cloud solutions including applications, cloud native services, cloud infrastructure, and container services.
Design and Implement IaC standard templates for cloud and non-cloud solutions, ensuring availability, scalability, and security and compliance.
Manage IaC tools such as Terraform Cloud, Ansible, and Vault Services
Automate day two operations utilizing programming and scripting technologies (e.g., Ansible, PowerShell, Bash, HCL, Python, and JavaScript).
Manage Linux and Windows hosting platforms within on-premises and cloud environments.
Manage highly available and scalable on-premises and cloud solutions.
Optimize all DevOps aspects of on-premises and cloud service delivery.
Manage and deploy sensitive data such as PHI, PII.
Educate teams on the implementation of new cloud-based initiatives, providing associated training as required.
Participate in operational support to ensure service reliability.
Participate in on-call and deployment rotation.
Perform other duties as required.


Bachelor's degree in Computer Science, IT related field, or equivalent experience.
5+ years of DevOps, CloudOps, or SRE work experience.
4+ years of demonstrated, progressively responsible experience implementing and managing continuous integration, build management, automated deployments, release engineering, and/or configuration management.
5+ years of IT in software development and/or technology experience with at least 3+ years designing, building cloud services, and providing operational support for cloud.
1+ years of experience with Terraform or equivalent IaC platform.
3+ years of hands-on experience and an understanding of best practices with Azure and AWS cloud services, including infrastructure, security, networking, database, code pipeline and cost optimization.
Ability to code/script using either Ansible, PowerShell, Bash, HCL, Python, and JavaScript or similar.
Understanding and prior experience designing and implementing infrastructure as code IaC templates for on-premises and cloud solutions.
Understanding and prior experience with operating cloud services, including monitoring/alerting, cost reporting and optimization, cloud security, and problem resolution.
Understanding and prior experience with software configuration management systems and/or source code version control systems, Subversion, Git, TFS, and Azure DevOps.
Understanding and prior experience with Unix/Linux and Microsoft Servers, Apache/Tomcat, IIS, and Weblogic webservers.
Understanding and prior experience with on-premises and cloud database technologies including PaaS database solutions.
Expertise to propose continuous integration, source control, and automated deployment solutions/strategies, implement the solutions, establish policies, quickly solve problems within these environments, and instruct team members in the use of these solutions.
Understanding and prior experience in development languages such as C#, Java, JavaScript, React, Angular, or similar.
Understanding of SDLC best practices and experience with code management.
Understanding of Agile, Scrum, or SAFe development methodologies.
Solid interpersonal skills, with ability to work well with people of all levels of information technology expertise.
Proven interest in IT and passion in furthering the mission of UMass Chan.
Experience in providing technical expertise for hybrid application and database infrastructure.
Experience in providing technical expertise on the automation of the deployment, scaling, and overall management of the software applications in a hybrid and multi-cloud environment.
Experience in building and guiding the design and development of reusable Infrastructure as Code (IaC) components; create IaC test plans and execute testing across the lifecycle.
Deep knowledge and Experience in Analyzing data from production monitoring tools to determine areas of improvement, then design and implement the necessary changes
Experience in collaborating with peers to assure all designs meet the University's standards, compliance, and security requirements.
Experience collaborating with multiple teams in an agile environment.
Advanced knowledge in automation and orchestration frameworks relating to the arranging and coordination of tasks that are automated to create a consolidated workflow or process.
Advanced knowledge in application hosting solutions including (IIS, Tomcat, Apache) .
Experience with Microsoft technology, including, but not limited to: Windows Server and Client OS, Active Directory, Remote Desktop Services, SQL, IIS, DNS, and DHCP.
Experience with serverless and microservice design including (AWS Lambda, SQS, SNS, Kinesis).
Advanced knowledge of network engineering infrastructure (e.g., protocols, routing, switching, firewalls) as well as operational understanding of IP-based computer networking within LAN and WAN environments including implementations in a public cloud environment.
Knowledge of developing and maintaining proper technical and user documentation in a consistent and structured manner.
Experience in best practices and IT operations.
Experience in Agile practices.
Solid interpersonal skills with ability to work well with people of all levels of information technology expertise and comfort in varied organizational relationships.
Ability to learn new technical skills/services quickly through comprehensive research and analysis, and the ability to produce clear and concise recommendations and guidance.