Resource will be responsible for the following:
Drive formulation and implementation of “never down” strategy aimed at identifying opportunities to improve overall stability and resiliency of critical business functions and applications. Partner with infrastructure and application development teams to implement enhancements.
Ensure the design and development of new capabilities incorporate best practices aimed at ensuring such capabilities are highly resilient and stable.
Partner with production support, infrastructure and application teams to review disaster and contingency capabilities for all critical business functions, applications and individual components. Identify opportunities to optimize such capabilities, specifically recovery time and recovery point objectives, and partner with appropriate teams to implement such enhancements.
Ensure the design and development of new capabilities incorporate best practices aimed at ensuring optimal recovery time and point objectives can be achieved. Partner with production support, infrastructure and development organizations to ensure robust disaster recovery and contingency plans and capabilities are implemented and operationalized.
Apply extensive technical experience and skill set to drive the triaging of complex, high impact Production incidents to quickly restore service
Partner with application and product managers to identify root cause and actions to correct complex, high impact Production problems. Also, work with those teams to identify other opportunities to improve overall Production stability, including actions to mitigate the reoccurrence of any problem as well as opportunities to improve overall monitoring.
Socialize best practice design patterns for highly available and resilient applications with production support, infrastructure and development partners.
Function as a subject matter expert for the team on stability and resiliency
Required Job Skills
Resource will have the following skills:
7+ years of experience in information technology
Knowledgeable in best practice design patterns aimed at highly available and resilient applications
Experience formulating and driving enterprise strategy across a large-scale organization
Previous experience as an architect working with business partners and application development teams to understand business requirements and identify technology solutions best positioned to meet such needs in a highly resilient and stable manner
Experience as a system administrator, database administrator, middleware administrator and/or network administrator. Ideally, experience in more than one role preferred.
Experience as an application developer and/or production support
Experience using advanced monitoring tools such as Splunk, AppDynamics, SiteScope, Glassbox, and NetScout
Experience troubleshooting network related incidents
Strong, courageous communicator capable of effectively communicating, verbally, via emails and instant messaging, to both technical and business teams
Capable of periodically providing on call support outside of normal working hours
Capable of working in high pressure situations
Bachelor’s degree in business, computer science, MIS or related field
Experience working for a large cap technology company
Experience supporting/development applications that utilize SAN and NAS storage. Any experience with Dell EMC Centera and Hitachi HCP storage a plus.
Experience leaning out and automating processes aimed at improving overall efficiency and quality of the work product
Familiarity with the ITIL framework
1st shift (United States of America)
Hours Per Week:
Learn more about this role