Back to search results

Sr Site Reliability Engineer

Plano, Texas;

Job Description:

Resource will be responsible for the following:
 Drive formulation and implementation of “never down” strategy aimed at identifying opportunities to improve overall stability and resiliency of critical business functions and applications. Partner with infrastructure and application development teams to implement enhancements.
 Ensure the design and development of new capabilities incorporate best practices aimed at ensuring such capabilities are highly resilient and stable.
 Partner with production support, infrastructure and application teams to review disaster and contingency capabilities for all critical business functions, applications and individual components. Identify opportunities to optimize such capabilities, specifically recovery time and recovery point objectives, and partner with appropriate teams to implement such enhancements.
 Ensure the design and development of new capabilities incorporate best practices aimed at ensuring optimal recovery time and point objectives can be achieved. Partner with production support, infrastructure and development organizations to ensure robust disaster recovery and contingency plans and capabilities are implemented and operationalized.
 Apply extensive technical experience and skill set to drive the triaging of complex, high impact Production incidents to quickly restore service
 Partner with application and product managers to identify root cause and actions to correct complex, high impact Production problems. Also, work with those teams to identify other opportunities to improve overall Production stability, including actions to mitigate the reoccurrence of any problem as well as opportunities to improve overall monitoring.
 Socialize best practice design patterns for highly available and resilient applications with production support, infrastructure and development partners.
 Function as a subject matter expert for the team on stability and resiliency

Required Job Skills

Resource will have the following skills:
  7+ years of experience in information technology
 Knowledgeable in best practice design patterns aimed at highly available and resilient applications
 Experience formulating and driving enterprise strategy across a large-scale organization
 Previous experience as an architect working with business partners and application development teams to understand business requirements and identify technology solutions best positioned to meet such needs in a highly resilient and stable manner
 Experience as a system administrator, database administrator, middleware administrator and/or network administrator.  Ideally, experience in more than one role preferred.
 Experience as an application developer and/or production support
 Experience using advanced monitoring tools such as Splunk, AppDynamics, SiteScope, Glassbox, and NetScout
 Experience troubleshooting network related incidents
 Strong, courageous communicator capable of effectively communicating, verbally, via emails and instant messaging, to both technical and business teams
 Capable of periodically providing on call support outside of normal working hours
 Capable of working in high pressure situations

Desired Skills

Bachelor’s degree in business, computer science, MIS or related field

Experience working for a large cap technology company
 Experience supporting/development applications that utilize SAN and NAS storage.  Any experience with Dell EMC Centera and Hitachi HCP storage a plus.
 Experience leaning out and automating processes aimed at improving overall efficiency and quality of the work product
 Familiarity with the ITIL framework
 

Shift:

1st shift (United States of America)

Hours Per Week: 

40

Learn more about this role

Full time

JR-20002589

Manages People: No

Travel: No

Manager:

Talent Acquisition Contact:

Referral Bonus: