Back to search results

Network Reliability Engineer

Richmond, Virginia;

Job Description:

About Us:

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection.  Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities, and shareholders every day.

One of the keys to driving Responsible Growth is being a great place to work for our teammates around the world. We’re devoted to being a diverse and inclusive workplace for everyone. We hire individuals with a broad range of backgrounds and experiences and invest heavily in our teammates and their families by offering competitive benefits to support their physical, emotional, and financial well-being.

Bank of America believes both in the importance of working together and offering flexibility to our employees. We use a multi-faceted approach for flexibility, depending on the various roles in our organization.

Working at Bank of America will give you a great career with opportunities to learn, grow and make an impact, along with the power to make a difference. Join us!

Position Summary

As we embark on a journey to transform our Network Operations Center, we are looking for a highly-skilled Network Reliability Engineer to join us. We are a global team across USA, UK, India and Singapore made up of a diverse range of people from varied backgrounds who each bring unique skillsets and perspectives. The team is responsible for building a suite of observability tools and developing our self-healing capabilities while working closely with other members of the Network Services team to ensure one of the largest network infrastructures in the world is highly available, resilient, and secure.


  • Building and maintaining network monitoring, orchestration and automation solutions, including automated inventory reconciliation and remediation, workflow automation, automated network configuration validation, automated network health monitoring, automated alerts handling and incident remediation.
  • Monitor the performance of our network infrastructure and develop solutions to address any issues.
  • Perform regular network infrastructure audits to ensure compliance with best practices and industry standards.
  • Collaborate with other teams to troubleshoot and resolve network-related issues.
  • Build services with an API driven approach to enable seamless integration of network tools with various other network related services and enable easy consumption of network tools services to end users. Contribute towards API economy.
  • Identify opportunities to automate repetitive tasks and help enhance quality of internal processes. Automate everything possible.
  • Collaborate with other teams to design and implement tools that will help automate end-to-end processes within network infrastructure.
  • Develop automated test suites and maintain clear documentation of solutions developed.
  • Development and implementation of build release pipelines with personal accountability for managing deployment schedules, issues, risks, and impediments
  • Work with Product owners and stakeholders to prioritize backlogs, deliver solutions through environments and into production.
  • Lead and provide estimates, formalize release plans, and implementation schedules/dependencies
  • Track infrastructure delivery and dependencies to implementation.
  • Communicate implementation issues, delays, and mitigation plans.
  • Innovate to improve future processes and deployments.
  • Interface and communicate with project team, management and stakeholders.
  • Ensure correctness of network inventory across tools by developing useful reports on inventory gaps and help network teams identify and remediate gaps quickly with help of insightful dashboards.
  • Ensure compliance of network devices configurations to standards and best practices defined using automated configuration validation scripts. Proactively identify vulnerabilities in the environment and work with stakeholders to remediate the vulnerabilities.

Required Skills

  • Strong programming skills with minimum of 5 years hands-on Python experience
  • Hands-on experience with automation and orchestration tools such as Ansible and Itential or similar tools
  • Hands-on experience with network monitoring tools
  • Ability to build API based services
  • Strong understanding of Network Domain fundamentals, good knowledge in Network Asset and Configuration management processes
  • Good understanding of the Software Development Life Cycle (SDLC) and experienced in using Agile methodologies and tools such as Bitbucket, JIRA, Jenkins
  • Analytical skills and problem-solving skills needed to manage multiple factors on a project simultaneously
  • Excellent communication skills (verbal and written)
  • Education: Bachelor’s Degree in Information Technology, Engineering or Computer Science is highly preferred
  • Experience in application development or network infrastructure design (Preferred)


1st shift (United States of America)

Hours Per Week: 


Learn more about this role

Full time


Manages People: No

Travel: No