Company DescriptionVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose - to uplift everyone, everywhere by being the best way to pay and be paid.Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa.Platform Products Technology group in VISA is one team that strongly works towards next-gen payments and believes in its slogan It\'s Everywhere You Want to Be, for making payments accessible everywhere and for everyone. This group innovates technology that improves the lives of millions of people around the world for the payment ecosystem. The desired candidate will be part of this journey of our team and will be contributing to achieve the same. This role is in Site Reliability Engineering (SRE) team which focusses on the digital products from reliability, availability, performance, and efficiency perspective.Responsibilities:Engage with product, architects, developers, Certification, Project management, Operations & Infrastructure teams from the start of the SDLC phase.Become subject matter expert for the assigned product verticals. Analyze complex systems from a reliability and resilience perspective.Run the production environment by monitoring availability and taking a holistic view of system health. Use ELK, Grafana, and Splunk for monitoring application-specific logs, visualizing metrics, creating dashboards, and alerts.Understanding the end-to-end product topology from infrastructure and application perspective.Build/Design automation script for manual process.Build/Design test script using ML/Python for API based products.Identify sources of instability in large-scale distributed systems and drive operational excellence. Dive deep and understand every issue occurred and own them completely for end-to-end closure.Performing functional analysis of products by gathering and analyzing metrics from both operating systems and applications to assist in performance tuning and fault finding - integration/operational challenges.Performing code bug fixes in production and recommending any architectural improvements during issue/incident analysis.Work closely with development and product teams on suggesting new features and enhancements based on live issues.Drive down the burden of toil with tooling and automation to achieve operational efficiency and smoother customer experience.Apply AI techniques to improve system reliability and efficiency.Technical consultancy for monitoring, incidents, and problem management. Lead technical bridges and interact with both technical staff and management during the incident and change management process.Participate in on-call support.Engage with tech and non-tech partners on regular basis to analyze functional and technical in-depth solutions.Understanding new changes in production systems and assessing its risk from application perspective for driving reliability and availabilityHave some level of network engineering understanding to assist in incident/issue triaging.Provide guidance and technical expertise to junior team members.Excellent problem-solving skills and attention to detail.Strong communication skills and ability to work effectively in a team.Collaborate with the team to define SRE practices and identify areas for improvement.This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.QualificationsBasic Qualifications:
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.