skillindiajobs
Hyderabad Jobs
Banglore Jobs
Chennai Jobs
Delhi Jobs
Ahmedabad Jobs
Mumbai Jobs
Pune Jobs
Vijayawada Jobs
Gurgaon Jobs
Noida Jobs
Oil & Gas Jobs
Banking Jobs
Construction Jobs
Top Management Jobs
IT - Software Jobs
Medical Healthcare Jobs
Purchase / Logistics Jobs
Sales
Ajax Jobs
Designing Jobs
ASP .NET Jobs
Java Jobs
MySQL Jobs
Sap hr Jobs
Software Testing Jobs
Html Jobs
IT Jobs
Logistics Jobs
Customer Service Jobs
Airport Jobs
Banking Jobs
Driver Jobs
Part Time Jobs
Civil Engineering Jobs
Accountant Jobs
Safety Officer Jobs
Nursing Jobs
Civil Engineering Jobs
Hospitality Jobs
Part Time Jobs
Security Jobs
Finance Jobs
Marketing Jobs
Shipping Jobs
Real Estate Jobs
Telecom Jobs

Senior Site Reliability Engineer

2.00 to 6.00 Years   Mumbai City   24 Dec, 2020
Job LocationMumbai City
EducationNot Mentioned
SalaryNot Disclosed
IndustryInternet / E-Commerce
Functional AreaGeneral / Other Software
EmploymentTypeFull-time

Job Description

DescriptionZycus is hiring Seasoned Application Troubleshooters, with a Technical experience of about minimum 2-6 years, and a hands-on experience in End to end Production support, Application monitoring and java based application troubleshooting.About UsZycus is a leading global provider of A.I. powered Source-to-Pay suite for procurement, finance, and AP organizations. Our comprehensive product portfolio includes eProcurement, eInvoicing, Spend Analysis, eSourcing, Contract Management, Supplier Management, Financial Savings Management, Project Management, Request Management, Supplier Network, Insight Studio, and Merlin A.I. Suite.The Merlin A.I. Suite is a unique platform of pre-packaged intelligent BOTs to automate run-of-the-mill procurement and A.P. tasks with intelligent and predictive suggestions. It enables teams to improve productivity through optimal efforts, enhance accuracy with minimal human intervention, and focus on strategic activities. Driven by Artificial Intelligence, Zycus Merlin A.I. BOTs introduce cutting edge technologies in procurement operations, making it truly autonomous and cognitive.Our spirit of innovation and passion to help organizations create a more significant business impact is reflected among the hundreds of procurement solution deployments that we have undertaken over the years.

  • Resource trending, capacity planning of overall Infrastructure through automation tools like grafana, logstash, prometheus.
  • Enable SRE team interaction / integration with other stakeholders (Development and Infrastructure)
  • Lead discussion and negotiate on Error Budgets, change management, down time management with appropriate stakeholders
  • Ability to solution and deliver all of the Operations/SRE services and processes including managing L2 Environment Support
  • You will serve as a leader and coach of a team of L2 Engineers responsible for ongoing operation and monitoring of our AWS Cloud infrastructure, working closely with the development teams.
  • Analyze reliability challenges and develop automated solutions for auto-healing and incident resolution
  • Work with development teams to improve applications operational features for faster MTTD and MTTR and auto recovery
  • Continuously analyze the current Site Reliability capabilities and identify areas of improvements
  • You will drive reliability and supportability aspects of Cloud service, including change management, triage of customer escalations, remediation plans, Devops Ansible playbooks and automations.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall distributed system health.
, Requirements
  • Ability to solution & deliver all of Operations/SRE services & processes including managing L2 Environment Support
  • 2-6years of overall environment support experience with 2+ years of experience as support / SRE engineer
  • Experience in implementing Monitoring solutions using APM tools( Example: AppDynamics, Graylog, Dynatrace, Datadog etc.) set up and test proactive monitoring alerts
  • Have a broad knowledge profile and really excel in some areas, such as HTTP/TLS, DNS, networking or containerization
  • Comfortable with large scale production systems and technologies, for example load balancing, monitoring, distributed systems, microservices, and configuration management.
Process Skills
  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
  • Interest in designing, analyzing and troubleshooting large-scale distributed systems.
Behavioral Skills
  • Practice sustainable incident response and blameless postmortems.
  • Proven ability in developing relationships with stakeholders, communicating project/program status, and understanding detailed business requirements across multiple project initiatives
  • This role requires candidates to work in rotational shifts. 24*7 support

Keyskills :
androidload balancingtime managementautomation toolsalgorithmsjavaspend analysisproblem solvingproduct portfolioacademicsstrong communication skillscapacity planningacpchange management

Senior Site Reliability Engineer Related Jobs

© 2020 Skillindia All Rights Reserved