skillindiajobs
Hyderabad Jobs
Banglore Jobs
Chennai Jobs
Delhi Jobs
Ahmedabad Jobs
Mumbai Jobs
Pune Jobs
Vijayawada Jobs
Gurgaon Jobs
Noida Jobs
Oil & Gas Jobs
Banking Jobs
Construction Jobs
Top Management Jobs
IT - Software Jobs
Medical Healthcare Jobs
Purchase / Logistics Jobs
Sales
Ajax Jobs
Designing Jobs
ASP .NET Jobs
Java Jobs
MySQL Jobs
Sap hr Jobs
Software Testing Jobs
Html Jobs
IT Jobs
Logistics Jobs
Customer Service Jobs
Airport Jobs
Banking Jobs
Driver Jobs
Part Time Jobs
Civil Engineering Jobs
Accountant Jobs
Safety Officer Jobs
Nursing Jobs
Civil Engineering Jobs
Hospitality Jobs
Part Time Jobs
Security Jobs
Finance Jobs
Marketing Jobs
Shipping Jobs
Real Estate Jobs
Telecom Jobs

SRE_Architect

2.00 to 4.00 Years   Bangalore   03 Nov, 2021
Job LocationBangalore
EducationNot Mentioned
SalaryNot Disclosed
IndustryManufacturing
Functional AreaGeneral / Other Software
EmploymentTypeFull-time

Job Description

Job DescriptionSRE architect will play the mission-critical role of ensuring that critical systems are healthy, monitored, automated, and designed to scale. This role requires a thoughtful problem solver with excellent organizational skills. The Site Reliability engineering team is responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. This role will be responsible for responding to production problems, investigating their causes, and engineering and advising on permanent solutions.Tasks and ResponsibilitiesDesign and implement secure, highly available, and scalable infrastructureEngage in and improve the whole lifecycle of services from inception and design, through deployment, operation and refinement.Guide reliability practices through the entire software development lifecycle through activities like architecture reviews, code reviews, creating platforms and frameworks, capacity planning.Plug into software release cycle. Work closely with developers to ensure software releases are well designed, planned, implemented, released, and monitoredWork with senior engineering and testing team members to build tools and testing strategies for problem prevention, detection, and chaos testing.Design and create centralized logging and monitoring systems.Design and create robust logging, monitoring, and alerting systems.Troubleshoot production incidents in real time.Lead root cause investigations.Improve service reliability through blameless post-incident reviews and using code to prevent or respond to problem recurrence.Proactively identify system anomalies.Recommend and execute testing strategies.Recognize automation opportunitieso Develop tools to automate routine jobs through knowledge learned on the job.o Automate system and application infrastructure using IaC solutionDesign and implement DevOps tools to enable software development teamsCoach teams on a variety of SRE practices (e.g., applying Error Budget visualizations, conduct Blameless Postmortems)Technical readiness reviews to improve reliability, scalability, performance, and securityShould be able travel to customer location for business needsWillingness to participate in on-call rotation SRE Job DescExpected SkillsExperience Cloud technologies and solutioning. (AWS, GCP Preferred)Experience with IAC tools (Terraform, CloudFormation)Experience with configuration management tools like Ansible.Experience with container technology and orchestration (Kubernetes, Docker).Proficiency with tools like Git, BitbucketExperience in one or more of the following: Java, JS, Duck creek, Python, MicroservicesExperience in System monitoring + Application Performance Monitoring (APM) implementationExperience with Log management and ELK Stack. (Elastic Search, Logstash, Kibana)Understanding of the Application servers, Network and Databases.Excellent understanding of Scalability processes and techniques.Understanding of Jenkins or other build tools.Hands on experience in administering high availability and high-performance environments, as well as managing large-scale deployments of traffic-heavy applications.Someone who can handle multiple complex systems and not shy away from the challenge of improving them.The willingness to try new technologies and make them harmonize with existing systems to achieve better operations overall.Experience in the automated setup of cloud infrastructure - Preferably setups in Azure, CloudFoundry with terraform, pulumi or ansibleExperience in design, implement and manage CICD pipelines using Jenkins, Concourse or Azure DevOpsKnowledge in Scripting PowerShell / Shell / PerlStrong knowledge of the Linux operating system & any DatabaseExperience in source code Management for distributed development & release coordinationBasic knowledge in ITILBasic knowledge of (virtual) network setups and (virtual) networking components,

Keyskills :
software development life cycleroot causebuild toolslog managementcomplex systemscritical systemschange managementsystem monitoringhigh availabilityemergency response

SRE_Architect Related Jobs

© 2020 Skillindia All Rights Reserved