Site Reliability Engineer - System/application Design

Staffio HR
  • Bangalore
  • 10-15 lakh
  • 7-10 years
  • Views
  • 19 Oct 2016

  • IT/ Information Technology

  • IT/ Technology - Software/ Services
Job Description

Minimum 7 years of managing services in an internet scale - nix environment

Responsibilities:

- Perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes

- Troubleshoot issues across the entire stack: hardware, software, application and network

- Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization

- Mentor SREs across the organization on best practices for everything from monitoring to troubleshooting complex code issues

- Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services

- Participate in code reviews for projects primarily written in Java and Scala, built on open source libraries such as Finagle, and running on both physical and virtualized platforms

- Represent the SRE organization in design reviews and operational readiness exercises for new and existing services

Requirements:

- Solid understanding of systems and application design, including the operational trade-offs of various designs

- Strong practical expertise building and supporting event-driven frontend and/or backend systems on JVM (Java and/or Scala)

- Practical knowledge of various aspects of service design, including messaging protocols & behavior, caching strategies and software design practices

- Demonstrable knowledge of TCP/IP, HTTP, web application security, and experience supporting multi-tier web application architectures

- Must work well with and be able to influence myriad personalities at all levels

- Practical, solid knowledge of shell scripting and at least one scripting language (Python preferred)

- Minimum 7 years of managing services in an internet scale - nix environment

- Ability to prioritize tasks and work independently

- Must be adaptable and able to focus on the simplest, most efficient & reliable solutions

- Track record of successful practical problem solving, excellent written and interpersonal communication, and documentation skills


Competencies/Skill sets for this job

Java IP Scala Python

Job Posted By

HS Sandesh
Talent Evangelist

About Organisation

Staffio HR