HackerEarth
Bangalore
13582 Followers
Jobs / Site Reliability Engineer

Site Reliability Engineer
Posted on: 9th May, 2017 | Active

Skills Python, SQL, Linux, Apache, AWS
Experience 5-8 years exp
Job Location Bangalore
Openings 1
Eligibility Criteria
  • Profile must be atleast 50% complete.
  • An updated resume should be uploaded
Apply directly through your profile. Apply

HackerEarth provides enterprise software solutions that help organisations in their innovation management and talent assessment endeavours. HackerEarth Recruit is a talent assessment platform that helps in efficient technical talent screening allowing organisations to build strong, proficient teams. HackerEarth Sprint is an innovation management software that helps organisations drive innovation through internal and external talent pools, including HackerEarth’s global community of 1M+ developers.

Today, HackerEarth serves 750+ organisations, including leading Fortune 500 companies from around the world. General Electric, IBM, Amazon, Apple, Wipro, Walmart Labs and Bosch are some of the brands that trust HackerEarth in helping them drive growth.

Roles and Responsibilities:

  • SRE will be responsible for uptime/availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.
  • She uses her experience in handling large complex enterprise production infrastructure and has developed or undertaken Production Readiness Review (PRR)
  • Has the ability to provide clear directives to the infrastructure team and helps increase or improve overall serviceability, performance, cost effectiveness and security
  • SRE can identify repeat problem areas and build monitoring and automation tools to mitigate them
  • Develop dashboards, visualizations and regular monitoring for our infrastructure components and core applications
  • Automate system capacity, uptime and other system related reports
  • Gain expert level knowledge of our applications and services
  • Provide operational support to our awesome development teams
  • Participate in a weekly on-call rotation, Conduct regular SRE quarterly service reviews to assess workload
  • An SRE should have strong technical expertise with a good sense for the intersection of technology and business functions/application knowledge.
  • Work with engineering teams to design and write code to create systems which are highly available and able to scale seamlessly.
  • Create methods, policies and processes to ensure seamless deployment
  • Plan for and eliminate any potential threats to product stability, availability or overall security.
  • Improve monitoring, alerting and resilience of systems.
  • Write/develop tools to assist work such as capacity planning or improving the ability to debug production issues over distributed systems.
  • Contribute to a culture of learning and responsibility by writing detailed postmortem RCA reports.
  • Tackle live issues on production when on-call with assistance from the rest of the teams.

Required Skills:

  • 5+ Years Experience in working both Software Development and Systems Engineering
  • Experience crafting, analysing, and troubleshooting distributed systems
  • Expertise in AWS is a must, a certification in AWS is a plus (AWS SA preferred)
  • Should be an expert of Redis, and MySQL. Can write programs, develop applications in the hour of need
  • Should be good at LINUX/UNIX, RDBMS, JMeter, Load Balancers, Certificates, DNS, Proxy, networking concepts, shell & Python Scripting etc.
  • Knowledge on any Config management tool, working understanding of Enterprise and Internet Security is a plus
  • Attention to detail and accuracy and ability to spot long term trends in production web environment
  • Outstanding interpersonal, analytical, and communication skills
  • Most importantly takes ownership and enjoys being an SRE
About HackerEarth

HackerEarth provides enterprise software solutions that help organizations in their talent assessment and innovation management endeavours.

HackerEarth Recruit is a talent assessment platform that helps in efficient technical talent screening thus allowing organizations to build strong, proficient teams. HackerEarth Sprint is an innovation management software application that helps organizations drive innovation through internal and external talent pools, including HackerEarth’s global community of 1M+ developers.

Today, HackerEarth serves 750+ organizations, including leading Fortune 500 companies from around the world. General Electric, IBM, Amazon, Apple, Wipro, Walmart Labs, and Bosch are some of the brands that trust HackerEarth in helping them drive growth.

Notifications
View All Notifications