HackerEarth
Bangalore
11632 Followers
Jobs / Site Reliability Engineer

Site Reliability Engineer
Posted on: 9th May, 2017 | Active

Skills Python, SQL, Linux, Apache, AWS
Experience 5-8 years exp
Job Location Bangalore
Openings 1
Eligibility Criteria
  • Profile must be atleast 50% complete.
  • An updated resume should be uploaded
Apply directly through your profile. Apply

The HackerEarth platform is the home for the largest and fastest-growing developer communities in the world. Developers from various technology domains solve problems on HackerEarth to improve their programming skills and compete with their peers. To businesses, HackerEarth provides the most effective mechanism to engage with the developer ecosystem and effectively evaluate technical skills.

Roles and Responsibilities:

  • SRE will be responsible for uptime/availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.
  • She uses her experience in handling large complex enterprise production infrastructure and has developed or undertaken Production Readiness Review (PRR)
  • Has the ability to provide clear directives to the infrastructure team and helps increase or improve overall serviceability, performance, cost effectiveness and security
  • SRE can identify repeat problem areas and build monitoring and automation tools to mitigate them
  • Develop dashboards, visualizations and regular monitoring for our infrastructure components and core applications
  • Automate system capacity, uptime and other system related reports
  • Gain expert level knowledge of our applications and services
  • Provide operational support to our awesome development teams
  • Participate in a weekly on-call rotation, Conduct regular SRE quarterly service reviews to assess workload
  • An SRE should have strong technical expertise with a good sense for the intersection of technology and business functions/application knowledge.
  • Work with engineering teams to design and write code to create systems which are highly available and able to scale seamlessly.
  • Create methods, policies and processes to ensure seamless deployment
  • Plan for and eliminate any potential threats to product stability, availability or overall security.
  • Improve monitoring, alerting and resilience of systems.
  • Write/develop tools to assist work such as capacity planning or improving the ability to debug production issues over distributed systems.
  • Contribute to a culture of learning and responsibility by writing detailed postmortem RCA reports.
  • Tackle live issues on production when on-call with assistance from the rest of the teams.

Required Skills:

  • 5+ Years Experience in working both Software Development and Systems Engineering
  • Experience crafting, analysing, and troubleshooting distributed systems
  • Expertise in AWS is a must, a certification in AWS is a plus (AWS SA preferred)
  • Should be an expert of Redis, and MySQL. Can write programs, develop applications in the hour of need
  • Should be good at LINUX/UNIX, RDBMS, JMeter, Load Balancers, Certificates, DNS, Proxy, networking concepts, shell & Python Scripting etc.
  • Knowledge on any Config management tool, working understanding of Enterprise and Internet Security is a plus
  • Attention to detail and accuracy and ability to spot long term trends in production web environment
  • Outstanding interpersonal, analytical, and communication skills
  • Most importantly takes ownership and enjoys being an SRE
About HackerEarth

HackerEarth is the hub for programmers to practice and improve their programming skills, compete in coding challenges/hackathons and showcase their profile. Businesses use HackerEarth to build developer relations and create a talent pipeline.

HackerEarth has built a proprietary code evaluation engine that allows programmers to write code in the browser and automatically evaluates it in real-time. HackerEarth regularly conducts a variety of hackathons ranging from basic domains like Algorithms to advanced concepts like AI and Machine learning. Based on the activity of the programmers, HackerEarth is able create an accurate skill graph for them, which is used to recommend jobs and other opportunities to programmers.

Notifications
View All Notifications