Job Details

Software Engineer - Monitoring Infrastructure

The Judge Group Inc.
Sunnyvale, California, United States
Location: Sunnyvale, CADescription: Our client is currently seeking a Software Engineer - Monitoring Infrastructure This job will have the following responsibilities: The SRE Monitoring Infrastructure is looking for a backend engineer with experience working with large-scale systems and an operational mindset to help scale our operational metrics platform. This is a fantastic opportunity to enable all engineers to monitor and keep our site up and running. In return, you will get to work with a world class team supporting a platform that serves Billions of metrics at Millions of QPS Site Reliability Engineers (SRE) fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying and resolving production issues. The ideal candidate will be passionate about an operations role that involves deep knowledge of both the application and the product, and will also believe that automation is a key component to operating large-scale systems. Responsibilities: Serve as a primary point responsible for the overall health, performance, and capacity of one or more of our Internet-facing services Gain deep knowledge of our complex applications. Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and constant growth. Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale UNIX environment. Work closely with development teams to ensure that platforms are designed with "operability" in mind. Function well in a fast-paced, rapidly-changing environment. Participate in a 24x7 rotation for second-tier escalations. Basic Qualifications: B.S. or higher in Computer Science or other technical discipline, or related practical experience. UNIX/Linux systems administration background. Programming skills (Golang, Python) Preferred Qualifications: 5+ years in a UNIX-based large-scale web operations role. Golang and/or Python experience Previous experience working with geographically-distributed coworkers. Strong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other SREs, Engineers, Product Managers, etc. Knowledge of most of these: data structures, relational and non-relational databases, networking, Linux internals, filesystems, web architecture, and related topics- basic knowledgeFor immediate consideration, please send an updated resume to @: job and many more are available through The Judge Group. Find us on the web at

Send application

Mail this job to me so I can apply later

Apply With CV

You are not logged in. If you have an account, log in to your account. If you do not have an account, why not sign up? It only takes a minute!

latest videos

Upcoming Events