Job Details

DevOps Engineer, AI Toolkits

NVIDIA Corporation
Santa Clara, California, United States

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people.Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. NVIDIA is hiring an excellent DevOps Engineer to work on our NeMo, NeMo Service and Riva teams. Our teams create building blocks to make generative AI easy to develop, integrate, and deploy. Your role is multifaceted: streamlining development, build, and releases with modern DevOps tools as well as maintaining cloud deployment infrastructure for our hosted services.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for various models supported in NeMo and NeMo Service

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. GitHub, Gitlab, Docker, Jenkins, etc.)

  • Designing continuous integration, verification and deployment strategy for NeMo toolkit, service and corresponding pre-trained models

  • Lead best-practices for building, testing, and releasing software

  • Identifying and plan for infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 3+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Adept programming skills in Python and PyTorch or Tensorflow

  • Fluency in Git and GitHub/GitLab

  • Pragmatic approach to solving problems and collaboration

  • Real passion for multiplying productivity of others

Ways to stand out from the crowd:

  • Contribution to open-source software

  • Deep learning expertise

  • Deep knowledge of container and cluster technologies like Docker, slurm, kubernetes, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

The base salary range is $141,000 - $268,000. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits .

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Send application

Mail this job to me so I can apply later

Apply With CV

You are not logged in. If you have an account, log in to your account. If you do not have an account, why not sign up? It only takes a minute!

latest videos

Upcoming Events