Determined AI - Backend Engineer, SaaS

Hewlett Packard Enterprise Development LP
Washington, DC
May 21, 2022
May 24, 2022
Engineer, IT, QA Engineer
Full Time
Deep learning has enormous promise, but developing deep learning models at scale remains extremely complex, time-consuming, and expensive. At Determined AI, part of Hewlett Packard Enterprise, we are working to change that: our revolutionary open source deep learning training platform enables deep learning engineers to train better models in less time, to seamlessly share GPU clusters, and to collaborate more effectively. As a Back End Software Engineer, you will play a fundamental role building our core deep learning platform. You'll get the chance to tackle challenging problems at the cutting edge of deep learning research and development, and to collaborate with leading machine learning researchers and engineers. You will have the opportunity (and responsibility!) to define major aspects of our product: you'll be expected to take on a difficult problem without a clear solution, and to design, build, and iterate until we've reached an elegant solution that delights our customers. You will work on problems such as efficient cluster scheduling over heterogeneous GPUs, implementing cutting-edge algorithms for hyperparameter optimization, and designing systems for managing ETL pipelines and automated deployment of deep models. This work can done anywhere in the United States. Requirements Strong problem solving and analytical skills Excellent communication skills, both written and verbal An exceptional track record of designing, implementing and shipping scalable, reliable production-quality software Experience with distributed and/or concurrent software development Theoretical knowledge of statistics or machine learning is not required Preferred Experience building systems for large-scale data management, analytics, cluster scheduling, stream processing, or machine learning Familiarity with modern container-based cluster managers (eg, Kubernetes, DC/OS) Experience doing operations and being on-call for production systems Interest or experience in machine learning and/or deep learning Familiarity with hardware performance, HPC and/or scientific computing Our Engineering Process We use two-week engineering sprints to strike the right balance between execution and evaluating priorities. We structure our collaboration around projects: The project lead is an individual contributor directly responsible for the execution of the project, including planning the technical solution, coordinating work with other engineers, and communicating progress to stakeholders. Engineering managers ensure project leads at all levels of their career have the support and guidance they need to fulfill these responsibilities effectively. All code is reviewed by members of the team, following a specific set of guidelines to reduce ambiguity and drive to shipping efficiently. You can check out how we review code on our public repository . Technical Stack - Go - Python - React - TypeScript - TensorFlow - PyTorch - Keras - Docker - Kubernetes - PostgreSQL HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT and Affirmative Action employer. We are committed to diversity and building a team that represents a variety of backgrounds, perspectives, and skills. We do not discriminate and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global diverse team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. 1120706