Senior Machine Learning Kernel Performance Engineer Job in Cupertino

685 IT & Software Developer jobs in the US

Company Size

200-500

Company Type

Product

Exp Level

Senior

Job Type

Full-Time

Language

English

Visa sponsorship

Requirements

Must:

- A minimum of 5 years of hands-on professional software development experience. - Over 5 years of experience in programming with at least one software programming language. - More than 5 years of experience in leading the design or architecture of new and existing systems, focusing on design patterns, reliability, and scalability. - At least 5 years of full software development life cycle experience, which encompasses coding standards, code reviews, source control management, build processes, testing, and operations. - Proven experience as a mentor, tech lead, or with leading an engineering team.

Technologies

Backbone

CUDA

Flow

GitHub

Hardware

Support

LLVM

Machine Learning

PyTorch

TensorFlow

Responsibilities

- I will design and implement high-performance compute kernels for machine learning operations, using the Neuron architecture and programming models. - I will analyze and optimize kernel-level performance across multiple generations of Neuron hardware. - I will conduct in-depth performance analysis utilizing profiling tools to identify and address bottlenecks. - I will apply compiler optimizations such as fusion, sharding, tiling, and scheduling. - I will work directly with customers to enable and optimize their machine learning models on AWS accelerators. - I will collaborate with teams to develop innovative techniques for kernel optimization.

Description

At Amazon Web Services (AWS), our Annapurna Labs team is engaged in developing AWS Neuron, a software development kit aimed at enhancing deep learning and generative AI workloads on Amazon’s specialized machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is central to maximizing performance for our unique ML accelerators. Our engineers, operating at the hardware-software interface, create high-performance kernels for machine learning functions, ensuring optimal performance for our customers’ complex workloads. We merge in-depth hardware insights with ML know-how to redefine the potential of AI acceleration. The AWS Neuron SDK, created by our Annapurna Labs team, provides the foundation for expediting deep learning and GenAI workloads on Inferentia and Trainium. This extensive toolkit encompasses an ML compiler, runtime, and application framework that integrates smoothly with well-known ML frameworks like PyTorch to deliver exceptional ML inference and training performance. As part of the larger Neuron Compiler organization, our team collaborates across various technology layers—from frameworks and compilers to runtime and collectives. We not only enhance current performance but also contribute to future architectural designs by closely collaborating with customers to implement their models and ensure peak performance. This position offers a distinct opportunity to work at the convergence of machine learning, high-performance computing, and distributed architectures, allowing you to influence the future of AI acceleration technology. We value diversity in experiences at AWS and encourage all candidates to apply, even if they do not meet every single qualification. Our culture embodies inclusion, supported by numerous employee-led affinity groups and continuous learning opportunities. We prioritize work-life balance, offering flexible working hours, and are dedicated to mentorship and career growth within our team. The role promises to be a rewarding journey in terms of learning and professional development as we build innovative solutions together. We look forward to considering your application as we seek to enhance our exceptional culture and contributions to the cloud computing landscape!

Something wrong or incorrect with this job? Tell us in the chat 💬 on the right ➡️

IT & Software developer jobs in the USMachine-Learning Developer jobs in the USMachine-Learning Developer jobs San Jose, CA

You can find Machine Learning Engineer salaries in the United States here.

How many Machine Learning Engineer jobs are in the United States?

Currently, there are 685 ML, AI openings. Check also: TensorFlow jobs, Python jobs, Computer-Vision jobs - all with salary brackets.

Is the US a good place for Machine Learning Engineers?

The US is one of the best countries to work as a Machine Learning Engineer. It has a vibrant startup community, growing tech hubs and, most important: lots of interesting jobs for people who work in tech.

Which companies are hiring for Machine Learning Engineer jobs in the United States?

Jump, Goway Travel Limited, Energy Transfer Partners, HelloGov, Sunoco, Energy Transfer, National Futures Association among others, are currently hiring for ML, AI roles in the United States.

The company with most openings is Capital One as they are hiring for 47 different Machine Learning Engineer jobs in the United States. They are probably quite committed to find good Machine Learning Engineers.