848 IT & Software Developer jobs in the US

Annapurna Labs (U.S.) Inc. jobs

Senior Machine Learning Kernel Performance Engineer

$151,300 - 261,500
Annapurna Labs (U.S.) Inc.
Torre Avenue 10201, Cupertino
$151,300 - 261,500
Company Size icon
Company Size
200-500
Company Type icon
Company Type
Product
Exp Level icon
Exp Level
Senior
Job Type icon
Job Type
Full-Time
Language icon
Language
English
Visa sponsorship icon
Visa sponsorship
No

Requirements

Must:
- A minimum of 5 years of hands-on professional software development experience. - Over 5 years of experience in programming with at least one software programming language. - More than 5 years of experience in leading the design or architecture of new and existing systems, focusing on design patterns, reliability, and scalability. - At least 5 years of full software development life cycle experience, which encompasses coding standards, code reviews, source control management, build processes, testing, and operations. - Proven experience as a mentor, tech lead, or with leading an engineering team.

Technologies

AI
Backbone
CUDA
Flow
GitHub
LLVM
Machine Learning
PyTorch

Responsibilities

- I will design and implement high-performance compute kernels for machine learning operations, using the Neuron architecture and programming models. - I will analyze and optimize kernel-level performance across multiple generations of Neuron hardware. - I will conduct in-depth performance analysis utilizing profiling tools to identify and address bottlenecks. - I will apply compiler optimizations such as fusion, sharding, tiling, and scheduling. - I will work directly with customers to enable and optimize their machine learning models on AWS accelerators. - I will collaborate with teams to develop innovative techniques for kernel optimization.

Description


At Amazon Web Services (AWS), our Annapurna Labs team is engaged in developing AWS Neuron, a software development kit aimed at enhancing deep learning and generative AI workloads on Amazon’s specialized machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is central to maximizing performance for our unique ML accelerators. Our engineers, operating at the hardware-software interface, create high-performance kernels for machine learning functions, ensuring optimal performance for our customers’ complex workloads. We merge in-depth hardware insights with ML know-how to redefine the potential of AI acceleration. The AWS Neuron SDK, created by our Annapurna Labs team, provides the foundation for expediting deep learning and GenAI workloads on Inferentia and Trainium. This extensive toolkit encompasses an ML compiler, runtime, and application framework that integrates smoothly with well-known ML frameworks like PyTorch to deliver exceptional ML inference and training performance. As part of the larger Neuron Compiler organization, our team collaborates across various technology layers—from frameworks and compilers to runtime and collectives. We not only enhance current performance but also contribute to future architectural designs by closely collaborating with customers to implement their models and ensure peak performance. This position offers a distinct opportunity to work at the convergence of machine learning, high-performance computing, and distributed architectures, allowing you to influence the future of AI acceleration technology. We value diversity in experiences at AWS and encourage all candidates to apply, even if they do not meet every single qualification. Our culture embodies inclusion, supported by numerous employee-led affinity groups and continuous learning opportunities. We prioritize work-life balance, offering flexible working hours, and are dedicated to mentorship and career growth within our team. The role promises to be a rewarding journey in terms of learning and professional development as we build innovative solutions together. We look forward to considering your application as we seek to enhance our exceptional culture and contributions to the cloud computing landscape!
Something wrong or incorrect with this job? Tell us in the chat 💬 on the right ➡️
You can find Machine Learning Engineer salaries in the United States here.

How many Machine Learning Engineer jobs are in the United States?

Currently, there are 848 ML, AI openings. Check also: TensorFlow jobs, Python jobs, Computer-Vision jobs - all with salary brackets.

Is the US a good place for Machine Learning Engineers?

The US is one of the best countries to work as a Machine Learning Engineer. It has a vibrant startup community, growing tech hubs and, most important: lots of interesting jobs for people who work in tech.

Which companies are hiring for Machine Learning Engineer jobs in the United States?

Sperasoft, New World Technologies Inc., Giesecke+Devrient, Archon Systems Inc, Puter Technologies Inc., HAPLY Robotics Inc., LGS, une Société IBM / an IBM Company among others, are currently hiring for ML, AI roles in the United States.

The company with most openings is Leidos as they are hiring for 89 different Machine Learning Engineer jobs in the United States. They are probably quite committed to find good Machine Learning Engineers.