AI Solutions Engineer Job in New York

784 IT & Software Developer jobs in the US

Company Size

5k+

Company Type

Services

Exp Level

Senior

Job Type

Full-Time

Language

English

Visa sponsorship

Requirements

Must:

We are seeking candidates with the following qualifications: - Over 5 years of software development experience in one or more programming languages (Python, C/C++, Go, Java); experience in building and managing large-scale Python applications is highly preferred. - A minimum of 3 years of experience in designing, architecting, testing, and deploying production machine learning systems, including model deployment, evaluation, monitoring, data processing pipelines, and model fine-tuning workflows. - Practical knowledge of Large Language Models (LLMs), including API integration, prompt engineering, fine-tuning, and developing applications that utilize RAG and tool-using agents. - Familiarity with various LLMs, both commercial and open-source, and their functionalities (e.g., OpenAI, Gemini, Llama, Qwen, Claude). - A solid understanding of applied statistics, machine learning concepts, algorithms, and data structures to create efficient and reliable solutions. - Strong analytical problem-solving skills, a sense of ownership, and a sense of urgency; the ability to communicate complex ideas in a straightforward manner and collaborate seamlessly across global teams with a focus on measurable business outcomes. - Preferred: Experience in building and operating on cloud infrastructure (preferably AWS), including containerized services (ECS/EKS), serverless computing (Lambda), data services (S3, DynamoDB, Redshift), orchestration (Step Functions), model serving (SageMaker), and infrastructure as code (Terraform/CloudFormation).

Technologies

Responsibilities

In this role, you will: - Design and develop agentic AI systems by creating tool-calling agents that incorporate retrieval, structured reasoning, and secure action execution while adhering to MCP protocol. Establish strong guardrails for safety, compliance, and minimal access permissions. - Productionize large language models by creating evaluation frameworks for open-source and foundational LLMs; develop retrieval pipelines, prompt synthesis, response validation, and self-correction loops uniquely suited for production settings. - Integrate agents within runtime ecosystems by connecting them to observability, incident management, and deployment systems to facilitate automated diagnostics, runbook execution, remediation, and incident summarization with comprehensive traceability. - Collaborate directly with production engineers and application teams to transform production challenges into agentic AI roadmaps, defining objective functions connected to reliability, risk mitigation, and cost efficiency while ensuring auditable and business-aligned results. - Implement safety, reliability, and governance measures by developing validator models, adversarial prompts, and policy checks; enforce deterministic fallbacks, circuit breakers, and rollback strategies; and carry out ongoing evaluations for usefulness, accuracy, and risk management. - Enhance scalability and performance by optimizing costs and latency through prompt engineering, context management, caching, model routing, and model distillation while utilizing batching, streaming, and parallel tool calls to meet stringent service level objectives under real-world conditions. - Create a Retrieval-Augmented Generation (RAG) pipeline by curating domain knowledge, building a data quality validation framework, and establishing feedback and milestone frameworks to maintain knowledge relevance. - Set high standards by driving design reviews, enforcing rigorous experimentation, and promoting high-quality engineering practices; mentor peers on agent architectures, evaluation methodologies, and safe deployment protocols.

Description

At Apexon, we take pride in being one of the fastest-growing companies in the digital engineering services sector. We are committed to nurturing your career aspirations and offering opportunities to enhance your skillset while engaging in innovative technologies and a variety of projects for our clients. Our global community spans 19 locations worldwide, with several bases in the U.S. including Silicon Valley, New York, Chicago, Princeton, Southfield, Bellevue, Dublin, Dallas, and Windsor, Ontario. We have also expanded our presence into Mexico! Join us on this thrilling journey and elevate your career to new levels. We are currently assembling a team of AI Engineers, and we invite you to be part of it.

Something wrong or incorrect with this job? Tell us in the chat 💬 on the right ➡️

IT & Software developer jobs in the USMachine-Learning Developer jobs in the USMachine-Learning Developer jobs New York City, NY

You can find Machine Learning Engineer salaries in the United States here.

How many Machine Learning Engineer jobs are in the United States?

Currently, there are 784 ML, AI openings. Check also: TensorFlow jobs, Python jobs, Computer-Vision jobs - all with salary brackets.

Is the US a good place for Machine Learning Engineers?

The US is one of the best countries to work as a Machine Learning Engineer. It has a vibrant startup community, growing tech hubs and, most important: lots of interesting jobs for people who work in tech.

Which companies are hiring for Machine Learning Engineer jobs in the United States?

bunny.net, Asset Inventories Inc., AI (Artificial Intelligence) Incorporated, Allied Technical Services Inc, Diploma Healthcare Group, Confidential, Consumer Cellular among others, are currently hiring for ML, AI roles in the United States.

The company with most openings is Leidos as they are hiring for 138 different Machine Learning Engineer jobs in the United States. They are probably quite committed to find good Machine Learning Engineers.