776 IT & Software Developer jobs in the US

Ellofant jobs

Site Reliability Engineer (SRE) - Pittsburgh

$53,000 - 81,000
Ellofant
Childrens Way, Pittsburgh
$53,000 - 81,000
Company Size icon
Company Size
<50
Company Type icon
Company Type
Services
Exp Level icon
Exp Level
Junior
Job Type icon
Job Type
Full-Time
Language icon
Language
English
Visa sponsorship icon
Visa sponsorship
No

Requirements

Must:
- 3-5 years of relevant experience in site reliability, infrastructure, or DevOps engineering - Strong expertise in monitoring and observability tools such as Dynatrace, Grafana, Prometheus, or Splunk - Experience with incident management and event correlation platforms like BigPanda or ServiceNow - Proficient in Linux/Unix systems (RHEL) and Windows Server environments - Hands-on experience with cloud platforms including AWS, Azure, or OpenShift - Strong knowledge of containerization technologies and orchestration tools like Kubernetes and Docker - Familiarity with chaos engineering frameworks (Litmus, Gremlin, AWS FIS) - Solid understanding of networking fundamentals and distributed architectures - Experience with event streaming platforms such as Kafka and service mesh technologies like Istio - Familiarity with mainframe systems and legacy infrastructures - Experience with automation tools and infrastructure as code - Knowledge of job scheduling systems and middleware tools - Proficiency with Jira, Confluence, and ITSM software - Preferred experience in financial services or highly regulated environments - Relevant certifications such as AWS/Azure architecture, RHCE, or Kubernetes (CKA/CKAD) - Strong analytical skills and ability to troubleshoot complex systems - Excellent communication skills for cross-functional collaboration

Technologies

Lambda
Confluence
Datadog
Dynatrace
Istio
ITSM

Responsibilities

- Coordinate responses to critical incidents with application support teams and the Site Reliability Center - Triage and respond to alerts from the BigPanda event correlation platform - Evaluate cross-domain impacts and engage relevant support teams or escalate as needed - Participate in on-call rotations to ensure 24/7 coverage for critical systems - Conduct blameless post-mortems and root cause analyses for continuous improvement - Design and implement automated monitoring and alerting systems - Create robust dashboards and establish SLAs/SLOs through comprehensive monitoring - Analyze metrics to assist in performance optimization and fault detection - Develop and execute chaos engineering practices using various tools - Conduct fault injection experiments to validate system resilience - Build self-healing capabilities and automated remediation workflows - Implement health checks and autoscaling solutions using AWS Lambda, Kubernetes, and OpenShift - Manage infrastructure across mainframe systems, Windows, RHEL, and cloud platforms - Work with containerized environments and database systems - Maintain virtualization infrastructure and storage systems - Leverage incident management, issue tracking, and job scheduling tools - Identify areas for improving application stability and promote SRE best practices - Maintain documentation in knowledge bases and runbooks - Mentor junior team members on resiliency patterns and operational excellence

Description


At Ellofant, we are a forward-thinking consulting firm dedicated to delivering impactful solutions that drive significant outcomes for our clients. Our workplace promotes clarity over complexity and values people over rigid processes. We are on the lookout for motivated individuals who are eager to tackle real challenges. As part of our infrastructure resiliency team, you will play a vital role ensuring the performance and reliability of critical systems across various technologies. Located in Pittsburgh, a city celebrated for its innovation and vibrant tech scene, you will enjoy the benefits of competitive compensation, comprehensive medical and dental plans, retirement savings options, and the opportunity for professional development. We embrace diversity and inclusivity, ensuring a welcoming environment for all backgrounds.
Something wrong or incorrect with this job? Tell us in the chat 💬 on the right ➡️
You can find DevOps salaries in the United States here.

How many DevOps jobs are in the United States?

Currently, there are 776 DevOps openings. Check also: Cloud jobs, AWS jobs, Azure jobs, GCP jobs, Kubernetes jobs, Docker jobs, Terraform jobs - all with salary brackets.

Is the US a good place for DevOps?

The US is one of the best countries to work as a DevOps. It has a vibrant startup community, growing tech hubs and, most important: lots of interesting jobs for people who work in tech.

Which companies are hiring for DevOps jobs in the United States?

D3 Security Management Systems, Nurse Next Door, Snaplii, LYNKED Inc., Clarence Farm Services Ltd., DataAnnotation, Studio 3 Marketing among others, are currently hiring for DevOps roles in the United States.

The company with most openings is Peraton as they are hiring for 43 different DevOps jobs in the United States. They are probably quite committed to find good DevOps.