Site Reliability Engineer (SRE) Job at Cognizant, Phoenix, AZ

cktWTFNQcWR3TlJIdHVicjRMNEZKTW9TTWc9PQ==
  • Cognizant
  • Phoenix, AZ

Job Description

**About the role** As a Site Reliability Engineer (SRE), you will make an impact by designing and implementing advanced observability solutions for edge computing environments. You will be a valued member of our Infrastructure & Operations team, collaborating with engineering and platform teams to ensure high availability, reliability, and performance across distributed systems. **In this role, you will:** + Design and implement observability frameworks for edge environments, including monitoring, logging, tracing, and metrics collection. + Define and maintain SLIs, SLOs, and business KPIs to improve system reliability across edge and centralized infrastructure. + Build and optimize dashboards, visualizations, and alerting systems for real-time insights and rapid incident response. + Implement distributed tracing and log aggregation systems to troubleshoot complex issues in edge computing. + Collaborate with engineering teams to embed observability best practices into applications and infrastructure. + Drive proactive issue detection and resolution, reducing MTTD and MTTR across distributed systems. + Lead incident postmortems and implement observability-driven improvements to prevent recurrence. + Develop automation scripts and tools to enhance observability pipelines, addressing edge-specific challenges like bandwidth and connectivity. **What you need to have to be considered** + 3-5 years of experience in service reliability/operations for large-scale, high-performance applications in hybrid environments (on-prem and cloud). + Strong scripting and automation skills for building dashboards and managing application performance. + Proficiency in programming languages such as Go, Python, Java, or Rust. + Hands-on experience with databases (Oracle, SQL Server, Redis, Clickhouse, Postgres, MongoDB, or time-series DBs). + 2+ years of experience transitioning platforms to cloud and containerization (GCP, AWS, Rancher, or similar). + Experience maintaining containerized applications in GKE/RKE/AKE environments. + Expertise in implementing cloud observability using OpenTelemetry (OTEL) for monitoring and distributed tracing. + Knowledge of networking protocols (TCP/IP, DNS) and troubleshooting in high-pressure scenarios. **These will help you stand out** + Experience managing application availability for 24x7 high-availability platforms. + Familiarity with monitoring tools like Splunk, AppDynamics, Grafana/Prometheus, and Dynatrace. + Hands-on experience with CI/CD tools and Rally, Confluence. + Knowledge of in-memory caching solutions (Redis preferred). + Strong debugging skills across integrated technical platforms and API gateways. + Exposure to GCS, Cloud SQL, Spanner, Firestore, and enterprise-level infrastructure operations. + Experience with HashiCorp Vault, Vertex AI, Gen AI, and BigQuery. **Work model: On-site** This is an onsite position requiring presence at a Cognizant or client location in Arizona City, Arizona and/or Scottsdale, Arizona. We strive to provide flexibility wherever possible and support a healthy work-life balance through our wellbeing programs. The working arrangements for this role are accurate as of the date of posting. This may change based on the project you're engaged in, as well as business and client requirements. Rest assured; we will always be clear about role expectations. Applicants may be required to attend interviews in person or by video conference. In addition, candidates may be required to present their current state or government issued ID during each interview. **Salary and Other Compensation:** The annual salary for this position is between $60,000 - $93,500 depending on experience and other qualifications of the successful candidate. This position is also eligible for Cognizant's discretionary annual incentive program, based on performance and subject to the terms of Cognizant's applicable plans. **Benefits:** Cognizant offers the following benefits for this position, subject to applicable eligibility requirements: - Medical/Dental/Vision/Life Insurance - Paid holidays plus Paid Time Off - 401(k) plan and contributions - Long-term/Short-term Disability - Paid Parental Leave - Employee Stock Purchase Plan **Disclaimer:** The salary, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law. Cognizant is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law.

Job Tags

Temporary work,

Similar Jobs

Promise

Cloud Site Reliability Engineer (SRE) Job at Promise

 ...recognized as one of Fast Company's "World's Most Innovative Companies of 2022." Role Overview Were looking for a Cloud Site Reliability Engineer (SRE) to build, operate, and optimize the infrastructure that powers our products. Youll be responsible for ensuring high... 

PwC

Acceleration Center - Healthcare Business Analyst - Experienced Associate Save for Later Remove job Job at PwC

 ...essential advantages by working alongside business leaders to solve their toughest problems...  ...you need to lead and deliver value at this level include but are not limited to:...  ...PwC does not intend to hire experienced or entry level job seekers who will need, now or in... 

Next Level Delivery Solutions LLC

Amazon Delivery Driver $21.75 - $23 /hour Job at Next Level Delivery Solutions LLC

 ...Next Level Delivery Solutions LLC is an Amazon DSP known for its exceptional delivery performance operating out of Amazon Warehouse...  ...Types: Full-time, Part-time Benefits: ~401(k)~ Company truck ~ Dental insurance ~ Flexible schedule ~ Health insurance... 

The TEA Center

3 Days on Weekdays/ Weekend Administrative Assistant for a Childcare Center Job at The TEA Center

 ...be part of a growing business while making a difference in the lives of children. Prior experience in a childcare setting a plus on weekdays but a MUST for weekend admin. The program values great leadership and opportunity for growthDay to day operations Operations.... 

PwC

Digital Assurance & Transparency - IT Audit Senior Associate Products & Services Save for Later Remove job Job at PwC

 ...At PwC, our people in audit and assurance focus on providing independent and objective assessments of financial statements, internal controls...  ..., cyber security measures, data and AI systems, and their associated governance, to help organisations and their stakeholders build...