Comcast Senior Engineer, Engineering Operations in Philadelphia, Pennsylvania

Comcast brings together the best in media and technology. We drive innovation to create the world's best entertainment and online experiences. As a Fortune 50 leader, we set the pace in a variety of innovative and fascinating businesses and create career opportunities across a wide range of locations and disciplines. We are at the forefront of change and move at an amazing pace, thanks to our remarkable people, who bring cutting-edge products and services to life for millions of customers every day. If you share in our passion for teamwork, our vision to revolutionize industries and our goal to lead the future in media and technology, we want you to fast-forward your career at Comcast.

Site Reliability Engineer, Comcast Compass Operations

Job Overview:

The CoMPASS (Comcast Metadata Products and Search Services) team is supporting an ever expanding platform for delivering entertainment metadata to Comcast's array of new television, web and mobile products. We are looking for talented engineers who want to work on a sophisticated system to join our team in downtown Philadelphia.

Job Responsibilities:

- Maintain a virtual machine deployment library using an "infrastructure as code" methodology

- Support and evolve our application deployment software projects

- Develop new and use existing monitoring tools to reason about production application behavior

- Lead the debugging of production issues, which affect performance and reliability

- Participate in the building of tools and processes to support the production application ecosystem

- Provide technical guidance and leadership to fellow team members

Preferred Qualifications:

- Software development experience with an interest in using that experience to solve operations problems

- Experience with at least one of these languages: Python, Ruby, Go, Java

- Exposure to domain specific languages built on top of the above languages

- Linux systems experience (e.g. RedHat/CentOS)

- AWS experience

- Understanding of protocols and data formats like HTTP, JDBC, mongo wire protocol, protobuf, JSON, Yaml

- Understanding of technologies like Mongodb, Ansible, Kafka, Slack, Jenkins, Docker, Redis

- Application clustering, load balancing, high availability, and reliability concepts and supporting technologies

- Clear written and verbal communication skills

- Some level of participation in an on-call escalation path

- A passion for providing excellent service to all internal and external customers


- Experience with Cloud technologies like Terraform, Kubernetes

- Writes documentation out of habit

- Experience with monitoring systems (e.g. Prometheus, Alertmanager, Grafana, Splunk, OpenTracing)

- Hands on experience with AB testing strategies and traffic shaping technologies ( Istio, Envoy )

- Capable of diagnosing complex performance and availability problems in a production environment

Job Specification:

- Masters Degree or Equivalent

- Engineering, Computer Science

- Generally requires 11 years related experience

Comcast is an EOE/Veterans/Disabled/LGBT employer