Comcast Eng 3, Prodt Dev Engineering in Philadelphia, Pennsylvania

No header

Engineer 3: DevOps Storage Engineer – Private Cloud Storage Engineering

Comcast shapes the future at the intersection of media and technology. Comcast provides one of the kind entertainment platforms, High Speed Data platform, and Internet of Things services including Home Security services. We create world-class experiences that people love and trust and drive innovation that builds value. We bring millions TV and entertainment, sports and news, communications and home management, theme parks, home security, Voice, and high-speed Internet access. Comcast brings to life the best of what's to come. Comcast brings Entertainment, Internet, Voice, and other services to the customers via the largest footprint of networks in the country. 20M On-Demand video are streamed daily, 8200 Video and Audio Channels are streamed daily across the country, 167M phone calls are made and 153M messages are send daily on our network. Comcast supports more than 800K route miles or network pipeline.

* JOB DESCRIPTION *
The Private Cloud Engineering team is responsible for operating & managing large scale storage environments that provide the underlying infrastructure services to various engineering teams within Comcast.We are currently seeking a motivated, career and customer-oriented DevOps Storage Engineer to explore an exciting and challenging career. SDS (Software Defined Storage) is a cutting-edge cloud team aiming to build Intelligent SDS solutions to create a self-healing and self-managing distributed storage that can easily scale from terabytes to petabytes. This unique virtualized storage offering is a more cost effective and performance optimizing solution for our internally critical customers and you will be a part of that team build our own creative engineering solutions to operational problems, have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance for running better production systems. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. Practices such as limiting time spent on operational work, blameless postmortems and proactive identification of potential outages factor into iterative improvement is key to both product quality and interesting and dynamic day-to-day work.
Responsibilities

-[if !supportLists]->--[endif]->Exceptional team player, problem-solver with the ability to work independently and demonstrate strong initiative and an ability to organize daily tasks with minimal supervision

-[if !supportLists]->--[endif]->Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and performance by contributing code

-[if !supportLists]->--[endif]->Work on array lifecycle management to migrate data to newer platform

-[if !supportLists]->--[endif]->Monitor the availability, latency and overall system health.Would be involved in on-call incident management

-[if !supportLists]->--[endif]->Collaborate with team members to accomplish sprint goals by actively participating in the sprint cycle, in code reviews, and in helping to build a learning organization

_ Qualifications _

-[if !supportLists]->--[endif]->BS / MS degree in Computer Science or computer engineering or related field

-[if !supportLists]->--[endif]->Experience with algorithms, data structures, complexity analysis and software design

-[if !supportLists]->--[endif]->2 years of development and Linux experience, with the majority in an agile environment (familiarity with one or more of the following: C, C , Java, Python, RUST)

-[if !supportLists]->--[endif]->2 years of experience on private cloud storage environments (NFS and CIFS storage – GlusterFS/NetApp/DellEMC Unity, OpenStack Ceph Software Defined Storage)

-[if !supportLists]->--[endif]->Experience with source control repositories (e.g. Git) and CI/CD toolsets

-[if !supportLists]->--[endif]->Should be flexible, able to handle escalations and able to drive/lead triage call/bridge- Good understanding of Incident, Change and problem Management

-[if !supportLists]->--[endif]->Good communication skills and ability to clearly articulate complex issues and technologies

-[if !supportLists]->--[endif]->Great design and problem-solving skills

-[if !supportLists]->--[endif]->Willingness to take ownership of problems and see them through to resolution

-[if !supportLists]->--[endif]->Ability to comfortably work in a fast-paced agile environment. Requirements change quickly and our team needs to constantly adapt to moving targets

Nice to have

-[if !supportLists]->--[endif]->Familiarity with cloud-native and microservice architectures and an understanding of design principles for scalability, performance, and reliability

-[if !supportLists]->--[endif]->Understanding of distributed systems, asynchronous messaging, and networking protocols. Experience with open source applications, frameworks, and libraries

-[if !supportLists]->--[endif]->Experience working with deployment and orchestration technologies (such as Docker, Kubernetes, Mesos, OpenStack, Puppet, Chef, Salt, Ansible, Jenkins)

-[if !supportLists]->--[endif]->Understanding of open source server software (such as NGINX, RabbitMQ, Redis, Elasticsearch)

-[if !supportLists]->--[endif]->Familiarity with standard IT security practices such as encryption, certificates and key management

####

Comcast is an EOE/Veterans/Disabled/LGBT employer